Change the existing code, so that it prints (the necessary data) for a learning curve.
Plot the learning curves for all three data sets (monk-1, monk-2, monk-3) and interpret them.
Are the trees that are learned consistent with the training data? If not, what could be the reason?
Look at the true functions for the three data sets in monk.names (Section 9). Design a good decision tree for each of the concepts by hand.
Compare these decision trees to the ones that were learned by the algorithm (on the whole training data).