Based on a mix of real world data and a simulated dataset for predicting the students’ academic performance, we study/compare various decision tree (DT) based algorithms (which include ID3, C4.5 and CART) with different choices of information entropy metrics (which include Shannon, Quadratic, Havrda and Charvát, Rényi, Taneja, Trigonometric and R-norm entropies) to build a decision tree in order to provide appropriate counseling/advise at an earlier stage. DT is one such important technique in educational data mining (EDM) which creates hierarchical structures of classification rules “If ⋯, Then ⋯” building a tree structure by incrementally breaking down the datasets in smaller subsets. The results suggest that basic training of the students has no significant predictive power on performance, while information about their abilities, diligence, motivation and activity in the learning process can predict their grades. As such, the resulting forecasts can be used by the instructor in optimizing the learning process and designing the course content and schedule.