Data Mining Analysis of K-means Algorithm and Decision Tree for Early Detection of Students at Risk of Dropping Out
Sumber Resmi Research : journal.ittelkom-pwt.ac.id/index.php/inista/article/view/1630
Published May 24, 2025DOI @5/inista.v7i2.1630
Imam AkbarSains and Technology Faculty, Muhammadiyah University of Enrekang, IndonesiaIta Sarmita SamadLanguage and Literature Faculty, State University of Makassar, IndonesiaRahmat RahmatTeacher Training and Education, Muhammadiyah University of Enrekang, IndonesiaSri RosmianaTeacher Training and Education, Muhammadiyah University of Enrekang, Indonesia
Abstract
Dropout occurs in higher education, where students are unable to complete their studies within a specified timeframe. It has become a significant concern in education due to its substantial impact on individuals, institutions, and society. This study aims to develop a model for predicting the early potential for students' dropout using the K-Means Algorithm and decision trees. The research method consists of a Dataset, Data Preprocessing, K-means implementation, labeling student data, and Decision Tree implementation. This study resulted in 4 clusters. The students in Cluster 1 have an excellent average GPA, a substantial number of credits, and are very active. The students in Cluster 2 have a lower average GPA and are less active than in Cluster 1. The students in Cluster 3 show a relatively good average GPA, which is lower than in Clusters 1 and 2. The number of active students indicates that students in this cluster are much less active or at risk of D.O. than those in clusters 1 and 2. Cluster 4 indicates that the average GPA of students is very low, often close to zero, and they are generally inactive in academic activities. Thus, they are significantly at risk of D.O. at Universitas Muhammadiyah Enrekang. This research provides significant results, both in terms of accuracy and data interpretation. The resulting insights enable universities to make more strategic and targeted decisions, thereby reducing the risk of university dropout rates, increasing resource efficiency, and supporting the overall educational success of students. The accuracy of the resulting model is 98.52% which indicates that the model has excellent performance in classifying students at risk of D.O.
How to Cite
Akbar, I., Samad, I., Rahmat, R., & Rosmiana, S. (2025). Data Mining Analysis of K-means Algorithm and Decision Tree for Early Detection of Students at Risk of Dropping Out. Journal of Informatics Information System Software Engineering and Applications (INISTA), 7(2), 148-162. https://doi.org/10.20895/inista.v7i2.1630
Issue
@hp/inista/issue/view/68
Section
Articles
This work is licensed under a @s/by-sa/4.0/.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
References
[1] Márquez-Vera C, Cano A, Romero C, Noaman AYM, Mousa Fardoun H, Ventura S. Early dropout prediction using data mining: A case study with high school students. Expert Syst. 2016;33(1):107–24.
[2] Casanova JR, Cervero A, Núñez JC, Almeida LS, Bernardo A. Factors that determine the persistence and dropout of university students. Psicothema. 2018;30(4):408–14.
[3] Agrusti F, Bonavolontà G, Mezzini M. University dropout prediction through educational data mining techniques: A systematic review. J E-Learning Knowl Soc. 2019;15(3):161–82.
[4] Gitto L, Minervini LF, Monaco L. University dropouts in Italy: Are supply side characteristics part of the problem? Econ Anal Policy. 2016;49(February):108–16.
[5] Panggabean DD, Motlan, Harahap MH, Irfandi, Sirait AP. Development of Accelerating Strategy on Improvement of Study Program Accreditation in Accordance With 9 Criterias of Ban-Pt in the State University of Medan. Adv Soc Sci Res J. 2020;7(1):483–91.
[6] Utari M, Warsito B, Kusumaningrum R. Implementation of Data Mining for Dropout Prediction using Random Forest Method. 2020 8th Int Conf Inf Commun Technol ICoICT 2020. 2020;
Информация по комментариям в разработке