Vol.10, No.4, November 2021.                                                                                                                                                                           ISSN: 2217-8309

                                                                                                                                                                                                                          eISSN: 2217-8333

 

TEM Journal

 

TECHNOLOGY, EDUCATION, MANAGEMENT, INFORMATICS

Association for Information Communication Technology Education and Science


Comparison of Classification Data Mining C4.5 and Naïve Bayes Algorithms of EDM Dataset

 

Joseph Teguh Santoso, Ni Luh Wiwik Sri Rahayu Ginantra, Muhammad Arifin, R Riinawati, Dadang Sudrajat, Robbi Rahim

 

© 2021 Robbi Rahim, published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. (CC BY-NC-ND 4.0)

 

Citation Information: TEM Journal. Volume 10, Issue 4, Pages 1738-1744, ISSN 2217-8309, DOI: 10.18421/TEM104-34, November 2021.

 

Received: 04 July 2021.

Revised:  26 October 2021.
Accepted: 30 October 2021.
Published: 26 November 2021.

 

Abstract:

 

The purpose of this research is to choose the best method by comparing two classification methods of data mining C4.5 and Naïve Bayes on Educational Data Mining, in which the data used is student graduation data consisting of 79 records. Both methods are tested for validation with 10-ford X Validation and perform a T-Test difference test to produce a table that contains the best method ranking. Different results were obtained for each method. Based on the results of these two methods, it is very influential on the dataset and the value of the area under curve in the Naïve Bayes method is better than the C4.5 method in various datasets. Comparison of the method with the 10-Ford X Validation test and the T-Test difference test is that the Naïve Bayes method is better than C4.5 with an average accuracy value of 73.41% and an under-curve area of 0.664.

 

Keywords –Comparison, data mining, Classification, C4.5, Naive Bayes, Performance, EDM.

 

-----------------------------------------------------------------------------------------------------------

Full text PDF >  

-----------------------------------------------------------------------------------------------------------

 


Copyright © 2021 UIKTEN
Copyright licence: All articles are licenced via Creative Commons CC BY-NC-ND 4.0 licence