Classification of Heuristic Information by Using Machine Learning Algorithms

Murat KOKLU, Kadir SABANCI, Muhammed Fahri UNLERSEN
  • Kadir SABANCI
    Affiliation not present
  • Muhammed Fahri UNLERSEN
    Affiliation not present


The User Knowledge Modelling dataset in the UCI machine learning repository was used in this study. The students were classified into 4 class (very low, low, middle, and high) due to the 5 performance data in the dataset. 258 data of 403 data in the dataset were used for training and 145 of them were used for tests. The Weka (Waikato Environment for Knowledge Analysis) software was used for classification. In classification Multilayer Perceptron (MLP), k Nearest Neighbors (kNN), J48, NativeBayes, BayesNet, KStar, RBFNetwork and RBFClassifier machine learning algorithms were used and success rates and error rates were calculated. In this study 8 different data mining algorithm were used and the best classification success rate was obtained by MLP. With Multilayer perceptron neural network model the classification success rates was calculated when there are different number of neurons in the hidden layer of MLP. The best classification success rate was achieved as 97.2414% when there was 8 neurons in the hidden layer. MAE and RMSE values were obtained for this classification success rate as 0.0242 and 0.1094 respectively.


Machine learning;Weka;MLP;kNN; J48

Full Text:

Submitted: 2018-12-21 11:33:14
Published: 2016-12-26 00:00:00
Search for citations in Google Scholar
Related articles: Google Scholar


Clifton, Christopher (2010). "Encyclopædia Britannica: Definition of Data Mining". Retrieved 2010-12-09.

B., Peterson, P., Baker, E. (Eds.) International Encyclopedia of Education (3rd edition). Oxford, UK: Elsevier.

Superby J. F., Vandamme J. P., Meskens N. Determination of factors influencing the achievement of the first-year university students using data mining methods. In International conference on intelligent tutoring systems, Educational Data Mining Workshop, Taiwan, 2006:1 – 8.

Márquez-Vera, C., Cano, A., Romero, C., Ventura, S. Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data, Applied Intelligence, April 2013, Volume 38, Issue 3, pp 315-330.

Sen B., Ucar E., Delen D. Predicting and analyzing secondary education placement-test scores: A data mining approach, Expert Systems with Applications, Vol. 39, No. 10, pp. 9468-9476, 2012.

WEKA, Last access: 10.04.2015.

Rohit Arora and Suman, Comparative Analysis of Classification Algorithms on Different Datasets using WEKA, International Journal of Computer Applications (0975 – 8887) Volume 54– No.13, September 2012.

J. Wang, P. Neskovic, and L. N. Cooper, “Improving nearest neighbor rule with a simple adaptive distance measure”, Pattern Recognition Letters, 28(2):207-213, 2007.

Y. Zhou, Y. Li, and S. Xia, “An improved KNN text classification algorithm based on clustering”, Journal of computers, 4(3):230-237, 2009.

John G. Cleary, Leonard E. Trigg: “K*: An Instance based Learner Using an Entropic Distance Measure”, 12th International Conference on Machine Learning, 108-114, 1995.

Abstract views:


Copyright (c) 2018 International Journal of Intelligent Systems and Applications in Engineering

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
© Prof.Dr. Ismail SARITAS 2013-2019     -    Address: Selcuk University, Faculty of Technology 42031 Selcuklu, Konya/TURKEY.