ANALYSIS OF BAYESIAN CLASSIFIER ACCURACY

Felipe Schneider Costa; Maria Marlene De Souza Pires; Silvia Modesto Nassar

doi:10.3844/jcssp.2013.1487.1495

Research Article Open Access

ANALYSIS OF BAYESIAN CLASSIFIER ACCURACY

Felipe Schneider Costa¹, Maria Marlene De Souza Pires¹ and Silvia Modesto Nassar¹

¹ Universidade Federal de Santa Catarina, Brazil

Abstract

The naïve Bayes classifier is considered one of the most effective classification algorithms today, competing with more modern and sophisticated classifiers. Despite being based on unrealistic (naïve) assumption that all variables are independent, given the output class, the classifier provides proper results. However, depending on the scenario utilized (network structure, number of samples or training cases, number of variables), the network may not provide appropriate results. This study uses a process variable selection, using the chi-squared test to verify the existence of dependence between variables in the data model in order to identify the reasons which prevent a Bayesian network to provide good performance. A detailed analysis of the data is also proposed, unlike other existing work, as well as adjustments in case of limit values between two adjacent classes. Furthermore, variable weights are used in the calculation of a posteriori probabilities, calculated with mutual information function. Tests were applied in both a naïve Bayesian network and a hierarchical Bayesian network. After testing, a significant reduction in error rate has been observed. The naïve Bayesian network presented a drop in error rates from twenty five percent to five percent, considering the initial results of the classification process. In the hierarchical network, there was not only a drop in fifteen percent error rate, but also the final result came to zero.

Journal of Computer Science

Volume 9 No. 11, 2013, 1487-1495

DOI: https://doi.org/10.3844/jcssp.2013.1487.1495

Submitted On: 27 August 2013 Published On: 26 September 2013

How to Cite: Costa, F. S., Pires, M. M. D. S. & Nassar, S. M. (2013). ANALYSIS OF BAYESIAN CLASSIFIER ACCURACY. Journal of Computer Science, 9(11), 1487-1495. https://doi.org/10.3844/jcssp.2013.1487.1495

Copyright: © 2013 Felipe Schneider Costa, Maria Marlene De Souza Pires and Silvia Modesto Nassar. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,442 Views
3,693 Downloads
4 Citations

Download

Keywords

Bayesian Network
Entropy
Feature Weighting
Mutual Information
Small Sample Set