UNBIASEDNESS OF FEATURE SELECTION BY HYBRID FILTERING
DOI:
https://doi.org/10.47839/ijc.10.1.735Keywords:
Feature selection, hybrid algorithms, machine learning, classification.Abstract
In this article we examine characteristics of feature selection algorithms by introducing their aspects important in practice. We will focus on the unbiasedness, analyse it and investigate a robust hybrid method of feature selection, being a composition of several feature filters, that could ensure unbiased results of selection. Using parallel multi-measures and voting, we reduce the risk of selecting non-optimal features, a common situation when we select attributes using single evaluation based on one evaluation criterion. To test this method we selected a personal bankruptcy dataset, containing various types of attributes and one of the popular machine learning benchmarks. By the performed experiments we will demonstrate that an approach of multi-evaluation used for features filtering may lead to the creation of effective and fast methods of features selection with an unbiased outcome.References
S. Avidan. Joint feature-basis subset selection. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004.
W. Duch, T. Wieczorek, J. Biesiada, M. Blachnik. Comparison of feature ranking methods based on information en-tropy. Proceeding of International Joint Conference on Neural Networks, 2004.
A. Frank, A. Asuncion. UCI Machine Learning Repository http://archive.ics.uci.edu/ml. Irvine, CA: University of California, School of Information and Computer Science, 2010.
M. Gonen. Analyzing Receiver Operating Characteristic Curves Using SAS. SAS Press, 2007.
P. E. Greenwood, M. S. Nikulin. A Guide to Chi-Squared Testing. John Wiley Sons, New York, 1996
I. Guyon, A. Elisseeff. An introduction to variable and feature selection. Journal of Machine Learning Research (3) (2003), pp. 1157-1182.
M. A. Hall, L. A. Smith. Feature selection for machine learning: Comparing a correlation-based filter approach to the wrapper. Proceedings of the Twelfth International FLAIRS Conference, 1999.
D. Koller, M. Sahami. Toward optimal feature selection. Proceedings of the Thirteenth International Conference on Machine Learning, 1996.
L. Kuncheva. Combining Pattern Classi?ers: Methods and Algorithms. Wiley-IEEE, Hoboken, 2004.
H. Liu, R. Setiono. Incremental feature selection. Applied Intelligence (9) (1998), pp. 217-230.
J. R. Quinlan. C4.5: programs for machine learning. Morgan Kaufmann. San Francisco, 1993.
J. Ren, Z. Qiu, W. Fan, H. Cheng, P. S. Yu. Forward semi-supervised feature selection. Paci?c-Asia Conference on Knowledge Discovery and Data Mining. 2008.
R. Rojas. Neural Networks – A Systematic Introduction. Springer-Verlag. Berlin, 1996.
L. Rozenberg, W. Pietruszkiewicz. The methodic of diagnosis and prognosis of household bankruptcy. Di?n. Warszawa, 2008.
Y. Saeys, I. Inza, P. Larranaga, D. J. Wren. A review of feature selection techniques in bioinformatics. Bioinformatics 23 (19) (2007).
D. Ververidis. C. Kotropoulos. Sequential forward feature selection with low computational cost. Proceedings of European Signal Processing Conference, 2005.
I. H. Witten. E. Frank. Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann. San Francisco, 2005.
Z. Zheng. Feature selection for text categorization on imbalanced data. ACM SIGKDD Explorations Newsletter archive 6 (1) (2004), pp. 80-89.
J. Zhou, D. P. Foster, R. A. Stine, L. H. Ungar. Streamwise feature selection. Journal of Machine Learning Research (7) (2006), pp. 1861-1885.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.