A Weighted Majority Voting Ensemble Model for Disease Prediction Boosted by PSO: The Case of Type 2 Diabetes
DOI:
https://doi.org/10.47839/ijc.22.4.3354Keywords:
Diabetes, Machine learning, Ensemble majority voting, PSOAbstract
Early detection of diabetes is critical to reducing the number of cases, which continues to rise year after year. Many approaches to diagnosis have been used, but they still have flaws in making clinical decisions that are more effective and efficient. The use of intelligent systems is very effective in assisting in data analysis and clinical decision support. The purpose of this article is to develop a model to predict diabetes mellitus using the Pima Indian Diabetes Dataset (PIDD). The ensemble method has shown to be quite effective at increasing accuracy, but it has the issue of determining the optimal weight. As a result, to improve prediction accuracy, this study employs PSO optimization in the selection of ensemble majority voting weights. The test results show that predictions for ensemble majoritarian voting using PSO perform well, with an accuracy rate of 0.9322, precision of 0.9412, recall of 0.8421, and F1-score of 0.8889. PSO accuracy is improved by 4% and 7%, respectively. This demonstrates that applying PSO to the ensemble model can improve accuracy.
References
S. Li, J. Wang, B. Zhang, X. Li, and Y. Liu, “Diabetes mellitus and cause-specific mortality: A population-based study,” Diabetes Metab. J., vol. 43, no. 3, p. 319, 2019. https://doi.org/10.4093/dmj.2018.0060.
M. S. Rahman et al., “Role of insulin in health and disease: An update,” Int. J. Mol. Sci., vol. 22, no. 12, p. 6403, 2021. https://doi.org/10.3390/ijms22126403.
A. Z. Woldaregay et al., “Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes,” Artif. Intell. Med., vol. 98, no. April 2018, pp. 109–134, 2019. https://doi.org/10.1016/j.artmed.2019.07.007.
H. Qteat and M. Awad, “Using hybrid model of particle swarm optimization and multi-layer perceptron neural networks for classification of diabetes,” Int. J. Intell. Eng. Syst., vol. 14, no. 3, pp. 11–22, 2021. https://doi.org/10.22266/ijies2021.0630.02.
N. Esser, K. M. Utzschneider, and S. E. Kahn, “On the causal relationships between hyperinsulinaemia, insulin resistance, obesity and dysglycaemia in type 2 diabetes: Reply to Johnson JD [letter],” Diabetologia, vol. 64, no. 10, pp. 2345–2347, 2021. https://doi.org/10.1007/s00125-021-05511-6.
P. Kostagiolas, P. Tsiligros, P. Theodorou, N. Tentolouris, and D. Niakas, “A cross-sectional survey interconnecting health information seeking behavior with clinical data of type 2 diabetes mellitus patients,” Libr. Hi Tech, vol. 39, no. 2, pp. 448–461, 2020. https://doi.org/10.1108/LHT-02-2020-0030.
International Diabetes Federation, IDF Diabetes Atlas, 9th edition, 2019.
A. U. Haq et al., “Intelligent machine learning approach for effective recognition of diabetes in e-healthcare using clinical data,” Sensors (Switzerland), vol. 20, no. 9, article no. 2649, 2020. https://doi.org/10.3390/s20092649.
G. Battineni, G. G. Sagaro, C. Nalini, F. Amenta, and S. K. Tayebati, “Comparative machine-learning approach: A follow-up study on type 2 diabetes predictions by cross-validation methods,” Machines, vol. 7, no. 4, pp. 1–11, 2019. https://doi.org/10.3390/machines7040074.
Y. Jian, M. Pasquier, A. Sagahyroon, and F. Aloul, “A machine learning approach to predicting diabetes complications,” Healthcare, vol. 9, no. 12, p. 1712, 2021. https://doi.org/10.3390/healthcare9121712.
D. D. Rufo, T. G. Debelee, A. Ibenthal, and W. G. Negera, “Diagnosis of diabetes mellitus using gradient boosting machine (Lightgbm),” Diagnostics, vol. 11, no. 9, pp. 1–14, 2021. https://doi.org/10.3390/diagnostics11091714.
H. Jafarzadeh, M. Mahdianpari, E. Gill, F. Mohammadimanesh, and S. Homayouni, “Bagging and boosting ensemble classifiers for classification of multispectral, hyperspectral and PolSAR data: A comparative evaluation,” Remote Sens., vol. 13, no. 21, p. 4405, 2021. https://doi.org/10.3390/rs13214405,
L. J. Muhammad, E. A. Algehyne, and S. S. Usman, “Predictive supervised machine learning models for diabetes mellitus,” SN Comput. Sci., vol. 1, no. 5, pp. 1–10, 2020. https://doi.org/10.1007/s42979-020-00250-8,
S. K. Kalagotla, S. V. Gangashetty, and K. Giridhar, “A novel stacking technique for prediction of diabetes,” Comput. Biol. Med., vol. 135, no. June, p. 104554, 2021. https://doi.org/10.1016/j.compbiomed.2021.104554,
S. Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” Int. J. Cogn. Comput. Eng., vol. 2, no. November 2020, pp. 40–46, 2021. https://doi.org/10.1016/j.ijcce.2021.01.001.
M. Shahhosseini, G. Hu, and H. Pham, “Optimizing ensemble weights and hyperparameters of machine learning models for regression problems,” Mach. Learn. with Appl., vol. 7, p. 100251, 2022. https://doi.org/10.1016/j.mlwa.2022.100251,
Y. Xu, C. Wu, K. Zheng, X. Wang, X. Niu, and T. Lu, “Computing adaptive feature weights with PSO to improve android malware detection,” Secur. Commun. Networks, vol. 2017, Article ID 3284080, 2017. https://doi.org/10.1155/2017/3284080.
M. Maniruzzaman et al., “Accurate diabetes risk stratification using machine learning: Role of missing value and outliers,” J. Med. Syst., vol. 42, no. 5, p. 92, 2018. https://doi.org/10.1007/s10916-018-0940-7.
G. Aksu, C. O. Güzeller, and M. T. Eser, “The effect of the normalization method used in different sample sizes on the success of artificial neural network model,” Int. J. Assess. Tools Educ., vol. 6, no. 2, pp. 170–192, 2019. https://doi.org/10.21449/ijate.479404.
M. S. Sainin, R. Alfred, and F. Ahmad, “Ensemble Meta Classifier with Sampling and Feature Selection for Data with Multiclass Imbalance Problem,” J. Inf. Commun. Technol., vol. 20, no. 2, pp. 103–133, 2021. https://doi.org/10.32890/jict2021.20.2.1.
S. Y. Kim and A. Upneja, “Majority voting ensemble with a decision trees for business failure prediction during economic downturns,” J. Innov. Knowl., vol. 6, no. 2, pp. 112–123, 2021. https://doi.org/10.1016/j.jik.2021.01.001.
H. Bouziane, B. Messabih, and A. Chouarfia, “Profiles and majority voting-based ensemble method for protein secondary structure prediction,” Evol. Bioinforma., vol. 7, no. 7, p. EBO.S7931, 2011. https://doi.org/10.4137/EBO.S7931.
J. Kennedy and R. Eberhart, “Particle swarm optimization,” Proceedings of the International Conference on Neural Networks ICNN’95, 1995, vol. 4, pp. 1942–1948.
D. Ratnawati, M. Marjono, W. Widodo, and S. Anam, “PSO-ELM with time-varying inertia weight for classification of SMILES codes,” Int. J. Intell. Eng. Syst., vol. 13, no. 6, pp. 522–532, 2020. https://doi.org/10.22266/ijies2020.1231.46.
Y. Hayashi and S. Yukita, “Rule extraction using recursive-rule extraction algorithm with J48graft combined with sampling selection techniques for the diagnosis of type 2 diabetes mellitus in the Pima Indian dataset,” Informatics Med. Unlocked, vol. 2, pp. 92–104, 2016. https://doi.org/10.1016/j.imu.2016.02.001.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.