Customer Churn Prediction: A Machine Learning Approach with Data Balancing for Telecom Industry

Authors

  • Anurag Bhatnagar
  • Sumit Srivastava

DOI:

https://doi.org/10.47839/ijc.24.1.3873

Keywords:

Churn, Classification models, Imbalanced data, Hyper parameter tuning, Attributes, Validation, Accuracy

Abstract

Churn prediction is the process of identifying customers who stop using services. Churn is not only the problem in Telecom industry but also banking, insurance, gaming companies, and internet service providers are also facing this challenge. This study focuses on churn prediction in telecom industry to determine the best classification model and reduce the number of attributes in the dataset. Machine learning models like Random Forest, K-Nearest Neighbour, Decision Tree, Support Vector Machine, Logistic Regression, Bagging Classifier, Extreme Gradient Boosting, Stochastic Gradient Descent Classifier, Gaussian Naive Bayes were used. To handle imbalance data and for hyper parameter tuning, techniques like SMOTE, ENN, Under-Sampling, Over-Sampling and K-cross fold validation were used. Random Forest classifier performed exceptionally well in forecasting customer churn in the telecom sector, as evidenced by the results. Its accuracy rate was 90.30% with all attributes, and 90.90% with reduced attributes dataset. This implies that the dataset with reduced attributes may be useful for churn prediction tasks in a variety of industries, offering useful information to companies trying to reduce customer attrition. This work validates itself by comparing with four previously published research.

References

A. Bhatnagar and S. Srivastava, “Performance analysis of hoeffding and logisitic algorithm for churn prediction in telecom sector,” in 2020 International Conference on Computation, Automation and Knowledge Management (ICCAKM). IEEE, 2020, pp. 377–380.

A. Bhatnagar and S. Srivastava, “A robust model for churn prediction using supervised machine learning,” in 2019 IEEE 9th international conference on advanced computing (IACC). IEEE, 2019, pp. 45–49.

A. Saran Kumar and D. Chandrakala, “A survey on customer churn prediction using machine learning techniques,” International Journal of Computer Applications, vol. 975, p. 8887, 2016.

A. J. Petkovski, B. L. Risteska Stojkoska, K. V. Trivodaliev, and S. A. Kalajdziski, “Analysis of churn prediction: A case study on telecommunication services in macedonia,” in 2016 24th Telecommunications Forum (TELFOR), 2016, pp. 1–4.

P. Lalwani, M. K. Mishra, J. S. Chadha, and P. Sethi, “Customer churn prediction system: a machine learning approach,” Computing, vol. 104, no. 2, pp. 271–294, 2022.

T. Vafeiadis, K. I. Diamantaras, G. Sarigiannidis, and K. C. Chatzisavvas, “A comparison of machine learning techniques for customer churn prediction,” Simulation Modelling Practice and Theory, vol. 55, pp. 1–9, 2015.

I. Ullah, B. Raza, A. K. Malik, M. Imran, S. U. Islam, and S. W. Kim, “A churn prediction model using random forest: analysis of machine learning techniques for churn prediction and factor identification in telecom sector,” IEEE access, vol. 7, pp. 60 134–60 149, 2019.

B. Prabadevi, R. Shalini, and B. R. Kavitha, “Customer churning analysis using machine learning algorithms,” International Journal of Intelligent Networks, vol. 4, pp. 145–154, 2023.

T. A. R. Akbar and C. Apriono, “Machine learning predictive models analysis on telecommunications service churn rate,” Green Intelligent Systems and Applications, vol. 3, no. 1, pp. 22–34, 2023.

W. Bi, M. Cai, M. Liu, and G. Li, “A big data clustering algorithm for mitigating the risk of customer churn,” IEEE Transactions on Industrial Informatics, vol. 12, no. 3, pp. 1270–1281, 2016.

I. Mitkees, S. Badr, and A. Elseddawy, “Customer churn prediction model using data mining techniques,” 12 2017, pp. 262–268.

M. Saghir, Z. Bibi, S. Bashir, and F. H. Khan, “Churn prediction using neural network based individual and ensemble models,” in 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST). IEEE, 2019, pp. 634–639.

V. Kavitha, G. H. Kumar, S. M. Kumar, and M. Harish, “Churn prediction of customer in telecom industry using machine learning algorithms,” International Journal of Engineering Research & Technology (2278-0181), vol. 9, no. 05, pp. 181–184, 2020.

K. S. Rani, S. Thaslima, N. Prasanna, R. Vindhya, and P. Srilakshmi, “Analysis of customer churn prediction in telecom industry using logistic regression,” International Journal of Innovative Research in Computer Science & Technology (IJIRCST) ISSN, pp. 2347–5552, 2021.

B. Nigam, H. Dugar, and M. Niranjanamurthy, “Effectual predicting telecom customer churn using deep neural network,” Int J Eng Adv Technol (IJEAT), vol. 8, no. 5, 2019.

S. W. Fujo, S. Subramanian, M. A. Khder et al., “Customer churn prediction in telecommunication industry using deep learning,” Information Sciences Letters, vol. 11, no. 1, p. 24, 2022.

A. Khattak, Z. Mehak, H. Ahmad, M. U. Asghar, M. Z. Asghar, and A. Khan, “Customer churn prediction using composite deep learning technique,” Scientific Reports, vol. 13, no. 1, p. 17294, 2023.

S. S. Poudel, S. Pokharel, and M. Timilsina, “Explaining customer churn prediction in telecom industry using tabular machine learning models,” Machine Learning with Applications, vol. 17, p. 100567, 2024. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S2666827024000434

V. Muthupriya, R. Narayanan, S. Nakeeb, and A. Abhishek, “Customer churn analysis using xgboosted decision trees,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 25, no. 1, pp. 488–495, 2022.

H.-A. Park, “An introduction to logistic regression: from basic concepts to interpretation with particular attention to nursing domain,” Journal of Korean academy of nursing, vol. 43, no. 2, pp. 154–164, 2013.

D. Ramadhanti, A. Larasati, A. Muid, and E. Mohamad, “Building customer churn prediction models in indonesian telecommunication company using decision tree algorithm,” in AIP Conference Proceedings, vol. 2654, no. 1. AIP Publishing, 2023.

J. Balogun, F. Kasali, and I. Akinyemi, “Development of classification model for the prediction of churn among customers using decision tree algorithm,” Journal of Computer Science and Its Application, vol. 28, pp. 45–53, 08 2022.

M. K. Awang, M. Makhtar, N. Udin, and N. F. Mansor, “Improving customer churn classification with ensemble stacking method,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 11, 2021.

L. Makurumidze, W. S. Manjoro, and W. Makondo, “Implementing random forest to predict churn,” International Journal of Computer Science and Mobile C

omputing, vol. 11, no. 2, pp. 75–84, 2022.

S. Gore, Y. Chibber, M. Bhasin, S. Mehta, and S. Suchitra, “Customer churn prediction using neural networks and smote-enn for data sampling,” in 2023 3rd international conference on artificial intelligence and signal processing (AISP). IEEE, 2023, pp. 1–5.

Downloads

Published

2025-03-31

How to Cite

Bhatnagar, A., & Srivastava, S. (2025). Customer Churn Prediction: A Machine Learning Approach with Data Balancing for Telecom Industry. International Journal of Computing, 24(1), 9-18. https://doi.org/10.47839/ijc.24.1.3873

Issue

Section

Articles