Performance Evaluation of Classification Algorithm for Movie Review Sentiment Analysis
DOI:
https://doi.org/10.47839/ijc.22.1.2873Keywords:
Decision Tree, K-Nearest Neighbor, Support Vector Machine, Naive Bayes, Sentiment Analysis, Performance EvaluationAbstract
The majority of the current research on sentiment analysis, which covers topics like political reviews, movie reviews, and product reviews, has developed quickly. The classification and clustering stage of sentiment analysis research involves a number of subjects. Some of them cover text classification comparison research and algorithm performance optimization. An intricate issue in sentiment analysis research is dealing with unstructured or semi-structured data. The sentiment analysis procedure and improving the efficacy of the classifier’s algorithm are both hampered by unstructured data. In order to manage unstructured data successfully and provide accurate and relevant information, unique strategies are required. The proposed classification model performance evaluation using Support Vector Machine, Naive Bayes, K-Nearest Neighbor, and Decision Tree is specifically covered in this paper. According to the study’s findings, SVM has an accuracy rate of 96% and Naive Bayes is 86%. While the decision tree’s gain accuracy is 78 percent and the kNN classification model’s gain accuracy is 78 percent, respectively. The test results demonstrate that SVM is superior to other classification models in terms of accuracy performance.
References
S. Widya Sihwi, I. Prasetya Jati, and R. Anggrainingsih, “Twitter sentiment analysis of movie reviews using information gain and Naïve Bayes classifier,” Proceedings of the 2018 Int. Semin. Appl. Technol. Inf. Commun. Creat. Technol. Hum. Life, iSemantic 2018, pp. 190–195, 2018. https://doi.org/10.1109/ISEMANTIC.2018.8549757.
B. Das and S. Chakraborty, “An improved text sentiment classification model using TF-IDF and next word negation,” 2018, [Online]. Available at: http://arxiv.org/abs/1806.06407
R. I. Mukhamediev et al., “Review of artificial intelligence and machine learning technologies: Classification, restrictions, opportunities and challenges,” Mathematics, vol. 10, no. 15, pp. 1–25, 2022. https://doi.org/10.3390/math10152552.
P. Kalaivani and K. L. Shunmuganathan, “An improved K-nearest-neighbor algorithm using genetic algorithm for sentiment classification,” Proceedings of the 2014 Int. Conf. Circuits, Power Comput. Technol. ICCPCT 2014, pp. 1647–1651, 2014. https://doi.org/10.1109/ICCPCT.2014.7054826.
T. T. Thet, J.-C. Na, and C. S. G. Khoo, “Aspect-based sentiment analysis of movie reviews on discussion boards,” J. Inf. Sci., vol. 36, no. 6, pp. 823–848, 2010. https://doi.org/10.1177/0165551510388123.
P. Chaovalit and L. Thou, “Movie review mining: A comparison between supervised and unsupervised classification approaches,” Proc. Annu. Hawaii Int. Conf. Syst. Sci., vol. 00, no. C, p. 112, 2005. https://doi.org/10.1109/HICSS.2005.445.
R. Maulana, P. A. Rahayuningsih, W. Irmayani, D. Saputra, and W. E. Jayanti, “Improved accuracy of sentiment analysis movie review using support vector machine based information gain,” J. Phys. Conf. Ser., vol. 1641, no. 1, 2020. https://doi.org/10.1088/1742-6596/1641/1/012060.
K. Lee, J. Park, I. Kim, and Y. Choi, “Predicting movie success with machine learning techniques: ways to improve accuracy,” Inf. Syst. Front., vol. 20, no. 3, pp. 577–588, 2018. https://doi.org/10.1007/s10796-016-9689-z.
A. H. Aliwy and E. H. A. Ameer, “Comparative study of five text classification algorithms with their improvements,” Int. J. Appl. Eng. Res., vol. 12, no. 14, pp. 4309–4319, 2017. https://files2.shewaya.com/files/62055.pdf.
A. I. Kadhim, “Survey on supervised machine learning techniques for automatic text classification,” Artif. Intell. Rev., vol. 52, no. 1, pp. 273–292, 2019. https://doi.org/10.1007/s10462-018-09677-1.
V. B. Vaghela, B. M. Jadav, “Analysis of various sentiment classification techniques,” Int. J. Comput. Appl., vol. 140, no. 3, pp. 22–27, 2016. https://doi.org/10.5120/ijca2016909259.
M. Karim and S. Das, “Sentiment analysis on textual reviews,” IOP Conf. Ser. Mater. Sci. Eng., vol. 396, no. 1, pp. 122–127, 2018. https://doi.org/10.1088/1757-899X/396/1/012020.
J. Miharja, J. L. Putra, and N. Hadianto, “Comparison of machine learning classification algorithm on hotel review sentiment analysis (Case study: Luminor hotel Pecenongan),” J. Pilar Nusa Mandiri, vol. 16, no. 1, pp. 59–64, 2020. https://doi.org/10.33480/pilar.v16i1.1131.
R. S. Jagdale, V. S. Shirsat, and S. N. Deshmukh, “Sentiment analysis on product reviews using machine learning techniques,” Adv. Intell. Syst. Comput., vol. 768, pp. 639–647, 2019. https://doi.org/10.1007/978-981-13-0617-4_61.
B. Saberi and S. Saad, “Sentiment analysis or opinion mining: A review,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 7, no. 5, pp. 1660–1666, 2017. https://doi.org/10.18517/ijaseit.7.5.2137.
Y. Nurdiansyah, S. Bukhori, and R. Hidayat, “Sentiment analysis system for movie review in Bahasa Indonesia using Naive Bayes classifier method,” J. Phys. Conf. Ser., vol. 1008, no. 1, 2018. https://doi.org/10.1088/1742-6596/1008/1/012011.
R. I. Pristiyanti, M. A. Fauzi, and L. Muflikhah, “Sentiment analysis summarizing film reviews using information gain and K-nearest neighbor methods,” J. Pengemb. Teknol. Inf. dan Ilmu Komput. Univ. Brawijaya, vol. 2, no. 3, pp. 1179–1186, 2018. (in Indonesian), [Online]. Available at: http://j-ptiik.ub.ac.id/index.php/j-ptiik/article/view/1140.
R. Hidayat and S. Minati, “Comparative Analysis of Text Mining Classification Algorithms for English and Indonesian Qur’an Translation,” IJID (International J. Informatics Dev., vol. 8, no. 1, p. 47, 2019. https://doi.org/10.14421/ijid.2019.08108.
B. V. Indriyono, E. Utami, and A. Sunyoto, “Utilization of the Porter Stemmer Algorithm for Indonesian in the process of classifying book types,” J. Buana Inform., vol. 6, no. 4, pp. 301–310, 2015. (in Indonesian) https://doi.org/10.24002/jbi.v6i4.462.
A. Tripathy, A. Agrawal, and S. K. Rath, “Classification of sentimental reviews using machine learning techniques,” Procedia Comput. Sci., vol. 57, no. March, pp. 821–829, 2015. https://doi.org/10.1016/j.procs.2015.07.523.
D. Kim, D. Seo, S. Cho, and P. Kang, “Multi-co-training for document classification using various document representations: TF–IDF, LDA, and Doc2Vec,” Inf. Sci. (Ny)., vol. 477, pp. 15–29, 2019. https://doi.org/10.1016/j.ins.2018.10.006.
P. Dellia and A. Tjahyanto, “Tax complaints classification on twitter using text mining,” IPTEK J. Sci., vol. 2, no. 1, p. 11, 2017. https://doi.org/10.12962/j23378530.v2i1.a2254.
C. J. Huang, Y. J. Yang, D. X. Yang, and Y. J. Chen, “Frog classification using machine learning techniques,” Expert Syst. Appl., vol. 36, no. 2 PART 2, pp. 3737–3743, 2009. https://doi.org/10.1016/j.eswa.2008.02.059.
K. Kowsari, K. J. Meimandi, M. Heidarysafa, S. Mendu, L. Barnes, and D. Brown, “Text classification algorithms: A survey,” Inf., vol. 10, no. 4, pp. 1–68, 2019. https://doi.org/10.3390/info10040150.
P. Y. Pawar and S. H. Gawande, “A comparative study on different types of approaches to text categorization,” Int. J. Mach. Learn. Comput., vol. 2, no. 4, pp. 423–426, 2012. https://doi.org/10.7763/IJMLC.2012.V2.158.
M. Mamtesh and S. Mehla, “Sentiment analysis of movie reviews using machine learning classifiers,” Int. J. Comput. Appl., vol. 182, no. 50, pp. 25–28, 2019. https://doi.org/10.5120/ijca2019918756.
V. C. Pande and A. S. Khandelwal, “Comparative analysis of various text classification algorithms,” Int. J. Comput. Sci. Inf. Technol., vol. 8, no. 5, pp. 574–577, 2017.
V. Korde, “Text classification and classifiers: A survey,” Int. J. Artif. Intell. Appl., vol. 3, no. 2, pp. 85–99, 2012. https://doi.org/10.5121/ijaia.2012.3208.
R. B. Purnomoputra and U. N. Wisesty, “Sentiment analysis of movie reviews using Naïve Bayes method with Gini index feature selection,” no. July, pp. 85–94, 2019. https://doi.org/10.34818/jdsa.2019.2.36.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.