A Novel Approach to Spoken Arabic Number Recognition Based on Developed Ant Lion Algorithm
DOI:
https://doi.org/10.47839/ijc.20.2.2175Keywords:
Pre-processing, Feature Extraction, Recognition System, Ant-lion algorithmAbstract
Intelligent spoken system is constructed to recognize numbers spoken in Arabic language by different people. Series of operations are performed on audio sound file as pre-processing stages. A novel approach is applied to extract features of audio files called Max Mean Log to reduce audio file dimensions in an efficient manner. Several stages of initial processing are used to prepare the file for the next step of the recognition process. The recognition process begins with the use of Antlion’s advanced intelligence algorithm to determine the type of the spoken number in Arabic and later convert it to a visual text that represents the value of the spoken number. The current proposal method is relatively fast and very effective. The percentage of recognizing numbers spoken by the proposed algorithm is 99%. For 1,800 different audio files, the error rate was 1%. Additional 40 audio files were used that are different from people’s original dataset. Due to an additional examination of the system and its ability to recognize the audio file, the rate of discrimination for such files was 72.5%.
References
L. Rabiner, and B.-H. Juang, Fundamental of Speech Recognition, Prentice Hall, Signal Processing series, New Jersey USA, 1993.
P. Das, K. Acharjee, P. Das, V. Prasad, “Voice recognition system: Speech-to-Text,” Journal of Applied and Fundamental Sciences, vol. 1, issue 2, pp. 191-195, 2015.
S. K. Gaikwad, B. W. Gawali, P. Yannawar, “A review on speech recognition technique,” International Journal of Computer Applications, vol. 10, no. 3, pp. 1-9, 2010, https://doi.org/10.5120/1462-1976.
M. A. Anusuya, S. K. Katti, “Speech recognition by machine: A review,” (IJCSIS) International Journal of Computer Science and Information Security, vol. 6, no. 3, pp. 181-205, 2009.
M. Kumar Nandwana, J. Van Hout, M. McLaren, A. Stauffer, C. Richey, A. Lawson, M. Graciarena, “Robust speaker recognition from distant speech under real reverberant environments using speaker embeddings,” Proceedings of the Interspeech 2018, Hyderabad, 2-6 September 2018, pp. 1106-1110, https://doi.org/10.21437/Interspeech.2018-2221.
A. A. M. Abushariah, T. S. Gunawan, O. O. Khalifa, M. M. A. Abushariah, “English digits speech recognition system based on hidden Markov models,” Proceedings of the International Conference on Computer and Communication Engineering (ICCCE 2010), Kuala Lumpur, Malaysia, 11-13 May 2010, https://doi.org/10.1109/ICCCE.2010.5556819.
A. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates, A. Y. Ng., “Deep Speech: Scaling up end-to-end speech recognition,” arXiv:1412.5567v2 [cs.CL] 19 Dec 2014.
T. Al Smadi, H. A. Al Issa, E. Trad, K. A. Al Smadi, “Artificial intelligence for speech recognition based on neural networks,” Journal of Signal and Information Processing, vol. 6, pp. 66-72, 2015, https://doi.org/10.4236/jsip.2015.62006.
R. Menon, A. Biswas, A. Saeb, J. Quinn, T. Niesler, “Automatic speech recognition for humanitarian applications in Somali,” arXiv:1807.08669v1 [cs.CL] 23 July 2018, https://doi.org/10.21437/SLTU.2018-5.
B. Ghojogh, M. N. Samad, S. A. Mashhadi, T. Kapoor, W. Ali, F. Karray, M. Crowley, “Feature selection and feature extraction in pattern analysis: A literature review,” arXiv:1905.02845v1 [cs.LG] 7 May 2019.
D. Hughes-Hallett, W.G. McCallum, A.M. Gleason et al., CALCULUS Single & Multivariable, Sixth Edition, John Wiley & Sons, Inc., 2013.
H. Hu, Y. Li, Y. Bai, J. Zhang, M. Liu, “The improved antlion optimizer and artificial neural network for Chinese influenza prediction,” Hindawi Complexity, vol. 2019, article ID 1480392, 12 pages, 2019, https://doi.org/10.1155/2019/1480392.
E. S. Ali, S. M. Abd Elazim, A. Y. Abdelazez, “Ant lion optimization algorithm for optimal location and sizing of renewable distributed generations,” ELSEVIER, Renewable Energy, vol. 101, pp. 1311-1324, 2017, https://doi.org/10.1016/j.renene.2016.09.023.
O. Bozorg-Haddad, “Advanced optimization by nature-inspired algorithms,” Springer Nature Singapore Pte Ltd., vol. 720, 2018, https://doi.org/10.1007/978-981-10-5221-7.
S. Mouassa, T. Bouktir, A. Salhi, “Ant lion optimizer for solving optimal reactive power dispatch problem in power systems,” International Journal of Engineering Science and Technology, vol. 20, issue 3, pp. 885-895, 2017, https://doi.org/10.1016/j.jestch.2017.03.006.
Hardiansyah, K. H. Khwee, Suparno, Rianda, I. Suharto, “Ant lion optimization for dynamic economic dispatch,” IOSR Journal of Electrical and Electronics Engineering (IOSR-JEEE), vol. 12, issue 6, ver. I, pp. 87-92, 2017, https://doi.org/10.9790/1676-1203036567.
R. Satheeshkumar, R. Shivakumar, “Ant lion optimization approach for load frequency control of multi-area interconnected power systems,” Circuits and Systems, no. 7, pp. 2357-2383, 2016, https://doi.org/10.4236/cs.2016.79206.
A. N. Younis, F. M. Remo, “Distinguish musical symbol printed using the linear discriminant analysis LDA and similarity scale,” International Journal of Computer Applications, vol. 179, no. 47, pp. 20-24, 2018, https://doi.org/10.5120/ijca2018917236.
A. Nazar, Developing and Implementation of Metaheuristic Algorithm Sequentially and Parallel for Musical Notes Recognition, Master Thesis, University of Mosul, 2018.
N. Nimat, Design Hybrid Intelligent System for Transitional Bladder Cell Carcinoma Diagnosis, Ph.D. Thesis, University of Mosul, 2005.
T. S. H. AL-Mashhadani, F. M. Ramo, “Personal identification system using dental panoramic radiograph based on meta_heuristic algorithm,” International Journal of Computer Science and Information Security (IJCSIS), vol. 14, no. 5, pp. 344-350, 2016.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.