A NEW VECTOR QUANTIZATION APPROACH FOR DISCRETE HMM SPEECH RECOGNITION SYSTEM

Authors

  • Mohamed Debyeche
  • Jean Paul Haton
  • Amrane Houacine

DOI:

https://doi.org/10.47839/ijc.5.1.384

Keywords:

Arabic Language, Hidden Markov Model, Vector Quantization, Neural Network, Speech Recognition

Abstract

In order to address accuracy issues of discrete Hidden Markov Models (HMMs), in this paper, a new vector quantization (VQ) approach is presented. This new VQ approach performs an optimal distribution of VQ codebook components on HMM states. This technique that we named the distributed vector quantization (DVQ) of hidden Markov models, succeeds in unifying acoustic micro-structure and phonetic macro-structure, when the estimation of HMM parameters is performed. The DVQ technique is implemented through two variants. The first variant uses the K-means algorithm (K-means-DVQ) to optimize the VQ, while the second variant exploits the benefits of the classification behavior of neural networks (NN-DVQ) for the same purpose. The proposed variants are compared with the HMMbased baseline system by experiments of specific Arabic consonants recognition. The results show that the distributed vector quantization technique increase the performance of the discrete HMM system.

References

X.D. Huang. H.W. Hon. M.Y. Hwang. K.F. Lee. A comparative study of discrete, semi continuous, and continuous hidden Markov models, Computer Speech and Language (7) (1993), p.359-368

N. Morgan. H. Bourlard. Continuous speech recognition, IEEE signal processing magazine 12(3) (1995).

J.C. Segura et al. Multiple VQ hidden Markov modeling for speech recognition, Speech Communication, (14) (1994). P.163-170

Q. Huo. C. Chan. Contextual vector quantization for speech recognition with discrete hidden Markov model, Pattern Recognition, 28(4), (1995). p.513-517.

V. Digalakis. S. Tsakalidis. C. Harizakis. L. Neumeyer. Efficient speech recognition using sub vector quantization and discrete-mixture HMMs, Computer Speech and Language, (2000). p.33-46

F. Lefevre, Non parametric probability estimation for HMM-based automatic speech recognition, Computer Speech and Language, (17) (2003). p.113-136

A. Bernard. A. Alwan. Low-Bit-rate Distributed Speech Recognition for Packet-Based and Wireless Communication, IEEE Transactions on Speech and Audio Processing, 10(8) 2002. p.570-580

R. Ethan. D. A. Subramaniam. B. D. Rao. Improved quantization structures using generalized HMM modelling with application to wideband speech coding, IEEE International Conference on Audio Speech and Signal Processing, Montreal (1) 2004 p.161-164.

M.A. Elkhouli. Hearing distinction of speech sound, Arabic linguistic and computer science, publication of Tunis University, (1989) p.267-295.

S. Davis. P. Mermelstein. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Trans. on Acoustics, Speech and Signal Processing, 28(4) (1980). p.357-366.

Y. Linde. A. Buzo. R.M. Gray. An algorithm for Vector Quantizer, IEEE Trans. on Communication , 28(1) 1980.

L.R. Rabiner. A tutorial on hidden Markov Models and Selected applications in speech recognition, IEEE Trans. Speech Process 77(2) (1989). p.257-285.

P. Hedelin. J. Skoglound. Vector quantization based on Gaussian Mixture models, IEEE trans. on Speech and Audio Processing, 8(4) (2000). p.385-401.

A. Likas, N. Vlassis & J. J. Verbeek, The global k-means clustering algorithm, Pattern Recognition, 36(2) (2003). P. 451-461.

Downloads

Published

2014-08-01

How to Cite

Debyeche, M., Haton, J. P., & Houacine, A. (2014). A NEW VECTOR QUANTIZATION APPROACH FOR DISCRETE HMM SPEECH RECOGNITION SYSTEM. International Journal of Computing, 5(1), 72-78. https://doi.org/10.47839/ijc.5.1.384

Issue

Section

Articles