AN ANTHROPOMORPHIC SPEECH PROCESSING BASED ON THE COCHLEAR MODEL AND ITS APPLICATION FOR CODING TASK

Authors

  • A. A. Petrovsky
  • D. S. Likhachov
  • W. Wan

DOI:

https://doi.org/10.47839/ijc.3.1.256

Keywords:

Cochlear model, basilar membrane, tuning cochlear filter, auditory model, sinusoidal modeling

Abstract

According to antisymmetry of basilar membrane (BM) movements, a new mathematical model of cochlea is derived using viscous cochlear fluid theory, and then transformed into a digital cochlear model with bilinear transformation. The frequency responses are found to be quite consistent with the experimental data, especially the high frequency slope is much more improved. A new cochlear map and 3 dB bandwidth characteristics for cochlear filter banks are obtained and presented, which will make applications of cochlear model more quantitative and accurate. Due to simplicity of its structure and reality of its characteristics, it will be proved the model can be used easily in speech processing system.

References

B.Allen, "Two-dimensional cochlear fluid model: new results", J.Acoust. Soc. Am. 61., 1971, pp.110-119.

B.Allen, M.M.Sondhi, "Cochlear macromechanics: time domain solutions", J.Acoust. Soc. Am. 66., 1979, pp.123-132.

R.Diependaal, "Nonlinear and active cochlear models: analysis and solution methods", Ph.D.thejsis, Delft University, 1988.

J.W.Matthews, "Mechanical modeling of nonlinear phenomena observed in the peripheral auditory system", Ph.D.thesis, Washington University, St.Louis, MD, 1980.

S.T.Neely, D.O.Kim, "A model for active elements in cochlear biomechanics", J.Acoust. Soc. Am. 79, 1986, pp.1472-1480.

S.Koshigoe, W.K.Kwok, and A.Tubis, "Effects of perilymph viscosity on low-frequency intracochlear pressures and the cochlear input impedance of the cat", J.Acoust. Soc. Am. 74, 1983, pp.486-492.

W.G.Wan, C.X.Fan, "A new solution for cochlear macromechanics", Acustica 75, 1991, pp.79-82.

W.G.Wan, C.X.Fan, "A new cochlear model: viscous fluid motion", J.Acoust. Soc. Am. 89, 1991, pp.1865.

L.Robles, M.A.Ruggero, N.C.Rich, "Basilar membrane mechanics at the base of the chinchilla cochlea. I. Input-output functions, tuning curves and response phases", J.Acoust. Soc. Am. 80., 1986, pp.1364-1374.

W.S.Rhode, "Observation of the vibration of the basilar membrane in squirrel monkeys using the Mossbauer technique", J.Accoust. Soc. Am. 49, 1971, pp.1218-1231.

W.S.Rhode, L.Robles, "Evidence from Mossbauer experiments for nonlinear vibrations in the cochlea", J.Accoust. Soc. Am. 55, 1974, pp.588-596.

J.B.Allen, "Cochlear modeling", IEEE ASSP Magazine 1, 1985, pp.3-29.

R.J.McAulay, T.F.Quatieni, "Speech analysis/synthesis based on a sinusoidal representations", IEEE Trans. on ASSP 34, 1986, pp.744-754.

Monderer, "Exploring the space-time structure at the output of a cochlear model", Ph.D. thesis, Columbia University, New York, 1988.

D.Johnson, J.Johnson, and H.Moore, A bandbook of active filters, Prentice-Hall, Inc. Englewood Gliffs, N.J., 1980.

A.A.Petrovsky, "Methods and microprocessor means of processing broadband and fastflowing processes in real time”, Nauka i Tekhnika, Minsk, 1988.

S.Puria, J.B.Allen, "A parametric study of cochlear input impendance", J.Acoust. Soc. Am. 89, 1991, pp.287-309.

D.H.Eldredge, J.D.Miller, and D.A.Bohne, "A frequency-position map for the chinchilla cochlea", J.Acoust. Soc. Am. 69, 1981, pp.1091-1095.

D.Greenwood, "A cochlear frequency-position function for several species -- 29 years later", J.Acoust. Soc. Am. 87, 1990, pp.2592-2605.

M.C.Liberman, "The cochlear frequency map for the cat: Labeling auditory nerve fibers of known characteristics frequency", J.Acoust. Soc. Am. 72, 1982, pp.1441-1449.

A.A.Petrovsky, J.A. Ganushkin. “Syntheses of digital bandpass filters. Radiotechnika”, Radiotekh. Electronik, 1985, vol. 10, pp. 24-25, 1985.

A.Petrovsky. “The synthesis of high order digital bandpass filter with tunable center frequency and bandwidth”, VIII Europ. Sig. Proc. Conf. (EUSIPCO 96). Trieste, Italy, 10-13 September 1996, pp. 1527-1530.

Alexander A. Petrovsky, Jaroslaw Baszun. “Design of Digital Tunable Bandpass Filters: Synthethis and Applications”, Instytut Informatyki, Politechnika Bialostocka, Kwartalnik electroniki i telekomunikacji, 2001, pp. 5-25.

Likhachov D.S., Petrovsky A.A. “Improved auditory-based speech coding using psychoacoustic model based on a cochlear filter bank and an average localized synchrony detection”, CISIM’2003, pp.11-19.

A. M. A. Ali et al., “Robust auditory-based speech processing using the average localized synchrony detection”, IEEE Transactions on Speech and Audio Processing, vol.10, no.5, July 2002, pp.1447-1459.

Oded Ghitza, “Auditory Models and Human Performance in Tasks Related to Speech Coding and Speech Recognition”, IEEE Transactions on Speech and Audio Processing, vol.2, no.1, part II, 1994 - pp. 115-132.

Downloads

Published

2014-08-01

How to Cite

Petrovsky, A. A., Likhachov, D. S., & Wan, W. (2014). AN ANTHROPOMORPHIC SPEECH PROCESSING BASED ON THE COCHLEAR MODEL AND ITS APPLICATION FOR CODING TASK. International Journal of Computing, 3(1), 75-83. https://doi.org/10.47839/ijc.3.1.256

Issue

Section

Articles