AN ANTHROPOMORPHIC SPEECH PROCESSING BASED ON THE COCHLEAR MODEL AND ITS APPLICATION FOR CODING TASK
DOI:
https://doi.org/10.47839/ijc.3.1.256Keywords:
Cochlear model, basilar membrane, tuning cochlear filter, auditory model, sinusoidal modelingAbstract
According to antisymmetry of basilar membrane (BM) movements, a new mathematical model of cochlea is derived using viscous cochlear fluid theory, and then transformed into a digital cochlear model with bilinear transformation. The frequency responses are found to be quite consistent with the experimental data, especially the high frequency slope is much more improved. A new cochlear map and 3 dB bandwidth characteristics for cochlear filter banks are obtained and presented, which will make applications of cochlear model more quantitative and accurate. Due to simplicity of its structure and reality of its characteristics, it will be proved the model can be used easily in speech processing system.References
B.Allen, "Two-dimensional cochlear fluid model: new results", J.Acoust. Soc. Am. 61., 1971, pp.110-119.
B.Allen, M.M.Sondhi, "Cochlear macromechanics: time domain solutions", J.Acoust. Soc. Am. 66., 1979, pp.123-132.
R.Diependaal, "Nonlinear and active cochlear models: analysis and solution methods", Ph.D.thejsis, Delft University, 1988.
J.W.Matthews, "Mechanical modeling of nonlinear phenomena observed in the peripheral auditory system", Ph.D.thesis, Washington University, St.Louis, MD, 1980.
S.T.Neely, D.O.Kim, "A model for active elements in cochlear biomechanics", J.Acoust. Soc. Am. 79, 1986, pp.1472-1480.
S.Koshigoe, W.K.Kwok, and A.Tubis, "Effects of perilymph viscosity on low-frequency intracochlear pressures and the cochlear input impedance of the cat", J.Acoust. Soc. Am. 74, 1983, pp.486-492.
W.G.Wan, C.X.Fan, "A new solution for cochlear macromechanics", Acustica 75, 1991, pp.79-82.
W.G.Wan, C.X.Fan, "A new cochlear model: viscous fluid motion", J.Acoust. Soc. Am. 89, 1991, pp.1865.
L.Robles, M.A.Ruggero, N.C.Rich, "Basilar membrane mechanics at the base of the chinchilla cochlea. I. Input-output functions, tuning curves and response phases", J.Acoust. Soc. Am. 80., 1986, pp.1364-1374.
W.S.Rhode, "Observation of the vibration of the basilar membrane in squirrel monkeys using the Mossbauer technique", J.Accoust. Soc. Am. 49, 1971, pp.1218-1231.
W.S.Rhode, L.Robles, "Evidence from Mossbauer experiments for nonlinear vibrations in the cochlea", J.Accoust. Soc. Am. 55, 1974, pp.588-596.
J.B.Allen, "Cochlear modeling", IEEE ASSP Magazine 1, 1985, pp.3-29.
R.J.McAulay, T.F.Quatieni, "Speech analysis/synthesis based on a sinusoidal representations", IEEE Trans. on ASSP 34, 1986, pp.744-754.
Monderer, "Exploring the space-time structure at the output of a cochlear model", Ph.D. thesis, Columbia University, New York, 1988.
D.Johnson, J.Johnson, and H.Moore, A bandbook of active filters, Prentice-Hall, Inc. Englewood Gliffs, N.J., 1980.
A.A.Petrovsky, "Methods and microprocessor means of processing broadband and fastflowing processes in real time”, Nauka i Tekhnika, Minsk, 1988.
S.Puria, J.B.Allen, "A parametric study of cochlear input impendance", J.Acoust. Soc. Am. 89, 1991, pp.287-309.
D.H.Eldredge, J.D.Miller, and D.A.Bohne, "A frequency-position map for the chinchilla cochlea", J.Acoust. Soc. Am. 69, 1981, pp.1091-1095.
D.Greenwood, "A cochlear frequency-position function for several species -- 29 years later", J.Acoust. Soc. Am. 87, 1990, pp.2592-2605.
M.C.Liberman, "The cochlear frequency map for the cat: Labeling auditory nerve fibers of known characteristics frequency", J.Acoust. Soc. Am. 72, 1982, pp.1441-1449.
A.A.Petrovsky, J.A. Ganushkin. “Syntheses of digital bandpass filters. Radiotechnika”, Radiotekh. Electronik, 1985, vol. 10, pp. 24-25, 1985.
A.Petrovsky. “The synthesis of high order digital bandpass filter with tunable center frequency and bandwidth”, VIII Europ. Sig. Proc. Conf. (EUSIPCO 96). Trieste, Italy, 10-13 September 1996, pp. 1527-1530.
Alexander A. Petrovsky, Jaroslaw Baszun. “Design of Digital Tunable Bandpass Filters: Synthethis and Applications”, Instytut Informatyki, Politechnika Bialostocka, Kwartalnik electroniki i telekomunikacji, 2001, pp. 5-25.
Likhachov D.S., Petrovsky A.A. “Improved auditory-based speech coding using psychoacoustic model based on a cochlear filter bank and an average localized synchrony detection”, CISIM’2003, pp.11-19.
A. M. A. Ali et al., “Robust auditory-based speech processing using the average localized synchrony detection”, IEEE Transactions on Speech and Audio Processing, vol.10, no.5, July 2002, pp.1447-1459.
Oded Ghitza, “Auditory Models and Human Performance in Tasks Related to Speech Coding and Speech Recognition”, IEEE Transactions on Speech and Audio Processing, vol.2, no.1, part II, 1994 - pp. 115-132.
Downloads
Published
How to Cite
Issue
Section
License
International Journal of Computing is an open access journal. Authors who publish with this journal agree to the following terms:• Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
• Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
• Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.