A cepstrum domain HMM-based speech enhancement method applied to nonstationary noise
| Document type: | Conference Papers |
|---|---|
| Peer reviewed: | Yes |
| Author(s): | Mikael Nilsson, Mattias Dahl, Ingvar Claesson |
| Title: | A cepstrum domain HMM-based speech enhancement method applied to nonstationary noise |
| Conference name: | 7th International Symposium on Digital Signal Processing for Communication, Systems, DEC, 2003 |
| Year: | 2005 |
| Pagination: | 1-13 |
| ISBN: | 0-387-22847-0 |
| Publisher: | Springer |
| City: | Coolangatta, AUSTRALIA |
| ISI number: | 000226081100001 |
| Organization: | Blekinge Institute of Technology |
| Department: | School of Engineering - Dept. of Signal Processing (Sektionen för teknik – avd. för signalbehandling) School of Engineering S- 372 25 Ronneby +46 455 38 50 00 http://www.tek.bth.se/ |
| Language: | English |
| Abstract: | This paper presents a Hidden Markov Model (HMM)-based speech enhancement method, aiming at reducing non-stationary noise from speech signals. The system is based on the assumption that the speech and the noise are additive and uncorrelated. Cepstral features are used to extract statistical information from both the speech and the noise. A-priori statistical information is collected from long training sequences into ergodic hidden Markov models. Given the ergodic models for the speech and the noise, a compensated speech-noise model is created by means of parallel model combination, using a log-normal approximation. During the compensation. the mean of every mixture in the speech and noise model is stored. The stored means are then used in the enhancement process to create the most likely speech and noise power spectral distributions using the forward algorithm combined with mixture probability. The distributions are used to generate a Wiener filter for every observation. The paper includes a performance evaluation of the speech enhancer for stationary as well as non-stationary noise environment. |
| Subject: | Signal Processing\General Signal Processing\Detection and Classification |
| Keywords: | HMM, PMC, speech enhancement, log-normal |
| Note: | SIGNAL PROCESSING FOR TELECOMMUNICATIONS AND MULTIMEDIA Book Series: MULTIMEDIA SYSTEMS AND APPLICATIONS (SERIES) Vol. 27 |












