A robust feature extraction for automatic speech recognition in noisy environments

This paper presents a method for extraction of speech robust features when the external noise is additive and has white noise characteristics. The process consists of a short time power normalisation which goal is to preserve as much as possible, the speech features against noise. The proposed norma...

Full description

Bibliographic Details
Main Author:	Lima, C. S. (author)
Other Authors:	Almeida, Luís B. (author), Monteiro, João L. (author)
Format:	conferencePaper
Language:	eng
Published:	2002
Subjects:	Features robustness Features extraction Robust speech recognition HMM modelling Science & Technology
Online Access:	http://hdl.handle.net/1822/2142
Country:	Portugal
Oai:	oai:repositorium.sdum.uminho.pt:1822/2142

Description
Summary:	This paper presents a method for extraction of speech robust features when the external noise is additive and has white noise characteristics. The process consists of a short time power normalisation which goal is to preserve as much as possible, the speech features against noise. The proposed normalisation will be optimal if the corrupted process has, as the noise process white noise characteristics. With optimal normalisation we can mean that the corrupting noise does not change at all the means of the observed vectors of the corrupted process. As most of the speech energy is contained in a relatively small frequency band being most of the band composed by noise or noise-like power, this normalisation process can still capture most of the noise distortions. For Signal to Noise Ratio greater than 5 dB the results show that for stationary white noise, the normalisation process where the noise characteristics are ignored at the test phase, outperforms the conventional Markov models composition where the noise is known. If the noise is known, a reasonable approximation of the inverted system can be easily obtained performing noise compensation still increasing the recogniser performance.

A robust feature extraction for automatic speech recognition in noisy environments

Similar Items

Need Help?