Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesis

The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and freque...

ver descrição completa

Detalhes bibliográficos
Autor principal: Coelho, Luís (author)
Outros Autores: Braga, Daniela (author), Garcia-Mateo, Carmen (author)
Formato: conferenceObject
Idioma:eng
Publicado em: 2016
Assuntos:
Texto completo:http://hdl.handle.net/10400.22/7636
País:Portugal
Oai:oai:recipp.ipp.pt:10400.22/7636
Descrição
Resumo:The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results.