Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesis

The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and freque...

Full description

Bibliographic Details
Main Author: Coelho, Luís (author)
Other Authors: Braga, Daniela (author), Garcia-Mateo, Carmen (author)
Format: conferenceObject
Language:eng
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10400.22/7636
Country:Portugal
Oai:oai:recipp.ipp.pt:10400.22/7636
Description
Summary:The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results.