Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesis
The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and freque...
Main Author: | |
---|---|
Other Authors: | , |
Format: | conferenceObject |
Language: | eng |
Published: |
2016
|
Subjects: | |
Online Access: | http://hdl.handle.net/10400.22/7636 |
Country: | Portugal |
Oai: | oai:recipp.ipp.pt:10400.22/7636 |
Summary: | The recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results. |
---|