Sibilant consonants classification with deep neural networks

Abstract. Many children su ering from speech sound disorders cannot pronounce the sibilant consonants correctly. We have developed a serious game that is controlled by the children's voices in real time and that allows children to practice the European Portuguese sibilant consonants. For this,...

ver descrição completa

Detalhes bibliográficos
Autor principal: Anjos, Ivo (author)
Outros Autores: Marques, Nuno (author), Grilo, Ana Margarida (author), Guimarães, Isabel (author), Magalhães, João (author), Cavaco, Sofia (author)
Formato: conferenceObject
Idioma:eng
Publicado em: 2022
Assuntos:
Texto completo:http://hdl.handle.net/10400.26/40076
País:Portugal
Oai:oai:comum.rcaap.pt:10400.26/40076
Descrição
Resumo:Abstract. Many children su ering from speech sound disorders cannot pronounce the sibilant consonants correctly. We have developed a serious game that is controlled by the children's voices in real time and that allows children to practice the European Portuguese sibilant consonants. For this, the game uses a sibilant consonant classi er. Since the game does not require any type of adult supervision, children can practice the production of these sounds more often, which may lead to faster improvements of their speech. Recently, the use of deep neural networks has given considerable improvements in classi cation for a variety of use cases, from image classication to speech and language processing. Here we propose to use deep convolutional neural networks to classify sibilant phonemes of European Portuguese in our serious game for speech and language therapy. We compared the performance of several diferent arti cial neural networks that used Mel frequency cepstral coefcients or log Mel lterbanks. Our best deep learning model achieves classi cation scores of 95:48% using a 2D convolutional model with log Mel lterbanks as input features.