Classificação prosódica de marcadores discursivos

This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are...

ver descrição completa

Detalhes bibliográficos
Autor principal: Cabarrão, V. (author)
Outros Autores: Moniz, H. (author), Ferreira, J. (author), Batista, F. (author), Trancoso, I. (author), Mata, Ana I. (author), Curto, S. (author)
Formato: article
Idioma:por
Publicado em: 2017
Assuntos:
Texto completo:http://hdl.handle.net/10071/12800
País:Portugal
Oai:oai:repositorio.iscte-iul.pt:10071/12800
Descrição
Resumo:This work describes the discourse markers present in two corpora for European Portuguese, in different domains (university lectures and map-task dialogues). In this study, we also perform a multiclass automatic classification task based on prosodic features to verify in both corpora which words are discourse markers, which are disfluencies, and which are sentence like-units (SUs). Results show that the selection of discourse markers varies across domain and between speakers. As for the classification task, results show that the discourse markers are better classified in the lectures corpus (87%) than in the dialogue corpus (84%). However, cross?domain experiments evidenced that data trained with the dialogue corpus predicts better the events in the lecture corpus, since this domain displays more speakers and therefore complex patterns. In both corpora, markers are more easily classified as SUs than as disfluencies.