Entity Relation Extraction from News Articles in Portuguese for Competitive Intelligence based on BERT

Competitive intelligence (CI) is a relevant area of a corporation and can support the strategic business area by showing those responsible, helping decision making on how to position an organization in the market. This work uses the Bidirectional Transformer Encoding Representations (BERT) to proces...

ver descrição completa

Detalhes bibliográficos
Autor principal: De Los Reyes, Daniel (author)
Outros Autores: Trajano, Douglas (author), Manssour, Isabel (author), Vieira, Renata (author), Bordini, Rafael (author)
Formato: article
Idioma:eng
Publicado em: 2021
Assuntos:
Texto completo:http://hdl.handle.net/10174/30462
País:Portugal
Oai:oai:dspace.uevora.pt:10174/30462
Descrição
Resumo:Competitive intelligence (CI) is a relevant area of a corporation and can support the strategic business area by showing those responsible, helping decision making on how to position an organization in the market. This work uses the Bidirectional Transformer Encoding Representations (BERT) to process a sentence and its named entities and extract the parts of the sentences that represent or describe the semantic relationship between these named entities. The approach was developed for the Portuguese language, considering the financial domain and exploring deep linguistic representations without using other lexical-semantic resources. The results of the experiments show a precision of 73.5% using the Jaccard metric that measures the similarity between sentences. A second contribution of this work is the manually constructed dataset with more than 4.500 tuples (phrase, entity, entity) annotated.