Document details

Entity Relation Extraction from News Articles in Portuguese for Competitive Intelligence based on BERT

Author(s): De Los Reyes, Daniel ; Trajano, Douglas ; Manssour, Isabel ; Vieira, Renata ; Bordini, Rafael

Date: 2021

Persistent ID: http://hdl.handle.net/10174/30462

Origin: Repositório Científico da Universidade de Évora

Subject(s): Entidades nomeadas; Business Intelligence; Extração de Informação


Description

Competitive intelligence (CI) is a relevant area of a corporation and can support the strategic business area by showing those responsible, helping decision making on how to position an organization in the market. This work uses the Bidirectional Transformer Encoding Representations (BERT) to process a sentence and its named entities and extract the parts of the sentences that represent or describe the semantic relationship between these named entities. The approach was developed for the Portuguese language, considering the financial domain and exploring deep linguistic representations without using other lexical-semantic resources. The results of the experiments show a precision of 73.5% using the Jaccard metric that measures the similarity between sentences. A second contribution of this work is the manually constructed dataset with more than 4.500 tuples (phrase, entity, entity) annotated.

FCT CEECIND/01997/2017, UIDB/00057/2020, CAPES

Document Type Journal article
Language English
facebook logo  linkedin logo  twitter logo 
mendeley logo

Related documents

No related documents