Publicação

Named Entities in the QTLeap Corpus of Online Helpdesk Interactions

Ver documento

Detalhes bibliográficos
Resumo:In this paper we present the annotation of a corpus with named entities that are classified into semantic types and disambiguated by linking them to their corresponding entry in the Portuguese DBpedia. This corpus, QTLeap Corpus, is a multilingual collection of question and answer pairs from a chat-based helpdesk service for Information and Communication Technologies. The resulting annotated corpus is a gold-standard named entity annotated lexical resource that is useful in supporting the training and evaluation of named entity annotation and disambiguation tools for Portuguese.
Autores principais:Querido, Andreia
Outros Autores:Carvalho, Rita de; Rodrigues, João; Silva, João; Neale, Steven; Pereira, Rita; Gomes, Patrícia; Correia, Catarina; Amaral, Diana; Branco, António
Assunto:Annotated corpus QTLeap Corpus Named entities Annotation task Disambiguation task
Ano:2016
País:Portugal
Tipo de documento:artigo
Tipo de acesso:acesso aberto
Instituição associada:Universidade de Lisboa
Idioma:inglês
Origem:Repositório da Universidade de Lisboa
Descrição
Resumo:In this paper we present the annotation of a corpus with named entities that are classified into semantic types and disambiguated by linking them to their corresponding entry in the Portuguese DBpedia. This corpus, QTLeap Corpus, is a multilingual collection of question and answer pairs from a chat-based helpdesk service for Information and Communication Technologies. The resulting annotated corpus is a gold-standard named entity annotated lexical resource that is useful in supporting the training and evaluation of named entity annotation and disambiguation tools for Portuguese.