Publicação

CQPWeb: Uma nova plataforma de pesquisa para o CRPC

Ver documento

Detalhes bibliográficos
Resumo:We present a newly available online resource for Portuguese, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. We report on work carried out on the corpus previous to its publication online, namely how the corpus was built, our choice of metadata and the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. We also describe the web platform and resume the extensive search options available for linguistic or NLP studies.
Autores principais:Mendes, Amália
Outros Autores:Généreux, Michel; Hendrickx, Iris; Pereira, Luísa; Bacelar do Nascimento, Maria Fernanda; Antunes, Sandra
Assunto:Corpus Limpeza Pré-processamento linguístico Pesquisa online
Ano:2012
País:Portugal
Tipo de documento:documento de conferência
Tipo de acesso:acesso aberto
Instituição associada:Universidade de Lisboa
Idioma:português
Origem:Repositório da Universidade de Lisboa
Descrição
Resumo:We present a newly available online resource for Portuguese, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. We report on work carried out on the corpus previous to its publication online, namely how the corpus was built, our choice of metadata and the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. We also describe the web platform and resume the extensive search options available for linguistic or NLP studies.