Publicação
CQPWeb: Uma nova plataforma de pesquisa para o CRPC
| Resumo: | We present a newly available online resource for Portuguese, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. We report on work carried out on the corpus previous to its publication online, namely how the corpus was built, our choice of metadata and the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. We also describe the web platform and resume the extensive search options available for linguistic or NLP studies. |
|---|---|
| Autores principais: | Mendes, Amália |
| Outros Autores: | Généreux, Michel; Hendrickx, Iris; Pereira, Luísa; Bacelar do Nascimento, Maria Fernanda; Antunes, Sandra |
| Assunto: | Corpus Limpeza Pré-processamento linguístico Pesquisa online |
| Ano: | 2012 |
| País: | Portugal |
| Tipo de documento: | documento de conferência |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade de Lisboa |
| Idioma: | português |
| Origem: | Repositório da Universidade de Lisboa |
| Resumo: | We present a newly available online resource for Portuguese, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. We report on work carried out on the corpus previous to its publication online, namely how the corpus was built, our choice of metadata and the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries. We also describe the web platform and resume the extensive search options available for linguistic or NLP studies. |
|---|