Publicação
NER in archival finding aids: extended
| Resumo: | The amount of information preserved in Portuguese archives has increased over the years. These documents represent a national heritage of high importance, as they portray the country’s history. Currently, most Portuguese archives have made their finding aids available to the public in digital format, however, these data do not have any annotation, so it is not always easy to analyze their content. In this work, Named Entity Recognition solutions were created that allow the identification and classification of several named entities from the archival finding aids. These named entities translate into crucial information about their context and, with high confidence results, they can be used for several purposes, for example, the creation of smart browsing tools by using entity linking and record linking techniques. In order to achieve high result scores, we annotated several corpora to train our own Machine Learning algorithms in this context domain. We also used different architectures, such as CNNs, LSTMs, and Maximum Entropy models. Finally, all the created datasets and ML models were made available to the public with a developed web platform, NER@DI. |
|---|---|
| Autores principais: | Cunha, Luís Filipe da Costa |
| Outros Autores: | Ramalho, José Carlos |
| Assunto: | named entity recognition archival search aids machine learning deep learning maximum entropy |
| Ano: | 2022 |
| País: | Portugal |
| Tipo de documento: | artigo |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade do Minho |
| Idioma: | inglês |
| Origem: | RepositóriUM - Universidade do Minho |
Registos relacionados
article NER in Archival Finding Aids
por: Cunha, Luís Filipe da Costa
Publicado em: (2021)
por: Cunha, Luís Filipe da Costa
Publicado em: (2021)
article Towards Entity Linking, NER in archival finding aids
por: Cunha, Luís Filipe da Costa
Publicado em: (2021)
por: Cunha, Luís Filipe da Costa
Publicado em: (2021)
school Entity recognition in archival descriptions
por: Cunha, Luís Filipe da Costa
Publicado em: (2022)
por: Cunha, Luís Filipe da Costa
Publicado em: (2022)
article Development of a machine learning framework for biomedical text mining
por: Rodrigues, Rúben
Publicado em: (2016)
por: Rodrigues, Rúben
Publicado em: (2016)
article Desenvolvimento e avaliação de um modelo NER no domínio da análise cultural e do turismo
por: Sotelo Docío, Susana
Publicado em: (2023)
por: Sotelo Docío, Susana
Publicado em: (2023)
school Anonimização automática
por: Santos, Luís Bernardo Crisóstomo e Silva Rodrigues Esteves dos
Publicado em: (2025)
por: Santos, Luís Bernardo Crisóstomo e Silva Rodrigues Esteves dos
Publicado em: (2025)
school CEPAD: Classificação e processamento automatizado de documento
por: Borges, Rui Pedro Pinto
Publicado em: (2022)
por: Borges, Rui Pedro Pinto
Publicado em: (2022)
article Dyscalculia: A behavioural vision
por: Ferraz, Filipa Tinoco
Publicado em: (2019)
por: Ferraz, Filipa Tinoco
Publicado em: (2019)
article Applying recognition of emotions in speech to extend the impact of brand slogan research
por: Chien, Charles S.
Publicado em: (2007)
por: Chien, Charles S.
Publicado em: (2007)
article A text mining approach for the extraction of kinetic information from literature
por: Freitas, Ana A.
Publicado em: (2015)
por: Freitas, Ana A.
Publicado em: (2015)
article Bringing named entity recognition on Drupal content management system
por: Fernandes, José
Publicado em: (2014)
por: Fernandes, José
Publicado em: (2014)
school Extraction of kinetic information from literature
por: Freitas, Ana Alão
Publicado em: (2014)
por: Freitas, Ana Alão
Publicado em: (2014)
category A text mining approach for the extraction of kinetic information from literature
por: Freitas, Ana A.
Publicado em: (2015)
por: Freitas, Ana A.
Publicado em: (2015)
article A survey on the semi supervised learning paradigm in the context of speech emotion recognition
por: Andrade, Guilherme
Publicado em: (2022)
por: Andrade, Guilherme
Publicado em: (2022)
article Inertial data-based AI approaches for ADL and fall recognition
por: Martins, Luís M.
Publicado em: (2022)
por: Martins, Luís M.
Publicado em: (2022)
article Violence detection in audio: evaluating the effectiveness of deep learning models and data augmentation
por: Durães, Dalila
Publicado em: (2023)
por: Durães, Dalila
Publicado em: (2023)
article Weakness evaluation on in-vehicle violence detection: an assessment of X3D, C2D and I3D against FGSM and PGD
por: Santos, Flávio
Publicado em: (2022)
por: Santos, Flávio
Publicado em: (2022)
article Evaluation of chemical and gene/protein entity recognition systems at BioCreative V.5: the CEMP and GPRO patents tracks
por: Pérez-Pérez, Martin
Publicado em: (2017)
por: Pérez-Pérez, Martin
Publicado em: (2017)
article Biomedical text mining applied to document retrieval and semantic indexing
por: Lourenço, Anália
Publicado em: (2009)
por: Lourenço, Anália
Publicado em: (2009)
article BioDR: semantic indexing networks for biomedical document retrieval
por: Lourenço, Anália
Publicado em: (2010)
por: Lourenço, Anália
Publicado em: (2010)
article Use of biochemical tests and machine learning in the search for potential diagnostic biomarkers of COVID-19, HIV/AIDS, and pulmonary tuberculosis
por: Cobre, Alexandre
Publicado em: (2024)
por: Cobre, Alexandre
Publicado em: (2024)
article A Comparison of AutoML Tools for Machine Learning, Deep Learning and XGBoost
por: Ferreira, Luís
Publicado em: (2021)
por: Ferreira, Luís
Publicado em: (2021)
draft Data confession in the portuguese EDM
por: Costa, Leonardo
Publicado em: (2008)
por: Costa, Leonardo
Publicado em: (2008)
article An approach using entropy and supervised classifications to disaggregate agricultural data at a local level
por: Xavier, Antonio
Publicado em: (2019)
por: Xavier, Antonio
Publicado em: (2019)
school Visualizing neural network architectures
por: Tavares, Diogo de Oliveira Campos
Publicado em: (2023)
por: Tavares, Diogo de Oliveira Campos
Publicado em: (2023)
article Computer-aided diagnosis in Brain Computer Tomography screening
por: Peixoto, Hugo
Publicado em: (2009)
por: Peixoto, Hugo
Publicado em: (2009)
article Computer vision-based wood identification: a review
por: Silva, José Luís
Publicado em: (2022)
por: Silva, José Luís
Publicado em: (2022)
article A regression deep learning approach for fashion compatibility
por: Silva, Luís
Publicado em: (2024)
por: Silva, Luís
Publicado em: (2024)
article A BiLSTM approach to outfit compatibility and image similarity
por: Silva, Luís
Publicado em: (2025)
por: Silva, Luís
Publicado em: (2025)
article @Note: a workbench for biomedical text mining
por: Lourenço, Anália
Publicado em: (2009)
por: Lourenço, Anália
Publicado em: (2009)
science Intra- and inter-regional complexity in multi-channel awake EEG through multivariate multiscale dispersion entropy for assessing sleep quality and aging
por: Zandbagleh, Ahmad
Publicado em: (2025)
por: Zandbagleh, Ahmad
Publicado em: (2025)
book Lexical semantics annotation for enriched Portuguese corpora
por: Neale, Steven
Publicado em: (2016)
por: Neale, Steven
Publicado em: (2016)
article Automated computer-aided design of cranial implants using a deep volumetric convolutional denoising autoencoder
por: Morais, Ana
Publicado em: (2019)
por: Morais, Ana
Publicado em: (2019)
article Archives in Portuguese public policies: a steady place
por: Silva, Carlos Guardado da, 1971-
Publicado em: (2022)
por: Silva, Carlos Guardado da, 1971-
Publicado em: (2022)
article From the archival bond to the informational bond
por: Pacheco, André Miguel Pereira, 1991-
Publicado em: (2023)
por: Pacheco, André Miguel Pereira, 1991-
Publicado em: (2023)
article Creating an Earth Archive
por: Fisher, Christopher
Publicado em: (2022)
por: Fisher, Christopher
Publicado em: (2022)
school Classificação de preferências no retalho alimentar: uma abordagem supervisionada
por: Rocha, Rodrigo Filipe Rodrigues
Publicado em: (2023)
por: Rocha, Rodrigo Filipe Rodrigues
Publicado em: (2023)
article Analysis of machine learning algorithms for violence detection in audio
por: Veloso, Bruno
Publicado em: (2022)
por: Veloso, Bruno
Publicado em: (2022)
article Blind people: clothing category classification and stain detection using transfer learning
por: Rocha, Daniel
Publicado em: (2023)
por: Rocha, Daniel
Publicado em: (2023)
article Named Entities in the QTLeap Corpus of Online Helpdesk Interactions
por: Querido, Andreia
Publicado em: (2016)
por: Querido, Andreia
Publicado em: (2016)
Registos relacionados
-
article NER in Archival Finding Aids
por: Cunha, Luís Filipe da Costa
Publicado em: (2021) -
article Towards Entity Linking, NER in archival finding aids
por: Cunha, Luís Filipe da Costa
Publicado em: (2021) -
school Entity recognition in archival descriptions
por: Cunha, Luís Filipe da Costa
Publicado em: (2022) -
article Development of a machine learning framework for biomedical text mining
por: Rodrigues, Rúben
Publicado em: (2016) -
article Desenvolvimento e avaliação de um modelo NER no domínio da análise cultural e do turismo
por: Sotelo Docío, Susana
Publicado em: (2023)