115 documents found, page 1 of 12

Sort by Issue Date

Annotating, analysing and learning named entities in Portuguese historical text...

Vieira, Renata; Olival, Fernanda; Cameron, Helena; Farrica, Fátima; Santos, Joaquim; Reyes, Daniel

This article presents a study based on 18th-century Portuguese texts, focusing on the analysis of named entities to enhance their value for historical research. For that, an annotated corpus was developed using a primary source (the Parish Memories), which was transcribed, revised, and standardised. The distribution of named entities in the source was then analysed to reflect on the variations in the defined ca...

Date: 2025   |   Origin: Linguamática

Assessing European and Brazilian Portuguese LLMs for NER in Specialised Domains

Nunes, Rafael Oleques; Santos, Joaquim; Spritzer, André; Balreira, Dennis G.; Freitas, Carla M. Dal Sasso; Olival, Fernanda; Cameron, Helena Freire

This paper discusses the impact of Portuguese variants in Large Language Models for the task of named entity recognition (NER) in specialised domains. The tests were made on a Brazilian Portuguese le gal and a European Portuguese historical corpora. The models taken into account are BERTimbau (PT-BR), Albertina (PT-PT and PT-BR), and XML-R (multilingual). The impact was more evident in the Portuguese historical...


From the text sources to the map: the Parish Memories, a para cadastre of Portu...

Ribeiro, Ana Sofia; Olival, Fernanda; Cameron, Helena Freire; Vieira, Renata; Farrica, Fátima

This study explores the cartographic reconstruction of 18th-century Portuguese parishes and municipalities, using Vila Viçosa as a case study. Based on the 1758 Parish Memories and other historical sources, it employs Natural Language Processing and QGIS to extract and map geographical and administrative data. The research highlights challenges such as overlapping or vanished parishes, particularly in rural are...


Anotação, análise e aprendizagem de Entidades Nomeadas em textos históricos por...

Vieira, Renata; Olival, Fernanda; Cameron, Helena; Santos, Joaquim; Reyes, Daniel

Este artigo apresenta um estudo baseado em textos portugueses do século XVIII, através da análise de entidades nomeadas, tendo em vista potenciá-las para análise histórica. Para isso foi elaborado um corpus anotado, a partir de uma fonte (Memórias Paroquiais) transcrita, revista e normalizada. Posteriormente, realizou-se uma análise da distribuição das entidades nomeadas na fonte em apreço, para refletir sobre ...


Provas de Habilitação para o Exercício de Funções de Coordenação Científica, Li...

Vieira, Renata

Este documento, submetido à prova de Habilitação para o Exercício de Funções de Coordenação Científica, apresenta conjuntamente uma proposta de um programa de investigação (Parte1), e de um programa de pós-graduação (Parte 2). O documento elabora sobre o papel da língua portuguesa e suas tecnologias para o desenvolvimento das Humanidades Digitais. A proposta está contextualizado na investigação da proponente, e...


A Pipeline for the Analysis of User Interactions in YouTube Comments: A Hybridi...

Bassi, Davide; Maggini, Michele; Vieira, Renata; Pereira-Farina, Martin

This study presents a novel approach to analyze user interactions on YouTube, addressing the platform's API limitations in capturing comprehensive conversation chains. By combining Large Language Models (LLMs) and rule-based methods, we developed a pipeline to reconstruct comment threads and analyze user stances on controversial topics. We applied this approach to examine immigration debates across 27,000 comme...


Preserving Intangible Cultural Heritage of Megalithic Sites using Immersive Mob...

Masoodian, Masood; Aula, Inkeri; Vieira, Renata; Rodrigues, Aurea; Santos, Ivo; Diniz, António; Campos, Camila; Prezado, Rafael; Rocha, Leonor

This poster introduces the INT-ACT project which aims to investigate the use of immersive XR environments for presenting the emotional, experiential and environmental dimensions of Intangible Cultural Heritage (ICH) associated with tangible cultural heritage sites. It also presents a mobile XR demonstrator, developed as part of INT-ACT, that focuses on the ICH related to a megalithic site.; The INT-ACT project ...


Assessing European and Brazilian Portuguese LLMs for NER in Specialised Domains

Nunes, Rafael; Santos, Joaquim; Balreira, Dennis; Freitas, Carla; Olival, Fernanda; Cameron, Helena; Vieira, Renata

This paper discusses the impact of Portuguese variants in Large Language Models for the task of named entity recognition (NER) in specialised domains. The tests were made on a Brazilian Portuguese legal and a European Portuguese historical corpora. The models taken into account are BERTimbau (PT-BR), Albertina (PT-PT and PT-BR), and XML-R (multilingual). The impact was more evident in the Portuguese historical ...


Linguistic Markers of Population Replacement Conspiracy Theories in YouTube Imm...

Marino, Erik; Bassi, Davide; Vieira, Renata

This paper presents a linguistic analysis of YouTube comments related to immigration discourse, analyzing the contrasts between standard anti-immigration comments and those linked to Population Replacement Conspiracy Theories (PRCT). Using a dataset of 71,137 YouTube comments classified into three stance categories (PRO, NEUTRAL, CONTRA) and PRCT annotation, we analyze the linguistic features of each group thro...


Explaining Machine Learning: A Deeper Look into Admission Prediction

Consoli, Bernardo; Pedroso, Vinicius; Kniest, Artur; Vieira, Renata; Bordini, Rafael; Manssour, Isabel

The popularization of artificial intelligence solutions in both research and industry that has been occurring due to the rise of tools such as the GPT, Gemini and Claude large language models has revitalized research in the area. There are many possible uses within the medical field, but a key determinant of the adoption of new tools by medical professionals is trust. To augment tool trust, the tool must be mad...


115 Results

Queried text

Refine Results

Author





















Date















Document Type









Access rights




Resource










Subject