78 documents found, page 1 of 8

Sort by Issue Date

Cross-genre argument mining: Can language models automatically fill in missing ...

Rocha, G; Henrique Lopes Cardoso; Belouadi, J; Eger, S

Available corpora for Argument Mining differ along several axes, and one of the key differences is the presence (or absence) of discourse markers to signal argumentative content. Exploring effective ways to use discourse markers has received wide attention in various discourse parsing tasks, from which it is well-known that discourse markers are strong indicators of discourse relations. To improve the robustnes...


Historical Portuguese corpora: a survey

Osório, TF; Henrique Lopes Cardoso

This survey aims to thoroughly examine and evaluate the current landscape of electronic corpora in historical Portuguese. This is achieved through a comprehensive analysis of existing resources. The article makes two main contributions. The first is an exhaustive cataloguing of existing Portuguese historical corpora, where each corpus is meticulously detailed regarding linguistic periods, geographic origins, an...


FairytaleQA Translated: Enabling Educational Question and Answer Generation in ...

Leite, B; Osório, TF; Henrique Lopes Cardoso

Question Answering (QA) datasets are crucial in assessing reading comprehension skills for both machines and humans. While numerous datasets have been developed in English for this purpose, a noticeable void exists in less-resourced languages. To alleviate this gap, our paper introduces machine-translated versions of FairytaleQA, a renowned QA dataset designed to assess and enhance narrative comprehension skill...


On Few-Shot Prompting for Controllable Question-Answer Generation in Narrative ...

Leite, B; Henrique Lopes Cardoso

Question Generation aims to automatically generate questions based on a given input provided as context. A controllable question generation scheme focuses on generating questions with specific attributes, allowing better control. In this study, we propose a few-shot prompting strategy for controlling the generation of question-answer pairs from childrens narrative texts. We aim to control two attributes: the qu...


Expanding FLORES+ Benchmark for More Low-Resource Settings: Portuguese-Emakhuwa...

António Ali, FDM; Henrique Lopes Cardoso; Silva, RS

As part of the Open Language Data Initiative shared tasks, we have expanded the FLORES+ evaluation set to include Emakhuwa, a low-resource language widely spoken in Mozambique. We translated the dev and devtest sets from Portuguese into Emakhuwa, and we detail the translation process and quality assurance measures used. Our methodology involved various quality checks, including post-editing and adequacy assessm...


Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina P...

Rodrigo Santos; João Rodrigues; Luís Gomes; João Ricardo Silva; António Branco; Henrique Lopes Cardoso; Tomás Freitas Osório; Bernardo Leite



Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina P...

Santos, R; Rodrigues, J; Gomes, L; Silva, J; Branco, A; Henrique Lopes Cardoso; Osório, TF; Leite, B

To foster the neural encoding of Portuguese, this paper contributes foundation encoder models that represent an expansion of the still very scarce ecosystem of large language models specifically developed for this language that are fully open, in the sense that they are open source and openly distributed for free under an open license for any purpose, thus including research and commercial usages. Like most lan...



PTPARL-V: Portuguese Parliamentary Debates for Voting Behaviour Study

Sousa, A; Henrique Lopes Cardoso

We present a new dataset, PTPARL-V, that is a valuable resource for advancing discourse analysis of parliamentary debates in Portuguese and their alignment with voting behaviour. This is achieved by processing the open-access information available at the official Portuguese Parliament website and scraping the debate minutes concerning legislative initiatives, together with meta-data related to voting positions....


78 Results

Queried text

Refine Results

Author





















Date





















Document Type






Access rights



Resource


Subject