This paper investigates the processing and interpretation of non-culmination sentences and subjects' agentivity in European Portuguese. A five-point Likert scale is employed to test the native speaker's judgment regarding the acceptability of five types of non-culminating accomplishment sentences put forward in Guéron and Vogeleer (2021), providing evidence that speakers accept non-culminating accomplishments t...
The definition of rigorous and well-structured annotation schemes is a key element in the advancement of Natural Language Processing (NLP). This paper aims to compare the perfor- mance of a general-purpose annotation scheme - Text2Story, based on the ISO 24617-1 stan- dard - with that of a domain-specific scheme - i2b2 - in the context of clinical narrative annotation; and to assess the feasibility of har- moni...
High-quality annotation is essential for the ef- fective predictions of machine learning mod- els. When annotations are dense, achieving accurate human labeling can be challenging since the most used annotation tools present an overloaded visualization of labels. Thus, we present Vitra (Visualizer of temporal relation annotations), a tool designed for viewing anno- tations made in corpora, specifically focusing...
We present an annotation scheme designed to capture information related to the maintenance or change in the price of some goods (fuels, wa- ter, and vehicles) in news articles in Portuguese. The methodology we used involved adapting an existing annotation scheme, the Text2Story scheme (Silvano et al., 2021; Leal et al., 2022), which is based on different parts of ISO 24617 to capture the essential information f...
The development of a robust annotation scheme and corresponding guidelines is crucial for pro- ducing annotated datasets that advance both lin- guistic and computational research. This paper presents a case study that outlines a method- ology for designing an annotation scheme and its guidelines, specifically aimed at represent- ing morphosyntactic and semantic information regarding temporal features, as well a...
.; .
.
Temporal reasoning has been the focus of several studies during the past years, both in linguistics and computational studies. Although advances on this topic are undeniable, there are still improvements to be made and new avenues to pursue. One relevant problem concerns the temporal ordering of the events, particularly asserting and representing how events are temporally related and how the story told in the n...
Narratives have been the subject of extensive research across various scientific fields such as linguistics and computer science. However, the scarcity of freely available datasets, essential for studying this genre, remains a significant obstacle. Furthermore, datasets annotated with narratives components and their morphosyntactic and semantic information are even scarcer. To address this gap, we developed the...
The main objective of this study is to contribute to multilingual discourse research by employing ISO-24617 Part 8 (Semantic Relations in Discourse, Core Annotation Schema - DR-core) for annotating discourse relations. Centering around a parallel discourse relations corpus that includes English, Polish, and European Portuguese, we initiate one of the few ISO-based comparative analyses through a multilingual cor...