This paper aims to present an ongoing large-scale classification and description of Brazilian Portuguese adjectives. The 3,367 most frequent adjective lemmas in a reference corpus, corresponding to 92.09\% of the occurrences of adjectives, were classified into predicative and non-predicative. The former were further classified based on argument number (one or two) and type (noun phrase or clause), which led to ...
Verbal idioms (or verbal idiomatic expressions) are multiword expressions in which the main verb is distributionally frozen with one or more of its arguments (subject or complements). For the most part, they convey a non-compositional meaning that cannot be inferred from the individual meanings of their constituents when used separately.The primary goal of this project is the creation of a system capable of pro...
Automatic Essay Scoring is a field that has been receiving a lot of attention in Portuguese. Among the available datasets, one stands out: a corpus of narrative essays written by students from 5th to 9th grade in Brazil. These essays were evaluated according to four traits: formal register, thematic coherence, narrative rhetorical structure, and textual cohesion. This~work explores the development of a sy...
Expressões idiomáticas verbais são expressões multipalavra em que o verbo principal ´e distribucional mente fixo com um ou mais dos seus argumentos. O significado global destas expressões ´e, geralmente, não composicional, isto ´e, não pode ser regularmente inferido a partir do significado individual dos seus constituintes, quando usados separadamente. O principal objetivo deste trabalho ´e a construção de um s...
Este artigo descreve uma classificação e descrição em larga escala, ainda em andamento, dos adjetivos do português brasileiro. Classificamos em predicativo ou n ̃ao predicativo os 3.367 lemas mais frequentes de adjetivo em um corpus de referência, o que corresponde a 92.09% das ocorrências de adjetivos nesse corpus. Os adjetivos predicativos receberam classificações adicionais com base no número (um ou dois) e ...
Automatic Essay Scoring is a field that has been receiving a lot of attention in Portuguese. Among the available datasets, one stands out: a corpus of narrative essays written by students from 5th to 9th grade in Brazil. These essays were evaluated according to four traits: formal register, thematic coherence, narrative rhetorical structure, and textual cohesion. This~work explores the development of a sy...
A partir de um conjunto de dados semi-automaticamente anotados do Corpus de Textos Antigos (CTA), este artigo propõe-se a analisar os resultados obtidos sobre a síncope de -d- intervocálico no morfema da 2.ª pessoa plural, e a consequente resolução do hiato, e as terminações de Particípio Passado -udo/-ido nos verbos com origem etimológica nas 2.ª e 3.ª conjugações latinas. A novidade deste artigo está no recur...
Artificial Intelligence (AI), particularly Large Language Models (LLMs), has been transforming language education. This study explores the application of LLMs in the teaching of Portuguese as a Foreign Language (PFL), focusing on the automatic creation, classification, and validation of a corpus of short narratives based on Portuguese proverbs. The objectives are: (i) to automatically generate short narratives ...
This paper examines the syntactic properties of Portuguese multiword adverbs, focusing specifically on disjunctive adverbs of style (PS). Also known as enunciative, metalinguistic, or illocutionary adverbs, PS adverbs function as sentence-external modifiers, typically expressing the speaker’s attitude on a given statement. Our research has cataloged approximately 3,700 multiword adverbs in Brazilian and Europea...