Publicação
Open-Source Language Models for News Classification: Implementing Small Models in Low-Resource Environments
| Resumo: | Pre-Trained Models (PTMs) or Large Language Models (LLMs) are deep neural networks trained on vast amounts of text data, enabling them to make predictions based on learned knowledge. Google has played a significant role in this field, particularly through popularizing the Transformers Architecture. However, the landscape evolved dramatically with the release of ChatGPT by OpenAI in November 2022, marking the advent of the universal artificial intelligence era. This event sparked significant interest and efforts in studying LLMs, prompting industries to adapt their operations, software providers to refine their skills, and society to contemplate ethical implications. This research delves into the use of Open-Source LLMs, focusing particularly on text classification — a critical task in Natural Language Processing (NLP). The study employs techniques such as fine-tuning and model quantization, which are essential for leveraging LLMs effectively in practical applications. Key questions addressed include evaluating the comparability of open-source models with established benchmarks across different text classification approaches. The research aims to identify primary challenges and limitations associated with running modern open-source LLMs in lowresource environments. By exploring these topics, the research aims to contribute insights into optimizing the deployment of open-source LLMs, enhancing their accessibility, and addressing practical constraints that affect their widespread adoption across various sectors and applications in NLP. |
|---|---|
| Autores principais: | Figueiredo, Fernando Niglio de |
| Assunto: | Large Language Models (LLM) Pre-trained models (PTM) Text classification News classification Fine-tuning Open Source LoRA QLoRA low-resource environment SDG 8 - Decent work and economic growth |
| Ano: | 2024 |
| País: | Portugal |
| Tipo de documento: | dissertação de mestrado |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade Nova de Lisboa |
| Idioma: | inglês |
| Origem: | Repositório Institucional da UNL |
Registos relacionados
school BestLoRaNet: firmware to optimize the selection of LoRa networks
por: Teixeira, Joana da Silva
Publicado em: (2022)
por: Teixeira, Joana da Silva
Publicado em: (2022)
school Nó sensor reconfigurável para redes LoRa
por: Gouveia, Filipa da Conceição
Publicado em: (2021)
por: Gouveia, Filipa da Conceição
Publicado em: (2021)
school Wireless IoT – Avaliação de desempenho da camada MAC
por: Silva, Tiago Chagas da
Publicado em: (2019)
por: Silva, Tiago Chagas da
Publicado em: (2019)
school Mobile LoRa gateway for communication and sensing on the railway
por: Soares, João Pedro Correia
Publicado em: (2022)
por: Soares, João Pedro Correia
Publicado em: (2022)
school Tiny ground station for satellite communication: forest applications
por: Barbosa, Miguel Madeira Bidarra Oliveira
Publicado em: (2024)
por: Barbosa, Miguel Madeira Bidarra Oliveira
Publicado em: (2024)
school Deteção de colisões em rails de estradas
por: Ferreira, José Pedro de Almeida
Publicado em: (2020)
por: Ferreira, José Pedro de Almeida
Publicado em: (2020)
school Controlo de acessos com comunicação LoRa
por: Santos, José Domingos Reis Pinho Correia dos
Publicado em: (2021)
por: Santos, José Domingos Reis Pinho Correia dos
Publicado em: (2021)
school Medium access control in LoRa networks with multiple low-cost gateways
por: Figueiredo, Alexandre Daniel Gomes
Publicado em: (2021)
por: Figueiredo, Alexandre Daniel Gomes
Publicado em: (2021)
school Medium access control for large scale LoRa networks
por: Fernandes, Rui Pedro Castro
Publicado em: (2019)
por: Fernandes, Rui Pedro Castro
Publicado em: (2019)
school A communication network for sensing by a self-adaptive team of aquatic drones
por: Sousa, Daniela Casal de
Publicado em: (2018)
por: Sousa, Daniela Casal de
Publicado em: (2018)
school Soluções de acesso a zonas históricas com base em comunicações sem fios
por: Ferreira, Luis Miguel Oliveira
Publicado em: (2019)
por: Ferreira, Luis Miguel Oliveira
Publicado em: (2019)
school Applying LLM-based entity matching for hierarchical product categorization in e-commerce
por: Markwardt, Elias
Publicado em: (2025)
por: Markwardt, Elias
Publicado em: (2025)
article Normalized effect size (NES): a novel feature selection model for Urdu fake news classification
por: Wasim, Muhammad
Publicado em: (2023)
por: Wasim, Muhammad
Publicado em: (2023)
school Comunicações multi-tecnologia IoT para comboios conectados
por: Lima, Tiago Daniel Almeida
Publicado em: (2023)
por: Lima, Tiago Daniel Almeida
Publicado em: (2023)
book A comparative study of loRaWAN, sigFox, and NB-IoT for smart water grid
por: Lalle, Yandja
Publicado em: (2020)
por: Lalle, Yandja
Publicado em: (2020)
article A new hotel classification model combining guest reviews with official hotel classification systems: bridging expert and consumer ratings
por: Messias, Ana
Publicado em: (2025)
por: Messias, Ana
Publicado em: (2025)
school Pequena estação terrestre para comunicações por satélite: aplicações marítimas
por: Almeida, Diogo Henrique da Silva
Publicado em: (2023)
por: Almeida, Diogo Henrique da Silva
Publicado em: (2023)
school Features for the Classification and Clustering of Music in Symbolic Format
por: Bernardo, Alexandre Miguel Entradas
Publicado em: (2008)
por: Bernardo, Alexandre Miguel Entradas
Publicado em: (2008)
school Employing retrieval augmented generation to optimize LIMS for the legal domain: evaluating methods to improve chatbot performance
por: Schumann, Lorenzo Oliver
Publicado em: (2024)
por: Schumann, Lorenzo Oliver
Publicado em: (2024)
school Features for the Classification and Clustering of Music in Symbolic Format
por: Bernardo, Alexandre
Publicado em: (2008)
por: Bernardo, Alexandre
Publicado em: (2008)
school Discrete to dimensional physiological emotion classification
por: Alves, Carolina Fernandes
Publicado em: (2021)
por: Alves, Carolina Fernandes
Publicado em: (2021)
school Wetland Habitat Studies using various Classification Techniques on Multi-Spectral Landsat Imagery: Case study: Tram chim National Park, Dong Thap Vietnam
por: Luu, Thi Phuong Mai
Publicado em: (2009)
por: Luu, Thi Phuong Mai
Publicado em: (2009)
school Applying text mining techniques to forecast the stock market fluctuations of large it companies with twitter data: descriptive and predictive approaches to enhance the research of stock market predictions with textual and semantic data
por: Zois, Christos
Publicado em: (2019)
por: Zois, Christos
Publicado em: (2019)
school Machine learning and deep learning in healthcare: advancing cardiac arrhythmia classification in healthcare analytics
por: Mehler, Alexander
Publicado em: (2024)
por: Mehler, Alexander
Publicado em: (2024)
school Realidade aumentada em manutenção: estudo de uma abordagem multi-dispositivo
por: Esteves, Rafael Gonçalves
Publicado em: (2018)
por: Esteves, Rafael Gonçalves
Publicado em: (2018)
school Identifying Key Drivers and Predicting Technology Adoption Using Classification Algorithms
por: Santos, Pedro Almeida Madureira
Publicado em: (2025)
por: Santos, Pedro Almeida Madureira
Publicado em: (2025)
school A comparative study of data augmentation techniques for image classification: generative models vs. classical transformations
por: Gonçalves, Guilherme Marques
Publicado em: (2020)
por: Gonçalves, Guilherme Marques
Publicado em: (2020)
groups AdRA and Prokura: two Portuguese companies on the path to digital transformation
por: Simões, Anabela
Publicado em: (2019)
por: Simões, Anabela
Publicado em: (2019)
article Sustainability of large language models: user perspective
por: Pipek, Pavel
Publicado em: (2025)
por: Pipek, Pavel
Publicado em: (2025)
school Asymptotic Treatment for Multinomial Models and Applications
por: Akoto, Isaac
Publicado em: (2022)
por: Akoto, Isaac
Publicado em: (2022)
article Avaliação da influência de fatores associados a alterações climáticas nas fases larvares de rã-verde (Pelophylax perezi)
por: Marques, Carlos A.
Publicado em: (2019)
por: Marques, Carlos A.
Publicado em: (2019)
draft Are asset price data informative about news shocks? A DSGE perspective
por: Iskrev, Nikolay
Publicado em: (2018)
por: Iskrev, Nikolay
Publicado em: (2018)
school Content analysis and semantic enrichment of financial news
por: António, João Alexandre Mateus Luna
Publicado em: (2024)
por: António, João Alexandre Mateus Luna
Publicado em: (2024)
school Multipurpose sensing platform for improved road safety
por: Carvalhosa, Miguel Filipe Pereira de Freitas
Publicado em: (2021)
por: Carvalhosa, Miguel Filipe Pereira de Freitas
Publicado em: (2021)
school Online Reviews Analysis with Large Language Models
por: Ferreira, Henrique Marques
Publicado em: (2024)
por: Ferreira, Henrique Marques
Publicado em: (2024)
groups A semantic tool for indexation and classification of music and sound objects in the Lusophone World: a theoretical foundation and practices
por: Duarte, Andreia
Publicado em: (2023)
por: Duarte, Andreia
Publicado em: (2023)
school Assessment of the effects of abiotic factors related to climate change on larval stages of Pelophylax perezi
por: Matos, Ana Beatriz Moura Rodrigues
Publicado em: (2019)
por: Matos, Ana Beatriz Moura Rodrigues
Publicado em: (2019)
article From the classification of quadrilaterals to the classification of prisms: An experiment with prospective teachers
por: Brunheira, Lina
Publicado em: (2019)
por: Brunheira, Lina
Publicado em: (2019)
article Shaping the periodic classification in Portugal through (text)books and charts
por: Malaquias, Isabel
Publicado em: (2019)
por: Malaquias, Isabel
Publicado em: (2019)
article Semantic features analysis for biomedical lexical answer type prediction using ensemble learning approach
por: Hussain, Fiza Gulzar
Publicado em: (2024)
por: Hussain, Fiza Gulzar
Publicado em: (2024)
Registos relacionados
-
school BestLoRaNet: firmware to optimize the selection of LoRa networks
por: Teixeira, Joana da Silva
Publicado em: (2022) -
school Nó sensor reconfigurável para redes LoRa
por: Gouveia, Filipa da Conceição
Publicado em: (2021) -
school Wireless IoT – Avaliação de desempenho da camada MAC
por: Silva, Tiago Chagas da
Publicado em: (2019) -
school Mobile LoRa gateway for communication and sensing on the railway
por: Soares, João Pedro Correia
Publicado em: (2022) -
school Tiny ground station for satellite communication: forest applications
por: Barbosa, Miguel Madeira Bidarra Oliveira
Publicado em: (2024)