Publicação
MapIntel
| Resumo: | Competitive Intelligence allows an organization to keep up with market trends and foresee business opportunities. This practice is mainly performed by analysts scanning for any piece of valuable information in a myriad of dispersed and unstructured sources. Here we present MapIntel, a system for acquiring intelligence from vast collections of text data by representing each document as a multidimensional vector that captures its own semantics. The system is designed to handle complex Natural Language queries and visual exploration of the corpus, potentially aiding overburdened analysts in finding meaningful insights to help decision-making. The system searching module uses a retriever and re-ranker engine that first finds the closest neighbours to the query embedding and then sifts the results through a cross-encoder model that identifies the most relevant documents. The browsing or visualization module also leverages the embeddings by projecting them onto two dimensions while preserving the multidimensional landscape, resulting in a map where semantically related documents form topical clusters which we capture using topic modelling. This map aims at promoting a fast overview of the corpus while allowing a more detailed exploration and interactive information encountering process. We evaluate the system and its components on the 20 newsgroups data set, using the semantic document labels provided, and demonstrate the superiority of Transformer-based components. Finally, we present a prototype of the system in Python and show how some of its features can be used to acquire intelligence from a news article corpus we collected during a period of 8 months. |
|---|---|
| Autores principais: | Silva, David |
| Outros Autores: | Bação, Fernando |
| Assunto: | competitive intelligence information retrieval sentence embeddings topic modelling transformer architecture visual analytics Control and Systems Engineering Theoretical Computer Science Computational Theory and Mathematics Artificial Intelligence |
| Ano: | 2023 |
| País: | Portugal |
| Tipo de documento: | artigo |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade Nova de Lisboa |
| Idioma: | inglês |
| Origem: | Repositório Institucional da UNL |
Registos relacionados
article A goal-directed implementation of query answering for hybrid MKNF knowledge bases
por: Gomes, Ana Sofia
Publicado em: (2014)
por: Gomes, Ana Sofia
Publicado em: (2014)
article An algebra of behavioural types
por: Ravara, António
Publicado em: (2012)
por: Ravara, António
Publicado em: (2012)
article Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE
por: Douzas, Georgios
Publicado em: (2019)
por: Douzas, Georgios
Publicado em: (2019)
article Cleaning ECG with Deep Learning
por: Dias, Mariana
Publicado em: (2024)
por: Dias, Mariana
Publicado em: (2024)
school Lifelog and information retrieval from daily digital data
por: Ribeiro, Ricardo Ferreira
Publicado em: (2024)
por: Ribeiro, Ricardo Ferreira
Publicado em: (2024)
groups Using Taxonomy Tree to Generalize a Fuzzy Thematic Cluster
por: Frolov, Dmitry
Publicado em: (2019)
por: Frolov, Dmitry
Publicado em: (2019)
school Customer Review Analysis
por: Tueschen, Philipp
Publicado em: (2022)
por: Tueschen, Philipp
Publicado em: (2022)
article Identities in plactic, hypoplactic, sylvester, Baxter, and related monoids
por: Cain, Alan J.
Publicado em: (2018)
por: Cain, Alan J.
Publicado em: (2018)
article Inclusive Intelligent Learning Management System Framework
por: Machado, David Sotto-Mayor
Publicado em: (2023)
por: Machado, David Sotto-Mayor
Publicado em: (2023)
article Reconstructing Young tableaux
por: Cain, Alan J.
Publicado em: (2022)
por: Cain, Alan J.
Publicado em: (2022)
article Specializing Context-Free Grammars with a (1 + 1)-EA
por: Manzoni, Luca
Publicado em: (2020)
por: Manzoni, Luca
Publicado em: (2020)
groups Vectorial GP for Alzheimer’s Disease Prediction Through Handwriting Analysis
por: Azzali, Irene
Publicado em: (2022)
por: Azzali, Irene
Publicado em: (2022)
article Earth-fixed trajectory and map online estimation: Building on GES sensor-based SLAM filters
por: Lourenço, Pedro
Publicado em: (2020)
por: Lourenço, Pedro
Publicado em: (2020)
article Correcting gene tree by removal and modification
por: Beretta, Stefano
Publicado em: (2015)
por: Beretta, Stefano
Publicado em: (2015)
groups Verifying real-world software with contracts for concurrency
por: Lourenço, João M.
Publicado em: (2018)
por: Lourenço, João M.
Publicado em: (2018)
article A distance between populations for one-point crossover in genetic algorithms
por: Manzoni, Luca
Publicado em: (2012)
por: Manzoni, Luca
Publicado em: (2012)
groups Preserving strong equivalence while forgetting
por: Knorr, Matthias
Publicado em: (2014)
por: Knorr, Matthias
Publicado em: (2014)
groups Unlabeled multi-target regression with genetic programming
por: Lopez, Uriel
Publicado em: (2020)
por: Lopez, Uriel
Publicado em: (2020)
article A Study on the Dynamics and Effectiveness of the Deflate Geometric Semantic Mutation
por: Farinati, Davide
Publicado em: (2025)
por: Farinati, Davide
Publicado em: (2025)
newspaper Editorial
por: Barresi, Giacinto
Publicado em: (2024)
por: Barresi, Giacinto
Publicado em: (2024)
book Ambient Intelligence - Software and Applications
por: Mohamed, Amr
Publicado em: (2015)
por: Mohamed, Amr
Publicado em: (2015)
school Chatbot for the University of Aveiro
por: Trigo, José Pedro Marta
Publicado em: (2024)
por: Trigo, José Pedro Marta
Publicado em: (2024)
groups Universal learning machine with genetic programming
por: Re, Alessandro
Publicado em: (2019)
por: Re, Alessandro
Publicado em: (2019)
groups A Strategic Model and Framework for Intelligent Process Automation
por: Feio, Iris Cláudia Lebre
Publicado em: (2022)
por: Feio, Iris Cláudia Lebre
Publicado em: (2022)
article Marketing database knowledge extraction - towards a domain ontology
por: Pinto, Filipe Mota
Publicado em: (2009)
por: Pinto, Filipe Mota
Publicado em: (2009)
article Structural similarity index (SSIM) revisited
por: Bakurov, Illya
Publicado em: (2022)
por: Bakurov, Illya
Publicado em: (2022)
article SBML2HYB
por: Pinto, José
Publicado em: (2023)
por: Pinto, José
Publicado em: (2023)
rate_review Recent progress in optoelectronic memristors for neuromorphic and in-memory computation
por: Pereira, Maria Elias
Publicado em: (2023)
por: Pereira, Maria Elias
Publicado em: (2023)
school Large language models for enhanced technical support in Tridonic business operations
por: Carvalho, Rodrigo Silva
Publicado em: (2024)
por: Carvalho, Rodrigo Silva
Publicado em: (2024)
groups Combining Bayesian approaches and evolutionary techniques for the inference of breast cancer networks
por: Beretta, Stefano
Publicado em: (2016)
por: Beretta, Stefano
Publicado em: (2016)
book Algorithmic Cities
por: Neto, Miguel de Castro
Publicado em: (2021)
por: Neto, Miguel de Castro
Publicado em: (2021)
groups Semi-automatic tool to identify heterogeneity zones in lge-cmr and incorporate the result into a 3d model of the left ventricle
por: Narciso, Maria
Publicado em: (2020)
por: Narciso, Maria
Publicado em: (2020)
article The power of GenAI nudges
por: Richarde, Ana Paula Merenda
Publicado em: (2025)
por: Richarde, Ana Paula Merenda
Publicado em: (2025)
school Implementing an SQL Based ETL Platform for Business Intelligence Solution
por: Silva, André Vieira da
Publicado em: (2023)
por: Silva, André Vieira da
Publicado em: (2023)
school Employing retrieval augmented generation to optimize LIMS for the legal domain: evaluating methods to improve chatbot performance
por: Schumann, Lorenzo Oliver
Publicado em: (2024)
por: Schumann, Lorenzo Oliver
Publicado em: (2024)
article The impact of big data analytics on firms’ high value business performance
por: Popovič, Aleš
Publicado em: (2018)
por: Popovič, Aleš
Publicado em: (2018)
article Influence of the time of occlusion on the quantitative parameters obtained by modelling trans-epidermal water loss curves to describe the human cutaneous barrier function in vivo
por: Pinto, PC
Publicado em: (2005)
por: Pinto, PC
Publicado em: (2005)
article Spatial-behavioral types for concurrency and resource control in distributed systems
por: Caires, Luís
Publicado em: (2008)
por: Caires, Luís
Publicado em: (2008)
school NovaIntell - projecto de text-Mining para a língua portuguesa numa empresa de Gestão de Informação e Conhecimento
por: Rolim, Pedro Gonçalo Jorge
Publicado em: (2011)
por: Rolim, Pedro Gonçalo Jorge
Publicado em: (2011)
article An energy-focused model for batteryless IoT
por: Hosseinzadeh, Mehdi
Publicado em: (2025)
por: Hosseinzadeh, Mehdi
Publicado em: (2025)
Registos relacionados
-
article A goal-directed implementation of query answering for hybrid MKNF knowledge bases
por: Gomes, Ana Sofia
Publicado em: (2014) -
article An algebra of behavioural types
por: Ravara, António
Publicado em: (2012) -
article Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE
por: Douzas, Georgios
Publicado em: (2019) -
article Cleaning ECG with Deep Learning
por: Dias, Mariana
Publicado em: (2024) -
school Lifelog and information retrieval from daily digital data
por: Ribeiro, Ricardo Ferreira
Publicado em: (2024)