Publicação
How much data is enough to track tourists? The tradeoff between data granularity and storage costs
| Resumo: | In the increasingly technology-dependent world, data is one of the key strategic resources for organizations. Often, the challenge that many decision-makers face is to determine which data and how much to collect, and what needs to be kept in their data storage. The challenge is to preserve enough information to inform decisions but doing so without overly high costs of storage and data processing cost. In this thesis, this challenge is studied in the context of a collection of mobile signaling data for studying tourists’ behavioral patterns. Given the number of mobile phones in use, and frequency of their interaction with network infrastructure and location reporting, mobile data sets represent a rich source of information for mobility studies. The objective of this research is to analyze to what extent can individual trajectories be reconstructed if only a fraction of the original location data is preserved, providing insights about the tradeoff between the volume of data available and the accuracy of reconstructed paths. To achieve this, a signaling data of 277,093 anonymized foreign travelers is sampled with different sampling rates, and the full trajectories are reconstructed, using the last seen, linear, and cubic interpolations completion methods. The results of the comparison are discussed from the perspective of data management and implications on the research, especially the results of research with lower time-density mobile phone data. |
|---|---|
| Autores principais: | Pereira, Inês Correia |
| Assunto: | Tourism Mobility Trajectory Reconstruction Call Detail Records Signaling Data Data Sparsity |
| Ano: | 2020 |
| País: | Portugal |
| Tipo de documento: | dissertação de mestrado |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade Nova de Lisboa |
| Idioma: | inglês |
| Origem: | Repositório Institucional da UNL |
Registos relacionados
school The tradeoff between data granularity and storage costs
por: Pereira, Inês Correia
Publicado em: (2020)
por: Pereira, Inês Correia
Publicado em: (2020)
school WhereToGo :Identification and classification of places of interest using anonymized mobile communication data
por: Ferreira, Gonçalo Francisco
Publicado em: (2021)
por: Ferreira, Gonçalo Francisco
Publicado em: (2021)
article Identification and Classification of Routine Locations Using Anonymized Mobile Communication Data
por: Ferreira, Gonçalo
Publicado em: (2022)
por: Ferreira, Gonçalo
Publicado em: (2022)
article Cold-start and data sparsity problems in a digital twin based recommendation system
por: Pires, Flávia
Publicado em: (2024)
por: Pires, Flávia
Publicado em: (2024)
article Face-to-face interactions estimated using mobile phone data to support contact tracing operations
por: Cumbane, Silvino
Publicado em: (2025)
por: Cumbane, Silvino
Publicado em: (2025)
school Anomaly Detection in Managed Connectivity M2M
por: Gomes, André Daniel Dinis
Publicado em: (2021)
por: Gomes, André Daniel Dinis
Publicado em: (2021)
article Bicomplex signals with sparsity constraints
por: Cerejeiras, P.
Publicado em: (2018)
por: Cerejeiras, P.
Publicado em: (2018)
school A Data-Driven Approach to Mobility Modelling of Urban Spaces: Inferring Commuting Routes and Travel Modes
por: Pires, Joel Filipe Rogão
Publicado em: (2019)
por: Pires, Joel Filipe Rogão
Publicado em: (2019)
article Compressed sensing for quaternionic signals
por: Kähler, Uwe
Publicado em: (2017)
por: Kähler, Uwe
Publicado em: (2017)
school CDR-based location analytics & gender prediction from subscribers’ list of installed mobile applications
por: Sheikh, Dahmane
Publicado em: (2020)
por: Sheikh, Dahmane
Publicado em: (2020)
article Investigating patterns of tourist movement using multiple data sources
por: Abreu Novais, Margarida
Publicado em: (2026)
por: Abreu Novais, Margarida
Publicado em: (2026)
school DEVELOPMENT OF NEW METHODOLOGIES FOR ROAD ACCIDENT RECONSTRUCTION WITH CDR TOOL
por: Francisco, André Gaspar
Publicado em: (2022)
por: Francisco, André Gaspar
Publicado em: (2022)
school DEVELOPMENT OF NEW METHODOLOGIES FOR ROAD ACCIDENT RECONSTRUCTION WITH CDR TOOL
por: Francisco, André Gaspar
Publicado em: (2022)
por: Francisco, André Gaspar
Publicado em: (2022)
article Robust identification of target genes and outliers in triple-negative breast cancer data
por: Segaert, Pieter
Publicado em: (2018)
por: Segaert, Pieter
Publicado em: (2018)
article A robust sparse linear approach for contaminated data
por: Shahriari, S
Publicado em: (2019)
por: Shahriari, S
Publicado em: (2019)
school My places identification of user’s geographic map
por: Rodrigues, Cláudia Beatriz Almeida
Publicado em: (2021)
por: Rodrigues, Cláudia Beatriz Almeida
Publicado em: (2021)
article A large memory, high transfer rate VME data acquisition system for the JET correlation reflectometer
por: Cruz, Nuno
Publicado em: (2002)
por: Cruz, Nuno
Publicado em: (2002)
article A robust sparce linear approach for contamined data
por: Shahriari, Shirin
Publicado em: (2019)
por: Shahriari, Shirin
Publicado em: (2019)
school DEVELOPMENT OF A PRE-IMPACT DYNAMICS SIMULATOR FOR A VEHICLE INVOLVED IN A ROAD ACCIDENT
por: Gomes, Luís Miguel Marques
Publicado em: (2024)
por: Gomes, Luís Miguel Marques
Publicado em: (2024)
groups Back to the past to charter the vinyl electronic market: A data mining approach
por: Lousão, S.
Publicado em: (2020)
por: Lousão, S.
Publicado em: (2020)
groups Structured and unstructured data integration with electronic medical records
por: Baptista, D.
Publicado em: (2019)
por: Baptista, D.
Publicado em: (2019)
school Household Identification Using Call Records
por: Paiva, Ricardo José Monteiro
Publicado em: (2021)
por: Paiva, Ricardo José Monteiro
Publicado em: (2021)
article Data integration issues in the reconstruction of the genome-scale metabolic model of Zymomonas mobillis
por: Pinto, José P.
Publicado em: (2009)
por: Pinto, José P.
Publicado em: (2009)
article Data quality in tuberculosis: the case study of two ambulatories in the state of São Paulo, Brazil
por: Yamaguti, Verena Hokino
Publicado em: (2017)
por: Yamaguti, Verena Hokino
Publicado em: (2017)
article Data warehouse and medical research
por: Martins, Thiago Gonçalves dos Santos
Publicado em: (2022)
por: Martins, Thiago Gonçalves dos Santos
Publicado em: (2022)
article A granularity theory for modelling spatio-temporal phenomena at multiple levels of detail
por: Silva, Ricardo
Publicado em: (2015)
por: Silva, Ricardo
Publicado em: (2015)
article New distribution data on spanish autochthonous species of freshwater fish
por: Perea, Silvia
Publicado em: (2011)
por: Perea, Silvia
Publicado em: (2011)
article Sensing Mobility and Routine Locations through Mobile Phone and Crowdsourced Data: Analyzing Travel and Behavior during COVID-19
por: Rodrigues, Cláudia
Publicado em: (2023)
por: Rodrigues, Cláudia
Publicado em: (2023)
article Enhanced neutron diagnostics data acquisition system based on a time digitizer and transient recorder hybrid module
por: Pereira, R. C.
Publicado em: (2006)
por: Pereira, R. C.
Publicado em: (2006)
article Merging data diversity of clinical medical records to improve effectiveness
por: Helgheim, B. I.
Publicado em: (2019)
por: Helgheim, B. I.
Publicado em: (2019)
article Data analysis for trajectory generation for a robot manipulator using data from a 2D industrial laser
por: Gomes, Diogo
Publicado em: (2022)
por: Gomes, Diogo
Publicado em: (2022)
article Visual analytics for spatiotemporal events
por: Silva, Ricardo Almeida
Publicado em: (2019)
por: Silva, Ricardo Almeida
Publicado em: (2019)
groups Visualising hidden spatiotemporal patterns at multiple levels of detail
por: Silva, Ricardo Almeida
Publicado em: (2018)
por: Silva, Ricardo Almeida
Publicado em: (2018)
article Droughts in Portugal in the 18th century: a study based on newly found documentary data
por: Fragoso, Marcelo
Publicado em: (2018)
por: Fragoso, Marcelo
Publicado em: (2018)
school Study and validation of data recorded in the vehicles’ EDR in order to perform a road accident’s dynamic reconstruction
por: Laranjeira, Francisco Gonçalo Mendes
Publicado em: (2022)
por: Laranjeira, Francisco Gonçalo Mendes
Publicado em: (2022)
article Tracking people and equipment simulation inside healthcare units
por: Salgado, Catia
Publicado em: (2013)
por: Salgado, Catia
Publicado em: (2013)
school Creation of Synthetic Patient Data for Health Care Research
por: Abreu, Luís Miguel da Torre
Publicado em: (2025)
por: Abreu, Luís Miguel da Torre
Publicado em: (2025)
article Improvements in data quality for decision support in intensive care
por: Portela, Filipe
Publicado em: (2011)
por: Portela, Filipe
Publicado em: (2011)
school Internship at the Bank of Portugal: Development of a dashboard to track corrections in granular credit data and a gender classification model
por: Vigueras, Julio César Rojas
Publicado em: (2024)
por: Vigueras, Julio César Rojas
Publicado em: (2024)
article Interpolation of monogenic functions by using reproducing kernel Hilbert spaces
por: Cerejeiras, Paula
Publicado em: (2018)
por: Cerejeiras, Paula
Publicado em: (2018)
Registos relacionados
-
school The tradeoff between data granularity and storage costs
por: Pereira, Inês Correia
Publicado em: (2020) -
school WhereToGo :Identification and classification of places of interest using anonymized mobile communication data
por: Ferreira, Gonçalo Francisco
Publicado em: (2021) -
article Identification and Classification of Routine Locations Using Anonymized Mobile Communication Data
por: Ferreira, Gonçalo
Publicado em: (2022) -
article Cold-start and data sparsity problems in a digital twin based recommendation system
por: Pires, Flávia
Publicado em: (2024) -
article Face-to-face interactions estimated using mobile phone data to support contact tracing operations
por: Cumbane, Silvino
Publicado em: (2025)