Publicação

How much data is enough to track tourists? The tradeoff between data granularity and storage costs

Ver documento

Detalhes bibliográficos
Resumo:In the increasingly technology-dependent world, data is one of the key strategic resources for organizations. Often, the challenge that many decision-makers face is to determine which data and how much to collect, and what needs to be kept in their data storage. The challenge is to preserve enough information to inform decisions but doing so without overly high costs of storage and data processing cost. In this thesis, this challenge is studied in the context of a collection of mobile signaling data for studying tourists’ behavioral patterns. Given the number of mobile phones in use, and frequency of their interaction with network infrastructure and location reporting, mobile data sets represent a rich source of information for mobility studies. The objective of this research is to analyze to what extent can individual trajectories be reconstructed if only a fraction of the original location data is preserved, providing insights about the tradeoff between the volume of data available and the accuracy of reconstructed paths. To achieve this, a signaling data of 277,093 anonymized foreign travelers is sampled with different sampling rates, and the full trajectories are reconstructed, using the last seen, linear, and cubic interpolations completion methods. The results of the comparison are discussed from the perspective of data management and implications on the research, especially the results of research with lower time-density mobile phone data.
Autores principais:Pereira, Inês Correia
Assunto:Tourism Mobility Trajectory Reconstruction Call Detail Records Signaling Data Data Sparsity
Ano:2020
País:Portugal
Tipo de documento:dissertação de mestrado
Tipo de acesso:acesso aberto
Instituição associada:Universidade Nova de Lisboa
Idioma:inglês
Origem:Repositório Institucional da UNL
_version_ 1868415465249832960
author Pereira, Inês Correia
author_facet Pereira, Inês Correia
author_role author
contributor_name_str_mv Zejnilović, Leid
Vanneschi, Leonardo
RUN
country_str PT
creators_json_txt [{\"Person.name\":\"Pereira, Inês Correia\"}]
datacite.contributors.contributor.contributorName.fl_str_mv Zejnilović, Leid
Vanneschi, Leonardo
RUN
datacite.creators.creator.creatorName.fl_str_mv Pereira, Inês Correia
datacite.date.Accepted.fl_str_mv 2020-01-07T00:00:00Z
datacite.date.available.fl_str_mv 2020-10-28T14:45:03Z
datacite.date.embargoed.fl_str_mv 2020-10-28T14:45:03Z
datacite.rights.fl_str_mv http://purl.org/coar/access_right/c_abf2
datacite.subjects.subject.fl_str_mv Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
datacite.titles.title.fl_str_mv How much data is enough to track tourists? The tradeoff between data granularity and storage costs
dc.contributor.none.fl_str_mv Zejnilović, Leid
Vanneschi, Leonardo
RUN
dc.creator.none.fl_str_mv Pereira, Inês Correia
dc.date.Accepted.fl_str_mv 2020-01-07T00:00:00Z
dc.date.available.fl_str_mv 2020-10-28T14:45:03Z
dc.date.embargoed.fl_str_mv 2020-10-28T14:45:03Z
dc.format.none.fl_str_mv application/pdf
dc.identifier.none.fl_str_mv http://hdl.handle.net/10362/106282
dc.language.none.fl_str_mv eng
dc.rights.cclincense.fl_str_mv http://creativecommons.org/licenses/by/4.0/
dc.rights.none.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.subject.none.fl_str_mv Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
dc.title.fl_str_mv How much data is enough to track tourists? The tradeoff between data granularity and storage costs
dc.type.none.fl_str_mv http://purl.org/coar/resource_type/c_bdcc
description In the increasingly technology-dependent world, data is one of the key strategic resources for organizations. Often, the challenge that many decision-makers face is to determine which data and how much to collect, and what needs to be kept in their data storage. The challenge is to preserve enough information to inform decisions but doing so without overly high costs of storage and data processing cost. In this thesis, this challenge is studied in the context of a collection of mobile signaling data for studying tourists’ behavioral patterns. Given the number of mobile phones in use, and frequency of their interaction with network infrastructure and location reporting, mobile data sets represent a rich source of information for mobility studies. The objective of this research is to analyze to what extent can individual trajectories be reconstructed if only a fraction of the original location data is preserved, providing insights about the tradeoff between the volume of data available and the accuracy of reconstructed paths. To achieve this, a signaling data of 277,093 anonymized foreign travelers is sampled with different sampling rates, and the full trajectories are reconstructed, using the last seen, linear, and cubic interpolations completion methods. The results of the comparison are discussed from the perspective of data management and implications on the research, especially the results of research with lower time-density mobile phone data.
dirty 0
eu_rights_str_mv openAccess
format masterThesis
fulltext.url.fl_str_mv https://run.unl.pt/bitstreams/1c3c2051-85e5-4dad-b6a7-96ce7ad52d45/download
id run_d773fa72a2118370ca27ecb507a2b706
identifier.url.fl_str_mv http://hdl.handle.net/10362/106282
instacron_str unl
institution Universidade Nova de Lisboa
instname_str Universidade Nova de Lisboa
language eng
network_acronym_str run
network_name_str Repositório Institucional da UNL
oai_identifier_str oai:run.unl.pt:10362/106282
organization_str_mv urn:organizationAcronym:unl
person_str_mv Pereira, Inês Correia
publishDate 2020
reponame_str Repositório Institucional da UNL
repository_id_str urn:repositoryAcronym:run
service_str_mv urn:repositoryAcronym:run
spelling engpt_PTIn the increasingly technology-dependent world, data is one of the key strategic resources for organizations. Often, the challenge that many decision-makers face is to determine which data and how much to collect, and what needs to be kept in their data storage. The challenge is to preserve enough information to inform decisions but doing so without overly high costs of storage and data processing cost. In this thesis, this challenge is studied in the context of a collection of mobile signaling data for studying tourists’ behavioral patterns. Given the number of mobile phones in use, and frequency of their interaction with network infrastructure and location reporting, mobile data sets represent a rich source of information for mobility studies. The objective of this research is to analyze to what extent can individual trajectories be reconstructed if only a fraction of the original location data is preserved, providing insights about the tradeoff between the volume of data available and the accuracy of reconstructed paths. To achieve this, a signaling data of 277,093 anonymized foreign travelers is sampled with different sampling rates, and the full trajectories are reconstructed, using the last seen, linear, and cubic interpolations completion methods. The results of the comparison are discussed from the perspective of data management and implications on the research, especially the results of research with lower time-density mobile phone data.application/pdfpt_PTHow much data is enough to track tourists? The tradeoff between data granularity and storage costsPereira, Inês CorreiaZejnilović, LeidVanneschi, LeonardoHostingInstitutionOrganizationalRUNe-mailmailto:run@unl.ptrun@unl.ptURNurn:tid:2025300272020-10-28T14:45:03Z2020-01-072020-01-07T00:00:00ZHandlehttp://hdl.handle.net/10362/106282http://purl.org/coar/access_right/c_abf2open accessTourism MobilityTrajectory ReconstructionCall Detail RecordsSignaling DataData Sparsity3067385 bytesliteraturehttp://purl.org/coar/resource_type/c_bdccmaster thesis2020-01-07http://creativecommons.org/licenses/by/4.0/http://purl.org/coar/access_right/c_abf2application/pdffulltexthttps://run.unl.pt/bitstreams/1c3c2051-85e5-4dad-b6a7-96ce7ad52d45/download
spellingShingle How much data is enough to track tourists? The tradeoff between data granularity and storage costs
Pereira, Inês Correia
Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
status SINGLETON
subject.fl_str_mv Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
title How much data is enough to track tourists? The tradeoff between data granularity and storage costs
title_full How much data is enough to track tourists? The tradeoff between data granularity and storage costs
title_fullStr How much data is enough to track tourists? The tradeoff between data granularity and storage costs
title_full_unstemmed How much data is enough to track tourists? The tradeoff between data granularity and storage costs
title_short How much data is enough to track tourists? The tradeoff between data granularity and storage costs
title_sort How much data is enough to track tourists? The tradeoff between data granularity and storage costs
topic Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
topic_facet Tourism Mobility
Trajectory Reconstruction
Call Detail Records
Signaling Data
Data Sparsity
url http://hdl.handle.net/10362/106282
visible 1