Document details

Spatio-temporal pattern mining from global positioning systems (GPS) trajectories dataset

Author(s): Akal, Tigabu Dagne

Date: 2015

Persistent ID: http://hdl.handle.net/10362/31766

Origin: Repositório Institucional da UNL

Subject(s): Spatio; Trajectories dataset


Description

Dissertation submitted in partial fulfilment of the requirements for the degree of Master of Science in Geospatial Technologies

The increasing frequency of use location-acquisition technology like the Global Positioning System is leading to the collection of large spatio-temporal datasets. The prospect of discovering usable knowledge about movement behavior, which encourages for the discovery of interesting relationships and characteristics users that may exist implicitly in spatial databases. Therefore spatial data mining is emerging as a novel area of research. In this study, the experiments were conducted following the Knowledge Discovery in Database process model. The Knowledge Discovery in Database process model starts from selection of the datasets. The GPS trajectory dataset for this research collected from Microsoft Research Asia Geolife project. After taking the data, it has been preprocessed. The major preprocessing activities include:  Fill in missed values and remove outliers;  Resolve inconsistencies, integration of data that contains both labeled and unlabeled datasets,  Dimensionality reduction, size reduction and data transformation activity like discretization tasks were done for this study. A total of 4,273 trajectory dataset are used for training the models. For validating the performance of the selected model a separate 1,018 records are used as a testing set. For building a spatiotemporal model of this study the K-nearest Neighbors (KNN), decision tree and Bayes algorithms have been tasted as supervised approach. The model that was created using 10-fold cross validation with K value 11 and other default parameter values showed the best classification accuracy. The model has a prediction accuracy of 98.5% on the training datasets and 93.12% on the test dataset to classify the new instances as bike, bus, car, subway, train and walk classes. The findings of this study have shown that the spatiotemporal data mining methods help to classify user mobility transportation modes. Future research directions are forwarded to come up an applicable system in the area of the study.

Document Type Master thesis
Language English
Advisor(s) Henriques, Roberto André Pereira
Contributor(s) RUN
CC Licence
facebook logo  linkedin logo  twitter logo 
mendeley logo

Related documents

No related documents