Publicação
Text Mining Techniques for Car Price Prediction
| Resumo: | Modern data sources routinely contain information both in unstructured and structured forms, combining text with the usual numerical and categorical data. For instance, in websites dedicated for selling and buying cars the listings typically include a textual description of the car. Others also include a detailed list of numerical or categorical attributes, such as the total number of kilometers the car has, or it´s model. In this work project we apply text mining techniques to create predictors for car price regression from unstructured data, the textual description in car listings. Two different types of predictors were studied, the tf-idf features obtained from the n-gram count matrix, or the singular vectors derived from the decomposition of the tf-idf matrix. In this work we also examine the performance of reducing the vocabulary dimension by applying stemming, lemmatization or not applying either of those. We also compare the effects of creating the initial n-gram count matrix with only unigrams, unigrams and bigrams or only bigrams. Our regression experiment shows that Support Vector Regression performs best at car price prediction using text data as predictors with R2 = 0.77, MSE = 0.19 and MAE = 0.32. These results can be seen as respectable given the complex nature of the task. |
|---|---|
| Autores principais: | Gonçalves, Ricardo Miguel Galvão |
| Assunto: | Text Mining Regression Analysis Car Price Prediction |
| Ano: | 2022 |
| País: | Portugal |
| Tipo de documento: | dissertação de mestrado |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade Nova de Lisboa |
| Idioma: | inglês |
| Origem: | Repositório Institucional da UNL |
Registos relacionados
school Applying text mining techniques to forecast the stock market fluctuations of large it companies with twitter data: descriptive and predictive approaches to enhance the research of stock market predictions with textual and semantic data
por: Zois, Christos
Publicado em: (2019)
por: Zois, Christos
Publicado em: (2019)
article Automatically detect diagnostic patterns based on clinical notes through Text Mining
por: Ribeiro, Joao
Publicado em: (2019)
por: Ribeiro, Joao
Publicado em: (2019)
article Big data for stock market by means of mining techniques
por: Lima, Luciana
Publicado em: (2015)
por: Lima, Luciana
Publicado em: (2015)
article Towards of automatically detecting brain death patterns through text mining
por: Silva, Antonio
Publicado em: (2016)
por: Silva, Antonio
Publicado em: (2016)
article Sentiment analysis with Text Mining in contexts of Big Data
por: Andrade, Carina Sofia Marinho
Publicado em: (2017)
por: Andrade, Carina Sofia Marinho
Publicado em: (2017)
school Automatic information retrieval through text-mining
por: Viana, Hugo Henrique Amorim
Publicado em: (2013)
por: Viana, Hugo Henrique Amorim
Publicado em: (2013)
school Forecasting real estate prices in Portugal based on a data science approach
por: Samadani, Sanam
Publicado em: (2021)
por: Samadani, Sanam
Publicado em: (2021)
article Classifying heart sounds using SAX motifs, random forests and text mining techniques
por: Gomes, Elsa Ferreira
Publicado em: (2014)
por: Gomes, Elsa Ferreira
Publicado em: (2014)
school Pump it : twitter sentiment analysis for cryptocurrency price prediction
por: Koltun, Vladyslav
Publicado em: (2022)
por: Koltun, Vladyslav
Publicado em: (2022)
article The impact of microblogging data for stock market prediction: Using Twitter to predict returns, volatility, trading volume and survey sentiment indices
por: Oliveira, Nuno Ernesto Salgado
Publicado em: (2017)
por: Oliveira, Nuno Ernesto Salgado
Publicado em: (2017)
school Social media analytics : optimizing Facebook campaign’s performance using text mining
por: Gouveia, Lia Isabel Morais
Publicado em: (2019)
por: Gouveia, Lia Isabel Morais
Publicado em: (2019)
school Text mining na análise de sentimentos em contextos de big data
por: Andrade, Carina Sofia Marinho de
Publicado em: (2015)
por: Andrade, Carina Sofia Marinho de
Publicado em: (2015)
book Attribute selection in hedonic pricing modeling applied to the Portuguese urban housing market
por: Batista, Paulo
Publicado em: (2011)
por: Batista, Paulo
Publicado em: (2011)
article Development of text mining tools for information retrieval from patents
por: Alves, T.
Publicado em: (2017)
por: Alves, T.
Publicado em: (2017)
article Research trends on Big Data in Marketing: A text mining and topic modeling based literature analysis
por: Amado, Alexandra
Publicado em: (2018)
por: Amado, Alexandra
Publicado em: (2018)
article Using text mining to diagnose and classify epilepsy in children
por: Luis Pereira
Publicado em: (2013)
por: Luis Pereira
Publicado em: (2013)
article Insights from a text mining survey on Expert Systems research from 2000 to 2016
por: Cortez, Paulo
Publicado em: (2018)
por: Cortez, Paulo
Publicado em: (2018)
article Development of a machine learning framework for biomedical text mining
por: Rodrigues, Rúben
Publicado em: (2016)
por: Rodrigues, Rúben
Publicado em: (2016)
article A text mining based supervised learning algorithm for classification of manufacturing suppliers
por: Manupati, V. K.
Publicado em: (2018)
por: Manupati, V. K.
Publicado em: (2018)
article Energy prices and CO2 emission allowance prices : a quantile regression approach
por: Hammoudeh, Shawkat
Publicado em: (2014)
por: Hammoudeh, Shawkat
Publicado em: (2014)
article A text mining and topic modelling perspective of ethnic marketing research
por: Moro, Sérgio
Publicado em: (2019)
por: Moro, Sérgio
Publicado em: (2019)
article Applying a text mining framework to the extraction of numerical parameters from scientific literature in the biotechnology domain
por: Santos, André Fernandes
Publicado em: (2012)
por: Santos, André Fernandes
Publicado em: (2012)
article Technological Innovations in Decarbonisation Strategies: A Text-Mining Approach to Technological Readiness and Potential
por: Costa, Paulo Moisés
Publicado em: (2024)
por: Costa, Paulo Moisés
Publicado em: (2024)
rate_review Data and text mining from online reviews
por: Moro, Sérgio
Publicado em: (2022)
por: Moro, Sérgio
Publicado em: (2022)
article Integration of Automatic Text Mining and Genomic and Proteomic Analysis to Unravel Prostate Cancer Biomarkers
por: Lima, Tânia
Publicado em: (2022)
por: Lima, Tânia
Publicado em: (2022)
draft Energy prices and CO2 emission allowance prices : a quantile regression approach
por: Hammoudeh, Shawkat
Publicado em: (2014)
por: Hammoudeh, Shawkat
Publicado em: (2014)
article Business intelligence in banking: A literature analysis from 2002 to 2013 using Text Mining and latent Dirichlet allocation
por: Moro, Sérgio
Publicado em: (2015)
por: Moro, Sérgio
Publicado em: (2015)
article Intelligent energy management using data mining techniques at Bosch Car Multimedia Portugal facilities
por: Mosavi, Nasim Sadat
Publicado em: (2022)
por: Mosavi, Nasim Sadat
Publicado em: (2022)
article Text Mining Applied to Electronic Medical Records
por: Pereira, Luis
Publicado em: (2015)
por: Pereira, Luis
Publicado em: (2015)
school Advanced text mining for annotation of genomic variants
por: Monteiro, Ana Rita Patrício
Publicado em: (2018)
por: Monteiro, Ana Rita Patrício
Publicado em: (2018)
article Framework for classroom student grading with open-ended questions: a text-mining approach
por: Vairinhos, Valter Martins
Publicado em: (2022)
por: Vairinhos, Valter Martins
Publicado em: (2022)
article Prediction of restrained shrinkage crack width of slag mortar composites using data mining techniques
por: Martins, Francisco F.
Publicado em: (2019)
por: Martins, Francisco F.
Publicado em: (2019)
article Estimating the capital cost of underground car parking projects
por: Bastos, Mónica
Publicado em: (2005)
por: Bastos, Mónica
Publicado em: (2005)
category A text mining approach for the extraction of kinetic information from literature
por: Freitas, Ana A.
Publicado em: (2015)
por: Freitas, Ana A.
Publicado em: (2015)
article A text mining approach for the extraction of kinetic information from literature
por: Freitas, Ana A.
Publicado em: (2015)
por: Freitas, Ana A.
Publicado em: (2015)
article Using sensitivity analysis and visualization techniques to open black box data mining models
por: Cortez, Paulo
Publicado em: (2013)
por: Cortez, Paulo
Publicado em: (2013)
article Improving international attractiveness of higher education institutions based on text mining and sentiment analysis
por: Santos, Carolina Leana
Publicado em: (2018)
por: Santos, Carolina Leana
Publicado em: (2018)
article Using text mining techniques for classical music scores analysis
por: Simões, Alberto
Publicado em: (2007)
por: Simões, Alberto
Publicado em: (2007)
article A proactive intelligent decision support system for predicting the popularity of online news
por: Kelwin, Fernandes
Publicado em: (2015)
por: Kelwin, Fernandes
Publicado em: (2015)
article Hourly prediction of organ failure and outcome in intensive care based on data mining techniques
por: Santos, Manuel Filipe
Publicado em: (2010)
por: Santos, Manuel Filipe
Publicado em: (2010)
Registos relacionados
-
school Applying text mining techniques to forecast the stock market fluctuations of large it companies with twitter data: descriptive and predictive approaches to enhance the research of stock market predictions with textual and semantic data
por: Zois, Christos
Publicado em: (2019) -
article Automatically detect diagnostic patterns based on clinical notes through Text Mining
por: Ribeiro, Joao
Publicado em: (2019) -
article Big data for stock market by means of mining techniques
por: Lima, Luciana
Publicado em: (2015) -
article Towards of automatically detecting brain death patterns through text mining
por: Silva, Antonio
Publicado em: (2016) -
article Sentiment analysis with Text Mining in contexts of Big Data
por: Andrade, Carina Sofia Marinho
Publicado em: (2017)