Publicação

Customer churn prediction

Ver documento

Detalhes bibliográficos
Resumo:Customer churn prediction is a critical task for businesses operating in competitive markets, especially in the context of online retail. Identifying customers at risk of leaving a service or product allows businesses to implement proactive retention strategies and maintain long-term profitability. This thesis aims to investigate the factors influencing customer churn in online retail and develop predictive models to anticipate churn behavior. Leveraging machine learning techniques, interpretability, and explainability, this study explores the impact of various customer attributes such as demographic information, purchasing behavior, and satisfaction scores on churn prediction. The analysis uses a comprehensive dataset containing customer attributes, transaction history, and response to marketing campaigns. By employing logistic regression models, gradient boosting models and advanced interpretability techniques such as SHAP (SHapley Additive exPlanations), this research aims to provide actionable insights for businesses to mitigate churn and enhance customer retention strategies in the online retail landscape. The findings highlight the significance of features such as average transaction amount, annual income, and recency of last purchase in predicting customer churn, and demonstrate the superior performance of gradient boosting models over logistic regression models in this context.
Autores principais:Fumo, Dalton
Assunto:Churn prediction Logistic regression Gradient boosting Interpretability & explainability SHAP Previsão de churn Regressão logística Interpretabilidade e explicabilidade
Ano:2024
País:Portugal
Tipo de documento:dissertação de mestrado
Tipo de acesso:acesso aberto
Instituição associada:Universidade Católica Portuguesa
Idioma:inglês
Origem:Veritati - Repositório Institucional da Universidade Católica Portuguesa
Descrição
Resumo:Customer churn prediction is a critical task for businesses operating in competitive markets, especially in the context of online retail. Identifying customers at risk of leaving a service or product allows businesses to implement proactive retention strategies and maintain long-term profitability. This thesis aims to investigate the factors influencing customer churn in online retail and develop predictive models to anticipate churn behavior. Leveraging machine learning techniques, interpretability, and explainability, this study explores the impact of various customer attributes such as demographic information, purchasing behavior, and satisfaction scores on churn prediction. The analysis uses a comprehensive dataset containing customer attributes, transaction history, and response to marketing campaigns. By employing logistic regression models, gradient boosting models and advanced interpretability techniques such as SHAP (SHapley Additive exPlanations), this research aims to provide actionable insights for businesses to mitigate churn and enhance customer retention strategies in the online retail landscape. The findings highlight the significance of features such as average transaction amount, annual income, and recency of last purchase in predicting customer churn, and demonstrate the superior performance of gradient boosting models over logistic regression models in this context.