Publicação

Evolutionary feature selection for time-series forecasting

Ver documento

Detalhes bibliográficos
Resumo:In machine learning, feature selection is crucial for pinpointing the key subset of features that enhances interpretability and preserves or boosts the model’s original performance. Filter methods, which assess features using statistical metrics, are particularly notable. Recently, a novel metric called Conditional Dependence Coefficient has been proposed to measure the dependence between subsets of variables. This paper introduces a novel filter feature selection method that integrates the Conditional Dependence Coefficient metric with an evolutionary algorithm to find the optimal feature subset. This approach combines the adaptability of genetic algorithms with the strength of an intuitive metric. Unlike many filter-based methods, our technique does not rely on parameters directly linked to the number of features (like thresholds). Moreover, it evaluates the collective merit of feature subsets rather than individual significance. We conducted tests on six different multivariate time-series datasets to address the forecasting challenge, determining the relevant lags. Considering no selection as baseline, experimental results indicate that our approach is competitive in terms of efficacy while demonstrating a reduction in the number of features selected.
Autores principais:Linares-Barrera, Maria Lourdes
Outros Autores:Jimenez-Navarro, Manuel J.; Brito, Isabel; Riquelme, José; Martínez-Ballesteros, María
Assunto:Machine learning Feature selection Genetic algorithm Regression Time-series forecasting
Ano:2024
País:Portugal
Tipo de documento:outro
Tipo de acesso:acesso aberto
Instituição associada:Instituto Politécnico de Beja
Idioma:inglês
Origem:Repositório Institucional do IPBeja
Descrição
Resumo:In machine learning, feature selection is crucial for pinpointing the key subset of features that enhances interpretability and preserves or boosts the model’s original performance. Filter methods, which assess features using statistical metrics, are particularly notable. Recently, a novel metric called Conditional Dependence Coefficient has been proposed to measure the dependence between subsets of variables. This paper introduces a novel filter feature selection method that integrates the Conditional Dependence Coefficient metric with an evolutionary algorithm to find the optimal feature subset. This approach combines the adaptability of genetic algorithms with the strength of an intuitive metric. Unlike many filter-based methods, our technique does not rely on parameters directly linked to the number of features (like thresholds). Moreover, it evaluates the collective merit of feature subsets rather than individual significance. We conducted tests on six different multivariate time-series datasets to address the forecasting challenge, determining the relevant lags. Considering no selection as baseline, experimental results indicate that our approach is competitive in terms of efficacy while demonstrating a reduction in the number of features selected.