Publicação

On the improvement of feature selection techniques: the fitness filter

Ver documento

Detalhes bibliográficos
Resumo:The need for feature selection (FS) techniques is central in many machine learning and pattern recognition problems. FS is a vast research field and therefore we now have many FS techniques proposed in the literature, applied in the context of quite different problems. Some of these FS techniques follow the relevance-redundancy (RR) framework to select the best subset of features. In this paper, we propose a supervised filter FS technique, named as fitness filter, that follows the RR framework and uses data discretization. This technique can be used directly on low or medium dimensional data or it can be applied as a post-processing technique to other FS techniques. Specifically, when used as a post-processing technique, it further reduces the dimensionality of the feature space found by common FS techniques and often improves the classification accuracy.
Autores principais:J. Ferreira, Artur
Outros Autores:Figueiredo, Mario
Assunto:Machine learning Feature selection Dimensionality reduction Relevance-redundancy Classification
Ano:2021
País:Portugal
Tipo de documento:documento de conferência
Tipo de acesso:acesso restrito
Instituição associada:Instituto Politécnico de Lisboa
Idioma:inglês
Origem:Repositório Científico do Instituto Politécnico de Lisboa
Descrição
Resumo:The need for feature selection (FS) techniques is central in many machine learning and pattern recognition problems. FS is a vast research field and therefore we now have many FS techniques proposed in the literature, applied in the context of quite different problems. Some of these FS techniques follow the relevance-redundancy (RR) framework to select the best subset of features. In this paper, we propose a supervised filter FS technique, named as fitness filter, that follows the RR framework and uses data discretization. This technique can be used directly on low or medium dimensional data or it can be applied as a post-processing technique to other FS techniques. Specifically, when used as a post-processing technique, it further reduces the dimensionality of the feature space found by common FS techniques and often improves the classification accuracy.