Publicação

Exploring the Impact of Data Scale on Mutation Step Size in SLIM-GSGP

Ver documento

Detalhes bibliográficos
Resumo:The Semantic Learning algorithm based on Inflate and deflate Mutation (SLIM) is a promising recent variant of Geometric Semantic Genetic Programming (GSGP) that introduces a new Deflate Geometric Semantic Mutation (DGSM). This operator maintains the key feature of the standard Geometric Semantic Mutation (GSM), inducing a unimodal error surface for any supervised learning problem, while generating smaller offspring than their parents, and thus allowing SLIM to generate compact, and potentially interpretable, final solutions. A key parameter controlling the evolution process in both GSGP and SLIM is the Mutation Step (MS), which regulates the extent of perturbation to the parent semantics. While it is intuitive that the optimal value of MS has a relationship with the scale of the dataset features, to the best of our knowledge no prior research has extensively explored this relationship. In this work, we provide the first comprehensive investigation into this topic. First, we hypothesize a general rule by analyzing results from artificial datasets, and then we confirm these findings with more complex, real-world datasets. This approach offers a solid alternative to the typical hyperparameter tuning approach.
Autores principais:Farinati, Davide
Outros Autores:Pietropolli, Gloria; Vanneschi, Leonardo
Assunto:Genetic Programming Geometric Semantic Genetic Programming Geometric Mutation Mutation Step Symbolic Regression Theoretical Computer Science General Computer Science
Ano:2025
País:Portugal
Tipo de documento:documento de conferência
Tipo de acesso:acesso aberto
Instituição associada:Universidade Nova de Lisboa
Idioma:inglês
Origem:Repositório Institucional da UNL
Descrição
Resumo:The Semantic Learning algorithm based on Inflate and deflate Mutation (SLIM) is a promising recent variant of Geometric Semantic Genetic Programming (GSGP) that introduces a new Deflate Geometric Semantic Mutation (DGSM). This operator maintains the key feature of the standard Geometric Semantic Mutation (GSM), inducing a unimodal error surface for any supervised learning problem, while generating smaller offspring than their parents, and thus allowing SLIM to generate compact, and potentially interpretable, final solutions. A key parameter controlling the evolution process in both GSGP and SLIM is the Mutation Step (MS), which regulates the extent of perturbation to the parent semantics. While it is intuitive that the optimal value of MS has a relationship with the scale of the dataset features, to the best of our knowledge no prior research has extensively explored this relationship. In this work, we provide the first comprehensive investigation into this topic. First, we hypothesize a general rule by analyzing results from artificial datasets, and then we confirm these findings with more complex, real-world datasets. This approach offers a solid alternative to the typical hyperparameter tuning approach.

Atividades financiadas

Carregando projetos financiados...