Publicação
Exploring the Impact of Data Scale on Mutation Step Size in SLIM-GSGP
| Resumo: | The Semantic Learning algorithm based on Inflate and deflate Mutation (SLIM) is a promising recent variant of Geometric Semantic Genetic Programming (GSGP) that introduces a new Deflate Geometric Semantic Mutation (DGSM). This operator maintains the key feature of the standard Geometric Semantic Mutation (GSM), inducing a unimodal error surface for any supervised learning problem, while generating smaller offspring than their parents, and thus allowing SLIM to generate compact, and potentially interpretable, final solutions. A key parameter controlling the evolution process in both GSGP and SLIM is the Mutation Step (MS), which regulates the extent of perturbation to the parent semantics. While it is intuitive that the optimal value of MS has a relationship with the scale of the dataset features, to the best of our knowledge no prior research has extensively explored this relationship. In this work, we provide the first comprehensive investigation into this topic. First, we hypothesize a general rule by analyzing results from artificial datasets, and then we confirm these findings with more complex, real-world datasets. This approach offers a solid alternative to the typical hyperparameter tuning approach. |
|---|---|
| Autores principais: | Farinati, Davide |
| Outros Autores: | Pietropolli, Gloria; Vanneschi, Leonardo |
| Assunto: | Genetic Programming Geometric Semantic Genetic Programming Geometric Mutation Mutation Step Symbolic Regression Theoretical Computer Science General Computer Science |
| Ano: | 2025 |
| País: | Portugal |
| Tipo de documento: | documento de conferência |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade Nova de Lisboa |
| Idioma: | inglês |
| Origem: | Repositório Institucional da UNL |
| Resumo: | The Semantic Learning algorithm based on Inflate and deflate Mutation (SLIM) is a promising recent variant of Geometric Semantic Genetic Programming (GSGP) that introduces a new Deflate Geometric Semantic Mutation (DGSM). This operator maintains the key feature of the standard Geometric Semantic Mutation (GSM), inducing a unimodal error surface for any supervised learning problem, while generating smaller offspring than their parents, and thus allowing SLIM to generate compact, and potentially interpretable, final solutions. A key parameter controlling the evolution process in both GSGP and SLIM is the Mutation Step (MS), which regulates the extent of perturbation to the parent semantics. While it is intuitive that the optimal value of MS has a relationship with the scale of the dataset features, to the best of our knowledge no prior research has extensively explored this relationship. In this work, we provide the first comprehensive investigation into this topic. First, we hypothesize a general rule by analyzing results from artificial datasets, and then we confirm these findings with more complex, real-world datasets. This approach offers a solid alternative to the typical hyperparameter tuning approach. |
|---|
Atividades financiadas
Carregando projetos financiados...