Document details

Exploring SLUG

Author(s): Rodrigues, Nuno M. ; Batista, João E. ; La Cava, William ; Vanneschi, Leonardo ; Silva, Sara

Date: 2024

Persistent ID: http://hdl.handle.net/10362/161429

Origin: Repositório Institucional da UNL

Project/scholarship: info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00408%2F2020/PT; info:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDP%2F00408%2F2020/PT; info:eu-repo/grantAgreement/FCT/3599-PPCDT/DSAIPA%2FDS%2F0113%2F2019/PT;

Subject(s): Feature selection; Epistasis; Genetic Programming; Genetic algorithms; Wrapper; Learner; Machine learning; Computer Science(all); Computer Science Applications; Computer Networks and Communications; Computer Graphics and Computer-Aided Design; Computational Theory and Mathematics; Artificial Intelligence


Description

Rodrigues, N. M., Batista, J. E., La Cava, W., Vanneschi, L., & Silva, S. (2024). Exploring SLUG: Feature Selection Using Genetic Algorithms and Genetic Programming. SN Computer Science, 5(1), 1-17. [91]. https://doi.org/10.1007/s42979-023-02106-3 --- Open access funding provided by FCT|FCCN (b-on). This work was partially supported by the FCT, Portugal, through funding of the LASIGE Research Unit (UIDB/00408/2020 and UIDP/00408/2020); MAR2020 program via project MarCODE (MAR$$-$$01.03.01-FEAMP-0047); project AICE (DSAIPA/DS/0113/2019). Nuno Rodrigues and João Batista were supported by PhD Grants 2021/05322/BD and SFRH/BD/143972/2019, respectively; William La Cava was supported by the National Library Of Medicine of the National Institutes of Health under Award Number R00LM012926

We present SLUG, a recent method that uses genetic algorithms as a wrapper for genetic programming and performs feature selection while inducing models. SLUG was shown to be successful on different types of classification tasks, achieving state-of-the-art results on the synthetic datasets produced by GAMETES, a tool for embedding epistatic gene–gene interactions into noisy datasets. SLUG has also been studied and modified to demonstrate that its two elements, wrapper and learner, are the right combination that grants it success. We report these results and test SLUG on an additional six GAMETES datasets of increased difficulty, for a total of four regular and 16 epistatic datasets. Despite its slowness, SLUG achieves the best results and solves all but the most difficult classification tasks. We perform further explorations of its inner dynamics and discover how to improve the feature selection by enriching the communication between wrapper and learner, thus taking the first step toward a new and more powerful SLUG.

Document Type Journal article
Language English
Contributor(s) NOVA Information Management School (NOVA IMS); Information Management Research Center (MagIC) - NOVA Information Management School; RUN
facebook logo  linkedin logo  twitter logo 
mendeley logo

Related documents