Document details

Stable variable selection method with shrinkage regression applied to the selection of genetic variants associated with Alzheimer’s Disease

Author(s): Afreixo, Vera ; Tavares, Ana Helena ; Enes, Vera ; Pinheiro, Miguel ; Rodrigues, Leonor ; Moura, Gabriela

Date: 2024

Persistent ID: http://hdl.handle.net/10773/41173

Origin: RIA - Repositório Institucional da Universidade de Aveiro

Subject(s): Penalized regression; Akaike’s information criterion; High-dimensional data; Stability; Overall weighted coefficients; Alzheimer’s disease; SNP


Description

In this work, we aimed to establish a stable and accurate procedure with which to perform feature selection in datasets with a much higher number of predictors than individuals, as in genomewide association studies. Due to the instability of feature selection where many potential predictors are measured, a variable selection procedure is proposed that combines several replications of shrinkage regression models. A weighted formulation is used to define the final predictors. The procedure is applied for the investigation of single nucleotide polymorphism (SNP) predictors associated with Alzheimer’s disease in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset. Furthermore, the two following data scenarios are investigated: one that solely considers the set of SNPs, and another with the covariates of age, sex, educational level, and ε4 allele of the Apolipoprotein E (APOE4) genotype. The SNP rs2075650 and the APOE4 genotype are provided as risk factors for Alzheimer’s disease, which is in line with the literature, and another four new SNPs are indicated, thus cultivating new hypotheses for in vivo analyses. These experiments demonstrate the potential of the new method for stable feature selection.

Document Type Journal article
Language English
facebook logo  linkedin logo  twitter logo 
mendeley logo

Related documents