Publicação
Variable selection methods in high-dimensional regression: a simulation study
| Resumo: | A challenging problem in the analysis of high-dimensional data is variable selection. In this study, we describe a bootstrap based technique for selecting predictors in partial least-squares regression (PLSR) and principle component regression (PCR) in high-dimensional data. Using a bootstrap-based technique for significance tests of the regression coefficients, a subset of the original variables can be selected to be included in the regression, thus obtaining a more parsimonious model with smaller prediction errors. We compare the bootstrap approach with several variable selection approaches (jack-knife and sparse formulation-based methods) on PCR and PLSR in simulation and real data. |
|---|---|
| Autores principais: | Shahriari, Shirin |
| Outros Autores: | Faria, Susana; Gonçalves, A. Manuela |
| Assunto: | High-dimensional data Partial least-squares regression Principle component regression Variable selection Bootstrap |
| Ano: | 2015 |
| País: | Portugal |
| Tipo de documento: | artigo |
| Tipo de acesso: | acesso restrito |
| Instituição associada: | Universidade do Minho |
| Idioma: | inglês |
| Origem: | RepositóriUM - Universidade do Minho |
Registos relacionados
school Variable selection in linear regression models with large number of predictors
por: Shahriari, Shirin
Publicado em: (2014)
por: Shahriari, Shirin
Publicado em: (2014)
article Outlier detection and robust variable selection for least angle regression
por: Shahriari, Shirin
Publicado em: (2014)
por: Shahriari, Shirin
Publicado em: (2014)
category Robust linear model selection in high dimensional data
por: Shahriari, Shirin
Publicado em: (2013)
por: Shahriari, Shirin
Publicado em: (2013)
article High-throughput FTIR-based bioprocess analysis of recombinant cyprosin production
por: Sampaio, Pedro
Publicado em: (2017)
por: Sampaio, Pedro
Publicado em: (2017)
article Predicting SO2 pollution incidents by means of additive models with optimum variable selection
por: Sestelo, Marta
Publicado em: (2014)
por: Sestelo, Marta
Publicado em: (2014)
article UV spectrophotometry method for the monitoring of galacto-oligosaccharides production
por: Dias, Luís G.
Publicado em: (2009)
por: Dias, Luís G.
Publicado em: (2009)
article In situ near infrared spectroscopy monitoring of cyprosin production by recombinant Saccharomyces cerevisiae strains
por: Sampaio, Pedro
Publicado em: (2014)
por: Sampaio, Pedro
Publicado em: (2014)
article Evaluation of green coffee beans quality using near infrared spectroscopy: A quantitative approach
por: Santos, João Rodrigo
Publicado em: (2012)
por: Santos, João Rodrigo
Publicado em: (2012)
article Forecasting in data-rich environments
por: Conraria, Luís Aguiar
Publicado em: (2004)
por: Conraria, Luís Aguiar
Publicado em: (2004)
article Evaluation of quality parameters of apple juices using near-infrared spectroscopy and chemometrics
por: Wlodarska, Katarzyna
Publicado em: (2018)
por: Wlodarska, Katarzyna
Publicado em: (2018)
article Automatic identification of activated sludge disturbances and assessment of operational parameters
por: Amaral, A. L.
Publicado em: (2013)
por: Amaral, A. L.
Publicado em: (2013)
school The impact of banking crises on public debt : the case-study of Portugal (1970-2015)
por: Pinto, Nuno Dias Duarte Parraça
Publicado em: (2021)
por: Pinto, Nuno Dias Duarte Parraça
Publicado em: (2021)
article Estimation of effluent quality parameters from an activated sludge system using quantitative image analysis
por: Mesquita, D. P.
Publicado em: (2016)
por: Mesquita, D. P.
Publicado em: (2016)
article Automatic selection of indicators in a fully saturated regression
por: Santos, Carlos
Publicado em: (2008)
por: Santos, Carlos
Publicado em: (2008)
article Monitoring morphological changes from activated sludge to aerobic granular sludge under distinct organic loading rates and increasing minimal imposed sludge settling velocities through quantitative image analysis
por: Silva, Sérgio Alves
Publicado em: (2022)
por: Silva, Sérgio Alves
Publicado em: (2022)
article Nonparametric regression with doubly truncated data
por: Moreira, Carla
Publicado em: (2016)
por: Moreira, Carla
Publicado em: (2016)
article Correlation between sludge settleability and image analysis information using Partial Least Squares
por: Mesquita, D. P.
Publicado em: (2008)
por: Mesquita, D. P.
Publicado em: (2008)
article Comparison of partial least squares-discriminant analysis and soft independent modeling of class analogy methods for classification of Saccharomyces cerevisiae cells based on mid-infrared spectroscopy
por: Sampaio, Pedro
Publicado em: (2021)
por: Sampaio, Pedro
Publicado em: (2021)
groups Classifying High-Dimensional Data with the The HiDimDA package
por: Duarte Silva, A. P.
Publicado em: (2013)
por: Duarte Silva, A. P.
Publicado em: (2013)
draft Two-way relationship between inequality and growth within fiscal policy channel : an empirical assessment for European countries
por: Coelho, José Carlos
Publicado em: (2021)
por: Coelho, José Carlos
Publicado em: (2021)
article Estimation of composition of quinoa (Chenopodium quinoa Willd.) grains by Near-Infrared Transmission spectroscopy
por: Encina-Zelada, Christian René
Publicado em: (2017)
por: Encina-Zelada, Christian René
Publicado em: (2017)
article Application of image analysis to the prediction of EBC barley kernel weight distribution
por: Amaral, A. L.
Publicado em: (2009)
por: Amaral, A. L.
Publicado em: (2009)
article A Chemometric Analysis of Soil Health Indicators Derived from Mid-Infrared Spectra
por: Almendros, Gonzalo
Publicado em: (2025)
por: Almendros, Gonzalo
Publicado em: (2025)
article Consistency and efficiency of ordinary least squares, maximum likelihood, and three type II linear regression models: A Monte-Carlo simulation study
por: Maroco, João
Publicado em: (2007)
por: Maroco, João
Publicado em: (2007)
article A deep regression model with low-dimensional feature extraction for multi-parameter manufacturing quality prediction
por: Deng, Jun
Publicado em: (2020)
por: Deng, Jun
Publicado em: (2020)
article Predicting site index from climate and soil variables for cork oak (Quercus suber L.) stands in Portugal
por: Paulo, Joana Amaral
Publicado em: (2015)
por: Paulo, Joana Amaral
Publicado em: (2015)
article Correlation between sludge settling ability and image analysis information using partial least squares
por: Mesquita, D. P.
Publicado em: (2009)
por: Mesquita, D. P.
Publicado em: (2009)
article Quantitative image analysis for assessing extracellular polymeric substances in activated sludge under atrazine exposure
por: Melo, Antonio
Publicado em: (2024)
por: Melo, Antonio
Publicado em: (2024)
draft Identification with averaged data and implications for hedonic regression studies
por: Silva, João Santos
Publicado em: (2010)
por: Silva, João Santos
Publicado em: (2010)
draft Growth accounting and regressions : new approach and results
por: Sequeira, Tiago
Publicado em: (2020)
por: Sequeira, Tiago
Publicado em: (2020)
article Morphological characterisation of biomass in wastewater treatment using partial least squares
por: Amaral, A. L.
Publicado em: (2002)
por: Amaral, A. L.
Publicado em: (2002)
groups Classification of recombinant Saccharomyces cerevisiae cells using PLS-DA modelling based on MIR spectroscopy
por: Sampaio, Pedro
Publicado em: (2019)
por: Sampaio, Pedro
Publicado em: (2019)
article Comparative analysis of anatomical characteristics and phenolic compounds of two highbush blueberry (Vaccinium corymbosum L.) cultivars with different rooting ability of semi-hardwood cuttings
por: Santos-Rufo, Antonio
Publicado em: (2024)
por: Santos-Rufo, Antonio
Publicado em: (2024)
article Family management and firm performance in family SMEs: the mediating roles of management control systems and technological innovation
por: Ruiz-Palomo, Daniel
Publicado em: (2019)
por: Ruiz-Palomo, Daniel
Publicado em: (2019)
article Efficient feature selection filters for high-dimensional data
por: J. Ferreira, Artur
Publicado em: (2012)
por: J. Ferreira, Artur
Publicado em: (2012)
article Valuing biodiversity enhancement in New Zealand's planted forests: socioeconomic and spatial determinants of willingness-to-pay
por: Yao, Richard T.
Publicado em: (2014)
por: Yao, Richard T.
Publicado em: (2014)
article Strategic alliances and competitive performance in the pharmaceutical industry
por: Rocha-Gonçalves, Francisco
Publicado em: (2008)
por: Rocha-Gonçalves, Francisco
Publicado em: (2008)
article Creating value from intellectual capital : an approach based on the specification of models
por: Cabrita, Maria do Rosário
Publicado em: (2008)
por: Cabrita, Maria do Rosário
Publicado em: (2008)
article Spectra Fusion of Mid-Infrared (MIR) and X-ray Fluorescence (XRF) Spectroscopy for Estimation of Selected Soil Fertility Attributes
por: Kandpal, Lalit M.
Publicado em: (2022)
por: Kandpal, Lalit M.
Publicado em: (2022)
article Prediction of intracellular storage polymers using quantitative image analysis in enhanced biological phosphorus removal systems
por: Mesquita, D. P.
Publicado em: (2013)
por: Mesquita, D. P.
Publicado em: (2013)
Registos relacionados
-
school Variable selection in linear regression models with large number of predictors
por: Shahriari, Shirin
Publicado em: (2014) -
article Outlier detection and robust variable selection for least angle regression
por: Shahriari, Shirin
Publicado em: (2014) -
category Robust linear model selection in high dimensional data
por: Shahriari, Shirin
Publicado em: (2013) -
article High-throughput FTIR-based bioprocess analysis of recombinant cyprosin production
por: Sampaio, Pedro
Publicado em: (2017) -
article Predicting SO2 pollution incidents by means of additive models with optimum variable selection
por: Sestelo, Marta
Publicado em: (2014)