Publicação
Enabling network inference methods to handle missing data and outliers
| Resumo: | The inference of complex networks from data is a challenging problem in biological sciences, as well as in a wide range of disciplines such as chemistry, technology, economics, or sociology. The quantity and quality of the data greatly affect the results. While many methodologies have been developed for this task, they seldom take into account issues such as missing data or outlier detection and correction, which need to be properly addressed before network inference. Results Here we present an approach to (i) handle missing data and (ii) detect and correct outliers based on multivariate projection to latent structures. The method, called trimmed scores regression (TSR), enables network inference methods to analyse incomplete datasets by imputing the missing values coherently with the latent data structure. Furthermore, it substitutes the faulty values in a dataset by proper estimations. We provide an implementation of this approach, and show how it can be integrated with any network inference method as a preliminary data curation step. This functionality is demonstrated with a state of the art network inference method based on mutual information distance and entropy reduction, MIDER. Conclusion The methodology presented here enables network inference methods to analyse a large number of incomplete and faulty datasets that could not be reliably analysed so far. Our comparative studies show the superiority of TSR over other missing data approaches used by practitioners. Furthermore, the method allows for outlier detection and correction. |
|---|---|
| Autores principais: | Folch-Fortuny, Abel |
| Outros Autores: | Villaverde, Alejandro F.; Ferrer, Alberto; Banga, Julio R. |
| Assunto: | Network inference Missing data Outlier detection Projection to latent structures Trimmed scores regression Information theory Mutual information |
| Ano: | 2015 |
| País: | Portugal |
| Tipo de documento: | artigo |
| Tipo de acesso: | acesso aberto |
| Instituição associada: | Universidade do Minho |
| Idioma: | inglês |
| Origem: | RepositóriUM - Universidade do Minho |
Registos relacionados
article Outlier detection and robust variable selection for least angle regression
por: Shahriari, Shirin
Publicado em: (2014)
por: Shahriari, Shirin
Publicado em: (2014)
category A non parametric robust method for the detection of outliers in linear models
por: Faria, Susana
Publicado em: (2006)
por: Faria, Susana
Publicado em: (2006)
school Detection of outliers and outliers clustering on large datasets with distributed computing
por: Pais, Rui Manuel Aleixo
Publicado em: (2012)
por: Pais, Rui Manuel Aleixo
Publicado em: (2012)
article Robust identification of target genes and outliers in triple-negative breast cancer data
por: Segaert, Pieter
Publicado em: (2018)
por: Segaert, Pieter
Publicado em: (2018)
article Robust order selection of mixtures of regression models with random effects
por: Novais, Luísa
Publicado em: (2021)
por: Novais, Luísa
Publicado em: (2021)
groups Palynology of the kingscourt outlier (Ireland)
por: Fernandes, Paulo
Publicado em: (2012)
por: Fernandes, Paulo
Publicado em: (2012)
article Ensemble outlier detection and gene selection in triple-negative breast cancer data
por: Lopes, Marta B.
Publicado em: (2018)
por: Lopes, Marta B.
Publicado em: (2018)
article Outlier detection in interval data
por: Silva, A. Pedro Duarte
Publicado em: (2018)
por: Silva, A. Pedro Duarte
Publicado em: (2018)
school Variable selection in linear regression models with large number of predictors
por: Shahriari, Shirin
Publicado em: (2014)
por: Shahriari, Shirin
Publicado em: (2014)
groups Jurassic palynostratigraphy of the Algarve Basin and the Carrapateira Outlier, Southern Portugal
por: Borges, Marisa
Publicado em: (2011)
por: Borges, Marisa
Publicado em: (2011)
school Improvement of data quality through outlier detection in time series
por: Alves, Bruno Miguel Pinheiro
Publicado em: (2024)
por: Alves, Bruno Miguel Pinheiro
Publicado em: (2024)
article The Jurassic (Pliensbachian to Kimmeridgian) palynology of the Algarve Basin and the Carrapateira outlier, southern Portugal
por: Borges, Marisa
Publicado em: (2011)
por: Borges, Marisa
Publicado em: (2011)
article PREMER: Parallel reverse engineering of biological networks with information theory
por: Villaverde, A. F.
Publicado em: (2016)
por: Villaverde, A. F.
Publicado em: (2016)
article Worldwide interlaboratory study on the determination of ochratoxin A in different wine type samples
por: Ratola, N.
Publicado em: (2006)
por: Ratola, N.
Publicado em: (2006)
groups Middle-Upper Jurassic palynology of the Sagres region and the Carrapateira outlier: southern Portugal
por: Borges, Marisa
Publicado em: (2010)
por: Borges, Marisa
Publicado em: (2010)
article Outliers impact on parameter estimation of gaussian and non-gaussian state space models: a simulation study
por: Pereira, Fernanda Catarina
Publicado em: (2022)
por: Pereira, Fernanda Catarina
Publicado em: (2022)
category Robust linear model selection in high dimensional data
por: Shahriari, Shirin
Publicado em: (2013)
por: Shahriari, Shirin
Publicado em: (2013)
article Incremental volumetric and Dual Kriging remapping methods
por: Miguel, C.
Publicado em: (2018)
por: Miguel, C.
Publicado em: (2018)
article A robust sparce linear approach for contamined data
por: Shahriari, Shirin
Publicado em: (2019)
por: Shahriari, Shirin
Publicado em: (2019)
draft Outlier robust specification of multiplicative time-varying volatility models
por: Amado, Cristina
Publicado em: (2022)
por: Amado, Cristina
Publicado em: (2022)
school Deteção de fraude em telecomunicações através de machine learning
por: Caldas, Luísa Lopes
Publicado em: (2019)
por: Caldas, Luísa Lopes
Publicado em: (2019)
article Systems, economics and neoliberal politics: theories to understand missed nursing care
por: Jones, Terry
Publicado em: (2020)
por: Jones, Terry
Publicado em: (2020)
school Downside risk measures: a comparison of risk models that account for outliers and parameter uncertainty
por: Costa, Luis Fernando Corrêa da
Publicado em: (2020)
por: Costa, Luis Fernando Corrêa da
Publicado em: (2020)
article Making sustainability tensions salient: changing information or people?
por: Manzhynski, Siarhei
Publicado em: (2025)
por: Manzhynski, Siarhei
Publicado em: (2025)
article An implementation of neural simulation-based inference for parameter estimation in ATLAS
por: Castro, Nuno Filipe
Publicado em: (2025)
por: Castro, Nuno Filipe
Publicado em: (2025)
book Becoming Jane e Miss Austen Regrets: representações e projeções de Jane Austen no filme biográfico
por: Pereira, Margarida Esteves
Publicado em: (2017)
por: Pereira, Margarida Esteves
Publicado em: (2017)
article Improving predictive accuracy in the context of dynamic modelling of non-stationary time series with outliers
por: Pereira, Fernanda Catarina
Publicado em: (2023)
por: Pereira, Fernanda Catarina
Publicado em: (2023)
book Dominant set approach to ECG biometrics
por: Lourenço, André Ribeiro
Publicado em: (2013)
por: Lourenço, André Ribeiro
Publicado em: (2013)
school O fear of missing out, a geração e o género como moderadores da relação entre o social media burnout e a intenção de permanência: caso instagram
por: Silva, Juliana Fernandes da
Publicado em: (2021)
por: Silva, Juliana Fernandes da
Publicado em: (2021)
article Acknowledging the role of word-based activation in spontaneous trait inferences
por: Orghian, Diana
Publicado em: (2018)
por: Orghian, Diana
Publicado em: (2018)
article The advantages of Structural Equation Modelling to address the complexity of spatial reference learning
por: Moreira, Pedro Miguel Silva
Publicado em: (2016)
por: Moreira, Pedro Miguel Silva
Publicado em: (2016)
article Data analytics to advance the inference of origin–destination in public transport systems: tracing network vulnerabilities and age-sensitive trip purposes
por: Cerqueira, Sofia
Publicado em: (2025)
por: Cerqueira, Sofia
Publicado em: (2025)
article Multiplicity lists for symmetric matrices whose graphs have few missing edges
por: Johnson, Charles R.
Publicado em: (2018)
por: Johnson, Charles R.
Publicado em: (2018)
article Combining intention and emotional state inference in a dynamic neural field architecture for human-robot joint action
por: Silva, Rui
Publicado em: (2016)
por: Silva, Rui
Publicado em: (2016)
article Cardiac arrhythmia detection by parameters sharing and MMIE training of hidden Markov models
por: Lima, C. S.
Publicado em: (2007)
por: Lima, C. S.
Publicado em: (2007)
article Information theory, synchronization and topological order in complete dynamical networks of discontinuous maps
por: Rocha, J. Leonel
Publicado em: (2021)
por: Rocha, J. Leonel
Publicado em: (2021)
school Controlo da tuberculose latente : intervenção a grupo de enfermeiros de uma unidade de saúde pública
por: Fortunato, Katia Rodrigues Dinis
Publicado em: (2022)
por: Fortunato, Katia Rodrigues Dinis
Publicado em: (2022)
article Too young to correct: A developmental test of the three-stage model of social inference
por: Haga, Sara
Publicado em: (2014)
por: Haga, Sara
Publicado em: (2014)
book A dynamic field approach to goal inference, error detection and anticipatory action selection in human-robot collaboration
por: Bicho, E.
Publicado em: (2011)
por: Bicho, E.
Publicado em: (2011)
article Relationship between oral health and physical activity in a young population aged 6–18 years from Seixal's public schools, Portugal (2011–2014)
por: Rodrigues, Octávio
Publicado em: (2017)
por: Rodrigues, Octávio
Publicado em: (2017)
Registos relacionados
-
article Outlier detection and robust variable selection for least angle regression
por: Shahriari, Shirin
Publicado em: (2014) -
category A non parametric robust method for the detection of outliers in linear models
por: Faria, Susana
Publicado em: (2006) -
school Detection of outliers and outliers clustering on large datasets with distributed computing
por: Pais, Rui Manuel Aleixo
Publicado em: (2012) -
article Robust identification of target genes and outliers in triple-negative breast cancer data
por: Segaert, Pieter
Publicado em: (2018) -
article Robust order selection of mixtures of regression models with random effects
por: Novais, Luísa
Publicado em: (2021)