Author(s): de Andrade, Luis Gustavo Modelli [UNESP] ; Tedesco-Silva, Helio
Date: 2020
Persistent ID: http://hdl.handle.net/11449/198507
Origin: Oasisbr
Author(s): de Andrade, Luis Gustavo Modelli [UNESP] ; Tedesco-Silva, Helio
Date: 2020
Persistent ID: http://hdl.handle.net/11449/198507
Origin: Oasisbr
Made available in DSpace on 2020-12-12T01:14:44Z (GMT). No. of bitstreams: 0 Previous issue date: 2020-02-01
Background One overlooked problem in statistical analysis is lateral collinearity, a phenomenon that may occur when the outcome variable derives from the predictors. In nephrology this issue is seen with the use of estimated glomerular filtration rate (eGFR) as an outcome and age, sex, and ethnicity as predictors. In this study with simulated data, we aim to illustrate this problem. Methods We randomly generated unrelated data to estimate eGFR by common equations. Results Using simulated data, we show that age, gender, and ethnicity (recycled predictors variables) are statistically significantly correlated with eGFR in linear regression analysis. Whereas the initial obvious conclusion is that age, sex, and ethnicity are strong predictors of eGFR, more rigorous interpretation suggests that this is a byproduct of the mathematical model produced when deriving new predictors from another. Conclusion While statistical models have the ability to identify vertical collinearity (predictor-predictor), lateral collinearity (predictor-outcome) is seldom identified and discussed in statistical analysis. Therefore, caution is needed when interpreting the correlation between age, gender, and ethnicity with eGFR derived from regression analyses.
Department of Internal Medicine UNESP Univ Estadual Paulista
Hospital do Rim Universidade Federal de São Paulo
Univ Estadual Paulista
Universidade Federal de São Paulo
Department of Internal Medicine UNESP Univ Estadual Paulista
Univ Estadual Paulista