The "care" package implements the regression approach of Zuber and Strimmer (2011). CAR scores measure the correlation between the response and the Mahalanobis-decorrelated predictors. The squared CAR score is a natural measure of variable importance and provides a canonical ordering of variables. This package provides functions for estimating CAR scores, for variable selection using CAR scores, and for estimating corresponding regression coefficients. Both shrinkage as well as empirical estimators are available.
Current Version: 1.1.11
Authors: Verena Zuber and Korbinian Strimmer.
Documentation and Installation:
- Manual (pdf file) and release history.
- Download of care version 1.1.11 source package.
- Archive of previous versions of care.
- Licensed under the GNU GPL version 3 (or any later version).
Publications:
For categorical response the corresponding statistic is the CAT score - see the "st" package and the "sda" package.- V. Zuber and K. Strimmer. 2011. High-dimensional regression and variable selection using CAR scores. Statist. Appl. Genet. Mol. Biol. 10: 34. (arXiv:1007.5516, original bepress version)
- V. Zuber, A. P. Duarte Silva, and K. Strimmer. 2012. A novel algorithm for simultaneous SNP selection in high-dimensional genome-wide association studies. BMC Bioinformatics 13: 284. (arXiv:1203.3082)
- A. Kessy, A. Lewin, and K. Strimmer. 2018. Optimal whitening and decorrelation. The American Statistician 72: 309-314. (arXiv:1512.00809)
Connection of CAR Scores with Optimal Whitening:
- For a discussion of the optimality of the whitening procedure underlying the CAR score see Kessy et al. (2018).
- Essentially, there exists a unique whitening transformation (ZCA-cor whitening) with the property that the whitened variables are maximally correlated to the original variables. Computing CAR scores for the original data is identical to computing marginal correlations from the optimally whitened data.
R Examples from 2011 Paper:
- R code for reproducing the simulations described in the 2011 methodology paper.
-
CAR regression models on diabetes data - see also the R notebook for analysis of diabetes data and the corresponding PDF vignette:
. - Comparison of regression models based CAR scores and marginal correlations for brain data - see also the R notebook for analysis of the brain data and the corresponding PDF vignette:
.
GAW17 Case Study: Simultaneous SNP Selection with CAR Scores
- Comparison of SNP rankings for phenotype Q1:
. - R code for the analysis of the GAW17 data.