A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank
Published in PLOS Genetics, 2020
In this project led by Junyang Qian, we developed BASIL, a novel algorithm to fit large-scale L1 penalized (Lasso) regression model using an iterative procedure, and implemented R snpnet package specially designed for genetic data. We demonstrate the ability of this approach in an application to UK Biobank dataset.
How to use the snpnet
package
Please check out our GitHub repo for the snpnet
package. It has some sample data and vignette documents that describe the usage of the package.
Reference: J. Qian, Y. Tanigawa, W. Du, M. Aguirre, C. Chang, R. Tibshirani, M. A. Rivas, T. Hastie, A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet. 16, e1009141 (2020). https://doi.org/10.1371/journal.pgen.1009141