University of Illinois Chicago
Browse

Using LASSO regression to detect predictive aggregate effects in genetic studies

Download (116.36 kB)
journal contribution
posted on 2012-04-30, 00:00 authored by Joel B Fontanarosa, Yang Dai
We use least absolute shrinkage and selection operator (LASSO) regression to select genetic markers and phenotypic features that are most informative with respect to a trait of interest. We compare several strategies for applying LASSO methods in risk prediction models, using the Genetic Analysis Workshop 17 exome simulation data consisting of 697 individuals with information on genotypic and phenotypic features (smoking, age, sex) in 5-fold cross-validated fashion. The cross-validated averages of the area under the receiver operating curve range from 0.45 to 0.63 for different strategies using only genotypic markers. The same values are improved to 0.69–0.87 when both genotypic and phenotypic information are used. The ability of the LASSO method to find true causal markers is limited, but the method was able to discover several common variants (e.g., FLT1) under certain conditions.

Funding

The Genetic Analysis Workshop is supported by National Institutes of Health grant R01 GM031575.

History

Publisher Statement

© 2011 Fontanarosa and Dai; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. DOI: 10.1186/1753-6561-5-S9-S69

Publisher

BioMed Central

Language

  • en_US

issn

1753-6561

Issue date

2010-10-01

Usage metrics

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC