Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags

Published in The American Journal of Human Genetics, 2024

We introduce “hypometric genetics,” an approach to investigate the genetic basis of binarized traits representing the presence of below-the-limit-of-quantification (BLQ) quality control indicators. We show a joint analysis of BLQ and quantitative traits leads to improved power in genetic discovery, highlighting the value of considering population-based samples and phenotypic extremes in genetic studies. Links to Article full text and MIT CSAIL News.

Abstract

Balancing the tradeoff between quantity and quality of phenotypic data is critical in omics studies. Measurements below the limit of quantification (BLQ) are often tagged in quality control fields, but these flags are currently underutilized in human genetics studies. Extreme phenotype sampling is advantageous for mapping rare variant effects. We hypothesize that genetic drivers, along with environmental and technical factors, contribute to the presence of BLQ flags. Here, we introduce “hypometric genetics” (hMG) analysis and uncover a genetic basis for BLQ flags, indicating an additional source of genetic signal for genetic discovery, especially from phenotypic extremes. Applying our hMG approach to n = 227,469 UK Biobank individuals with metabolomic profiles, we reveal more than 5% heritability for BLQ flags and report biologically relevant associations, for example, at APOC3, APOA5, and PDE3B loci. For common variants, polygenic scores trained only for BLQ flags predict the corresponding quantitative traits with 91% accuracy, validating the genetic basis. For rare coding variant associations, we find an asymmetric 65.4% higher enrichment of metabolite-lowering associations for BLQ flags, highlighting the impact of putative loss-of-function variants with large effects on phenotypic extremes. Joint analysis of binarized BLQ flags and the corresponding quantitative metabolite measurements improves power in Bayesian rare variant aggregation tests, resulting in an average of 181% more prioritized genes. Our approach is broadly applicable to omics profiling. Overall, our results underscore the benefit of integrating quality control flags and quantitative measurements and highlight the advantage of joint analysis of population-based samples and phenotypic extremes in human genetics studies.

Tanigawa and Kellis. Am J Hum Genet. 2024. Graphical abstract

Highlights

Omics profiling coupled with human genetic studies provides excellent opportunities to investigate the genetic basis of disease and intermediate traits. Laboratory measures often include quality control flags, such as below-the-limit-of-quantification (BLQ).

Laboratory measures often include quality control flags, such as below-the-limit-of-quantification (BLQ)

We hypothesize that extremely low measurement values tagged with BLQ flags are influenced by a combination of technical and biological factors. We call those measurements ‘hypometric’ and set to investigate the genetic basis using metabolomics profiles in UK Biobank.

We revealed that the binarized traits representing the presence or absence of BLQ flags have a genetic basis. One may conduct genetic analyses with the binarized BLQ traits, including common and rare association mapping, polygenic score modeling, and gene prioritization.

The binarized traits representing the presence or absence of BLQ flags have a genetic basis

On the common variants, genetic associations are similar between the binarized BLQ traits and the corresponding quantitative traits. Polygenic scores trained only on the binarized BLQ traits predict quantitative traits at ~91% accuracy.

The shared genetic basis between BLQ and quantitative traits allows polygenic prediction from BLQ traits.

On the rare variants, metabolite-lowering effects are enriched for genetic associations in BLQ traits, highlighting the utilities in analyzing individuals flagged with BLQ traits: they carry extremely low concentrations of metabolites, i.e., phenotypic extremes.

Rare variant associations are enriched for metabolite-lowering effects in BLQ traits.

We show that joint analysis of binarized BLQ traits and the corresponding quantitative traits improves the statistical power in genetic discovery, leading to an average of 181% more prioritized genes in Bayesian rare variant aggregation tests.

A joint analysis of binarized BLQ traits and the corresponding quantitative traits improves the statistical power in genetic discovery.

Overall, our results highlight the critical benefits of integrating QC flags with quantitative measurements in human genetics studies. More broadly, our results underscore the benefit of considering the phenotypic extremes and population-based samples in cohort studies. (8/n)

We thank UK Biobank, its participants, amazing colleagues, as well as funding.

The iPGS browser and dataset availability

The dataset is also available in OSFramework. The polygenic score models are also available in the iPGS Browser.

Reference: Tanigawa and Kellis. Hypometric genetics: Improved power in genetic discovery by incorporating quality control flags. The American Journal of Human Genetics. 111 (2024). https://doi.org/10.1016/j.ajhg.2024.09.008 (full text)