Mining the Unknown: A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information

Jan Krumsiek, Karsten Suhre, Anne M. Evans, Matthew W. Mitchell, Robert P. Mohney, Michael V. Milburn, Brigitte Wägele, Werner Römisch-Margl, Thomas Illig, Jerzy Adamski, Christian Gieger, Fabian J. Theis, Gabi Kastenmüller

Research output: Contribution to journalArticle

88 Citations (Scopus)

Abstract

Recent genome-wide association studies (GWAS) with metabolomics data linked genetic variation in the human genome to differences in individual metabolite levels. A strong relevance of this metabolic individuality for biomedical and pharmaceutical research has been reported. However, a considerable amount of the molecules currently quantified by modern metabolomics techniques are chemically unidentified. The identification of these "unknown metabolites" is still a demanding and intricate task, limiting their usability as functional markers of metabolic processes. As a consequence, previous GWAS largely ignored unknown metabolites as metabolic traits for the analysis. Here we present a systems-level approach that combines genome-wide association analysis and Gaussian graphical modeling with metabolomics to predict the identity of the unknown metabolites. We apply our method to original data of 517 metabolic traits, of which 225 are unknowns, and genotyping information on 655,658 genetic variants, measured in 1,768 human blood samples. We report previously undescribed genotype-metabotype associations for six distinct gene loci (SLC22A2, COMT, CYP3A5, CYP2C18, GBA3, UGT3A1) and one locus not related to any known gene (rs12413935). Overlaying the inferred genetic associations, metabolic networks, and knowledge-based pathway information, we derive testable hypotheses on the biochemical identities of 106 unknown metabolites. As a proof of principle, we experimentally confirm nine concrete predictions. We demonstrate the benefit of our method for the functional interpretation of previous metabolomics biomarker studies on liver detoxification, hypertension, and insulin resistance. Our approach is generic in nature and can be directly transferred to metabolomics data from different experimental platforms.

Original languageEnglish
Article numbere1003005
JournalPLoS Genetics
Volume8
Issue number10
DOIs
Publication statusPublished - Oct 2012
Externally publishedYes

Fingerprint

Metabolomics
metabolomics
Systems Analysis
metabolite
metabolites
Genome-Wide Association Study
genome
Individuality
Cytochrome P-450 CYP3A
hypertension
loci
gene
Human Genome
detoxification
Metabolic Networks and Pathways
insulin resistance
genotyping
Genes
Insulin Resistance
biomarker

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics
  • Genetics(clinical)
  • Cancer Research

Cite this

Krumsiek, J., Suhre, K., Evans, A. M., Mitchell, M. W., Mohney, R. P., Milburn, M. V., ... Kastenmüller, G. (2012). Mining the Unknown: A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information. PLoS Genetics, 8(10), [e1003005]. https://doi.org/10.1371/journal.pgen.1003005

Mining the Unknown : A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information. / Krumsiek, Jan; Suhre, Karsten; Evans, Anne M.; Mitchell, Matthew W.; Mohney, Robert P.; Milburn, Michael V.; Wägele, Brigitte; Römisch-Margl, Werner; Illig, Thomas; Adamski, Jerzy; Gieger, Christian; Theis, Fabian J.; Kastenmüller, Gabi.

In: PLoS Genetics, Vol. 8, No. 10, e1003005, 10.2012.

Research output: Contribution to journalArticle

Krumsiek, J, Suhre, K, Evans, AM, Mitchell, MW, Mohney, RP, Milburn, MV, Wägele, B, Römisch-Margl, W, Illig, T, Adamski, J, Gieger, C, Theis, FJ & Kastenmüller, G 2012, 'Mining the Unknown: A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information', PLoS Genetics, vol. 8, no. 10, e1003005. https://doi.org/10.1371/journal.pgen.1003005
Krumsiek, Jan ; Suhre, Karsten ; Evans, Anne M. ; Mitchell, Matthew W. ; Mohney, Robert P. ; Milburn, Michael V. ; Wägele, Brigitte ; Römisch-Margl, Werner ; Illig, Thomas ; Adamski, Jerzy ; Gieger, Christian ; Theis, Fabian J. ; Kastenmüller, Gabi. / Mining the Unknown : A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information. In: PLoS Genetics. 2012 ; Vol. 8, No. 10.
@article{c0ebce6af63541eb9b2b20648a8d59e2,
title = "Mining the Unknown: A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information",
abstract = "Recent genome-wide association studies (GWAS) with metabolomics data linked genetic variation in the human genome to differences in individual metabolite levels. A strong relevance of this metabolic individuality for biomedical and pharmaceutical research has been reported. However, a considerable amount of the molecules currently quantified by modern metabolomics techniques are chemically unidentified. The identification of these {"}unknown metabolites{"} is still a demanding and intricate task, limiting their usability as functional markers of metabolic processes. As a consequence, previous GWAS largely ignored unknown metabolites as metabolic traits for the analysis. Here we present a systems-level approach that combines genome-wide association analysis and Gaussian graphical modeling with metabolomics to predict the identity of the unknown metabolites. We apply our method to original data of 517 metabolic traits, of which 225 are unknowns, and genotyping information on 655,658 genetic variants, measured in 1,768 human blood samples. We report previously undescribed genotype-metabotype associations for six distinct gene loci (SLC22A2, COMT, CYP3A5, CYP2C18, GBA3, UGT3A1) and one locus not related to any known gene (rs12413935). Overlaying the inferred genetic associations, metabolic networks, and knowledge-based pathway information, we derive testable hypotheses on the biochemical identities of 106 unknown metabolites. As a proof of principle, we experimentally confirm nine concrete predictions. We demonstrate the benefit of our method for the functional interpretation of previous metabolomics biomarker studies on liver detoxification, hypertension, and insulin resistance. Our approach is generic in nature and can be directly transferred to metabolomics data from different experimental platforms.",
author = "Jan Krumsiek and Karsten Suhre and Evans, {Anne M.} and Mitchell, {Matthew W.} and Mohney, {Robert P.} and Milburn, {Michael V.} and Brigitte W{\"a}gele and Werner R{\"o}misch-Margl and Thomas Illig and Jerzy Adamski and Christian Gieger and Theis, {Fabian J.} and Gabi Kastenm{\"u}ller",
year = "2012",
month = "10",
doi = "10.1371/journal.pgen.1003005",
language = "English",
volume = "8",
journal = "PLoS Genetics",
issn = "1553-7390",
publisher = "Public Library of Science",
number = "10",

}

TY - JOUR

T1 - Mining the Unknown

T2 - A Systems Approach to Metabolite Identification Combining Genetic and Metabolic Information

AU - Krumsiek, Jan

AU - Suhre, Karsten

AU - Evans, Anne M.

AU - Mitchell, Matthew W.

AU - Mohney, Robert P.

AU - Milburn, Michael V.

AU - Wägele, Brigitte

AU - Römisch-Margl, Werner

AU - Illig, Thomas

AU - Adamski, Jerzy

AU - Gieger, Christian

AU - Theis, Fabian J.

AU - Kastenmüller, Gabi

PY - 2012/10

Y1 - 2012/10

N2 - Recent genome-wide association studies (GWAS) with metabolomics data linked genetic variation in the human genome to differences in individual metabolite levels. A strong relevance of this metabolic individuality for biomedical and pharmaceutical research has been reported. However, a considerable amount of the molecules currently quantified by modern metabolomics techniques are chemically unidentified. The identification of these "unknown metabolites" is still a demanding and intricate task, limiting their usability as functional markers of metabolic processes. As a consequence, previous GWAS largely ignored unknown metabolites as metabolic traits for the analysis. Here we present a systems-level approach that combines genome-wide association analysis and Gaussian graphical modeling with metabolomics to predict the identity of the unknown metabolites. We apply our method to original data of 517 metabolic traits, of which 225 are unknowns, and genotyping information on 655,658 genetic variants, measured in 1,768 human blood samples. We report previously undescribed genotype-metabotype associations for six distinct gene loci (SLC22A2, COMT, CYP3A5, CYP2C18, GBA3, UGT3A1) and one locus not related to any known gene (rs12413935). Overlaying the inferred genetic associations, metabolic networks, and knowledge-based pathway information, we derive testable hypotheses on the biochemical identities of 106 unknown metabolites. As a proof of principle, we experimentally confirm nine concrete predictions. We demonstrate the benefit of our method for the functional interpretation of previous metabolomics biomarker studies on liver detoxification, hypertension, and insulin resistance. Our approach is generic in nature and can be directly transferred to metabolomics data from different experimental platforms.

AB - Recent genome-wide association studies (GWAS) with metabolomics data linked genetic variation in the human genome to differences in individual metabolite levels. A strong relevance of this metabolic individuality for biomedical and pharmaceutical research has been reported. However, a considerable amount of the molecules currently quantified by modern metabolomics techniques are chemically unidentified. The identification of these "unknown metabolites" is still a demanding and intricate task, limiting their usability as functional markers of metabolic processes. As a consequence, previous GWAS largely ignored unknown metabolites as metabolic traits for the analysis. Here we present a systems-level approach that combines genome-wide association analysis and Gaussian graphical modeling with metabolomics to predict the identity of the unknown metabolites. We apply our method to original data of 517 metabolic traits, of which 225 are unknowns, and genotyping information on 655,658 genetic variants, measured in 1,768 human blood samples. We report previously undescribed genotype-metabotype associations for six distinct gene loci (SLC22A2, COMT, CYP3A5, CYP2C18, GBA3, UGT3A1) and one locus not related to any known gene (rs12413935). Overlaying the inferred genetic associations, metabolic networks, and knowledge-based pathway information, we derive testable hypotheses on the biochemical identities of 106 unknown metabolites. As a proof of principle, we experimentally confirm nine concrete predictions. We demonstrate the benefit of our method for the functional interpretation of previous metabolomics biomarker studies on liver detoxification, hypertension, and insulin resistance. Our approach is generic in nature and can be directly transferred to metabolomics data from different experimental platforms.

UR - http://www.scopus.com/inward/record.url?scp=84868112190&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84868112190&partnerID=8YFLogxK

U2 - 10.1371/journal.pgen.1003005

DO - 10.1371/journal.pgen.1003005

M3 - Article

C2 - 23093944

AN - SCOPUS:84868112190

VL - 8

JO - PLoS Genetics

JF - PLoS Genetics

SN - 1553-7390

IS - 10

M1 - e1003005

ER -