Estimating the single nucleotide polymorphism genotype misclassification from routine double measurements in a large epidemiologic sample

Iris M. Heid, Claudia Lamina, Helmut Küchenhoff, Guido Fischer, Norman Klopp, Melanie Kolz, Harald Grallert, Caren Vollmert, Stefanie Wagner, Cornelia Huth, Julia Müller, Martina Müller, Steven Hunt, Annette Peters, Bernhard Paulweber, H. Erich Wichmann, Florian Kronenberg, Thomas Illig

Research output: Contribution to journalArticle

10 Citations (Scopus)


Previously, estimation of genotype misclassification of single nucleotide polymorphisms (SNPs) as encountered in epidemiologic practice and involving thousands of subjects was lacking. The authors collected representative data on approximately 14,000 subjects from 8 studies and 646,558 genotypes assessed in 2005 by means of matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Overall discordance among 57,805 double genotypes from routine quality control was 0.36%. Fitting different misclassification models by maximum likelihood assuming identical misclassification for all SNPs, the estimated misclassification probabilities ranged from 0.0000 to 0.0035. When applying the misclassification simulation and extrapolation (MC-SIMEX) method for the first time to genetic data to account for the misclassification in a reanalysis of adiponectin-encoding (APM1) gene SNP associations with plasma adiponectin in 1,770 subjects, the authors found no impact of this small error on association estimates but increased estimates for a more substantial error. This study is the first to provide large-scale epidemiologic data on SNP genotype misclassification. The estimated misclassification in this example was small and negligible for association estimates, which is reassuring and essential for detecting SNP associations. In situations with more substantial error, the presented approach using duplicate genotyping and the MC-SIMEX method is practical and helpful for quantifying the genotyping error and its impact.

Original languageEnglish
Pages (from-to)878-889
Number of pages12
JournalAmerican Journal of Epidemiology
Issue number8
Publication statusPublished - Oct 2008
Externally publishedYes



  • Bias (epidemiology)
  • Genetics
  • Genotype
  • Likelihood functions
  • Polymorphism, single nucleotide

ASJC Scopus subject areas

  • Epidemiology

Cite this

Heid, I. M., Lamina, C., Küchenhoff, H., Fischer, G., Klopp, N., Kolz, M., Grallert, H., Vollmert, C., Wagner, S., Huth, C., Müller, J., Müller, M., Hunt, S., Peters, A., Paulweber, B., Wichmann, H. E., Kronenberg, F., & Illig, T. (2008). Estimating the single nucleotide polymorphism genotype misclassification from routine double measurements in a large epidemiologic sample. American Journal of Epidemiology, 168(8), 878-889.