Genetic structure of europeans

A view from the north-east

Mari Nelis, Tõni Esko, Reedik Mägi, Fritz Zimprich, Draga Toncheva, Sena Karachanak, Tereza Piskáčková, Ivan Balaščák, Leena Peltonen, Eveliina Jakkula, Karola Rehnström, Mark Lathrop, Simon Heath, Pilar Galan, Stefan Schreiber, Thomas Meitinger, Arne Pfeufer, H. Erich Wichmann, Béla Melegh, Noémi Polgár & 24 others Daniela Toniolo, Paolo Gasparini, Pio D'Adamo, Janis Klovins, Liene Nikitina-Zake, Vaidutis Kučinskas, Jurate Kasnauskiene, Jan Lubinski, Tadeusz Debniak, Svetlana Limborska, Andrey Khrunin, Xavier P. Estivill, Raquel Rabionet, Sara Marsal, Antonio Juliá, Stylianos E. Antonarakis, Samuel Deutsch, Christelle Borel, Homa Attar, Maryline Gagnebin, Milan Macek, Michael Krawczak, Maido Remm, Andres Metspalu

Research output: Contribution to journalArticle

201 Citations (Scopus)

Abstract

Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (F5t) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS). Copyright:

Original languageEnglish
Article numbere5472
JournalPLoS One
Volume4
Issue number5
DOIs
Publication statusPublished - 8 May 2009
Externally publishedYes

Fingerprint

Genetic Structures
Principal component analysis
Polymorphism
Population
Geography
Nucleotides
Genes
Population Genetics
Statistics
geography
Principal Component Analysis
Sample Size
population genetics
principal component analysis
sampling
inflation
Genome-Wide Association Study
Russia
Constitution and Bylaws
Economic Inflation

ASJC Scopus subject areas

  • Medicine(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

Nelis, M., Esko, T., Mägi, R., Zimprich, F., Toncheva, D., Karachanak, S., ... Metspalu, A. (2009). Genetic structure of europeans: A view from the north-east. PLoS One, 4(5), [e5472]. https://doi.org/10.1371/journal.pone.0005472

Genetic structure of europeans : A view from the north-east. / Nelis, Mari; Esko, Tõni; Mägi, Reedik; Zimprich, Fritz; Toncheva, Draga; Karachanak, Sena; Piskáčková, Tereza; Balaščák, Ivan; Peltonen, Leena; Jakkula, Eveliina; Rehnström, Karola; Lathrop, Mark; Heath, Simon; Galan, Pilar; Schreiber, Stefan; Meitinger, Thomas; Pfeufer, Arne; Wichmann, H. Erich; Melegh, Béla; Polgár, Noémi; Toniolo, Daniela; Gasparini, Paolo; D'Adamo, Pio; Klovins, Janis; Nikitina-Zake, Liene; Kučinskas, Vaidutis; Kasnauskiene, Jurate; Lubinski, Jan; Debniak, Tadeusz; Limborska, Svetlana; Khrunin, Andrey; Estivill, Xavier P.; Rabionet, Raquel; Marsal, Sara; Juliá, Antonio; Antonarakis, Stylianos E.; Deutsch, Samuel; Borel, Christelle; Attar, Homa; Gagnebin, Maryline; Macek, Milan; Krawczak, Michael; Remm, Maido; Metspalu, Andres.

In: PLoS One, Vol. 4, No. 5, e5472, 08.05.2009.

Research output: Contribution to journalArticle

Nelis, M, Esko, T, Mägi, R, Zimprich, F, Toncheva, D, Karachanak, S, Piskáčková, T, Balaščák, I, Peltonen, L, Jakkula, E, Rehnström, K, Lathrop, M, Heath, S, Galan, P, Schreiber, S, Meitinger, T, Pfeufer, A, Wichmann, HE, Melegh, B, Polgár, N, Toniolo, D, Gasparini, P, D'Adamo, P, Klovins, J, Nikitina-Zake, L, Kučinskas, V, Kasnauskiene, J, Lubinski, J, Debniak, T, Limborska, S, Khrunin, A, Estivill, XP, Rabionet, R, Marsal, S, Juliá, A, Antonarakis, SE, Deutsch, S, Borel, C, Attar, H, Gagnebin, M, Macek, M, Krawczak, M, Remm, M & Metspalu, A 2009, 'Genetic structure of europeans: A view from the north-east', PLoS One, vol. 4, no. 5, e5472. https://doi.org/10.1371/journal.pone.0005472
Nelis M, Esko T, Mägi R, Zimprich F, Toncheva D, Karachanak S et al. Genetic structure of europeans: A view from the north-east. PLoS One. 2009 May 8;4(5). e5472. https://doi.org/10.1371/journal.pone.0005472
Nelis, Mari ; Esko, Tõni ; Mägi, Reedik ; Zimprich, Fritz ; Toncheva, Draga ; Karachanak, Sena ; Piskáčková, Tereza ; Balaščák, Ivan ; Peltonen, Leena ; Jakkula, Eveliina ; Rehnström, Karola ; Lathrop, Mark ; Heath, Simon ; Galan, Pilar ; Schreiber, Stefan ; Meitinger, Thomas ; Pfeufer, Arne ; Wichmann, H. Erich ; Melegh, Béla ; Polgár, Noémi ; Toniolo, Daniela ; Gasparini, Paolo ; D'Adamo, Pio ; Klovins, Janis ; Nikitina-Zake, Liene ; Kučinskas, Vaidutis ; Kasnauskiene, Jurate ; Lubinski, Jan ; Debniak, Tadeusz ; Limborska, Svetlana ; Khrunin, Andrey ; Estivill, Xavier P. ; Rabionet, Raquel ; Marsal, Sara ; Juliá, Antonio ; Antonarakis, Stylianos E. ; Deutsch, Samuel ; Borel, Christelle ; Attar, Homa ; Gagnebin, Maryline ; Macek, Milan ; Krawczak, Michael ; Remm, Maido ; Metspalu, Andres. / Genetic structure of europeans : A view from the north-east. In: PLoS One. 2009 ; Vol. 4, No. 5.
@article{4461809d8b3e4ef8825d1f39c6f3df36,
title = "Genetic structure of europeans: A view from the north-east",
abstract = "Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (F5t) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1{\%} of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS). Copyright:",
author = "Mari Nelis and T{\~o}ni Esko and Reedik M{\"a}gi and Fritz Zimprich and Draga Toncheva and Sena Karachanak and Tereza Pisk{\'a}čkov{\'a} and Ivan Balašč{\'a}k and Leena Peltonen and Eveliina Jakkula and Karola Rehnstr{\"o}m and Mark Lathrop and Simon Heath and Pilar Galan and Stefan Schreiber and Thomas Meitinger and Arne Pfeufer and Wichmann, {H. Erich} and B{\'e}la Melegh and No{\'e}mi Polg{\'a}r and Daniela Toniolo and Paolo Gasparini and Pio D'Adamo and Janis Klovins and Liene Nikitina-Zake and Vaidutis Kučinskas and Jurate Kasnauskiene and Jan Lubinski and Tadeusz Debniak and Svetlana Limborska and Andrey Khrunin and Estivill, {Xavier P.} and Raquel Rabionet and Sara Marsal and Antonio Juli{\'a} and Antonarakis, {Stylianos E.} and Samuel Deutsch and Christelle Borel and Homa Attar and Maryline Gagnebin and Milan Macek and Michael Krawczak and Maido Remm and Andres Metspalu",
year = "2009",
month = "5",
day = "8",
doi = "10.1371/journal.pone.0005472",
language = "English",
volume = "4",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "5",

}

TY - JOUR

T1 - Genetic structure of europeans

T2 - A view from the north-east

AU - Nelis, Mari

AU - Esko, Tõni

AU - Mägi, Reedik

AU - Zimprich, Fritz

AU - Toncheva, Draga

AU - Karachanak, Sena

AU - Piskáčková, Tereza

AU - Balaščák, Ivan

AU - Peltonen, Leena

AU - Jakkula, Eveliina

AU - Rehnström, Karola

AU - Lathrop, Mark

AU - Heath, Simon

AU - Galan, Pilar

AU - Schreiber, Stefan

AU - Meitinger, Thomas

AU - Pfeufer, Arne

AU - Wichmann, H. Erich

AU - Melegh, Béla

AU - Polgár, Noémi

AU - Toniolo, Daniela

AU - Gasparini, Paolo

AU - D'Adamo, Pio

AU - Klovins, Janis

AU - Nikitina-Zake, Liene

AU - Kučinskas, Vaidutis

AU - Kasnauskiene, Jurate

AU - Lubinski, Jan

AU - Debniak, Tadeusz

AU - Limborska, Svetlana

AU - Khrunin, Andrey

AU - Estivill, Xavier P.

AU - Rabionet, Raquel

AU - Marsal, Sara

AU - Juliá, Antonio

AU - Antonarakis, Stylianos E.

AU - Deutsch, Samuel

AU - Borel, Christelle

AU - Attar, Homa

AU - Gagnebin, Maryline

AU - Macek, Milan

AU - Krawczak, Michael

AU - Remm, Maido

AU - Metspalu, Andres

PY - 2009/5/8

Y1 - 2009/5/8

N2 - Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (F5t) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS). Copyright:

AB - Using principal component (PC) analysis, we studied the genetic constitution of 3,112 individuals from Europe as portrayed by more than 270,000 single nucleotide polymorphisms (SNPs) genotyped with the Illumina Infinium platform. In cohorts where the sample size was >100, one hundred randomly chosen samples were used for analysis to minimize the sample size effect, resulting in a total of 1,564 samples. This analysis revealed that the genetic structure of the European population correlates closely with geography. The first two PCs highlight the genetic diversity corresponding to the northwest to southeast gradient and position the populations according to their approximate geographic origin. The resulting genetic map forms a triangular structure with a) Finland, b) the Baltic region, Poland and Western Russia, and c) Italy as its vertexes, and with d) Central- and Western Europe in its centre. Inter- and intra- population genetic differences were quantified by the inflation factor lambda (λ) (ranging from 1.00 to 4.21), fixation index (F5t) (ranging from 0.000 to 0.023), and by the number of markers exhibiting significant allele frequency differences in pair-wise population comparisons. The estimated lambda was used to assess the real diminishing impact to association statistics when two distinct populations are merged directly in an analysis. When the PC analysis was confined to the 1,019 Estonian individuals (0.1% of the Estonian population), a fine structure emerged that correlated with the geography of individual counties. With at least two cohorts available from several countries, genetic substructures were investigated in Czech, Finnish, German, Estonian and Italian populations. Together with previously published data, our results allow the creation of a comprehensive European genetic map that will greatly facilitate inter-population genetic studies including genome wide association studies (GWAS). Copyright:

UR - http://www.scopus.com/inward/record.url?scp=65549126774&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=65549126774&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0005472

DO - 10.1371/journal.pone.0005472

M3 - Article

VL - 4

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 5

M1 - e5472

ER -