Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations

Larsson Omberg, Jacqueline Salit, Neil Hackett, Jennifer Fuller, Rebecca Matthew, Lotfi Chouchane, Juan L. Rodriguez-Flores, Carlos Bustamante, Ronald Crystal, Jason G. Mezey

Research output: Contribution to journalArticle

23 Citations (Scopus)

Abstract

Background: Populations of the Arabian Peninsula have a complex genetic structure that reflects waves of migrations including the earliest human migrations from Africa and eastern Asia, migrations along ancient civilization trading routes and colonization history of recent centuries.Results: Here, we present a study of genome-wide admixture in this region, using 156 genotyped individuals from Qatar, a country located at the crossroads of these migration patterns. Since haplotypes of these individuals could have originated from many different populations across the world, we have developed a machine learning method "SupportMix" to infer loci-specific genomic ancestry when simultaneously analyzing many possible ancestral populations. Simulations show that SupportMix is not only more accurate than other popular admixture discovery tools but is the first admixture inference method that can efficiently scale for simultaneous analysis of 50-100 putative ancestral populations while being independent of prior demographic information.Conclusions: By simultaneously using the 55 world populations from the Human Genome Diversity Panel, SupportMix was able to extract the fine-scale ancestry of the Qatar population, providing many new observations concerning the ancestry of the region. For example, as well as recapitulating the three major sub-populations in Qatar, composed of mainly Arabic, Persian, and African ancestry, SupportMix additionally identifies the specific ancestry of the Persian group to populations sampled in Greater Persia rather than from China and the ancestry of the African group to sub-Saharan origin and not Southern African Bantu origin as previously thought.

Original languageEnglish
Article number49
JournalBMC Genetics
Volume13
DOIs
Publication statusPublished - 26 Jun 2012

Fingerprint

Genome
Qatar
Population
Persia
Human Migration
Civilization
Far East
Genetic Structures
Human Genome
Population Groups
Haplotypes
China
Demography

Keywords

  • Admixture
  • Arabian Peninsula
  • Human migration
  • Qatar
  • Support vector machines

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations. / Omberg, Larsson; Salit, Jacqueline; Hackett, Neil; Fuller, Jennifer; Matthew, Rebecca; Chouchane, Lotfi; Rodriguez-Flores, Juan L.; Bustamante, Carlos; Crystal, Ronald; Mezey, Jason G.

In: BMC Genetics, Vol. 13, 49, 26.06.2012.

Research output: Contribution to journalArticle

Omberg, L, Salit, J, Hackett, N, Fuller, J, Matthew, R, Chouchane, L, Rodriguez-Flores, JL, Bustamante, C, Crystal, R & Mezey, JG 2012, 'Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations', BMC Genetics, vol. 13, 49. https://doi.org/10.1186/1471-2156-13-49
Omberg, Larsson ; Salit, Jacqueline ; Hackett, Neil ; Fuller, Jennifer ; Matthew, Rebecca ; Chouchane, Lotfi ; Rodriguez-Flores, Juan L. ; Bustamante, Carlos ; Crystal, Ronald ; Mezey, Jason G. / Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations. In: BMC Genetics. 2012 ; Vol. 13.
@article{056dff346b3f4087941e1c6881d222d3,
title = "Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations",
abstract = "Background: Populations of the Arabian Peninsula have a complex genetic structure that reflects waves of migrations including the earliest human migrations from Africa and eastern Asia, migrations along ancient civilization trading routes and colonization history of recent centuries.Results: Here, we present a study of genome-wide admixture in this region, using 156 genotyped individuals from Qatar, a country located at the crossroads of these migration patterns. Since haplotypes of these individuals could have originated from many different populations across the world, we have developed a machine learning method {"}SupportMix{"} to infer loci-specific genomic ancestry when simultaneously analyzing many possible ancestral populations. Simulations show that SupportMix is not only more accurate than other popular admixture discovery tools but is the first admixture inference method that can efficiently scale for simultaneous analysis of 50-100 putative ancestral populations while being independent of prior demographic information.Conclusions: By simultaneously using the 55 world populations from the Human Genome Diversity Panel, SupportMix was able to extract the fine-scale ancestry of the Qatar population, providing many new observations concerning the ancestry of the region. For example, as well as recapitulating the three major sub-populations in Qatar, composed of mainly Arabic, Persian, and African ancestry, SupportMix additionally identifies the specific ancestry of the Persian group to populations sampled in Greater Persia rather than from China and the ancestry of the African group to sub-Saharan origin and not Southern African Bantu origin as previously thought.",
keywords = "Admixture, Arabian Peninsula, Human migration, Qatar, Support vector machines",
author = "Larsson Omberg and Jacqueline Salit and Neil Hackett and Jennifer Fuller and Rebecca Matthew and Lotfi Chouchane and Rodriguez-Flores, {Juan L.} and Carlos Bustamante and Ronald Crystal and Mezey, {Jason G.}",
year = "2012",
month = "6",
day = "26",
doi = "10.1186/1471-2156-13-49",
language = "English",
volume = "13",
journal = "BMC Genetics",
issn = "1471-2156",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Inferring genome-wide patterns of admixture in Qataris using fifty-five ancestral populations

AU - Omberg, Larsson

AU - Salit, Jacqueline

AU - Hackett, Neil

AU - Fuller, Jennifer

AU - Matthew, Rebecca

AU - Chouchane, Lotfi

AU - Rodriguez-Flores, Juan L.

AU - Bustamante, Carlos

AU - Crystal, Ronald

AU - Mezey, Jason G.

PY - 2012/6/26

Y1 - 2012/6/26

N2 - Background: Populations of the Arabian Peninsula have a complex genetic structure that reflects waves of migrations including the earliest human migrations from Africa and eastern Asia, migrations along ancient civilization trading routes and colonization history of recent centuries.Results: Here, we present a study of genome-wide admixture in this region, using 156 genotyped individuals from Qatar, a country located at the crossroads of these migration patterns. Since haplotypes of these individuals could have originated from many different populations across the world, we have developed a machine learning method "SupportMix" to infer loci-specific genomic ancestry when simultaneously analyzing many possible ancestral populations. Simulations show that SupportMix is not only more accurate than other popular admixture discovery tools but is the first admixture inference method that can efficiently scale for simultaneous analysis of 50-100 putative ancestral populations while being independent of prior demographic information.Conclusions: By simultaneously using the 55 world populations from the Human Genome Diversity Panel, SupportMix was able to extract the fine-scale ancestry of the Qatar population, providing many new observations concerning the ancestry of the region. For example, as well as recapitulating the three major sub-populations in Qatar, composed of mainly Arabic, Persian, and African ancestry, SupportMix additionally identifies the specific ancestry of the Persian group to populations sampled in Greater Persia rather than from China and the ancestry of the African group to sub-Saharan origin and not Southern African Bantu origin as previously thought.

AB - Background: Populations of the Arabian Peninsula have a complex genetic structure that reflects waves of migrations including the earliest human migrations from Africa and eastern Asia, migrations along ancient civilization trading routes and colonization history of recent centuries.Results: Here, we present a study of genome-wide admixture in this region, using 156 genotyped individuals from Qatar, a country located at the crossroads of these migration patterns. Since haplotypes of these individuals could have originated from many different populations across the world, we have developed a machine learning method "SupportMix" to infer loci-specific genomic ancestry when simultaneously analyzing many possible ancestral populations. Simulations show that SupportMix is not only more accurate than other popular admixture discovery tools but is the first admixture inference method that can efficiently scale for simultaneous analysis of 50-100 putative ancestral populations while being independent of prior demographic information.Conclusions: By simultaneously using the 55 world populations from the Human Genome Diversity Panel, SupportMix was able to extract the fine-scale ancestry of the Qatar population, providing many new observations concerning the ancestry of the region. For example, as well as recapitulating the three major sub-populations in Qatar, composed of mainly Arabic, Persian, and African ancestry, SupportMix additionally identifies the specific ancestry of the Persian group to populations sampled in Greater Persia rather than from China and the ancestry of the African group to sub-Saharan origin and not Southern African Bantu origin as previously thought.

KW - Admixture

KW - Arabian Peninsula

KW - Human migration

KW - Qatar

KW - Support vector machines

UR - http://www.scopus.com/inward/record.url?scp=84862678584&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862678584&partnerID=8YFLogxK

U2 - 10.1186/1471-2156-13-49

DO - 10.1186/1471-2156-13-49

M3 - Article

VL - 13

JO - BMC Genetics

JF - BMC Genetics

SN - 1471-2156

M1 - 49

ER -