ALLPATHS 2

Small genomes assembled accurately and with high continuity from short paired reads

Iain MacCallum, Dariusz Przybylski, Sante Gnerre, Joshua Burton, Ilya Shlyakhter, Andreas Gnirke, Joel Malek, Kevin McKernan, Swati Ranade, Terrance P. Shea, Louise Williams, Sarah Young, Chad Nusbaum, David B. Jaffe

Research output: Contribution to journalArticle

112 Citations (Scopus)

Abstract

We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8% (ALLPATHS2), 68.7% (Velvet), and 42.1% (EULER-SR).

Original languageEnglish
Article number103
JournalGenome Biology
Volume10
Issue number10
DOIs
Publication statusPublished - 1 Oct 2009

Fingerprint

Microbial Genome
Base Composition
genome
Genome
Escherichia coli
jumping

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Genetics
  • Cell Biology

Cite this

MacCallum, I., Przybylski, D., Gnerre, S., Burton, J., Shlyakhter, I., Gnirke, A., ... Jaffe, D. B. (2009). ALLPATHS 2: Small genomes assembled accurately and with high continuity from short paired reads. Genome Biology, 10(10), [103]. https://doi.org/10.1186/gb-2009-10-10-r103

ALLPATHS 2 : Small genomes assembled accurately and with high continuity from short paired reads. / MacCallum, Iain; Przybylski, Dariusz; Gnerre, Sante; Burton, Joshua; Shlyakhter, Ilya; Gnirke, Andreas; Malek, Joel; McKernan, Kevin; Ranade, Swati; Shea, Terrance P.; Williams, Louise; Young, Sarah; Nusbaum, Chad; Jaffe, David B.

In: Genome Biology, Vol. 10, No. 10, 103, 01.10.2009.

Research output: Contribution to journalArticle

MacCallum, I, Przybylski, D, Gnerre, S, Burton, J, Shlyakhter, I, Gnirke, A, Malek, J, McKernan, K, Ranade, S, Shea, TP, Williams, L, Young, S, Nusbaum, C & Jaffe, DB 2009, 'ALLPATHS 2: Small genomes assembled accurately and with high continuity from short paired reads', Genome Biology, vol. 10, no. 10, 103. https://doi.org/10.1186/gb-2009-10-10-r103
MacCallum I, Przybylski D, Gnerre S, Burton J, Shlyakhter I, Gnirke A et al. ALLPATHS 2: Small genomes assembled accurately and with high continuity from short paired reads. Genome Biology. 2009 Oct 1;10(10). 103. https://doi.org/10.1186/gb-2009-10-10-r103
MacCallum, Iain ; Przybylski, Dariusz ; Gnerre, Sante ; Burton, Joshua ; Shlyakhter, Ilya ; Gnirke, Andreas ; Malek, Joel ; McKernan, Kevin ; Ranade, Swati ; Shea, Terrance P. ; Williams, Louise ; Young, Sarah ; Nusbaum, Chad ; Jaffe, David B. / ALLPATHS 2 : Small genomes assembled accurately and with high continuity from short paired reads. In: Genome Biology. 2009 ; Vol. 10, No. 10.
@article{cdcdd9c4dcb942c894debc36d55a54ff,
title = "ALLPATHS 2: Small genomes assembled accurately and with high continuity from short paired reads",
abstract = "We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8{\%} (ALLPATHS2), 68.7{\%} (Velvet), and 42.1{\%} (EULER-SR).",
author = "Iain MacCallum and Dariusz Przybylski and Sante Gnerre and Joshua Burton and Ilya Shlyakhter and Andreas Gnirke and Joel Malek and Kevin McKernan and Swati Ranade and Shea, {Terrance P.} and Louise Williams and Sarah Young and Chad Nusbaum and Jaffe, {David B.}",
year = "2009",
month = "10",
day = "1",
doi = "10.1186/gb-2009-10-10-r103",
language = "English",
volume = "10",
journal = "Genome Biology",
issn = "1474-7596",
publisher = "BioMed Central",
number = "10",

}

TY - JOUR

T1 - ALLPATHS 2

T2 - Small genomes assembled accurately and with high continuity from short paired reads

AU - MacCallum, Iain

AU - Przybylski, Dariusz

AU - Gnerre, Sante

AU - Burton, Joshua

AU - Shlyakhter, Ilya

AU - Gnirke, Andreas

AU - Malek, Joel

AU - McKernan, Kevin

AU - Ranade, Swati

AU - Shea, Terrance P.

AU - Williams, Louise

AU - Young, Sarah

AU - Nusbaum, Chad

AU - Jaffe, David B.

PY - 2009/10/1

Y1 - 2009/10/1

N2 - We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8% (ALLPATHS2), 68.7% (Velvet), and 42.1% (EULER-SR).

AB - We demonstrate that genome sequences approaching finished quality can be generated from short paired reads. Using 36 base (fragment) and 26 base (jumping) reads from five microbial genomes of varied GC composition and sizes up to 40 Mb, ALLPATHS2 generated assemblies with long, accurate contigs and scaffolds. Velvet and EULER-SR were less accurate. For example, for Escherichia coli, the fraction of 10-kb stretches that were perfect was 99.8% (ALLPATHS2), 68.7% (Velvet), and 42.1% (EULER-SR).

UR - http://www.scopus.com/inward/record.url?scp=75349109607&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=75349109607&partnerID=8YFLogxK

U2 - 10.1186/gb-2009-10-10-r103

DO - 10.1186/gb-2009-10-10-r103

M3 - Article

VL - 10

JO - Genome Biology

JF - Genome Biology

SN - 1474-7596

IS - 10

M1 - 103

ER -