Assessment of de novo assemblers for draft genomes

A case study with fungal genomes

Mostafa Abbas, Qutaibah M. Malluhi, Ponnuraman Balakrishnan

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.

Original languageEnglish
Article numberS10
JournalBMC Genomics
Volume15
DOIs
Publication statusPublished - 8 Dec 2014
Externally publishedYes

Fingerprint

Fungal Genome
Genome
Firearms

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Assessment of de novo assemblers for draft genomes : A case study with fungal genomes. / Abbas, Mostafa; Malluhi, Qutaibah M.; Balakrishnan, Ponnuraman.

In: BMC Genomics, Vol. 15, S10, 08.12.2014.

Research output: Contribution to journalArticle

Abbas, Mostafa ; Malluhi, Qutaibah M. ; Balakrishnan, Ponnuraman. / Assessment of de novo assemblers for draft genomes : A case study with fungal genomes. In: BMC Genomics. 2014 ; Vol. 15.
@article{94018154bd584c5385f3c61294236835,
title = "Assessment of de novo assemblers for draft genomes: A case study with fungal genomes",
abstract = "Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.",
author = "Mostafa Abbas and Malluhi, {Qutaibah M.} and Ponnuraman Balakrishnan",
year = "2014",
month = "12",
day = "8",
doi = "10.1186/1471-2164-15-S9-S10",
language = "English",
volume = "15",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Assessment of de novo assemblers for draft genomes

T2 - A case study with fungal genomes

AU - Abbas, Mostafa

AU - Malluhi, Qutaibah M.

AU - Balakrishnan, Ponnuraman

PY - 2014/12/8

Y1 - 2014/12/8

N2 - Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.

AB - Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.

UR - http://www.scopus.com/inward/record.url?scp=84925265398&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84925265398&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-15-S9-S10

DO - 10.1186/1471-2164-15-S9-S10

M3 - Article

VL - 15

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - S10

ER -