Review of General Algorithmic Features for Genome Assemblers for Next Generation Sequencers

Bilal Wajid, Erchin Serpedin

Research output: Contribution to journalReview article

16 Citations (Scopus)


In the realm of bioinformatics and computational biology, the most rudimentary data upon which all the analysis is built is the sequence data of genes, proteins and RNA. The sequence data of the entire genome is the solution to the genome assembly problem. The scope of this contribution is to provide an overview on the art of problem-solving applied within the domain of genome assembly in the next-generation sequencing (NGS) platforms. This article discusses the major genome assemblers that were proposed in the literature during the past decade by outlining their basic working principles. It is intended to act as a qualitative, not a quantitative, tutorial to all working on genome assemblers pertaining to the next generation of sequencers. We discuss the theoretical aspects of various genome assemblers, identifying their working schemes. We also discuss briefly the direction in which the area is headed towards along with discussing core issues on software simplicity.

Original languageEnglish
Pages (from-to)58-73
Number of pages16
JournalGenomics, Proteomics and Bioinformatics
Issue number2
Publication statusPublished - 1 Apr 2012



  • Comparative assembly
  • De Bruijn graphs
  • De novo assembly
  • Genome assembly
  • Next-generation sequencing

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology
  • Genetics
  • Computational Mathematics

Cite this