Human BAC ends quality assessment and sequence analyses

Shaying Zhao, Joel Malek, Gregory Mahairas, Lily Fu, William Nierman, J. Craig Venter, Mark D. Adams

Research output: Contribution to journalArticle

33 Citations (Scopus)


End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering ~4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have high-lighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research. (C) 2000 Academic Press.

Original languageEnglish
Pages (from-to)321-332
Number of pages12
Issue number3
Publication statusPublished - 1 Feb 2000
Externally publishedYes


ASJC Scopus subject areas

  • Genetics

Cite this

Zhao, S., Malek, J., Mahairas, G., Fu, L., Nierman, W., Venter, J. C., & Adams, M. D. (2000). Human BAC ends quality assessment and sequence analyses. Genomics, 63(3), 321-332.