Human BAC ends quality assessment and sequence analyses

Shaying Zhao, Joel Malek, Gregory Mahairas, Lily Fu, William Nierman, J. Craig Venter, Mark D. Adams

Research output: Contribution to journalArticle

33 Citations (Scopus)

Abstract

End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering ~4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have high-lighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research. (C) 2000 Academic Press.

Original languageEnglish
Pages (from-to)321-332
Number of pages12
JournalGenomics
Volume63
Issue number3
DOIs
Publication statusPublished - 1 Feb 2000
Externally publishedYes

Fingerprint

Human Artificial Chromosomes
Bacterial Artificial Chromosomes
Sequence Analysis
Clone Cells
Sequence Tagged Sites
Genome
Technology
Expressed Sequence Tags
Human Genome
Libraries
Research
Neoplasms
Proteins

ASJC Scopus subject areas

  • Genetics

Cite this

Zhao, S., Malek, J., Mahairas, G., Fu, L., Nierman, W., Venter, J. C., & Adams, M. D. (2000). Human BAC ends quality assessment and sequence analyses. Genomics, 63(3), 321-332. https://doi.org/10.1006/geno.1999.6082

Human BAC ends quality assessment and sequence analyses. / Zhao, Shaying; Malek, Joel; Mahairas, Gregory; Fu, Lily; Nierman, William; Venter, J. Craig; Adams, Mark D.

In: Genomics, Vol. 63, No. 3, 01.02.2000, p. 321-332.

Research output: Contribution to journalArticle

Zhao, S, Malek, J, Mahairas, G, Fu, L, Nierman, W, Venter, JC & Adams, MD 2000, 'Human BAC ends quality assessment and sequence analyses', Genomics, vol. 63, no. 3, pp. 321-332. https://doi.org/10.1006/geno.1999.6082
Zhao S, Malek J, Mahairas G, Fu L, Nierman W, Venter JC et al. Human BAC ends quality assessment and sequence analyses. Genomics. 2000 Feb 1;63(3):321-332. https://doi.org/10.1006/geno.1999.6082
Zhao, Shaying ; Malek, Joel ; Mahairas, Gregory ; Fu, Lily ; Nierman, William ; Venter, J. Craig ; Adams, Mark D. / Human BAC ends quality assessment and sequence analyses. In: Genomics. 2000 ; Vol. 63, No. 3. pp. 321-332.
@article{30c1e0fd1e5e4608b7cfc5d91d0cf97e,
title = "Human BAC ends quality assessment and sequence analyses",
abstract = "End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering ~4.7{\%} of the genome. Over 60{\%} of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have high-lighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research. (C) 2000 Academic Press.",
author = "Shaying Zhao and Joel Malek and Gregory Mahairas and Lily Fu and William Nierman and Venter, {J. Craig} and Adams, {Mark D.}",
year = "2000",
month = "2",
day = "1",
doi = "10.1006/geno.1999.6082",
language = "English",
volume = "63",
pages = "321--332",
journal = "Genomics",
issn = "0888-7543",
publisher = "Academic Press Inc.",
number = "3",

}

TY - JOUR

T1 - Human BAC ends quality assessment and sequence analyses

AU - Zhao, Shaying

AU - Malek, Joel

AU - Mahairas, Gregory

AU - Fu, Lily

AU - Nierman, William

AU - Venter, J. Craig

AU - Adams, Mark D.

PY - 2000/2/1

Y1 - 2000/2/1

N2 - End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering ~4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have high-lighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research. (C) 2000 Academic Press.

AB - End sequences from bacterial artificial chromosomes (BACs) provide highly specific sequence markers in large-scale sequencing projects. To date, we have generated >300,000 end sequences from >186,000 human BAC clones with an average read length of >460 bp for a total of 141 Mb covering ~4.7% of the genome. Over 60% of the clones have BAC end sequences (BESs) from both ends representing more than fivefold coverage of the human genome by the paired-end clones. Our quality assessments and sequence analyses indicate that BESs from human BAC libraries developed at The California Institute of Technology (CalTech) and Roswell Park Cancer Institute have similar properties. The analyses have high-lighted differences in insert size for different segments of the CalTech library. Problems with the fidelity of tracking of sequence data back to physical clones have been observed in some subsets of the overall BES dataset. The annotation results of BESs for the contents of available genomic sequences, sequence tagged sites, expressed sequence tags, protein encoding regions, and repeats indicate that this resource will be valuable in many areas of genome research. (C) 2000 Academic Press.

UR - http://www.scopus.com/inward/record.url?scp=0034143651&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034143651&partnerID=8YFLogxK

U2 - 10.1006/geno.1999.6082

DO - 10.1006/geno.1999.6082

M3 - Article

VL - 63

SP - 321

EP - 332

JO - Genomics

JF - Genomics

SN - 0888-7543

IS - 3

ER -