Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences

Robert L. Strausberg, Elise A. Feingold, Lynette H. Grouse, Jeffery G. Derge, Richard D. Klausner, Francis S. Collins, Lukas Wagner, Carolyn M. Shenmen, Gregory D. Schuler, Stephen F. Altschul, Barry Zeeberg, Kenneth H. Buetow, Carl F. Schaefer, Narayan K. Bhat, Ralph F. Hopkins, Heather Jordan, Troy Moore, Steve I. Max, Jun Wang, Florence Hsieh & 63 others Luda Diatchenko, Kate Marusina, Andrew A. Farmer, Gerald M. Rubin, Ling Hong, Mark Stapleton, M. Bento Soares, Maria F. Bonaldo, Tom L. Casavant, Todd E. Scheetz, Michael J. Brownstein, Ted B. Usdin, Shiraki Toshiyuki, Piero Carninci, Christa Prange, Sam S. Raha, Naomi A. Loquellano, Garrick J. Peters, Rick D. Abramson, Sara J. Mullahy, Stephanie A. Bosak, Paul J. McEwan, Kevin J. McKernan, Joel Malek, Preethi H. Gunaratne, Stephen Richards, Kim C. Worley, Sarah Hale, Angela M. Garcia, Laura J. Gay, Stephen W. Hulyk, Debbie K. Villalon, Donna M. Muzny, Erica J. Sodergren, Xiuhua Lu, Richard A. Gibbs, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez, Michelle Whiting, Anup Madan, Alice C. Young, Yuriy Shevchenko, Gerard G. Bouffard, Robert W. Blakesley, Jeffrey W. Touchman, Eric D. Green, Mark C. Dickson, Alex C. Rodriguez, Jane Grimwood, Jeremy Schmutzs, Richard M. Myers, Yaron S.N. Butterfield, Martin I. Krzywinski, Ursula Skalska, Duane E. Smailus, Angelique Schnerch, Jacqueline E. Schein, Steven J.M. Jones, Marco A. Marra

Research output: Contribution to journalArticle

1375 Citations (Scopus)

Abstract

National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http://mgc.nci.nih.gov).

Original languageEnglish
Pages (from-to)16899-16903
Number of pages5
JournalProceedings of the National Academy of Sciences of the United States of America
Volume99
Issue number26
DOIs
Publication statusPublished - 24 Dec 2002
Externally publishedYes

Fingerprint

Complementary DNA
Clone Cells
Open Reading Frames
Genes
Expressed Sequence Tags
National Institutes of Health (U.S.)
Libraries
Databases

ASJC Scopus subject areas

  • General

Cite this

Strausberg, R. L., Feingold, E. A., Grouse, L. H., Derge, J. G., Klausner, R. D., Collins, F. S., ... Marra, M. A. (2002). Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proceedings of the National Academy of Sciences of the United States of America, 99(26), 16899-16903. https://doi.org/10.1073/pnas.242603899

Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. / Strausberg, Robert L.; Feingold, Elise A.; Grouse, Lynette H.; Derge, Jeffery G.; Klausner, Richard D.; Collins, Francis S.; Wagner, Lukas; Shenmen, Carolyn M.; Schuler, Gregory D.; Altschul, Stephen F.; Zeeberg, Barry; Buetow, Kenneth H.; Schaefer, Carl F.; Bhat, Narayan K.; Hopkins, Ralph F.; Jordan, Heather; Moore, Troy; Max, Steve I.; Wang, Jun; Hsieh, Florence; Diatchenko, Luda; Marusina, Kate; Farmer, Andrew A.; Rubin, Gerald M.; Hong, Ling; Stapleton, Mark; Soares, M. Bento; Bonaldo, Maria F.; Casavant, Tom L.; Scheetz, Todd E.; Brownstein, Michael J.; Usdin, Ted B.; Toshiyuki, Shiraki; Carninci, Piero; Prange, Christa; Raha, Sam S.; Loquellano, Naomi A.; Peters, Garrick J.; Abramson, Rick D.; Mullahy, Sara J.; Bosak, Stephanie A.; McEwan, Paul J.; McKernan, Kevin J.; Malek, Joel; Gunaratne, Preethi H.; Richards, Stephen; Worley, Kim C.; Hale, Sarah; Garcia, Angela M.; Gay, Laura J.; Hulyk, Stephen W.; Villalon, Debbie K.; Muzny, Donna M.; Sodergren, Erica J.; Lu, Xiuhua; Gibbs, Richard A.; Fahey, Jessica; Helton, Erin; Ketteman, Mark; Madan, Anuradha; Rodrigues, Stephanie; Sanchez, Amy; Whiting, Michelle; Madan, Anup; Young, Alice C.; Shevchenko, Yuriy; Bouffard, Gerard G.; Blakesley, Robert W.; Touchman, Jeffrey W.; Green, Eric D.; Dickson, Mark C.; Rodriguez, Alex C.; Grimwood, Jane; Schmutzs, Jeremy; Myers, Richard M.; Butterfield, Yaron S.N.; Krzywinski, Martin I.; Skalska, Ursula; Smailus, Duane E.; Schnerch, Angelique; Schein, Jacqueline E.; Jones, Steven J.M.; Marra, Marco A.

In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 99, No. 26, 24.12.2002, p. 16899-16903.

Research output: Contribution to journalArticle

Strausberg, RL, Feingold, EA, Grouse, LH, Derge, JG, Klausner, RD, Collins, FS, Wagner, L, Shenmen, CM, Schuler, GD, Altschul, SF, Zeeberg, B, Buetow, KH, Schaefer, CF, Bhat, NK, Hopkins, RF, Jordan, H, Moore, T, Max, SI, Wang, J, Hsieh, F, Diatchenko, L, Marusina, K, Farmer, AA, Rubin, GM, Hong, L, Stapleton, M, Soares, MB, Bonaldo, MF, Casavant, TL, Scheetz, TE, Brownstein, MJ, Usdin, TB, Toshiyuki, S, Carninci, P, Prange, C, Raha, SS, Loquellano, NA, Peters, GJ, Abramson, RD, Mullahy, SJ, Bosak, SA, McEwan, PJ, McKernan, KJ, Malek, J, Gunaratne, PH, Richards, S, Worley, KC, Hale, S, Garcia, AM, Gay, LJ, Hulyk, SW, Villalon, DK, Muzny, DM, Sodergren, EJ, Lu, X, Gibbs, RA, Fahey, J, Helton, E, Ketteman, M, Madan, A, Rodrigues, S, Sanchez, A, Whiting, M, Madan, A, Young, AC, Shevchenko, Y, Bouffard, GG, Blakesley, RW, Touchman, JW, Green, ED, Dickson, MC, Rodriguez, AC, Grimwood, J, Schmutzs, J, Myers, RM, Butterfield, YSN, Krzywinski, MI, Skalska, U, Smailus, DE, Schnerch, A, Schein, JE, Jones, SJM & Marra, MA 2002, 'Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences', Proceedings of the National Academy of Sciences of the United States of America, vol. 99, no. 26, pp. 16899-16903. https://doi.org/10.1073/pnas.242603899
Strausberg, Robert L. ; Feingold, Elise A. ; Grouse, Lynette H. ; Derge, Jeffery G. ; Klausner, Richard D. ; Collins, Francis S. ; Wagner, Lukas ; Shenmen, Carolyn M. ; Schuler, Gregory D. ; Altschul, Stephen F. ; Zeeberg, Barry ; Buetow, Kenneth H. ; Schaefer, Carl F. ; Bhat, Narayan K. ; Hopkins, Ralph F. ; Jordan, Heather ; Moore, Troy ; Max, Steve I. ; Wang, Jun ; Hsieh, Florence ; Diatchenko, Luda ; Marusina, Kate ; Farmer, Andrew A. ; Rubin, Gerald M. ; Hong, Ling ; Stapleton, Mark ; Soares, M. Bento ; Bonaldo, Maria F. ; Casavant, Tom L. ; Scheetz, Todd E. ; Brownstein, Michael J. ; Usdin, Ted B. ; Toshiyuki, Shiraki ; Carninci, Piero ; Prange, Christa ; Raha, Sam S. ; Loquellano, Naomi A. ; Peters, Garrick J. ; Abramson, Rick D. ; Mullahy, Sara J. ; Bosak, Stephanie A. ; McEwan, Paul J. ; McKernan, Kevin J. ; Malek, Joel ; Gunaratne, Preethi H. ; Richards, Stephen ; Worley, Kim C. ; Hale, Sarah ; Garcia, Angela M. ; Gay, Laura J. ; Hulyk, Stephen W. ; Villalon, Debbie K. ; Muzny, Donna M. ; Sodergren, Erica J. ; Lu, Xiuhua ; Gibbs, Richard A. ; Fahey, Jessica ; Helton, Erin ; Ketteman, Mark ; Madan, Anuradha ; Rodrigues, Stephanie ; Sanchez, Amy ; Whiting, Michelle ; Madan, Anup ; Young, Alice C. ; Shevchenko, Yuriy ; Bouffard, Gerard G. ; Blakesley, Robert W. ; Touchman, Jeffrey W. ; Green, Eric D. ; Dickson, Mark C. ; Rodriguez, Alex C. ; Grimwood, Jane ; Schmutzs, Jeremy ; Myers, Richard M. ; Butterfield, Yaron S.N. ; Krzywinski, Martin I. ; Skalska, Ursula ; Smailus, Duane E. ; Schnerch, Angelique ; Schein, Jacqueline E. ; Jones, Steven J.M. ; Marra, Marco A. / Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. In: Proceedings of the National Academy of Sciences of the United States of America. 2002 ; Vol. 99, No. 26. pp. 16899-16903.
@article{c558710c214c47619924738a53e8ff66,
title = "Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences",
abstract = "National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http://mgc.nci.nih.gov).",
author = "Strausberg, {Robert L.} and Feingold, {Elise A.} and Grouse, {Lynette H.} and Derge, {Jeffery G.} and Klausner, {Richard D.} and Collins, {Francis S.} and Lukas Wagner and Shenmen, {Carolyn M.} and Schuler, {Gregory D.} and Altschul, {Stephen F.} and Barry Zeeberg and Buetow, {Kenneth H.} and Schaefer, {Carl F.} and Bhat, {Narayan K.} and Hopkins, {Ralph F.} and Heather Jordan and Troy Moore and Max, {Steve I.} and Jun Wang and Florence Hsieh and Luda Diatchenko and Kate Marusina and Farmer, {Andrew A.} and Rubin, {Gerald M.} and Ling Hong and Mark Stapleton and Soares, {M. Bento} and Bonaldo, {Maria F.} and Casavant, {Tom L.} and Scheetz, {Todd E.} and Brownstein, {Michael J.} and Usdin, {Ted B.} and Shiraki Toshiyuki and Piero Carninci and Christa Prange and Raha, {Sam S.} and Loquellano, {Naomi A.} and Peters, {Garrick J.} and Abramson, {Rick D.} and Mullahy, {Sara J.} and Bosak, {Stephanie A.} and McEwan, {Paul J.} and McKernan, {Kevin J.} and Joel Malek and Gunaratne, {Preethi H.} and Stephen Richards and Worley, {Kim C.} and Sarah Hale and Garcia, {Angela M.} and Gay, {Laura J.} and Hulyk, {Stephen W.} and Villalon, {Debbie K.} and Muzny, {Donna M.} and Sodergren, {Erica J.} and Xiuhua Lu and Gibbs, {Richard A.} and Jessica Fahey and Erin Helton and Mark Ketteman and Anuradha Madan and Stephanie Rodrigues and Amy Sanchez and Michelle Whiting and Anup Madan and Young, {Alice C.} and Yuriy Shevchenko and Bouffard, {Gerard G.} and Blakesley, {Robert W.} and Touchman, {Jeffrey W.} and Green, {Eric D.} and Dickson, {Mark C.} and Rodriguez, {Alex C.} and Jane Grimwood and Jeremy Schmutzs and Myers, {Richard M.} and Butterfield, {Yaron S.N.} and Krzywinski, {Martin I.} and Ursula Skalska and Smailus, {Duane E.} and Angelique Schnerch and Schein, {Jacqueline E.} and Jones, {Steven J.M.} and Marra, {Marco A.}",
year = "2002",
month = "12",
day = "24",
doi = "10.1073/pnas.242603899",
language = "English",
volume = "99",
pages = "16899--16903",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
number = "26",

}

TY - JOUR

T1 - Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences

AU - Strausberg, Robert L.

AU - Feingold, Elise A.

AU - Grouse, Lynette H.

AU - Derge, Jeffery G.

AU - Klausner, Richard D.

AU - Collins, Francis S.

AU - Wagner, Lukas

AU - Shenmen, Carolyn M.

AU - Schuler, Gregory D.

AU - Altschul, Stephen F.

AU - Zeeberg, Barry

AU - Buetow, Kenneth H.

AU - Schaefer, Carl F.

AU - Bhat, Narayan K.

AU - Hopkins, Ralph F.

AU - Jordan, Heather

AU - Moore, Troy

AU - Max, Steve I.

AU - Wang, Jun

AU - Hsieh, Florence

AU - Diatchenko, Luda

AU - Marusina, Kate

AU - Farmer, Andrew A.

AU - Rubin, Gerald M.

AU - Hong, Ling

AU - Stapleton, Mark

AU - Soares, M. Bento

AU - Bonaldo, Maria F.

AU - Casavant, Tom L.

AU - Scheetz, Todd E.

AU - Brownstein, Michael J.

AU - Usdin, Ted B.

AU - Toshiyuki, Shiraki

AU - Carninci, Piero

AU - Prange, Christa

AU - Raha, Sam S.

AU - Loquellano, Naomi A.

AU - Peters, Garrick J.

AU - Abramson, Rick D.

AU - Mullahy, Sara J.

AU - Bosak, Stephanie A.

AU - McEwan, Paul J.

AU - McKernan, Kevin J.

AU - Malek, Joel

AU - Gunaratne, Preethi H.

AU - Richards, Stephen

AU - Worley, Kim C.

AU - Hale, Sarah

AU - Garcia, Angela M.

AU - Gay, Laura J.

AU - Hulyk, Stephen W.

AU - Villalon, Debbie K.

AU - Muzny, Donna M.

AU - Sodergren, Erica J.

AU - Lu, Xiuhua

AU - Gibbs, Richard A.

AU - Fahey, Jessica

AU - Helton, Erin

AU - Ketteman, Mark

AU - Madan, Anuradha

AU - Rodrigues, Stephanie

AU - Sanchez, Amy

AU - Whiting, Michelle

AU - Madan, Anup

AU - Young, Alice C.

AU - Shevchenko, Yuriy

AU - Bouffard, Gerard G.

AU - Blakesley, Robert W.

AU - Touchman, Jeffrey W.

AU - Green, Eric D.

AU - Dickson, Mark C.

AU - Rodriguez, Alex C.

AU - Grimwood, Jane

AU - Schmutzs, Jeremy

AU - Myers, Richard M.

AU - Butterfield, Yaron S.N.

AU - Krzywinski, Martin I.

AU - Skalska, Ursula

AU - Smailus, Duane E.

AU - Schnerch, Angelique

AU - Schein, Jacqueline E.

AU - Jones, Steven J.M.

AU - Marra, Marco A.

PY - 2002/12/24

Y1 - 2002/12/24

N2 - National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http://mgc.nci.nih.gov).

AB - National Institutes of Health Mammalian Gene Collection (MGC) Program is a multiinstitutional effort to identify and sequence a cDNA clone containing a complete ORF for each human and mouse gene. ESTs were generated from libraries enriched for full-length cDNAs and analyzed to identify candidate full-ORF clones, which then were sequenced to high accuracy. The MGC has currently sequenced and verified the full ORF for a nonredundant set of >9,000 human and >6,000 mouse genes. Candidate full-ORF clones for an additional 7,800 human and 3,500 mouse genes also have been identified. All MGC sequences and clones are available without restriction through public databases and clone distribution networks (see http://mgc.nci.nih.gov).

UR - http://www.scopus.com/inward/record.url?scp=0037168586&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037168586&partnerID=8YFLogxK

U2 - 10.1073/pnas.242603899

DO - 10.1073/pnas.242603899

M3 - Article

VL - 99

SP - 16899

EP - 16903

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - 26

ER -