XB-ART-34638
Genome Res
2004 Oct 01;1410B:2121-7. doi: 10.1101/gr.2596504.
Show Gene links
Show Anatomy links
The status, quality, and expansion of the NIH full-length cDNA project: the Mammalian Gene Collection (MGC).
Gerhard DS
,
Wagner L
,
Feingold EA
,
Shenmen CM
,
Grouse LH
,
Schuler G
,
Klein SL
,
Old S
,
Rasooly R
,
Good P
,
Guyer M
,
Peck AM
,
Derge JG
,
Lipman D
,
Collins FS
,
Jang W
,
Sherry S
,
Feolo M
,
Misquitta L
,
Lee E
,
Lee E
,
Rotmistrovsky K
,
Greenhut SF
,
Schaefer CF
,
Buetow K
,
Bonner TI
,
Haussler D
,
Kent J
,
Kiekhaus M
,
Furey T
,
Brent M
,
Prange C
,
Schreiber K
,
Shapiro N
,
Bhat NK
,
Hopkins RF
,
Hsie F
,
Driscoll T
,
Soares MB
,
Casavant TL
,
Scheetz TE
,
Brown-stein MJ
,
Usdin TB
,
Toshiyuki S
,
Carninci P
,
Piao Y
,
Dudekula DB
,
Ko MS
,
Kawakami K
,
Suzuki Y
,
Sugano S
,
Gruber CE
,
Smith MR
,
Simmons B
,
Moore T
,
Waterman R
,
Johnson SL
,
Ruan Y
,
Wei CL
,
Mathavan S
,
Gunaratne PH
,
Wu J
,
Garcia AM
,
Hulyk SW
,
Fuh E
,
Yuan Y
,
Sneed A
,
Kowis C
,
Hodgson A
,
Muzny DM
,
McPherson J
,
Gibbs RA
,
Fahey J
,
Helton E
,
Ketteman M
,
Madan A
,
Rodrigues S
,
Sanchez A
,
Whiting M
,
Madari A
,
Young AC
,
Wetherby KD
,
Granite SJ
,
Kwong PN
,
Brinkley CP
,
Pearson RL
,
Bouffard GG
,
Blakesly RW
,
Green ED
,
Dickson MC
,
Rodriguez AC
,
Grimwood J
,
Schmutz J
,
Myers RM
,
Butterfield YS
,
Griffith M
,
Griffith OL
,
Krzywinski MI
,
Liao N
,
Morin R
,
Morrin R
,
Palmquist D
,
Petrescu AS
,
Skalska U
,
Smailus DE
,
Stott JM
,
Schnerch A
,
Schein JE
,
Jones SJ
,
Holt RA
,
Baross A
,
Marra MA
,
Clifton S
,
Makowski KA
,
Bosak S
,
Malek J
,
MGC Project Team
.
???displayArticle.abstract???
The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5'-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline.
???displayArticle.pubmedLink??? 15489334
???displayArticle.pmcLink??? PMC528928
???displayArticle.link??? Genome Res
???displayArticle.grants??? [+]
N01-C0-12400 PHS HHS
References [+] :
Adams,
Complementary DNA sequencing: expressed sequence tags and human genome project.
1991, Pubmed
Adams, Complementary DNA sequencing: expressed sequence tags and human genome project. 1991, Pubmed
Anant, Molecular regulation, evolutionary, and functional adaptations associated with C to U editing of mammalian apolipoproteinB mRNA. 2003, Pubmed
Bamshad, Signatures of natural selection in the human genome. 2003, Pubmed
Baross, Systematic recovery and analysis of full-ORF human cDNA clones. 2004, Pubmed
Boguski, dbEST--database for "expressed sequence tags". 1993, Pubmed
Brentani, The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags. 2003, Pubmed
Gibbs, Genome sequence of the Brown Norway rat yields insights into mammalian evolution. 2004, Pubmed
Kent, The human genome browser at UCSC. 2002, Pubmed
Klein, Genetic and genomic tools for Xenopus research: The NIH Xenopus initiative. 2002, Pubmed , Xenbase
Lander, Initial sequencing and analysis of the human genome. 2001, Pubmed
Lennon, The I.M.A.G.E. Consortium: an integrated molecular analysis of genomes and their expression. 1996, Pubmed
Makalowski, Synonymous and nonsynonymous substitution distances are correlated in mouse and rat genes. 1998, Pubmed
Ohara, Construction and characterization of human brain cDNA libraries suitable for analysis of cDNA clones encoding relatively large proteins. 1997, Pubmed
Okazaki, Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. 2002, Pubmed
Ota, Complete sequencing and characterization of 21,243 full-length human cDNAs. 2004, Pubmed
Parks, The AMPA receptors of auditory neurons. 2000, Pubmed
Pruitt, NCBI Reference Sequence project: update and current status. 2003, Pubmed
Pruitt, RefSeq and LocusLink: NCBI gene-centered resources. 2001, Pubmed
Pruitt, Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. 2000, Pubmed
Rasooly, Genetic and genomic tools for zebrafish research: the NIH zebrafish initiative. 2003, Pubmed
Schaefer, A new cancer genome anatomy project web resource for the community. 2001, Pubmed
Schaub, RNA editing by adenosine deaminases generates RNA and protein diversity. 2002, Pubmed
Sherry, dbSNP: the NCBI database of genetic variation. 2001, Pubmed
Strausberg, The Cancer Genome Anatomy Project: new resources for reading the molecular signatures of cancer. 2001, Pubmed
Strausberg, The cancer genome anatomy project: online resources to reveal the molecular signatures of cancer. 2002, Pubmed
Strausberg, The mammalian gene collection. 1999, Pubmed
Strausberg, Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. 2002, Pubmed
Venter, The sequence of the human genome. 2001, Pubmed
Waterston, Initial sequencing and comparative analysis of the mouse genome. 2002, Pubmed
Wiemann, Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs. 2001, Pubmed
Williamson, The Merck Gene Index project. 1999, Pubmed
Wu, Large-scale RT-PCR recovery of full-length cDNA clones. 2004, Pubmed