Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
Proc Natl Acad Sci U S A
2005 Jan 11;1022:373-8. doi: 10.1073/pnas.0408810102.
Show Gene links
Show Anatomy links
Phylogeny determined by protein domain content.
Yang S
,
Doolittle RF
,
Bourne PE
.
???displayArticle.abstract???
A simple classification scheme that uses only the presence or absence of a protein domain architecture has been used to determine the phylogeny of 174 complete genomes. The method correctly divides the 174 taxa into Archaea, Bacteria, and Eukarya and satisfactorily sorts most of the major groups within these superkingdoms. The most challenging problem involved 119 Bacteria, many of which have reduced genomes. When a weighting factor was used that takes account of difference in genome size (number of considered folds), small-genome taxa were mostly grouped with their full-sized counterparts. Although not every organism appears exactly at its classical phylogenetic position in these trees, the agreement appears comparable with the efforts of others by using sophisticated sequence analysis and/or combinations of gene content and gene order. During the course of the study, it emerged that there is a core set of approximately 50 folds that is found in all 174 genomes and a single fold diagnostic of all Archaea.
Baldauf,
A kingdom-level phylogeny of eukaryotes based on combined protein data.
2000, Pubmed
Baldauf,
A kingdom-level phylogeny of eukaryotes based on combined protein data.
2000,
Pubmed
Bansal,
Evolutionary analysis by whole-genome comparisons.
2002,
Pubmed
Bapteste,
Phylogenetic reconstruction and lateral gene transfer.
2004,
Pubmed
Berman,
The Protein Data Bank.
2000,
Pubmed
Brown,
Universal trees based on large combined protein sequence data sets.
2001,
Pubmed
Caetano-Anollés,
An evolutionarily structured universe of protein architecture.
2003,
Pubmed
Clarke,
Inferring genome trees by using a filter to eliminate phylogenetically discordant sequences and a distance matrix based on mean normalized BLASTP scores.
2002,
Pubmed
Dandekar,
Conservation of gene order: a fingerprint of proteins that physically interact.
1998,
Pubmed
Deeds,
Proteomic traces of speciation.
2004,
Pubmed
Doolittle,
The multiplicity of domains in proteins.
1995,
Pubmed
Gerstein,
Patterns of protein-fold usage in eight microbial genomes: a comprehensive structural census.
1998,
Pubmed
Gerstein,
Comparing genomes in terms of protein structure: surveys of a finite parts list.
1998,
Pubmed
Gogarten,
Prokaryotic evolution in light of gene transfer.
2002,
Pubmed
Gough,
SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.
2002,
Pubmed
Gough,
Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure.
2001,
Pubmed
House,
Using homolog groups to create a whole-genomic tree of free-living organisms: an update.
2002,
Pubmed
Ishitani,
Crystal structure of archaeosine tRNA-guanine transglycosylase.
2002,
Pubmed
Karplus,
Hidden Markov models for detecting remote protein homologies.
1998,
Pubmed
Koga,
Did archaeal and bacterial cells arise independently from noncellular precursors? A hypothesis stating that the advent of membrane phospholipid with enantiomeric glycerophosphate backbones caused the separation of the two lines of descent.
1998,
Pubmed
Korbel,
SHOT: a web server for the construction of genome phylogenies.
2002,
Pubmed
Krylov,
Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution.
2003,
Pubmed
Kunin,
The balance of driving forces during genome evolution in prokaryotes.
2003,
Pubmed
Lin,
Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels.
2000,
Pubmed
Lo Conte,
SCOP database in 2002: refinements accommodate structural genomics.
2002,
Pubmed
Murzin,
SCOP: a structural classification of proteins database for the investigation of sequences and structures.
1995,
Pubmed
Roelofs,
Genes lost during evolution.
2001,
Pubmed
Rokas,
Genome-scale approaches to resolving incongruence in molecular phylogenies.
2003,
Pubmed
Saitou,
The neighbor-joining method: a new method for reconstructing phylogenetic trees.
1987,
Pubmed
Snel,
Genome phylogeny based on gene content.
1999,
Pubmed
Snel,
Genomes in flux: the evolution of archaeal and proteobacterial gene content.
2002,
Pubmed
Tekaia,
The genomic tree as revealed from whole proteome comparisons.
1999,
Pubmed
Waters,
The genome of Nanoarchaeum equitans: insights into early archaeal evolution and derived parasitism.
2003,
Pubmed
Woese,
On the evolution of cells.
2002,
Pubmed
Woese,
The universal ancestor.
1998,
Pubmed
Wolf,
Genome trees and the tree of life.
2002,
Pubmed
Wolf,
Genome trees constructed using five different approaches suggest new major bacterial clades.
2001,
Pubmed
Wolf,
Distribution of protein folds in the three superkingdoms of life.
1999,
Pubmed