Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
???displayArticle.abstract???
Gene duplication provides a window of opportunity for biological variants to persist under the protection of a co-expressed copy with similar or redundant function. Duplication catalyzes innovation (neofunctionalization), subfunction degeneration (subfunctionalization), and genetic buffering (redundancy), and the genetic survival of each paralog is triggered by mechanisms that add, compromise, or do not alter protein function. We tested the applicability of three types of mechanisms for promoting the retained expression of duplicated genes in 290 expressed paralogs of the tetraploid clawed frog, Xenopus laevis. Tests were based on explicit expectations concerning the ka/ks ratio, and the number and location of nonsynonymous substitutions after duplication. Functional constraints on the majority of paralogs are not significantly different from a singleton ortholog. However, we recover strong support that some of them have an asymmetric rate of nonsynonymous substitution: 6% match predictions of the neofunctionalization hypothesis in that (1) each paralog accumulated nonsynonymous substitutions at a significantly different rate and (2) the one that evolves faster has a higher ka/ks ratio than the other paralog and than a singleton ortholog. Fewer paralogs (3%) exhibit a complementary pattern of substitution at the protein level that is predicted by enhancement or degradation of different functional domains, and the remaining 13% have a higher average ka/ks ratio in both paralogs that is consistent with altered functional constraints, diversifying selection, or activity-reducing mutations after duplication. We estimate that these paralogs have been retained since they originated by genome duplication between 21 and 41 million years ago. Multiple mechanisms operate to promote the retained expression of duplicates in the same genome, in genes in the same functional class, over the same period of time following duplication, and sometimes in the same pair of paralogs. None of these paralogs are superfluous; degradation or enhancement of different protein subfunctions and neofunctionalization are plausible hypotheses for the retained expression of some of them. Evolution of most X. laevis paralogs, however, is consistent with retained expression via mechanisms that do not radically alter functional constraints, such as selection to preserve post-duplication stoichiometry or temporal, quantitative, or spatial subfunctionalization.
Figure 1. A Non-Exhaustive Diagram Relating Various Models for the Fate of Duplicate GenesCitations that either propose mechanisms or discuss them: Clark 1994 [100]; Ferris and Whitt 1979 [101]; Force et al. 1999 [17]; Gibson and Spring 1998 [74]; Goodman et al. 1987 [102]; Gu et al. 2003 [6]; Hughes 1994 [42]; Jensen 1976 [103]; Kondrashov et al. 2002 [19]; Li 1980 [104];Li et al. 1982 [105]; Lynch and Conery 2000 [1]; Lynch and Conery 2003 [106]; Lynch and Force 2000 [2]; Ohno 1973 [107]; Ohta 1987 [108]; Piatigorsky and Wistow 1991 [109]; Rodin and Riggs 2003 [110]).; Sidow 1996 [111]; Stoltzfus 1999 [60]; Takahata and Maruyama 1979 [112]; Wagner 1999 [53]; Wagner 2000 [113]; and Zhang et al. 1998 [68].
Figure 2. Putative Allopolyploid Evolution of the Tetraploid X. laevisDaggers indicate extinct diploid ancestors or genes. Nodes 1 and 2 correspond with the divergence and union, respectively, of two diploid genomes, and Node 3 marks the diversification of Xenopus tetraploids.(A) A reticulate phylogeny with ploidy in parentheses.(B) Nuclear genealogy assuming no recombination and no gene conversion between alleles at different paralogous loci (α and β). The dashed portion of the paralogous lineages evolved independently in different diploid ancestors.
Figure 3. Assignment of Putative Retention Mechanisms Based on Molecular Changes in the Coding RegionWe assigned a retention mechanism to paralogs based on the results of three analyses. The first one compared a model with no change in the ka/ks ratio after duplication (Model A in which the ka/ks ratio on all branches is indicated by R0) to a model with a higher ka/ks ratio after duplication (Model B with ka/ks ratio R1 > R0). The second one compared a model with no difference in the nonsynonymous substitution rate (Model B, in which R0 and R1 are nonsynonymous rates on each branch) to a model with different rates of nonsynonymous substitution in each paralog (Model C in which R0, R1, and R2 are nonsynonymous rates on each branch), with the stipulation that the paralog with the higher nonsynonymous rate also have a higher ka/ks ratio than the slower paralog and a higher ka/ks ratio than the diploid lineage. The third analysis tested for complementarity of amino acid substitution in each paralog.In the table in the figure, a minus sign (â) indicates either no significant difference between the models or no significant complementarity of nonsynonymous substitutions. A plus sign (+) indicates a significant improvement in likelihood of the more parameterized model or significant complementarity of nonsynonymous substitution. An asterisk (*) denotes the caveat that an increased substitution ratio could stem from relaxed purifying selection and therefore be a consequence of rather than a cause for retention.
Figure 4. Nonsynonymous Substitutions in Each X. laevis Paralog (α and β) and the Diploid Lineage in Representative GenesSubstitutions in the diploid lineage (d) occurred on the thick branches in the rooted topologies to the left of each locus. (A) liver-type arginase, (B) fibroblast growth factor receptor (FGFR), (C) embryonic fibroblast growth factor (EFGF), and (D) FTZ-F1ârelated orphan receptor. In (A) a gap indicates a single amino acid deletion, an arrow above the paralog indicates a single amino acid insertion, and this paralog is shortened due to an early stop codon. In (B) three red boxes and a blue box indicate three immunoglobulin domains and a tyrosine kinase domain. In (C) arrows below the paralog indicate predicted cleavage sites in each paralog [79]. In (D) yellow, green, and two lighter blue boxes indicate the DNA-binding C-domain, FTZ-F1 box, and DNA binding domain regions II and III [80].
Figure 5. Probability Distribution of the Difference in the Number of Substitutions in Concatenated Paralogs (âSuperparalogsâ)Analysis was performed on concatenated data from (A) nonsynonymous substitutions of paralogs identified by the likelihood analysis as having asymmetric rates of evolution, (B) synonymous substitutions of these paralogs, (C) nonsynonymous substitutions of the other paralogs that were not identified as having asymmetric rates and (D) synonymous substitutions of these paralogs.Black circles are the expected Skellam distributions, gray dots are dSP distributions from ten example simulations (out of 1,000 total), and white circles are the observed distribution of superparalog differences.
Figure 6. The Observed Relationship between ka/ks and ksThe observed relationship between ka/ks and ks corresponds with simulations that predict a negative relationship under neutral or near-neutral evolution of synonymous substitutions because of stochastic sampling of synonymous substitutions at in slowly evolving or young genes [61]. The plot shows the average ka/ks ratio on each branch of 290 genealogies versus average ks of bins of 50 lineages ranked by ks of each one. The last bin has only 20 lineages. Bars indicate the standard deviation of each bin.
Figure 7. The ka/ks Ratio of Genes with No Significant Difference before and after TetraploidizationThe ka/ks ratio is often slightly higher in the paralogs (above the dashed line), even though this average is not significantly higher than the diploid lineage. Only ratios from genes with no significant difference are shown (226 out of 292 genes). A dashed line indicates an equal ka/ks ratio before and after duplication.
Ahn,
Comparative linkage maps of the rice and maize genomes.
1993, Pubmed
Ahn,
Comparative linkage maps of the rice and maize genomes.
1993,
Pubmed
Amores,
Zebrafish hox clusters and vertebrate genome evolution.
1998,
Pubmed
Bailey,
Recent segmental duplications in the human genome.
2002,
Pubmed
Bisbee,
Albumin phylogeny for clawed frogs (Xenopus).
1977,
Pubmed
,
Xenbase
Blanc,
Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution.
2004,
Pubmed
Blanc,
Extensive duplication and reshuffling in the Arabidopsis genome.
2000,
Pubmed
Carroll,
Evolution at two levels: on genes and form.
2005,
Pubmed
Castillo-Davis,
GeneMerge--post-genomic analysis, data mining, and hypothesis testing.
2003,
Pubmed
Clark,
Invasion and maintenance of a gene duplication.
1994,
Pubmed
Conant,
Asymmetric sequence divergence of duplicate genes.
2003,
Pubmed
Davis,
Preferential duplication of conserved proteins in eukaryotic genomes.
2004,
Pubmed
Davis,
Do disparate mechanisms of duplication add similar genes to the genome?
2005,
Pubmed
Dermitzakis,
Differential selection after duplication in mammalian developmental genes.
2001,
Pubmed
D'Onofrio,
Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins.
1991,
Pubmed
Drummond,
Why highly expressed proteins evolve slowly.
2005,
Pubmed
Duda,
Molecular genetics of ecological diversification: duplication and rapid evolution of toxin genes of the venomous gastropod Conus.
1999,
Pubmed
Duret,
Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate.
2000,
Pubmed
Ellinger-Ziegelbauer,
FTZ-F1-related orphan receptors in Xenopus laevis: transcriptional regulators differentially expressed during early embryogenesis.
1994,
Pubmed
,
Xenbase
Evans,
Evolution of RAG-1 in polyploid clawed frogs.
2005,
Pubmed
,
Xenbase
Evans,
A mitochondrial DNA phylogeny of African clawed frogs: phylogeography and implications for polyploid evolution.
2004,
Pubmed
,
Xenbase
Fay,
The neutral theory in the genomic era.
2001,
Pubmed
Fay,
Sequence divergence, functional constraint, and selection in protein evolution.
2003,
Pubmed
Ferris,
Loss of duplicate gene expression after polyploidisation.
1977,
Pubmed
Ferris,
Evolution of the differential regulation of duplicate genes after polyploidization.
1979,
Pubmed
Force,
Preservation of duplicate genes by complementary, degenerative mutations.
1999,
Pubmed
Fryxell,
CpG mutation rates in the human genome are highly dependent on local GC content.
2005,
Pubmed
Gibson,
Genetic redundancy in vertebrates: polyploidy and persistence of genes encoding multidomain proteins.
1998,
Pubmed
Golding,
The structural basis of molecular adaptation.
1998,
Pubmed
Goldman,
Likelihood-based tests of topologies in phylogenetics.
2000,
Pubmed
Goodman,
Globins: a case study in molecular phylogeny.
1987,
Pubmed
Goss,
Detecting heterogeneity of substitution along DNA and protein sequences.
1996,
Pubmed
Gu,
Role of duplicate genes in genetic robustness against null mutations.
2003,
Pubmed
Gu,
Duplicate genes increase gene expression diversity within and between species.
2004,
Pubmed
Hancock,
Gene factories, microfunctionalization and the evolution of gene families.
2005,
Pubmed
Hughes,
Positive Darwinian selection promotes charge profile diversity in the antigen-binding cleft of class I major-histocompatibility-complex molecules.
1990,
Pubmed
Hughes,
The evolution of functionally novel proteins after gene duplication.
1994,
Pubmed
,
Xenbase
Hughes,
Evolution of duplicate genes in a tetraploid animal, Xenopus laevis.
1993,
Pubmed
,
Xenbase
Isaacs,
Expression of a novel FGF in the Xenopus embryo. A new candidate inducing factor for mesoderm formation and anteroposterior specification.
1992,
Pubmed
,
Xenbase
Jensen,
Enzyme recruitment in evolution of new function.
1976,
Pubmed
Jordan,
Duplicated genes evolve slower than singletons despite the initial rate increase.
2004,
Pubmed
Katju,
The structure and early evolution of recently arisen gene duplicates in the Caenorhabditis elegans genome.
2003,
Pubmed
Katz,
Widespread selection for local RNA secondary structure in coding regions of bacterial genes.
2003,
Pubmed
Knöchel,
Globin evolution in the genus Xenopus: comparative analysis of cDNAs coding for adult globin polypeptides of Xenopus borealis and Xenopus tropicalis.
1986,
Pubmed
,
Xenbase
Kondrashov,
Selection in the evolution of gene duplications.
2002,
Pubmed
Kuepfer,
Metabolic functions of duplicate genes in Saccharomyces cerevisiae.
2005,
Pubmed
Lercher,
Genomic regionality in rates of evolution is not explained by clustering of genes of comparable expression profile.
2004,
Pubmed
Li,
Rate of gene silencing at duplicate loci: a theoretical study and interpretation of data from tetraploid fishes.
1980,
Pubmed
Li,
Evolutionary change of duplicate genes.
1982,
Pubmed
Li,
Expression divergence between duplicate genes.
2005,
Pubmed
Liu,
Epigenetic phenomena and the evolution of plant allopolyploids.
2003,
Pubmed
Long,
Gene duplication and evolution.
2001,
Pubmed
Lynch,
The probability of duplicate gene preservation by subfunctionalization.
2000,
Pubmed
Lynch,
The evolutionary fate and consequences of duplicate genes.
2000,
Pubmed
Lynch,
The altered evolutionary trajectories of gene duplicates.
2004,
Pubmed
Lynch,
The origins of genome complexity.
2003,
Pubmed
Nadeau,
Comparable rates of gene loss and functional divergence after genome duplications early in vertebrate evolution.
1997,
Pubmed
Nembaware,
Impact of the presence of paralogs on sequence divergence in a set of mouse-human orthologs.
2002,
Pubmed
Nowak,
Evolution of genetic redundancy.
1997,
Pubmed
Ohta,
Time for acquiring a new gene by duplication.
1988,
Pubmed
Ono,
Ancient linkage groups and frozen accidents.
1973,
Pubmed
Osborn,
Understanding mechanisms of novel gene expression in polyploids.
2003,
Pubmed
Pál,
Highly expressed genes in yeast evolve slowly.
2001,
Pubmed
Piatigorsky,
The recruitment of crystallins: new functions precede gene duplication.
1991,
Pubmed
Pond,
Site-to-site variation of synonymous substitution rates.
2005,
Pubmed
Posada,
MODELTEST: testing the model of DNA substitution.
1998,
Pubmed
Rastogi,
Subfunctionalization of duplicated genes as a transition state to neofunctionalization.
2005,
Pubmed
Rice,
ANALYZING TABLES OF STATISTICAL TESTS.
1989,
Pubmed
Robinson-Rechavi,
Evolutionary rates of duplicate genes in fish and mammals.
2001,
Pubmed
Rodin,
Epigenetic silencing may aid evolution by gene duplication.
2003,
Pubmed
Rokas,
Genome-scale approaches to resolving incongruence in molecular phylogenies.
2003,
Pubmed
Sanderson,
Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach.
2002,
Pubmed
Seoighe,
Extent of genomic rearrangement after genome duplication in yeast.
1998,
Pubmed
Shields,
"Silent" sites in Drosophila genes are not neutral: evidence of selection among synonymous codons.
1988,
Pubmed
Shiu,
Role of positive selection in the retention of duplicate genes in mammalian genomes.
2006,
Pubmed
Sidow,
Gen(om)e duplications in the evolution of early vertebrates.
1996,
Pubmed
Soltis,
Polyploidy: recurrent formation and genome evolution.
1999,
Pubmed
Stoltzfus,
On the possibility of constructive neutral evolution.
1999,
Pubmed
Subramanian,
Gene expression intensity shapes evolutionary rates of the proteins encoded by the vertebrate genome.
2004,
Pubmed
Takahata,
Polymorphism and loss of duplicate gene expression: a theoretical study with application of tetraploid fish.
1979,
Pubmed
Tang,
Locating regions of differential variability in DNA and protein sequences.
1999,
Pubmed
Taylor,
Genome duplication, divergent resolution and speciation.
2001,
Pubmed
Taylor,
Comparative genomics provides evidence for an ancient genome duplication event in fish.
2001,
Pubmed
,
Xenbase
Van de Peer,
The ghost of selection past: rates of evolution and functional divergence of anciently duplicated genes.
2001,
Pubmed
,
Xenbase
Wagner,
The role of population size, pleiotropy and fitness effects of mutations in the evolution of overlapping gene functions.
2000,
Pubmed
Walsh,
How often do duplicated genes evolve new functions?
1995,
Pubmed
Wendel,
Genome evolution in polyploids.
2000,
Pubmed
Wu,
Estrogen receptors in Xenopus: duplicate genes, splice variants, and tissue-specific expression.
2003,
Pubmed
,
Xenbase
Wyckoff,
A highly unexpected strong correlation between fixation probability of nonsynonymous mutations and mutation rate.
2005,
Pubmed
Xu,
Developmental and hormonal regulation of the Xenopus liver-type arginase gene.
1993,
Pubmed
,
Xenbase
Yang,
Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution.
1998,
Pubmed
Yang,
PAML: a program package for phylogenetic analysis by maximum likelihood.
1997,
Pubmed
Zhang,
Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey.
2002,
Pubmed
Zhang,
Positive Darwinian selection after gene duplication in primate ribonuclease genes.
1998,
Pubmed
Zhang,
Different evolutionary patterns between young duplicate genes in the human genome.
2003,
Pubmed