Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
???displayArticle.abstract???
The concept of germ layers has been one of the foremost organizing principles in developmental biology, classification, systematics and evolution for 150 years (refs 1 - 3). Of the three germ layers, the mesoderm is found in bilaterian animals but is absent in species in the phyla Cnidaria and Ctenophora, which has been taken as evidence that the mesoderm was the final germ layer to evolve. The origin of the ectoderm and endoderm germ layers, however, remains unclear, with models supporting the antecedence of each as well as a simultaneous origin. Here we determine the temporal and spatial components of gene expression spanning embryonic development for all Caenorhabditis elegans genes and use it to determine the evolutionary ages of the germ layers. The gene expression program of the mesoderm is induced after those of the ectoderm and endoderm, thus making it the last germ layer both to evolve and to develop. Strikingly, the C. elegans endoderm and ectoderm expression programs do not co-induce; rather the endoderm activates earlier, and this is also observed in the expression of endoderm orthologues during the embryology of the frog Xenopus tropicalis, the sea anemone Nematostella vectensis and the sponge Amphimedon queenslandica. Querying the phylogenetic ages of specifically expressed genes reveals that the endoderm comprises older genes. Taken together, we propose that the endoderm program dates back to the origin of multicellularity, whereas the ectoderm originated as a secondary germ layer freed from ancestral feeding functions.
Extended Data Figure 2. A transcriptomic survey of C. elegans embryonic founder cell lineagesa, Replicates of the embryonic blastomere time-courses. The heatmaps show the correlations among the replicates for each blastomere lineage at each of the eleven examined stages. For three blastomere/stages there were no replicates. The median correlation coefficient is 0.9. Samples were collected in triplicates. Only samples with at least 750,000 reads were used which has been previously shown to be of sufficient sequencing depth for CEL-Seq12. Supplementary Table 3 provides the sequencing statistics for each sample. b, Expression profiles of the 3,910 dynamic genes across the blastomere lineage time-courses. See methods for definition of dynamic genes. c, Correlation coefficients between samples of the whole embryo time-course. Each of the 50 samples comprises a single embryo, collected at the indicated minutes past the 4-cell stage. Again, only samples with at least 750,000 reads were used and Supplementary Table 3 provides the sequencing statistics for each sample. d, The expression profiles of the 1,664 genes with differentiated expression analyzed in Figure 1c. Each profile was âstandardizedâ by subtracting its mean and dividing by its standard deviation. e, Comparison of the blastomere time-courses to the EPIC dataset15. For 115 genes, we could compare gene expression to previously published embryonic expression profiles generated by microscopic lineaging until the ~300-cell stage15,39. Of these, 75% of our profiles had consistent localized expression (Supplementary Table 1). Of those, 54% matched completely, and 21% of the genes, expressed in all of the lineages in our dataset had some missing expression in the EPIC dataset because the lineaging was not carried out until the end of the developmental process. The remaining genes have some overlap in expression. Such differences in expression could be caused by the transgene in the EPIC dataset not recapitulating the profile of the endogenous gene, or missing signals between cells in the blastomere dataset, as is seen from the whole embryo/blastomeres expression level ratio (See Supplementary Table 1, ratios defined as equal, slightly higher/lower or much higher/lower). Expression profile compared to the EPIC dataset deviates more when expression in the blastomeres is low compared to the whole embryo, but the blastomere dataset has the advantage that all genes are assayed simultaneously, no transgenes are used, maternal transcripts are seen, and down-regulation of genes is observable.
Extended Data Figure 3. Lineage restricted gene expression identifies genes dependent upon coherence of the lineages and tissue specificitya, Expression profiles of genes involved in pharynx specification. The left and right panels correspond to the two Notch signaling events. The top and bottom images correspond to the expected regulatory patterns in the whole embryo and isolated blastomeres, respectively. tbx-37 is not shown since it is identical to tbx-38 in expression profile. b, Comparison of the overall sum of expression between the two time-courses, plotted on a log2 scale (black). Genes âmissingâ in the separated lineage time-course were manually added to the graph at â3. The additional plots indicate the same measure for dynamically expressed genes (blue) and constitutive genes (red). c, Idealized expression profiles used to identify gene expression clusters. d, The gene expression profiles for the temporally restricted gene expression profiles. Each profile was âstandardizedâ by subtracting its mean and dividing by its standard deviation. e, Average expression profiles of ten clusters of dynamically expressed genes determined based upon the whole embryo expression data (see Methods). f, The number of dynamic genes in each temporal period In each group, the genes not expressed in the lineage time-course (b) are marked in red.
Extended Data Figure 4. The first principal component correlates with developmental timePrincipal components analysis as described in Figure 2a. Color codes are same as in Figure 1. PC1, PC2 and PC3 capture 18%, 12% and 11%, respectively, of the variation in the expression, respectively, in the 1,320 dynamically expressed genes with no expression in the first stage (to exclude genes with maternal expression).
Extended Data Figure 5. Germ layer specific expressionExpression profiles of the germ layer specific genes in each of the lineages. The x- and y-axes are the eleven examined temporal stages and individual genes, respectively. Germ layer specific genes were identified by hierarchical clustering based upon correlation among dynamically expressed genes (see Methods).
Extended Data Figure 6. Robustness of gene age analysisa, Same format as Figure 4a but with the definition of old genes as those present in at least 25% of the examined Eukaryotes (see Methods) that are not Ophisthokonts. b, Same as Figure 4a with a definition of âoldâ as those present in 25% of the examined organisms that are not Eukaryotes (Eubacteria and Archaea).
Extended Data Figure 7. Truncated endoderm gene set controlTo exclude the possibility that general genes are included as âendoderm-specificâ since the endoderm program is induced earlier, we excluded temporal clusters 8, 9, 10 from the endoderm genes and repeated the relevant analyses. We found that there is no dramatic change in the results. The results are shown in the same format as Figure 3 and 4b-c.
Figure 1. Determining the expression profiles of the C. elegans embryonic founder cell lineagesa, Sample collections are indicated for the shown C. elegans blastomere lineages (circles, Extended Data Fig. 1 and Extended Data Table 2) and whole embryos (notches, in minutes). b, Heat map showing Pearsonâs correlation coefficients among the transcriptomes of the whole embryo and the sum of the individual blastomere lineages. White circles indicate pairs of blastomere stages and embryonic time-points expected to be most similar (Extended Data Table 2). c, P-values of enrichment across curated lists of genes for the indicated lineage-specific gene expression clusters (Extended Data Table 3 and Extended Data Fig. 2d). The white circles indicate the expected differentiation of each expression cluster (Extended Data Table 1). d, Spatial and temporal gene expression profiles.
Figure 2. Dynamics of germ layer gene expression throughout developmenta, PCA on dynamically expressed genes for the five lineage time-courses (See Extended Data Fig. 4 for PC1). Adjacent stages of the same lineage are connected by a line, the terminal stage is indicated by a circle. b, Heat map indicating Pearsonâs correlation coefficients between blastomere expression profile of dynamically expressed genes. The right-side bars indicate the time of expression (temporal clusters, as in Extended Data Fig. 3c-e) and the location of expression. c, Summary of location of expression for genes according to temporal clusters.
Figure 3. The endoderm expression program precedes the ectoderm program in diverse speciesExpression of germ layer genes in C. elegans, and their orthologs in X. tropicalis, N. vectensis, and A. queenslandica. The average is computed on the maximum-normalized gene profiles.
Figure 4. The germ layers exhibit distinct gene ages and functional category enrichmentsa, Fraction of âoldâ genes â defined as presence of orthologs in other opisthokont Eukaryotes â across the indicated temporal induction clusters and germ layers. Different gene age thresholds show similar results (Extended Data Fig. 6). b, For the shown functional categories, the bars indicate the fraction of genes in the endoderm gene set, ectoderm gene set, and other dynamic and zygotically expressed genes. Asterisks indicate significant endoderm (green) and ectoderm (blue) enrichments (P<0.01, Hypergeometric distribution). c, The fraction of orthologs in M. brevicollis is indicated for each functional category. d, A model for germ layer evolution.
Anavy,
BLIND ordering of large-scale transcriptomic developmental timecourses.
2014, Pubmed
Anavy,
BLIND ordering of large-scale transcriptomic developmental timecourses.
2014,
Pubmed
Baker,
The External RNA Controls Consortium: a progress report.
2005,
Pubmed
Cowan,
Mapping the distribution of differentiation potential for intestine, muscle, and hypodermis during early development in Caenorhabditis elegans.
1985,
Pubmed
Domazet-Lošo,
A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns.
2010,
Pubmed
Edgar,
Culture and manipulation of embryonic cells.
2012,
Pubmed
Fischer,
Using OrthoMCL to assign proteins to OrthoMCL-DB groups or to cluster proteomes into new ortholog groups.
2011,
Pubmed
Fox,
The embryonic muscle transcriptome of Caenorhabditis elegans.
2007,
Pubmed
Fukushige,
Defining the transcriptional redundancy of early bodywall muscle development in C. elegans: evidence for a unified theory of animal muscle development.
2006,
Pubmed
Goldstein,
Induction of gut in Caenorhabditis elegans embryos.
1992,
Pubmed
Goldstein,
An analysis of the response to gut induction in the C. elegans embryo.
1995,
Pubmed
Goldstein,
Establishment of gut fate in the E lineage of C. elegans: the roles of lineage-dependent mechanisms and cell interactions.
1993,
Pubmed
Good,
The T-box transcription factors TBX-37 and TBX-38 link GLP-1/Notch signaling to mesoderm induction in C. elegans embryos.
2004,
Pubmed
Guberman,
BioMart Central Portal: an open database network for the biological community.
2011,
Pubmed
Hashimshony,
CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification.
2012,
Pubmed
Helm,
Characterization of differential transcript abundance through time during Nematostella vectensis development.
2013,
Pubmed
Kalinka,
Gene expression divergence recapitulates the developmental hourglass model.
2010,
Pubmed
King,
The genome of the choanoflagellate Monosiga brevicollis and the origin of metazoans.
2008,
Pubmed
Laufer,
Segregation of developmental potential in early embryos of Caenorhabditis elegans.
1980,
Pubmed
Leininger,
Developmental gene expression provides clues to relationships between sponge and eumetazoan body plans.
2014,
Pubmed
Levin,
Developmental milestones punctuate gene expression in the Caenorhabditis embryo.
2012,
Pubmed
Leys,
Epithelia, an evolutionary novelty of metazoans.
2012,
Pubmed
Li,
Fast and accurate short read alignment with Burrows-Wheeler transform.
2009,
Pubmed
Martindale,
Investigating the origins of triploblasty: 'mesodermal' gene expression in a diploblastic animal, the sea anemone Nematostella vectensis (phylum, Cnidaria; class, Anthozoa).
2004,
Pubmed
McGhee,
ELT-2 is the predominant transcription factor controlling differentiation and function of the C. elegans intestine, from embryo to adult.
2009,
Pubmed
Murray,
Multidimensional regulation of gene expression in the C. elegans embryo.
2012,
Pubmed
Murray,
Automated analysis of embryonic gene expression with cellular resolution in C. elegans.
2008,
Pubmed
Nakanishi,
Evolutionary origin of gastrulation: insights from sponge development.
2014,
Pubmed
Neves,
The REF-1 family of bHLH transcription factors pattern C. elegans embryos through Notch-dependent and Notch-independent pathways.
2005,
Pubmed
Pryszcz,
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.
2011,
Pubmed
Ryan,
The genome of the ctenophore Mnemiopsis leidyi and its implications for cell type evolution.
2013,
Pubmed
Schierwater,
Concatenated analysis sheds light on early metazoan evolution and fuels a modern "urmetazoon" hypothesis.
2009,
Pubmed
Schwarz,
WormBase: better software, richer content.
2006,
Pubmed
Shalek,
Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells.
2013,
Pubmed
Shapiro,
Single-cell sequencing-based technologies will revolutionize whole-organism science.
2013,
Pubmed
Sulston,
The embryonic cell lineage of the nematode Caenorhabditis elegans.
1983,
Pubmed
Tatusov,
A genomic perspective on protein families.
1997,
Pubmed
Technau,
Origin and evolution of endoderm and mesoderm.
2003,
Pubmed
Yanai,
Mapping gene expression in two Xenopus species: evolutionary constraints and developmental flexibility.
2011,
Pubmed
,
Xenbase