Click here to close
Hello! We notice that you are using Internet Explorer, which is not supported by Xenbase and may cause the site to display incorrectly.
We suggest using a current version of Chrome,
FireFox, or Safari.
???displayArticle.abstract???
During embryogenesis, signaling molecules produced by one cell population direct gene regulatory changes in neighboring cells and influence their developmental fates and spatial organization. One of the earliest events in the development of the vertebrate embryo is the establishment of three germ layers, consisting of the ectoderm, mesoderm and endoderm. Attempts to measure gene expression in vivo in different germ layers and cell types are typically complicated by the heterogeneity of cell types within biological samples (i.e., embryos), as the responses of individual cell types are intermingled into an aggregate observation of heterogeneous cell types. Here, we propose a novel method to elucidate gene regulatory circuits from these aggregate measurements in embryos of the frog Xenopus tropicalis using gene network inference algorithms and then test the ability of the inferred networks to predict spatial gene expression patterns. We use two inference models with different underlying assumptions that incorporate existing network information, an ODE model for steady-state data and a Markov model for time series data, and contrast the performance of the two models. We apply our method to both control and knockdown embryos at multiple time points to reconstruct the core mesoderm and endoderm regulatory circuits. Those inferred networks are then used in combination with known dorsal-ventral spatial expression patterns of a subset of genes to predict spatial expression patterns for other genes. Both models are able to predict spatial expression patterns for some of the core mesoderm and endoderm genes, but interestingly of different gene subsets, suggesting that neither model is sufficient to recapitulate all of the spatial patterns, yet they are complementary for the patterns that they do capture. The presented methodology of gene network inference combined with spatial pattern prediction provides an additional layer of validation to elucidate the regulatory circuits controlling the spatial-temporal dynamics in embryonic development.
Figure 7. Spatial gene expression patterns in Xenopus embryos. (A) The ventx gene is expressed ventrally. (B) The gsc gene is expressed dorsally. (C) The bix1 gene is expressed in both the ventral and dorsal regions. (D) The gata4 gene is expressed in the vegetal region. The rectangular bar across the embryo indicates the portion of the image viewed for classification purposes, with the bar under the image showing the 1-dimensional representation of gene expression.
Figure 1. Inference error versus number of observations. The proportional error (i.e., inference error) denotes the minimal cross-validation error divided by the minimal least-squares error of the linear regression without any regularization terms and averaged over five random networks. The proportional errors decrease with more observations and stabilize when there are enough observations.
Figure 2. Inference error versus number of random valid edges provided. The proportional error (i.e., inference error) denotes the minimal cross-validation error divided by the minimal least-squares error of the linear regression without any regularization terms and averaged over five random networks and five random sets of valid edges. Zero to 25 (i.e., the number of all edges in each random network) prior edges are provided respectively. (A), (B) and (C) are the results of 4, 5 and 6 observations, respectively. The proportional errors decrease when more valid edges are provided. Prior connections appear to be more important when only a few observations are available.
Figure 3. Inference error versus number of random invalid edges (without valid edges) provided. The proportional error (i.e., inference error) denotes the minimal cross-validation error divided by the minimal least-squares error of the linear regression without any regularization terms and averaged over five random networks. The red, blue and green curves represent 4, 6 and 10 observations, respectively. Providing invalid network edges only has little effect on the errors, especially when many observations are available. The errors become smaller as the number of observations increases, which is consistent with the results in Figure 1.
Figure 4. Inference error versus number of random invalid edges. The proportional error (i.e., inference error) denotes the minimal cross-validation error divided by the minimal least-squares error of the linear regression without any regularization terms and averaged over five random networks. The blue, green and red curves represent 0, 5 and 20 random valid edges, respectively. 4 and 2 observations are considered in (A) and (B), respectively. When there are more valid edges (e.g., validâ=â20), the errors are generally smaller as a whole. When only a few observations are available, the valid edges appear to be even more important. The cross-validation errors are in a large scale (i.e., 105) in (B), because they are divided by the least-squares errors, while fewer observations are easier to be over fitted with small least-squares errors (e.g., 10-6â~â10-4).
Figure 5. Inferred network from the linear ODE model. Positive interactions are shown in blue and negative interactions are shown in red. The strength of connections is indicated via the thickness of lines connecting the genes. The interactions vary from â1.2921 to 1.4132 and a threshold 0.25 is applied, i.e., all the edges with the weights smaller than 0.25 are discarded.
Figure 6. Inferred network from the linear Markov model. Positive interactions are shown in blue and negative interactions are shown in red. The strength of connections is indicated via the thickness of lines connecting the genes. The interactions vary from â1.1317 to 2.5607 and a threshold 0.2 is applied, i.e., all the edges with the weights smaller than 0.2 are discarded.
Figure 8. Percentage of correctly predicted patterns in the ODE spatial prediction model. Given a fixed number (from 1 to 26) of pre-defined patterns, the percentages are averaged over 20 sets (Figure (A)) and 100 sets (Figure (B)) of randomly chosen genes with pre-defined patterns. As expected, the percentages of correctly predicted patterns are more stable when they are averaged over more sets of randomly chosen genes. Also, the prediction percentages show an increasing trend as more pre-defined patterns are provided.
Figure 9. Estimated probability density distribution of the spatial patterns prediction. 1000 random networks derived from rearranging the inferred network were used to calculate the fractions of the correctly predicted genes in the 28 genes with prior patterns. (A) 1000 random networks were derived from rearranging the inferred 28-gene ODE network. They are mostly about 12% correct (the p-value is 0.007, i.e., there are only 0.7% of the 1000 random networks obtaining not less prior patterns than the inferred ODE network), while the inferred ODE network is 39.3% correct. (B) 1000 random networks were derived from rearranging the inferred 28-gene Markov network. They are mostly about 14% correct (the p-value is 0.002, i.e., there are 0.2% of the 1000 random networks obtaining not less prior patterns than the inferred Markov network), while the inferred Markov network is 39.3% correct.
Bansal,
How to infer gene networks from expression profiles.
2007, Pubmed
Bansal,
How to infer gene networks from expression profiles.
2007,
Pubmed
Biemar,
Comprehensive identification of Drosophila dorsal-ventral patterning genes using a whole-genome tiling array.
2006,
Pubmed
Bonneau,
Learning biological networks: from modules to dynamics.
2008,
Pubmed
Bonneau,
The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo.
2006,
Pubmed
Botman,
Spatial gene expression quantification: a tool for analysis of in situ hybridizations in sea anemone Nematostella vectensis.
2012,
Pubmed
Bowes,
Xenbase: gene expression and improved integration.
2010,
Pubmed
,
Xenbase
Chomczynski,
Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction.
1987,
Pubmed
Christiaen,
The transcription/migration interface in heart precursors of Ciona intestinalis.
2008,
Pubmed
Christley,
Incorporating existing network information into gene network inference.
2009,
Pubmed
Clements,
Changes in embryonic cell fate produced by expression of an endodermal transcription factor, Xsox17.
2000,
Pubmed
,
Xenbase
Crombach,
Efficient reverse-engineering of a developmental gene regulatory network.
2012,
Pubmed
Davidson,
Properties of developmental gene regulatory networks.
2008,
Pubmed
de Jong,
Modeling and simulation of genetic regulatory systems: a literature review.
2002,
Pubmed
De Smet,
Advantages and limitations of current network inference methods.
2010,
Pubmed
D'haeseleer,
Linear modeling of mRNA expression levels during CNS development and injury.
1999,
Pubmed
Erwin,
The evolution of hierarchical gene regulatory networks.
2009,
Pubmed
Faith,
Large-scale mapping and validation of Escherichia coli transcriptional regulation from a compendium of expression profiles.
2007,
Pubmed
Friedman,
Sparse inverse covariance estimation with the graphical lasso.
2008,
Pubmed
Friedman,
Inferring cellular networks using probabilistic graphical models.
2004,
Pubmed
Friedman,
Regularization Paths for Generalized Linear Models via Coordinate Descent.
2010,
Pubmed
Friedman,
Using Bayesian networks to analyze expression data.
2000,
Pubmed
Frise,
Systematic image-driven analysis of the spatial Drosophila embryonic expression landscape.
2010,
Pubmed
Gardner,
Inferring genetic networks and identifying compound mode of action via expression profiling.
2003,
Pubmed
Geiss,
Direct multiplexed measurement of gene expression with color-coded probe pairs.
2008,
Pubmed
Gentsch,
In vivo T-box transcription factor profiling reveals joint regulation of embryonic neuromesodermal bipotency.
2013,
Pubmed
,
Xenbase
Gupta,
A computational framework for gene regulatory network inference that combines multiple methods and datasets.
2011,
Pubmed
Gustafsson,
Constructing and analyzing a large-scale gene-to-gene regulatory network--lasso-constrained inference and biological validation.
2005,
Pubmed
Guthke,
Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection.
2005,
Pubmed
Haibe-Kains,
Predictive networks: a flexible, open source, web application for integration and analysis of human gene networks.
2012,
Pubmed
Hecker,
Integrative modeling of transcriptional regulation in response to antirheumatic therapy.
2009,
Pubmed
Holter,
Dynamic modeling of gene expression data.
2001,
Pubmed
Hurley,
Gene network inference and visualization tools for biologists: application to new human transcriptome datasets.
2012,
Pubmed
Jaeger,
Dynamic control of positional information in the early Drosophila embryo.
2004,
Pubmed
Kerwin,
The HUDSEN Atlas: a three-dimensional (3D) spatial framework for studying gene expression in the developing human brain.
2010,
Pubmed
Khokha,
Techniques and probes for the study of Xenopus tropicalis development.
2002,
Pubmed
,
Xenbase
Kimura,
Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm.
2005,
Pubmed
Koide,
Xenopus as a model system to study transcriptional regulatory networks.
2005,
Pubmed
,
Xenbase
Li,
Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic Bayesian network analysis.
2011,
Pubmed
Linde,
Regulatory interactions for iron homeostasis in Aspergillus fumigatus inferred by a Systems Biology approach.
2012,
Pubmed
Linde,
Regulatory network modelling of iron acquisition by a fungal pathogen in contact with epithelial cells.
2010,
Pubmed
Loose,
A genetic regulatory network for Xenopus mesendoderm formation.
2004,
Pubmed
,
Xenbase
Maduro,
Endomesoderm specification in Caenorhabditis elegans and other nematodes.
2006,
Pubmed
Marbach,
Wisdom of crowds for robust gene network inference.
2012,
Pubmed
Margolin,
Reverse engineering cellular networks.
2006,
Pubmed
Mjolsness,
A connectionist model of development.
1991,
Pubmed
Pepperkok,
High-throughput fluorescence microscopy for systems biology.
2006,
Pubmed
Perkins,
Reverse engineering the gap gene network of Drosophila melanogaster.
2006,
Pubmed
Perrin,
Gene networks inference using dynamic Bayesian networks.
2003,
Pubmed
Sinner,
Global analysis of the transcriptional network controlling Xenopus endoderm formation.
2006,
Pubmed
,
Xenbase
Spencer,
A spatial and temporal map of C. elegans gene expression.
2011,
Pubmed
To,
Measurement variation determines the gene network topology reconstructed from experimental data: a case study of the yeast cyclin network.
2010,
Pubmed
Vermot,
Fast fluorescence microscopy for imaging the dynamics of embryonic development.
2008,
Pubmed
Vohradský,
Neural network model of gene expression.
2001,
Pubmed
Vu,
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae.
2007,
Pubmed
Yip,
Improved reconstruction of in silico gene regulatory networks by integrating knockout and perturbation data.
2010,
Pubmed