XB-ART-50058
PLoS Biol
2015 Jan 06;131:e1002033. doi: 10.1371/journal.pbio.1002033.
Show Gene links
Show Anatomy links
Finding Our Way through Phenotypes.
Deans AR
,
Lewis SE
,
Huala E
,
Anzaldo SS
,
Ashburner M
,
Balhoff JP
,
Blackburn DC
,
Blake JA
,
Burleigh JG
,
Chanet B
,
Cooper LD
,
Courtot M
,
Csösz S
,
Cui H
,
Dahdul W
,
Das S
,
Dececchi TA
,
Dettai A
,
Diogo R
,
Druzinsky RE
,
Dumontier M
,
Franz NM
,
Friedrich F
,
Gkoutos GV
,
Haendel M
,
Harmon LJ
,
Hayamizu TF
,
He Y
,
Hines HM
,
Ibrahim N
,
Jackson LM
,
Jaiswal P
,
James-Zorn C
,
Köhler S
,
Lecointre G
,
Lapp H
,
Lawrence CJ
,
Le Novère N
,
Lundberg JG
,
Macklin J
,
Mast AR
,
Midford PE
,
Mikó I
,
Mungall CJ
,
Oellrich A
,
Osumi-Sutherland D
,
Parkinson H
,
Ramírez MJ
,
Richter S
,
Robinson PN
,
Ruttenberg A
,
Schulz KS
,
Segerdell E
,
Seltmann KC
,
Sharkey MJ
,
Smith AD
,
Smith B
,
Specht CD
,
Squires RB
,
Thacker RW
,
Thessen A
,
Fernandez-Triana J
,
Vihinen M
,
Vize PD
,
Vogt L
,
Wall CE
,
Walls RL
,
Westerfeld M
,
Wharton RA
,
Wirkner CS
,
Woolley JB
,
Yoder MJ
,
Zorn AM
,
Mabee P
.
???displayArticle.abstract???
Despite a large and multifaceted effort to understand the vast landscape of phenotypic data, their current form inhibits productive data analysis. The lack of a community-wide, consensus-based, human- and machine-interpretable language for describing phenotypes and their genomic and environmental contexts is perhaps the most pressing scientific bottleneck to integration across many key fields in biology, including genomics, systems biology, development, medicine, evolution, ecology, and systematics. Here we survey the current phenomics landscape, including data resources and handling, and the progress that has been made to accurately capture relevant data descriptions for phenotypes. We present an example of the kind of integration across domains that computable phenotypes would enable, and we call upon the broader biology community, publishers, and relevant funding agencies to support efforts to surmount today's data barriers and facilitate analytical reproducibility.
???displayArticle.pubmedLink??? 25562316
???displayArticle.pmcLink??? PMC4285398
???displayArticle.link??? PLoS Biol
???displayArticle.grants??? [+]
P41 HD064556 NICHD NIH HHS
Species referenced: Xenopus
Genes referenced: actl6a ids myb plxnd1
???attribute.lit??? ???displayArticles.show???
Figure 2. Phenotypes shared across biology.Phenotype data are relevant to many different domains, but they are currently isolated in data âsilos.â Research from a broad array of seemingly disconnected domains, as outlined here, can be dramatically accelerated with a computable data store. (A) Domains: Diverse fields such as evolutionary biology, human disease and medicine, and climate change relate to phenotypes. (B) Phenotypes: insects, vertebrates, plants, and even forests all have features that are branched in some way, but they are described using different terms. For a computer to discover this, the phenotypes must be annotated with unique identifiers from ontologies that are logically linked. Under âshapeâ in the PATO quality ontology [106], âbranchinessâ is an encompassing parent term with subtypes âbranchedâ and âincreased branchiness.â From left to right, top layer, insects, vertebrates and plants have species that demonstrate phenotypes for which the genetic basis is not known. Often their companion model species, however, have experimental genetic work that is relevant to proposing candidate genes and gene networks. Insects (1): An evolutionary novelty in bees (top layer) is the presence of branched setae used for pollen collection. Nothing is known about the genetic basis of this feature. One clue to the origin of this evolutionary feature comes from studies of Drosophila (bottom layer), where Mical overexpression in unbranched wild-type bristles generates a branched morphology [119]. Mical directly links semaphorins and their plexin receptors to the precise control of actin filament dynamics [119]. Vertebrates (2): In humans, aberrant angiogenesis, including excessive blood vessel branching (top layer), is one of the six central hallmarks of cancer [121]. Candidate genes have been identified using data from model organisms. In zebrafish (middle layer), studies of the control of sprouting in blood vessel development show that signaling via semaphorins [122] and their plexin receptors is required for proper abundance and distribution [123]; disruption of plxnd1 results in increased branching [120],[124],[125]. In mouse (bottom layer), branching of salivary glands is dependent on semaphorin signaling [126], as is the branching of various other epithelial organs [127]. Plants (3): The uppermost canopy of trees of the rainforest (top layer) undergo a marked increase in branching associated with climate change [128]. Nothing is known about the genetic basis of this feature. The branching of plant trichomes (bottom layer), tiny outgrowths with a variety of functions including seed dispersal, has been studied in the model Arabidopsis thaliana. Branching occurs in association with many MYB-domain genes [129], transcription factors that are found in both plants and animals [130]. (C) Environment: Diverse input from the environment influences organismal phenotype. (D) Genes: At the genetic level, previously unknown associations with various types of âbranchinessâ between insects and vertebrates are here made to possibly a common core or network of genes (the semaphorin-plexin signaling network). No association between genes associated with plant branching (Myb transcription factors) and animal branching is obvious from the literature. Image credit: Anya Broverman-Wray. | |
Figure 1. How to discover branching phenotypes?(Bottom panel) Phenotype data exhibiting various forms of branchiness are not easily discerned from diverse natural language descriptions. (A) Bee hairs are different from most other insect hairs in that they are plumose, which facilitates pollen collection. (B) A mutant of Drosophila melanogaster exhibits forked bristles, due to a variation in mical. (C) In zebrafish larvae (Danio rerio), angiogenesis begins with vessels branching. (D) Plant trichomes take on many forms, including trifurcation. (Top) Phenotypes involving some type of âbranchedâ are easily recovered when they are represented with ontologies. In a semantic graph, free text descriptions are converted into phenotype statements involving an anatomy term from animal or plant ontologies [56],[118] and a quality term from a quality ontology [106], connected by a logical expression (âinheres_in someâ). Anatomy (purple) and quality (green) terms (ontology IDs beneath) relate phenotype statements from different species by virtue of the logic inherent in the ontologies, e.g., plumose, bifurcated, branched, and tripartite are all subtypes of âbranched.â Image credits: bumble bee with pollen by Thomas Bresson, seta with pollen by István Mikó, Arabidopsis plants with hair-like structures (trichomes) by Annkatrin Rose, Drosophila photo by John Tann, Drosophila bristles redrawn from [119], scanning electron micrograph of Arabidopsis trichome by István Mikó, zebrafish embryos by MichianaSTEM, zebrafish blood vessels from [120]. Figure assembled by Anya Broverman-Wray. |
References [+] :
Aboelkassem,
Selective pumping in a network: insect-style microscale flow transport.
2013, Pubmed
Aboelkassem, Selective pumping in a network: insect-style microscale flow transport. 2013, Pubmed
Alvarez, Selective inhibition of retinal angiogenesis by targeting PI3 kinase. 2009, Pubmed
Arighi, An overview of the BioCreative 2012 Workshop Track III: interactive text mining task. 2013, Pubmed
Balhoff, Phenex: ontological annotation of phenotypic diversity. 2010, Pubmed
Balhoff, A semantic model for species description applied to the ensign wasps (hymenoptera: evaniidae) of New Caledonia. 2013, Pubmed
Berquist, The Digital Fish Library: using MRI to digitize, database, and document the morphological diversity of fish. 2012, Pubmed
Brinkman, Modeling biomedical experimental processes with OBI. 2010, Pubmed
Burleigh, Next-generation phenomics for the Tree of Life. 2013, Pubmed
Buttigieg, The environment ontology: contextualising biological and biomedical entities. 2013, Pubmed
Chen, MouseFinder: Candidate disease genes from mouse phenotype data. 2012, Pubmed
Chung, Semaphorin signaling facilitates cleft formation in the developing salivary gland. 2007, Pubmed
Cook, Genetic architecture of maize kernel composition in the nested association mapping and inbred association panels. 2012, Pubmed
Cooper, The plant ontology as a tool for comparative plant anatomy and genomic analyses. 2013, Pubmed
Dahdul, Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature. 2010, Pubmed
Deans, Time to change how we describe biodiversity. 2012, Pubmed
Dugan, Standardized metadata for human pathogen/vector genomic sequences. 2014, Pubmed
Gkoutos, Entity/quality-based logical definitions for the human skeletal phenome using PATO. 2009, Pubmed
Gkoutos, Using ontologies to describe mouse phenotypes. 2005, Pubmed
Gu, The role of semaphorins and their receptors in vascular development and cancer. 2013, Pubmed
Haendel, Unification of multi-species vertebrate anatomy ontologies for comparative biology in Uberon. 2014, Pubmed
Hamilton, The PhenX Toolkit: get the most from your measures. 2011, Pubmed
Hanahan, Hallmarks of cancer: the next generation. 2011, Pubmed
Hansen, High-resolution global maps of 21st-century forest cover change. 2013, Pubmed
Hiller, A "forward genomics" approach links genotype to phenotype using independent phenotypic losses among related species. 2012, Pubmed
Hoehndorf, PhenomeNET: a whole-phenome approach to disease gene discovery. 2011, Pubmed
Houle, Phenomics: the next challenge. 2010, Pubmed
Houle, Colloquium papers: Numbering the hairs on our heads: the shared challenge and promise of phenomics. 2010, Pubmed
Huang, Genome-wide association studies of 14 agronomic traits in rice landraces. 2010, Pubmed
Hung, Mical links semaphorins to F-actin disassembly. 2010, Pubmed
Köhler, The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. 2014, Pubmed
Köhler, Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research. 2013, Pubmed
Köhler, Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research. 2013, Pubmed
Korostylev, A functional role for semaphorin 4D/plexin B1 interactions in epithelial branching morphogenesis during organogenesis. 2008, Pubmed
Landrum, ClinVar: public archive of relationships among sequence variation and human phenotype. 2014, Pubmed
Lloyd, A comprehensive dataset of genes with a loss-of-function mutant phenotype in Arabidopsis. 2012, Pubmed
Ma, Controlled flight of a biologically inspired, insect-scale robot. 2013, Pubmed
Mabee, 500,000 fish phenotypes: The new informatics landscape for evolutionary and developmental biology of the vertebrate skeleton. 2012, Pubmed , Xenbase
Markov, The "street light syndrome", or how protein taxonomy can bias experimental manipulations. 2008, Pubmed
Mattingly, Providing the missing link: the exposure science ontology ExO. 2012, Pubmed
McGary, Systematic discovery of nonobvious human disease models through orthologous phenotypes. 2010, Pubmed , Xenbase
Mikó, Folding wings like a cockroach: a review of transverse wing folding ensign wasps (Hymenoptera: Evaniidae: Afrevania and Trissevania). 2014, Pubmed
Mungall, Uberon, an integrative multi-species anatomy ontology. 2012, Pubmed , Xenbase
National Research Council (US) Committee on Research at the Intersection of the Physical and Life Sciences, 2010, Pubmed
Ni, Gramene QTL database: development, content and applications. 2009, Pubmed
Oellrich, Improving disease gene prioritization by comparing the semantic similarity of phenotypes in mice with those of human diseases. 2012, Pubmed
O'Leary, MorphoBank: phylophenomics in the "cloud". 2011, Pubmed
Parkinson, ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression. 2009, Pubmed
Piwowar, Data archiving is a good investment. 2011, Pubmed
Pogodin, Biophysical model of bacterial cell interactions with nanopatterned cicada wing surfaces. 2013, Pubmed
Ramírez, Calculating structural complexity in phylogenies using ancestral ontologies. 2014, Pubmed
Ramírez, Linking of digital images to phylogenetic data matrices using a morphological ontology. 2007, Pubmed
Robinson, Improved exome prioritization of disease genes through cross-species phenotype comparison. 2014, Pubmed
Robinson, Phenotype ontologies and cross-species analysis for translational research. 2014, Pubmed
Robinson, The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. 2008, Pubmed
Robinson, The human phenotype ontology. 2010, Pubmed
Rosinski, Molecular evolution of the Myb family of transcription factors: evidence for polyphyletic origin. 1998, Pubmed
Rowan, Developmental genetics and new sequencing technologies: the rise of nonmodel organisms. 2011, Pubmed
Ruttenberg, Life sciences on the Semantic Web: the Neurocommons and beyond. 2009, Pubmed
Ruttenberg, Advancing translational research with the Semantic Web. 2007, Pubmed
Salmon, The long lifespan of two bat species is correlated with resistance to protein oxidation and enhanced protein homeostasis. 2009, Pubmed
Sansone, Toward interoperable bioscience data. 2012, Pubmed
Schnable, Genes identified by visible mutant phenotypes show increased bias toward one of two subgenomes of maize. 2011, Pubmed
Serna, Trichomes: different regulatory networks lead to convergent structures. 2006, Pubmed
Smedley, PhenoDigm: analyzing curated annotations to associate animal models with human diseases. 2013, Pubmed
Smith, The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. 2007, Pubmed , Xenbase
Smith, The mammalian phenotype ontology: enabling robust annotation and comparative analysis. 2009, Pubmed
Thessen, Knowledge extraction and semantic annotation of text from the encyclopedia of life. 2014, Pubmed
Torres-Vázquez, Semaphorin-plexin signaling guides patterning of the developing vasculature. 2004, Pubmed
Trelease, Anatomical reasoning in the informatics age: Principles, ontologies, and agendas. 2006, Pubmed
Tryka, NCBI's Database of Genotypes and Phenotypes: dbGaP. 2014, Pubmed
Vogt, The need for data standards in zoomorphology. 2013, Pubmed
Wall, Overview of FEED, the feeding experiments end-user database. 2011, Pubmed
Washington, Linking human diseases to animal models using ontology-based phenotype annotation. 2009, Pubmed
Wirkner, Evolutionary morphology of the circulatory system in Peracarida (Malacostraca; Crustacea). 2010, Pubmed
Yazdani, The semaphorins. 2006, Pubmed
Yoder, A gross anatomy ontology for hymenoptera. 2010, Pubmed
Youens-Clark, Gramene database in 2010: updates and extensions. 2011, Pubmed
Zamir, Where have all the crop phenotypes gone? 2013, Pubmed
Zemojtel, Effective diagnosis of genetic disease by computational phenotype analysis of the disease-associated genome. 2014, Pubmed
Zygmunt, Semaphorin-PlexinD1 signaling limits angiogenic potential via the VEGF decoy receptor sFlt1. 2011, Pubmed