<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Visualization of gene expression and expression as a phenotype with the XPO, XAO and DO using a combination of experimental data sources</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Troy Pells</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Malcolm E. Fisher</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Erik Segerdell</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joshua D. Fortriede</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christina James-Zorn</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kevin A. Burns</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Virgilio Ponferrada</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Praneet Chaturvedi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vaneet Lotay</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mardi Nenni</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stanley Chu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ying Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>DongZhuo Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kamran Karimi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter D. Vize</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Aaron M. Zorn</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Biological Science, University of Calgary</institution>
          ,
          <addr-line>Calgary, Alberta</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Division of Developmental Biology, Cincinnati Children's Hospital Medical Center</institution>
          ,
          <addr-line>Cincinnati, Ohio</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <fpage>7</fpage>
      <lpage>10</lpage>
      <abstract>
        <p>-Xenbase (www.xenbase.org) is a knowledge base for researchers and biomedical scientists that employ Xenopus (X. laevis and X. tropicalis) as a model organism in gaining a deeper understanding of developmental and disease processes. Through expert curation and automated data provisioning from various sources this MOD (model organism database) strives to integrate the Xenopus body of knowledge together with the visualization of biologically significant interactions. We present a vision for the usage of various ontologies that facilitate the visualization of gene expression from a combination of experimental data sources and the linking thereof back to human disease modeling.</p>
      </abstract>
      <kwd-group>
        <kwd>xenopus phenotype ontology</kwd>
        <kwd>xenopus anatomy ontology</kwd>
        <kwd>model organism database</kwd>
        <kwd>gene expression</kwd>
        <kwd>genomics</kwd>
        <kwd>human disease modeling</kwd>
        <kwd>biocuration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>If you use Xenbase resources in your research please consider
citing us, for example: Nucleic Acids Res. 2018 46(D1):D861-D868.</p>
      <p>Major funding for Xenbase is provided by the National Institute of Child
Health and Human Development, grant P41 HD064556.</p>
      <p>
        In the development of the XPO a mapping of EQ
statements for Xenopus phenotype to the MP was used to
identify common statements to base the first version of the
XPO on. In keeping with the design patterns and
interoperability defined for phenotype ontologies, by applying
the structure of anatomy and physiology ontologies (Gkoutos
et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]). This structure would persist in the XPO as inherited
from the MP and for new classes and relationships added there
to. Cross species interoperability is accomplished through
applying these rules for the structuring of the phenotype
ontology such that the ontologies for anatomy, phenotype and
disease being employed leverage the cross references to
UBERON (Uber Anatomy Ontology) or GO (Gene Ontology),
and PATO (phenotype and trait ontology). This in turn
integrates well with those species for which these are used
directly or are mapped to pre-composed phenotype ontologies.
      </p>
    </sec>
    <sec id="sec-2">
      <title>II. DATA PROVISIONING</title>
      <p>Prior to the development of a phenotype curation interface
in Xenbase. A Phenote version customized for the use of the
XAO provided the curators with a means to explore phenotype
annotations as applied to Xenopus related articles and images.
This early work provided a wealth of insight into, and working
with, pre/post composed statements. As an alternative to
manual curation, especially for EaP (Expression as a
Phenotype) the GEO (Gene Expression Omnibus) was
identified as a data source for Xenopus RNA-seq and ChIP-seq
raw data. A GEO metadata interface was developed to enable
the curators to work with the GSE/GSM metadata to review the
experiment and control samples together with the capturing of
their background, manipulation and assay information. Once
again utilizing ontologies including BFO, XAO and a subset of
the PubChem small molecules identified in the Xenopus small
molecules ontology (XSMO) as applicable to the experiments
being curated. Through the NCBI eUtilitize the MINiML xml
for the GEO metadata was sourced for Xenopus related
experiments (approx. 220). The SRA run tables provided the
link between the GSE, GSM and the raw data files. The
bioinformatics pipeline for processing the RNA-seq or
ChIPseq data was constructed in order to map to the current xenopus
genome and to generate TPM read counts and DE (Differential
Expression) reads based on the GEO metadata that had been
reviewed and annotated by the curators.</p>
      <p>The GEO loader is a java application developed to
consume the GEO metadata as well as provide the staging of
the TPM read counts and DE data. The population of a Star
schema table structure in Xenbase is slated for the storage of
this data in a Fact and Dimension table structure for subsequent
consumption by the gene expression visualization that is being
contemplated. The EaP statements derived from the DE reads
would be stored in tables linked to the GSE and in turn the
article and relevant image for the experiment.</p>
    </sec>
    <sec id="sec-3">
      <title>III. GENE EXPRESSION VISUALIZATION</title>
      <p>
        The Insitu gene expression data that exists in Xenbase
provided the basis upon which to develop a modified heat map
to identify developmental stage and tissue expression of a gene
on a gene page. The TPM read counts (Session et al. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ])
supported the identification of gene expression in select
developmental stages and adult tissue stages. For the
embryonic tissue stages (Briggs et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) single cell
experiments provided UMI read counts suitable for identifying
gene expression for these tissues. By bringing together the
gene expression data and combining them in a heat map
visualization, meaningful identification of expression of gene’s
at various stages and tissues can be accomplished.
      </p>
    </sec>
    <sec id="sec-4">
      <title>IV. CONCLUSION</title>
      <p>Bridging the pre/post composition phenotype statements
represents one of the issues that remains to be fleshed out. The
application of a graph path to map one to the other and
determine the accuracy, remains to be explored. The handling
of copyright images (embargo) related to phenotype statements
and the timing of the curation effort is another process flow
topic that is under reviewed.</p>
      <p>As the development of the XPO progresses and additional
disease (DO) relationships are constructed, the benefits derived
from the combined availability of expression as a phenotype,
anatomical phenotype and gene ontology phenotype to
contribute towards human disease modeling is anticipated.
Additional data integration with Monarch via their dipper
pipeline is also contemplated.</p>
    </sec>
    <sec id="sec-5">
      <title>ACKNOWLEDGMENT Major funding for Xenbase is provided by the National Institute of Child Health and Human Development, grant P41 HD064556</title>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Gkoutos</surname>
          </string-name>
          et al. “
          <article-title>The anatomy of phenotype ontologies: principles, properties and applications” Brief Bioinform. 2017 Apr 6</article-title>
          . doi:
          <volume>10</volume>
          .1093/bib/bbx035.https://academic.oup.com/bib/advancearticle/doi/10.1093/bib/bbx035/3108819
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Session</surname>
          </string-name>
          et al. “
          <article-title>Genome evolution in the allotetraploid frog Xenopus laevis</article-title>
          .”
          <source>Nature 2016 Oct</source>
          <volume>20</volume>
          ;
          <volume>538</volume>
          (
          <issue>7625</issue>
          ):
          <fpage>336</fpage>
          -
          <lpage>343</lpage>
          . doi:
          <volume>10</volume>
          .1038/nature19840 https://www.ncbi.nlm.nih.gov/pubmed/27762356
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Briggs</surname>
          </string-name>
          , Peshkin et al. “
          <article-title>The dynamics of gene expression in vertebrate embryogenesis at single-cell resolution</article-title>
          .”
          <source>Science 2018 Jun</source>
          <volume>1</volume>
          ;
          <issue>360</issue>
          (
          <issue>6392</issue>
          ).
          <source>pii: eaar5780. doi: 10</source>
          .1126/science.aar5780. https://www.ncbi.nlm.nih.gov/pubmed/29700227
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Karimi</surname>
          </string-name>
          et al. “
          <article-title>Xenbase: a genomic, epigenomic and transcriptomic model organism database”</article-title>
          ,
          <source>Nucleic Acids Res</source>
          .
          <year>2018</year>
          46(
          <issue>D1</issue>
          ):
          <fpage>D861</fpage>
          -
          <lpage>D868</lpage>
          . https://www.ncbi.nlm.nih.gov/pubmed/29059324
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>James-Zorn</surname>
          </string-name>
          et al. “
          <article-title>Navigating Xenbase: An Integrated Xenopus Genomics and Gene Expression Database”</article-title>
          ,
          <source>Eukaryotic Genomic Databases: Methods and Protocols</source>
          ,
          <year>2018</year>
          , Volume
          <volume>1757</volume>
          , Chapter 10, pp.
          <fpage>251</fpage>
          -
          <lpage>305</lpage>
          . https://www.ncbi.nlm.nih.gov/pubmed/29761462
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>