<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Knowledge Representation of Protein PTMs and Complexes in the Protein Ontology: Application to Multi-Faceted Disease Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Karen E. Ross</string-name>
          <email>ross@dbi.udel.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Catalina O. Tudor</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gang Li</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ruoyao Ding</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Irem Celen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Julie Cowart</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cecilia N. Arighi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Darren A. Natale</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Cathy H. Wu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Bioinformatics and Computational Biology, University of Delaware</institution>
          ,
          <addr-line>Newark, DE</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Protein Information Resource, Georgetown University Medical Center</institution>
          ,
          <addr-line>Washington, DC</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2014</year>
      </pub-date>
      <fpage>43</fpage>
      <lpage>46</lpage>
      <abstract>
        <p>-Alterations in protein post-translational modification (PTM) and PTM cross-talk are increasingly being appreciated as driving mechanisms of human disease. The Protein Ontology (PRO) is a valuable resource to study the relation between PTM and disease because it represents individual proteins and protein complex subunits at the proteoform (e.g., isoform, PTM form, and sequence variant) level, with links to their functional properties. We constructed a multi-relation network that represents knowledge obtained from large scale text-mining for phosphorylation-dependent proteinprotein interactions (PPIs) and their disease associations, built on the PRO framework for representation of PTM forms, complexes, and protein families, as well as their attributes and relationships. We then conducted two case studies that demonstrate the use of PRO in disease analysis. (i) We performed cross-species comparisons of two glioma-associated phosphorylated proteoforms of the human DNMT1 methylase, which revealed that the forms are not strictly conserved in mouse, a frequently used glioma model system. (ii) We used PRO-defined proteoforms of the oncoprotein beta-catenin phosphorylated on various combinations of the N-terminal sites, Ser-33, Ser-37, Thr-41, and Ser-45, to interpret a hierarchical clustering analysis of cancer types based on their pattern of mutations in these sites. The cancers formed two major clusters: one with mutations in Ser-33/Ser-37/Thr-41 and the other with mutations in Thr-41/Ser-45. Proteoform-specific annotation in PRO suggests that stabilization of beta-catenin may play a role in oncogenesis in the first group, whereas alterations in beta-catenin transcriptional or cell adhesion activity may play a more important role in the second group. Together, these scenarios illustrate the general applicability of PRO to disease understanding. Future plans include the integration of PRO with other semantic resources to increase our ability to address these problems with computational reasoning.</p>
      </abstract>
      <kwd-group>
        <kwd>Protein Ontology</kwd>
        <kwd>phosphorylation</kwd>
        <kwd>text mining</kwd>
        <kwd>beta-catenin</kwd>
        <kwd>cancer</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>I. INTRODUCTION</title>
    </sec>
    <sec id="sec-2">
      <title>Aberrations in protein post-translational modification (PTM), resulting from genetic variations that affect individual PTM sites as well as alterations in PTM enzyme activity that</title>
      <p>
        have global effects on the balance of PTM forms in the cell,
have been implicated in many diseases [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. Protein
phosphorylation, in particular, has been recognized as a central
disease-driving mechanism, leading to the development of
kinase inhibitors as therapeutic agents [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. With state-of-the-art
text mining tools, it is now possible to extract detailed
information about proteins, PTMs, and diseases from the
literature on a large scale. Tools such as RLIMS-P [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and eFIP
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] have captured a wealth of information on phosphorylated
protein forms, their modifying enzymes, and the functional
impact of phosphorylation, with a minimum of manual curator
effort.
      </p>
    </sec>
    <sec id="sec-3">
      <title>Despite these advances, representing this information in a</title>
      <p>form that is useful for both human interpretation and
computational reasoning is challenging. It is important not only
to link genetic variant information with its effects on protein
sequence and function but also to capture the impact of
imbalances of particular PTM forms and complexes on disease.</p>
    </sec>
    <sec id="sec-4">
      <title>Used in conjunction with other bioinformatic resources, the</title>
      <p>
        Protein Ontology (http://pir.georgetown.edu/pro/pro.shtml;
PRO [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) enables the structured representation and
interpretation of this information. PRO, a member of the Open
Biomedical Ontologies (OBO) foundry, represents proteoforms
(e.g., isoforms, PTM forms, and sequence variants) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and
protein complexes and their relationships within and across
species. Once a proteoform or complex is defined in PRO it
can be annotated with functional and/or disease information
derived from the scientific literature or bioinformatic
databases. This framework can then support the analysis of
biological processes in health and disease.
      </p>
      <p>
        Mouse models have been critical for understanding human
diseases. These models rely on the high degree of conservation
of proteins and pathways between human and mouse. Although
protein phosphorylation sites in human are also highly
conserved in mouse (92% conserved in one study [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]),
conservation at the proteoform level, which can have
significant functional consequences in a disease model has not
been assessed. With its emphasis on representation of
proteoforms and cross-species relationships, PRO is a valuable
resource for this type of analysis.
      </p>
    </sec>
    <sec id="sec-5">
      <title>In this article, we leverage PRO to: (i) create a</title>
      <p>phosphorylation network based on large-scale text mining of
phosphorylation-dependent PPIs that impact disease; (ii)
facilitate cross-species comparison of proteoforms of the
DNMT1 methylase that are associated with glioblastoma in
humans; and (iii) interpret patterns of mutations in the
betacatenin oncoprotein observed in different cancer types. These
examples illustrate the variety of ways in which PRO can be
used to represent disease knowledge and gain insight into
disease mechanisms.</p>
    </sec>
    <sec id="sec-6">
      <title>II. METHODS</title>
    </sec>
    <sec id="sec-7">
      <title>To identify phosphorylation-dependent PPIs described in</title>
      <p>
        literature, all PubMed abstracts and PubMedCentral (PMC)
Open Access full-length articles were processed using eFIP,
which identifies mentions of phosphorylation-dependent PPIs.
The kinases, phosphorylated proteins (substrates), and
interacting partners (interactants) were mapped to UniProtKB
identifiers, when possible, using GeneNorm [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Disease
mentions in article titles or abstracts were computationally
detected by matching to a custom dictionary of disease terms
based on MeSH and MedlinePlus. PRO terms for proteoforms
and complexes were created as described in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Term
annotation such as binding partners and disease association is
stored in the PRO annotation file (PAF), and PTM enzyme
information is recorded in the comments section of the OBO
stanza using structured vocabulary. All terms and annotations
are available upon request and will be made public on the PRO
website and in downloadable files as part of PRO release 43
(September 2014). The network was constructed using
Cytoscape 3.0 [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Sequence alignment was performed using
Clustal Omega [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and visualized with Jalview 2.8.1 [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
Cancer-associated mutations in beta-catenin were obtained
from Catalog of Somatic Mutations in Cancer (COSMIC) [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
Tumor tissue types that had at least 10 mutations in the
betacatenin N-terminal phosphorylation sites Ser-33, Ser-37,
Thr41, and Ser-45 were used. For each tissue the proportion of
mutations at each site was calculated relative to the total
number of mutations at all four sites. The heatmap was
constructed using the heatmap.2 function of R (version 3.0.2;
http://www.r-project.org/) with default parameters.
      </p>
    </sec>
    <sec id="sec-8">
      <title>III. RESULTS AND DISCUSSION</title>
      <p>A. Network Representation of Disease-Associated
Phoshorylation-Dependent Protein-Protein Interactions
Text-mining of over 23 million PubMed abstracts and
800,000 PMC open access full-length articles using eFIP
identified sentences describing PPIs that were dependent on the
phosphorylation state of one of the interactants in over 13,000
articles. About 500 articles also had phosphorylation site
information, UniProtKB-mapped substrates and interactants,
and mention of disease in the title or abstract. Through manual
curation of the 109 most recent articles, we found 52
diseaseassociated phospho-dependent PPIs in 39 articles. (In the
remaining articles the disease mention was not causally related
to the PPI.) A multi-relation network based on some of these</p>
      <p>This work has been supported by the National Science Foundation
[ABI1062520] and the National Institutes of Health [5R01GM080646-08].
results, including phospho-dependent PPIs, PTM enzyme-PTM
form relationships, proteoform/complex-disease relationships,
and relations among PRO terms, is shown in Fig. 1.</p>
      <p>
        This network illustrates several ways in which the PRO
framework allows curators to precisely represent complex
biological information. First, because PRO treats each
proteoform as a separate entity, multiple forms of a protein can
be defined and individually annotated. For example, the
Tyr357-phosphorylated form (PR:000037508) and the Ser-127
phosphorylated form (PR:000037510) of YAP1 differ in their
ability to bind 14-3-3 proteins (PR:000003237) and the
apoptosis regulatory protein p73 (PR:O15350) and exhibit
different associations with cancer and Alzheimer’s Disease.
PRO can also represent forms with multiple types of
modification, facilitating the description of PTM cross-talk
(e.g., CCND1 Thr-286-phosphorylated and ubiquitinated form
(PR:000037512)). Second, PRO represents protein complexes
as distinct entities to which complex-specific annotation can be
attached. Moreover, complex subunits are defined using PRO
terms, enabling the specification of which particular
proteoforms (e.g., phosphorylated forms) are part of the
complex. For example, the proteoform of the DNMT1 DNA
methlyase that lacks phosphorylation on Ser-127 and Ser-143
(PR:000037504) forms a complex with the DNA-associated
factors PCNA (PR:P12004) and UHRF1 (PR:Q96T88). This
complex (PR:000037517) has been associated with tumor
suppression [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Third, protein terms in PRO are defined at
multiple levels of granularity from the family level down to the
isoform and/or modification level. Thus, when describing a
biological relationship involving a protein, the term that is
most appropriate given the current state of knowledge can be
used. For example, because 14-3-3 proteins are encoded by
several genes, and the protein products of these genes are not
always distinguishable in experimental assays, 14-3-3 proteins
are represented by the class PR:000003237 that encompasses
the protein products of all 14-3-3 family genes. Similarly,
when the protein is known to be the product of a particular
gene, but no isoform information is available, a gene-level
PRO term that encompasses all protein products of a gene is
used (e.g., TP73 (PR:O15350)). Integration of PRO terms into
a multi-relation network context further allows identification of
proteoforms sharing common PTM enzymes (e.g., AKT
(PR:000029189)) or interacting partners (e.g., 14-3-3
(PR:000003237)) or implicated in the same diseases.
B. Cross-Species Comparison of Proteoforms of the
Glioma
      </p>
      <p>
        Associated DNA Methylase, DNMT1
DNMT1 phosphorylation on Ser-127 or Ser-127/Ser-143 and
the concomitant reduction in binding to UHRF1 and PCNA
(Fig. 1) has been associated with glioma in humans [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
Because the mouse is often used as a model system for
studying glioma, we investigated whether the
gliomaassociated DNMT1 proteoforms are conserved in mouse as
well as in several other mammals (Fig. 2). PRO representation
of proteoforms enables the cross-species comparison of PTM
at the PTM-form level, which is more likely to reflect
functional conservation than comparisons of the individual
sites alone. While human Ser-143 is conserved across all
species, Ser-127 is found only in other primates (red/pink
residues). Thus, neither the Ser-127 phosphorylated form nor
the Ser-127/Ser-143 phosphorylated form of DNMT1 (Fig. 1)
is strictly conserved in mouse, rat, dog, or cow, suggesting that
non-primates might use a different mechanism for regulating
DNMT1 interaction with PCNA and UHRF1. However,
mouse, rat, and dog (but not cow) have a serine at the adjacent
position (blue/light blue residues), which could potentially
fulfill the same role as human Ser-127. This serine has been
shown to be phosphorylated in mouse in a high-throughput
phospho-proteomic study [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Further studies are needed to
clarify the role of DNMT1 phosphorylation in a mouse glioma
model system.
      </p>
      <p>C. Analysis of Cancer-Associated Mutations in beta-catenin
Phosphorylation Sites</p>
    </sec>
    <sec id="sec-9">
      <title>Beta-catenin is a multi-functional protein involved in cell</title>
      <p>
        cell adhesion and transcriptional regulation [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. Several of its
key transcriptional targets drive cell proliferation, and
excessive beta-catenin transcriptional activity is oncogenic.
Beta-catenin stability is regulated by phosphorylation of four
residues in the N-terminus. Casein kinase I phosphorylates
Ser45 of beta-catenin, which promotes the sequential
phosphorylation of Thr-41, Ser-37, and Ser-33 by GSK3-beta.
Phosphorylation at Ser-37 and Ser-33 enables recognition of
beta-catenin by the ubiquitin ligase beta-TrCP, which targets it
for degradation by the proteosome. Mutations in the four
phosphorylation sites stabilize the protein and have been
associated with cancer.
      </p>
      <p>Through text mining with RLIMS-P and eFIP, we defined
four beta-catenin proteoforms phosphorylated at different
combinations of the N-terminal sites, with distinct sub-cellular
localizations, binding partners, and activities (Fig. 3A). To gain
insight into the role of these proteoforms in cancer, we used
data from COSMIC on cancer-associated mutations in these
sites to perform hierarchical clustering of different cancer types
(Fig. 3B). The cancers fall into two major clusters with
different mutation patterns. Cluster 1 cancers (pink box) are
characterized by mutations at Ser-33 and Ser-37, with few
mutations at Ser-45. Conversely, Cluster 2 cancers (blue box)
are predominantly mutated at Ser-45. Both clusters show
intermediate levels of Thr-41 mutations. The Ser-33/Ser-37
mutation the pattern in Cluster 1 suggests that oncogenesis in
these cancer types is related to the lack of proteoforms 1 and 2.</p>
      <p>
        Both of these forms are unstable due to their association with
the ubiquitin ligase beta-TrCP. Thus, beta-catenin stabilization
may be playing an important role in these cancers. Kinases,
such as HIPK2, that can phosphorylate Ser-33 and Ser-37
without prior phosphorylation of Ser-45 may be regulating
beta-catenin stability in these tissues [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Cluster 2 cancers
have relatively few mutations in the residues that bind
betaTrCP; instead, these cancers are associated with lack of Ser-45
phosphorylated proteoform 3. Unlike other beta-catenin
proteoforms, proteoform 3 is found in the nucleus and may be
a key transcriptionally active form of beta-catenin [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]; this
proteoform can also bind to the adhesion molecule E-cadherin
[
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. Thus, alterations in beta-catenin transcriptional and cell
adhesion activity independent of beta-catenin levels may
contribute to Cluster 2 cancers. This example highlights the
value of integrating experimental disease information from
bioinformatic resources such as COSMIC with PRO
representation of proteoforms to gain new insight into disease.
      </p>
    </sec>
    <sec id="sec-10">
      <title>IV. CONCLUSIONS AND FUTURE WORK</title>
    </sec>
    <sec id="sec-11">
      <title>Through the structured representation of proteoforms and</title>
      <p>complexes PRO facilitates: (i) representation of
proteoformdisease relations identified by large-scale text mining; (ii)
cross-species comparisons at the proteoform level for
evaluation of the relevance of animal models of disease; and
(iii) interpretation of disease-associated mutation patterns.
Currently, the PRO terms curated in this project can be viewed
on the PRO website by biologists interested in PTM, PPI, and
disease relationships. We are working toward formalizing these
relationships and disseminating them in standard semantic web
format (e.g. RDF/XML) to enable computational reasoning and
hypothesis generation.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>L.M.</given-names>
            <surname>Graves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.S.</given-names>
            <surname>Duncan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.C.</given-names>
            <surname>Whittle</surname>
          </string-name>
          , and
          <string-name>
            <given-names>G.L.</given-names>
            <surname>Johnson</surname>
          </string-name>
          ,
          <article-title>"The dynamic nature of the kinome,"</article-title>
          <source>Biochem J</source>
          , vol.
          <volume>450</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>B.M.</given-names>
            <surname>Kessler</surname>
          </string-name>
          ,
          <article-title>"Ubiquitin - omics reveals novel networks and associations with human disease,"</article-title>
          <source>Curr Opin Chem Biol</source>
          , vol.
          <volume>17</volume>
          , pp.
          <fpage>59</fpage>
          -
          <lpage>65</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>X.</given-names>
            <surname>Yuan</surname>
          </string-name>
          , et al.,
          <article-title>"An online literature mining tool for protein phosphorylation,"</article-title>
          <source>Bioinformatics</source>
          , vol.
          <volume>22</volume>
          , pp.
          <fpage>1668</fpage>
          -
          <lpage>1669</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.O.</given-names>
            <surname>Tudor</surname>
          </string-name>
          , et al.,
          <article-title>"The eFIP system for text mining of protein interaction networks of phosphorylated proteins,"</article-title>
          <source>Database (Oxford)</source>
          , vol.
          <year>2012</year>
          , pp.
          <fpage>bas044</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>D.A.</given-names>
            <surname>Natale</surname>
          </string-name>
          , et al.,
          <article-title>"Protein Ontology: a controlled structured network of protein entities,"</article-title>
          <source>Nucleic Acids Res</source>
          , vol.
          <volume>42</volume>
          , pp.
          <fpage>D415</fpage>
          -
          <lpage>421</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>L.M.</given-names>
            <surname>Smith</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.L.</given-names>
            <surname>Kelleher</surname>
          </string-name>
          , and
          <string-name>
            <surname>P.</surname>
          </string-name>
          <article-title>Consortium for Top Down, "Proteoform: a single term describing protein complexity,"</article-title>
          <source>Nat Methods</source>
          , vol.
          <volume>10</volume>
          , pp.
          <fpage>186</fpage>
          -
          <lpage>187</lpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Malik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.A.</given-names>
            <surname>Nigg</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Korner</surname>
          </string-name>
          ,
          <article-title>"Comparative conservation analysis of the human mitotic phosphoproteome,"</article-title>
          <source>Bioinformatics</source>
          , vol.
          <volume>24</volume>
          , pp.
          <fpage>1426</fpage>
          -
          <lpage>1432</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>C.H.</given-names>
            <surname>Wei</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.Y.</given-names>
            <surname>Kao</surname>
          </string-name>
          ,
          <article-title>"Cross-species gene normalization by species inference,"</article-title>
          <source>BMC Bioinformatics</source>
          , vol.
          <volume>12</volume>
          <issue>Suppl 8</issue>
          , pp.
          <fpage>S5</fpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>K.E.</given-names>
            <surname>Ross</surname>
          </string-name>
          , et al.,
          <article-title>"Construction of protein phosphorylation networks by data mining, text mining and ontology integration: analysis of the spindle checkpoint,"</article-title>
          <source>Database (Oxford)</source>
          , vol.
          <year>2013</year>
          , pp.
          <fpage>bat038</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.E.</given-names>
            <surname>Smoot</surname>
          </string-name>
          , et al.,
          <source>"Cytoscape 2</source>
          .
          <article-title>8: new features for data integration and network visualization,"</article-title>
          <source>Bioinformatics</source>
          , vol.
          <volume>27</volume>
          , pp.
          <fpage>431</fpage>
          -
          <lpage>432</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>F.</given-names>
            <surname>Sievers</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.G.</given-names>
            <surname>Higgins</surname>
          </string-name>
          ,
          <article-title>"Clustal Omega, accurate alignment of very large numbers of sequences,"</article-title>
          <source>Methods Mol Biol</source>
          , vol.
          <volume>1079</volume>
          , pp.
          <fpage>105</fpage>
          -
          <lpage>116</lpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>A.M. Waterhouse</surname>
          </string-name>
          , et al.,
          <article-title>"Jalview Version 2--a multiple sequence alignment editor and analysis workbench,"</article-title>
          <source>Bioinformatics</source>
          , vol.
          <volume>25</volume>
          , pp.
          <fpage>1189</fpage>
          -
          <lpage>1191</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>S.A.</given-names>
            <surname>Forbes</surname>
          </string-name>
          , et al.,
          <article-title>"COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer,"</article-title>
          <source>Nucleic Acids Res</source>
          , vol.
          <volume>39</volume>
          , pp.
          <fpage>D945</fpage>
          -
          <lpage>950</lpage>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>E.</given-names>
            <surname>Hervouet</surname>
          </string-name>
          , et al.,
          <article-title>"Disruption of Dnmt1/PCNA/UHRF1 interactions promotes tumorigenesis from human and mice glial cells,"</article-title>
          <source>PLoS One</source>
          , vol.
          <volume>5</volume>
          , pp.
          <fpage>e11333</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>M.</given-names>
            <surname>Trost</surname>
          </string-name>
          , et al.,
          <article-title>"The phagosomal proteome in interferon-gammaactivated macrophages,"</article-title>
          <source>Immunity</source>
          , vol.
          <volume>30</volume>
          , pp.
          <fpage>143</fpage>
          -
          <lpage>154</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>T.</given-names>
            <surname>Valenta</surname>
          </string-name>
          , G. Hausmann, and
          <string-name>
            <given-names>K.</given-names>
            <surname>Basler</surname>
          </string-name>
          ,
          <article-title>"The many faces and functions of beta-catenin,"</article-title>
          <source>EMBO J</source>
          , vol.
          <volume>31</volume>
          , pp.
          <fpage>2714</fpage>
          -
          <lpage>2736</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>E.A.</given-names>
            <surname>Kim</surname>
          </string-name>
          , et al.,
          <article-title>"Homeodomain-interacting protein kinase 2 (HIPK2) targets beta-catenin for phosphorylation and proteasomal degradation,"</article-title>
          <source>Biochem Biophys Res Commun</source>
          , vol.
          <volume>394</volume>
          , pp.
          <fpage>966</fpage>
          -
          <lpage>971</lpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M.T.</given-names>
            <surname>Maher</surname>
          </string-name>
          , et al.,
          <article-title>"Beta-catenin phosphorylated at serine 45 is spatially uncoupled from beta-catenin phosphorylated in the GSK3 domain: implications for signaling,"</article-title>
          <source>PLoS One</source>
          , vol.
          <volume>5</volume>
          , pp.
          <fpage>e10184</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <surname>M.C. Faux</surname>
          </string-name>
          , et al.,
          <article-title>"Independent interactions of phosphorylated betacatenin with E-cadherin at cell-cell contacts and APC at cell protrusions,"</article-title>
          <source>PLoS One</source>
          , vol.
          <volume>5</volume>
          , pp.
          <fpage>e14127</fpage>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>