<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>wKinMut: An integrated tool for the analysis and interpretation of mutations in human protein kinases</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jose MG Izarzugaza</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miguel Vazquez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Angela del Pozo</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alfonso Valencia</string-name>
          <email>avalencia@cnio.es</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Biological Sequence Analysis, Department of Systems Biolology, Technical University of Denmark</institution>
          ,
          <addr-line>2800 Lyngby</addr-line>
          ,
          <country country="DK">Denmark</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Spanish National Cancer Research Centre (CNIO). Structural Biology and BioComputing Programme. C/Melchor Fernandez Almagro</institution>
          ,
          <addr-line>3. E-28029 Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>   Protein kinases are involved in relevant physiological functions and a broad number of mutations in  this  superfamily  have  been  reported  in  the  literature  to  affect  protein  function  and  stability.  Unfortunately, the exploration of the consequences on the phenotypes of each individual mutation  remains  a  considerable  challenge.  wKinMut  facilitates  the  exploration  of  the  information  available  about  individual  mutations  by  integrating  prediction  approaches  with  the  automatic  extraction  of  information from the literature (text mining) and several state‐of‐the‐art databases.  The  wKinMut  web‐server  offers  direct  prediction  of  the  potential  pathogenicity  of  the  mutations  from  a  number  of  methods,  including  our  recently  developed  prediction  method  based  on  the  combination  of  information  from  a  range  of  diverse  sources,  including  physicochemical  properties  and functional annotations from FireDB and Swissprot and kinase‐specific characteristics such as the  membership  to  specific  kinase  groups,  the  annotation  with  disease‐associated  GO  terms  or  the  occurrence  of  the  mutation  in  PFAM  domains,  and  the  relevance  of  the  residues  in  determining  kinase  subfamily  specificity  from  S3Det.  This  predictor  yields  interesting  results  that  compare  favourably with other methods in the field when applied to protein kinases.   Together  with  the  predictions,  wKinMut  offers  a  number  of  integrated  services  for  the  analysis  of  mutations.  These  include:  the  classification  of  the  kinase,  information  about  associations  of  the  kinase with other proteins extracted from iHop, the mapping of the mutations onto PDB structures,  pathogenicity records from a number of databases and the classification of mutations in large‐scale  cancer studies. Importantly, wKinMut is connected with the SNP2L system that extracts mentions of  mutations directly from the literature, and therefore increases the possibilities of finding interesting  functional information associated to the studied mutations. </p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        wKinMut has been used during the last year for the analysis of the consequences of mutations in the 
context  of  a  number  of  cancer  genome  projects,  including  the  recent  analysis  of  Chronic 
Lymphocytic Leukemia cases  and is publicly available at http://wkinmut.bioinfo.cnio.es. 
Current high-throughput resequencing screenings (
        <xref ref-type="bibr" rid="ref1 ref2">1, 2, 3</xref>
        ) represent a powerful set of techniques to
discover large numbers of mutations. Of these, only a small fraction are causally implicated in disease
onset and therefore, separating the wheat from the chaff is still a major challenge (4). The
interpretation of the overwhelming wealth of data also represents an issue in other fields, such as
protein function prediction (
        <xref ref-type="bibr" rid="ref3">5</xref>
        ). For a small subset of the new mutations discovered, experimental
information regarding the relationship between the mutation and the underlying biochemical
mechanism is known. However, there is no information for the remaining mutations. The intensive
requirement of resources makes it unfeasible to experimentally test the association of all these
mutations to disease, and to characterize their functional effects. Nevertheless, this problem is very
amenable to in silico predictors (
        <xref ref-type="bibr" rid="ref4 ref5">4, 6, 7</xref>
        ). Different approaches are currently available to predict the
probability of a newly discovered mutation being implicated in disease. Some methods identify crucial
positions in a given protein and derive generalized rules to predict the pathogenicity of mutations.
Other methods assume that evolutionarily conserved protein residues are important for protein
structure, folding and function, whereby mutations in these residues are considered deleterious (
        <xref ref-type="bibr" rid="ref6">8</xref>
        ).
Variations on this principle lead to methods that predict deleterious mutations by evaluating changes
in evolutionarily conserved PFAM motifs (
        <xref ref-type="bibr" rid="ref7">9</xref>
        ). A number of systems use protein structures to
characterize substitutions that significantly destabilize the folded state. There are also methods that
integrate prior knowledge in the form of both sequence and structure-related features from a set of
experimentally characterized mutations to train automatic machine-learning systems. These systems
can infer the pathogenicity of new mutations based on the cases evaluated. Albeit similar in purpose,
very different machine-learning methods can be implemented. Among them, probably the most
popular ones are: rule-based systems (
        <xref ref-type="bibr" rid="ref10 ref8 ref9">10, 11, 12</xref>
        ), decision trees (
        <xref ref-type="bibr" rid="ref11">13</xref>
        ), random forests (
        <xref ref-type="bibr" rid="ref12 ref13">14, 15</xref>
        ),
neural networks (
        <xref ref-type="bibr" rid="ref14 ref15">16, 17</xref>
        ), Bayesian methods (
        <xref ref-type="bibr" rid="ref16">18</xref>
        ) and SVMs (
        <xref ref-type="bibr" rid="ref17 ref18 ref19 ref20 ref21">19, 20, 21, 22, 23</xref>
        ). Recently, some
meta approaches that combine different methodologies have been implemented. For example,
Condel (
        <xref ref-type="bibr" rid="ref22">24</xref>
        ) integrates five of the most widely employed computational tools for detecting pathogenic
single nucleotide variations. Predictors can also be classified according to their scope. Most of the
predictors are generally applicable to amino acid sequences from any protein family, while a few of
them include properties that apply only to a given protein family of interest; i.e. protein kinase specific
predictors (
        <xref ref-type="bibr" rid="ref18 ref21">20, 23</xref>
        ). These family-related features bring discriminative information that justifies the
development of specialized predictors.
      </p>
      <p>
        A broad number of mutations in the protein kinase superfamily have been reported in the literature (
        <xref ref-type="bibr" rid="ref23">25</xref>
        )
and a subset of them is known to disrupt protein structure and function (
        <xref ref-type="bibr" rid="ref24">26</xref>
        ). For some cases, since
human protein kinases are involved in a plethora of physiological functions, this disruption can be
causally associated to disease (
        <xref ref-type="bibr" rid="ref25">27</xref>
        ). Still, the majority of protein kinase mutations are tolerated without
apparent significant effects (
        <xref ref-type="bibr" rid="ref26 ref27">28, 29</xref>
        ).
      </p>
      <p>
        In previous publications, we have discussed the preferential distribution of germline pathogenic
deviations (
        <xref ref-type="bibr" rid="ref28">30</xref>
        ) and driver somatic mutations (
        <xref ref-type="bibr" rid="ref29">31</xref>
        ) with respect to regions of functional and structural
importance. Here we present, wKinMut, an integrated web-service for the collection of information
from multiple sources and for the prediction of the pathogenicity of mutations by combining several
prediction approaches. The objective of wKinMut is to provide a one-stop resource for the analysis
and interpretation of the consequences of mutations in the protein kinase superfamily.
      </p>
    </sec>
    <sec id="sec-2">
      <title>DESCRIPTION AND IMPLEMENTATION</title>
      <p>
        wKinMut represents the first resource to provide an integrated tool for the analysis and interpretation
of the consequences of mutations in the protein kinase superfamily. The main objective of wKinMut is
to aid computational biologists and clinicians to prioritize pathogenic mutations and to understand the
mechanisms by which some mutations lead to disease, and particularly, to cancer.
The tool presented here, incorporates information retrieval and prediction approaches and displays
information from diverse sources. First, it simplifies the collection of information about the mutations,
such as the classification, domain architecture, functional annotations and plausible interaction
partners of the kinase. Furthermore, kinase mutations are analyzed in their structural context and
mentions in dedicated databases, genotyping studies and the literature suggesting an implication in
disease are also presented. Second, wKinMut estimates the theoretical pathogenicity of kinase
mutations with three different approaches, including our newly developed kinase-specific method,
KinMut (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ), based on the evaluation of a wide set of sequence-derived features that describe each
independent mutation. The affected domain and kinase group, diverse functional annotations, residue
physicochemical properties and relevance of the mutated residues in determining subfamily specificity
are considered.
      </p>
      <p>Although wKinMut’s web interface was designed to be simple and user-friendly, we have included a
tutorial and the corresponding help page that will guide the reader through the different steps in the
process, including the submission of mutations and the interpretation of the different result views.
wKinMut has been implemented mostly using the Ruby programming language. The functionality is
implemented as a workflow accessible through a REST interface that can render the results either in
JSON format or HTML. The later constitutes the interface described in this document. Some of data
resources that support this system, such as gene descriptions or iHOP interactions, are queried
remotely through the internet as demanded; but are then cached to improve subsequent access. The
server incorporates some additional caching schemes to improve performance in the back-end, by
persisting the job results, and in the web interface, by caching the HTML.
 </p>
    </sec>
    <sec id="sec-3">
      <title>Step 1: Submission of mutations for analysis</title>
      <p>The input to wKinMut are non-synonymous mutations in the protein kinase superfamily. The
webservice accepts several input formats, the simplest of which encodes the Uniprot/Swissprot accession
number, the wild type residue, the position and the mutated residue. It is interesting to highlight at this
point that non-standard amino acids and truncating mutations will be excluded from the analysis. An
example of this format would be a mutation from Glycine to Alanine in position 719 of the human
epidermal growth factor receptor, that will be encoded as P00533_G719A. In the following sections,
we will use this example to guide the reader through the different result views (Figure 1). Multiple
mutations can be submitted at a time, either as a plain text file or directly via the applications form, the
sample dataset provided as part of wKinMut’s documentation can be used as a formatting guide.
 </p>
    </sec>
    <sec id="sec-4">
      <title>Step 2: Interpretation of the consequences of the mutations.</title>
      <p>
        The first output the user will get right after submitting the mutations is a summary page with useful
information about the requested mutations (Figure 1, panel a). It includes a description of the proteins
in Uniprot, the membership to kinase groups in the classification in KinBase (
        <xref ref-type="bibr" rid="ref30 ref31">32,33</xref>
        ) and the
estimation of the pathogenicity of mutations attending to our kinase-specific predictor of pathogenicity,
KinMut (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ). The prediction of the pathogenicity will be discussed in detail in a forthcoming section,
nevertheless we decided to include this information at this step as a guide to prioritize mutations. It
might be interesting to point out here that users interested only in the results from KinMut, can find a
link to the predictions in this summary page that can be accessed programmatically. The scope of
wKinMut goes beyond providing raw prediction of pathogenicity from KinMut, the web-service’s main
goal is to aid computational biologists and clinicians to understand and to interpret the consequences
of kinase mutations. Hence, information complementary to KinMut predictions, is provided. In the
summary table, the ‘View’ link in the right-most ‘Details’ column (Figure 1, panel b) will redirect the
user to another page containing this complementary information, which includes: the values of the
features used for classification, PFAM domains affected by the mutation, protein-protein interaction
information extracted from the literature with iHop (
        <xref ref-type="bibr" rid="ref32">34</xref>
        ), mentions of the mutations in the literature
automatically mined with SNP2L (
        <xref ref-type="bibr" rid="ref23 ref33">25,35</xref>
        ), and existing records of the mutations in other dedicated
databases. This additional information is intended to provide the basic background to help to
understand and interpret the consequences of the mutations. Each individual piece of information will
be discussed thoroughly in the following sections.
      </p>
      <p>
        General information about the protein/gene: Information under the ‘gene/protein’ tab (Figure 1, panel
c) focuses on information shared by all mutations in the same kinase. Background information such
as the gene name, the formal description in Uniprot and the classification in KinBase (
        <xref ref-type="bibr" rid="ref30 ref31">32,33</xref>
        ) of the
kinase is provided. In addition, the system provides the Gene Ontology terms with which the kinase
has been annotated in each of the independent sub-ontologies (namely Molecular Function, Cellular
Compartment and Biological Process). This information provides clues to unveil the function of the
kinase and it is used by KinMut to calculate the likeness of the protein (and subsequently the mutation)
to play a role in disease.
      </p>
      <p>
        PFAM domains: In a previous publication (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ) we demonstrated that mutations occurring in certain
domains such as the Tyrosine kinase domain (PKinase Tyr, according to PFAM) are more likely to
cause disease. This is coherent with the assumption that the function of some domains is more
important that the function of others. In wKinMut, this information is contained in the ‘PFAM domains’
tab (Figure 1, panel d), which displays the domain (or domains, in some cases) where the mutation is
occurring and the alignment used by PFAM as seed to generate the domain family. The alignment is
evaluated in terms of sequence conservation. Under the assumption that conserved regions have
been preserved by evolution, this information can help the user to identify important regions in the
structure of the domain.
      </p>
      <p>
        Mapping the mutations onto structures: To understand the consequences of mutations might have in
protein stability and function it is sometimes useful to study the mutations in their structural contexts.
However, mapping mutations from sequences to structures is not always trivial (
        <xref ref-type="bibr" rid="ref34">36</xref>
        ). Under the
‘Structures‘ tab, wKinMut enables the visualization of the mutation mapped to all available structures.
(Figure 1, panel e). In addition, the versatility of the Jmol applet implemented in wKinMut allows
advanced users to improve the visualization and to design their own experiments.
      </p>
      <p>
        Prediction of the pathogenicity: In wKinMut the theoretical pathogenicity of mutations is assessed by
three independent methods, namely SIFT (
        <xref ref-type="bibr" rid="ref6">8</xref>
        ), MutationAssessor (
        <xref ref-type="bibr" rid="ref8">10</xref>
        ) and KinMut (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ). This
information is displayed in the ‘Pathogenicity’ tab (Figure 1, panel f). SIFT (
        <xref ref-type="bibr" rid="ref6">8</xref>
        ) predicts whether
nonsynonymous mutations are prone to affect protein function. This prediction is based on the degree of
conservation of the residues in sequence alignments derived from closely related sequences. A
threshold value of 0.05 is used to determine that mutations are likely to be pathogenic.
MutationAssessor (
        <xref ref-type="bibr" rid="ref8">10</xref>
        ) is a predictor of the functional impact of amino acid mutations in proteins. The
evolutionary conservation of the affected residues is evaluated in a set of homologous sequences.
KinMut (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ) is a kinase-specific predictor of the pathogenicity of mutations. It relies in a
machinelearning approach (SVM) to evaluate a number of sequence-derived features that describe kinase
mutations from different perspectives, including: a) at the gene level, the membership to a Kinbase
group and Gene Ontology terms. b) at the domain level, the occurrence of the mutation inside a
PFAM domain, and c) at the residue level, several properties including amino acid type, functional
annotations from Swissprot and FireDB (
        <xref ref-type="bibr" rid="ref35">37</xref>
        ), specificity-determining positions, etc. SVM scores
greater than -0.5 indicate that the mutation is very likely pathogenic. The values of these features are
also displayed in this section of the web-service to aid to interpret the predictions. Please, refer to the
original publications for information on the individual characteristics, capabilities and validation of
each predictor.
      </p>
      <p>
        Mutations in databases: The wealth of knowledge provided by current research is usually stored in
databases. A number of them store information about mutations from diverse perspectives. In
wKinMut (Figure 1, panel g) we collect information from four different sources (namely the Uniprot
Variant Pages (
        <xref ref-type="bibr" rid="ref36">38</xref>
        ), KinMutBase (
        <xref ref-type="bibr" rid="ref37">39</xref>
        ), SAAPdb (
        <xref ref-type="bibr" rid="ref24">26</xref>
        ) and COSMIC (
        <xref ref-type="bibr" rid="ref38">40</xref>
        )) in an attempt to cover all
aspects of protein kinase mutation. The information displayed includes information about the
structural consequences of mutations, experiments associating mutations with a certain disease, or
the proof that a mutation has been observed in a cancer sample.
      </p>
      <p>
        Automatic extraction of mutations from the literature: Unfortunately, the databases referred in the
previous section do not contain all current knowledge about mutations. Even in the cases where a
database record exists, the knowledgebase cannot always store all contextual information. The
context is sometimes very important for the correct interpretation of the predictions: experimental
conditions, patients’ habits and clinical histories, etcetera. wKinMut provides pointers to mentions of
the mutations in the literature under the ‘Literature’ tab (Figure 1, panel h). We extract this information
automatically using our in-house text mining approach,SNP2L (
        <xref ref-type="bibr" rid="ref23">25</xref>
        ). In brief, SNP2L is a literature
mining pipeline for the automatic extraction and disambiguation of singlepoint mutation mentions from
both abstracts as well as full text articles, followed by a sequence validation check to link mutations to
their corresponding kinase protein sequences.
      </p>
      <p>Automatic determination of interaction partners: wKinMut integrates Protein-Protein Interactions (PPI)
gathered from iHOP in the homonymous tab (Figure 1, panel i). Briefly, iHOP is a powerful text mining
system to automatically extract protein protein interactions from PubMed abstracts. To relate the
interaction information with its context, the sentences including the interaction mentions are also
provided.</p>
    </sec>
    <sec id="sec-5">
      <title>OVERVIEW AND FUTURE PERSPECTIVES</title>
      <p>
        wKinMut facilitates the exploration of the information available about individual mutations by
integrating prediction approaches with the automatic extraction of information from the literature (text
mining) and several currently available databases. wKinMut works as an open accessible web server.
The system offers direct prediction of the potential pathogenicity of the mutations from a number of
methods, including our recently developed prediction method based on the combination of information
from a range of diverse sources with a machine learning system (
        <xref ref-type="bibr" rid="ref21">23</xref>
        ). The features used by our new
prediction system include: general physicochemical properties, annotations of known functional sites
from FireDB and Swissprot and kinase-specific characteristics such as membership to a specific
group of kinases, annotations of disease associations extracted from GO terms and mapping of
PFAM domains, and relevance of the residues for the differences between groups of kinases. In
addition to the predictions, wKinMut offers a number of integrated complementary services that help
to understand the consequences and the mechanism of the mutations. These services include the
classification of the kinase, information about associations of the kinase with other proteins extracted
directly extracted from the Medline abstracts, the mutations on the corresponding protein structures,
and possible relations with pathogenicity recorded in disease-variation databases and from
largescale cancer studies. An important component of wKinMut is the access to information about the
mutations extracted directly from the literature. This information is important for the contextualization
of the consequences of the mutations. wKinMut uses our previously developed SNP2L (
        <xref ref-type="bibr" rid="ref23">25</xref>
        ), that has
been shown to provide a substantial addition to the information provided by public databases and
repositories..
      </p>
      <p>
        In summary, we think that wKinMut constitutes a powerful one-stop shop for the study of the potential
pathogenic potential of mutations in protein kinases. As such, wKinMut will be of interest for
bioinformaticians and computational biologists that can use the information provided by the server
programmatically as part of their own analysis pipelines, and it can be also useful to biologists and
clinicians who can browse and explore punctual information easily from the provided interface. We
have used wKinMut during the past year for the analysis of the consequences of mutations in the
context of a number of personalized cancer genome projects (see 41), including the recent analysis of
Chronic Lymphocytic Leukemia cases (
        <xref ref-type="bibr" rid="ref40">42, 43</xref>
        ).
      </p>
      <p>A further development of the presented system would consider the analysis of the downstream
consequences of mutations in relation to potential and known post-translational modifications and
their interelations (see 44,45). We are interested in extending wKinMut capabilities to the analysis of
the combined effect of mutations in pathways and signalling networks in where kinases are essential
components..
wKinMut is publicly available at http://wkinmut.bioinfo.cnio.es</p>
    </sec>
    <sec id="sec-6">
      <title>ACKNOWLEDGEMENTS</title>
      <p>
        The  authors  thank  the  members  of  the  Structural  Biology  and  Biocomputing  Programme  (CNIO), 
especially A. Rausell, D. Juan, I. Ezkurdia and T. Pons, for interesting discussion and comments on
this manuscript. This research was supported by OpenPhacts European project (115191-2) and
Spanish Ministry of Science and Innovation project BIO2007-6685  
Papadopoulos N, Buckhaults P, Markowitz SD, Parmigiani G, Kinzler KW, Velculescu VE, Vogelstein
B: The genomic landscapes of human breast and colorectal cancers. Science 2007, 318(5853):1108–
13.
Tubío JM, Royo R, Gelpí JL, Orozco M, Pisano DG, Zamora J, Vázquez M, Valencia A, Himmelbauer
H, Bayés M, Heath S, Gut M, Gut I, Estivill X, López-Guillermo A, Puente XS, Campo E, López-Otín
C.. Exome sequencing identifies recurrent mutations of the splicing factor SF3B1 gene in chronic
lymphocytic leukemia. Nat Genet. 2011 Dec 11;44(
        <xref ref-type="bibr" rid="ref1">1</xref>
        ):47-52
44) Minguez P, Parca L, Diella F, Mende DR, Kumar R, Helmer-Citterich M, Gavin AC, van Noort V
and Bork P. Deciphering a global network of functionally associated post-translational modifications.
Molecular Systems Biology 8; Article number 599
45) Beltrao P, Albanese V, Kenner L, Swaney DL, Burlingame A, Villen J, Lim WA, Fraser JS,
Frydman J, Krogan NJ. Systematic Functional Prioritization of Protein Posttranslational Modifications.
Cell 150, 413–425, July 20, 2012
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Sjöblom</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wood</surname>
            <given-names>LD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsons</surname>
            <given-names>DW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barber</surname>
            <given-names>TD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mandelker</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leary</surname>
            <given-names>RJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ptak</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silliman</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szabo</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buckhaults</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Farrell</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meeh</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markowitz</surname>
            <given-names>SD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willis</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dawson</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willson</surname>
            <given-names>JKV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gazdar</surname>
            <given-names>AF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartigan</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wu</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parmigiani</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            <given-names>BH</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bachman</surname>
            <given-names>KE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papadopoulos</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vogelstein</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kinzler</surname>
            <given-names>KW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velculescu</surname>
            <given-names>VE</given-names>
          </string-name>
          :
          <article-title>The consensus coding sequences of human breast and colorectal cancers</article-title>
          .
          <source>Science</source>
          <year>2006</year>
          ,
          <volume>314</volume>
          (
          <issue>5797</issue>
          ):
          <fpage>268</fpage>
          -
          <lpage>74</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Wood</surname>
            <given-names>LD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsons</surname>
            <given-names>DW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lin</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sjöblom</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leary</surname>
            <given-names>RJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shen</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boca</surname>
            <given-names>SM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barber</surname>
            <given-names>TD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ptak</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silliman</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Szabo</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dezso</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ustyanksky</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolskaya</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolsky</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karchin</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wilson</surname>
            <given-names>PA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaminker</surname>
            <given-names>JS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croshaw</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willis</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dawson</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shipitsin</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Willson</surname>
            <given-names>JKV</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sukumar</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polyak</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            <given-names>BH</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pethiyagoda</surname>
            <given-names>CL</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pant</surname>
            <given-names>PVK</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ballinger</surname>
            <given-names>DG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sparks</surname>
            <given-names>AB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hartigan</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            <given-names>DR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suh</surname>
            <given-names>E</given-names>
          </string-name>
          , 4.
          <string-name>
            <surname>Baudot</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Real</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>From cancer genomes to cancer models: bridging the gaps</article-title>
          .
          <source>EMBO Rep</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          5.
          <string-name>
            <surname>Friedberg</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jambon</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Godzik</surname>
            <given-names>A</given-names>
          </string-name>
          .
          <article-title>New avenues in protein function prediction</article-title>
          .
          <source>Protein Sci</source>
          (
          <year>2006</year>
          ) vol.
          <volume>15</volume>
          (
          <issue>6</issue>
          ) pp.
          <fpage>1527</fpage>
          -
          <lpage>9</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          6.
          <string-name>
            <surname>Karchin</surname>
            <given-names>R</given-names>
          </string-name>
          :
          <article-title>Next generation tools for the annotation of human SNPs</article-title>
          .
          <source>Brief Bioinformatics</source>
          <year>2009</year>
          ,
          <volume>10</volume>
          :
          <fpage>35</fpage>
          -
          <lpage>52</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          7.
          <string-name>
            <surname>Cline</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karchin</surname>
            <given-names>R</given-names>
          </string-name>
          :
          <article-title>Using bioinformatics to predict the functional impact of SNVs</article-title>
          .
          <source>Bioinformatics</source>
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          8.
          <string-name>
            <surname>Ng</surname>
            <given-names>PC</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Henikoff</surname>
            <given-names>S</given-names>
          </string-name>
          :
          <article-title>Predicting deleterious amino acid substitutions</article-title>
          .
          <source>Genome Res</source>
          <year>2001</year>
          ,
          <volume>11</volume>
          (
          <issue>5</issue>
          ):
          <fpage>863</fpage>
          -
          <lpage>74</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          9.
          <string-name>
            <surname>Clifford</surname>
            <given-names>RJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Edmonson</surname>
            <given-names>MN</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nguyen</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buetow</surname>
            <given-names>KH</given-names>
          </string-name>
          :
          <article-title>Large scale analysis of non-synonymous coding region single nucleotide polymorphisms</article-title>
          .
          <source>Bioinformatics</source>
          <year>2004</year>
          ,
          <volume>20</volume>
          (
          <issue>7</issue>
          ):
          <fpage>1006</fpage>
          -
          <lpage>14</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          10.
          <string-name>
            <surname>Reva</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Antipin</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sander</surname>
            <given-names>C</given-names>
          </string-name>
          :
          <article-title>Predicting the functional impact of protein mutations: application to cancer genomics</article-title>
          .
          <source>Nucleic Acids Res</source>
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          11.
          <string-name>
            <surname>Ramensky</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bork</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sunyaev</surname>
            <given-names>S</given-names>
          </string-name>
          :
          <article-title>Human non-synonymous SNPs: server and survey</article-title>
          .
          <source>Nucleic Acids Res</source>
          <year>2002</year>
          ,
          <volume>30</volume>
          (
          <issue>17</issue>
          ):
          <fpage>3894</fpage>
          -
          <lpage>900</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          12.
          <string-name>
            <surname>Wang</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moult</surname>
            <given-names>J</given-names>
          </string-name>
          :
          <article-title>SNPs, protein structure, and disease</article-title>
          .
          <source>Hum Mutat</source>
          <year>2001</year>
          ,
          <volume>17</volume>
          (
          <issue>4</issue>
          ):
          <fpage>263</fpage>
          -
          <lpage>70</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          13.
          <string-name>
            <surname>Krishnan</surname>
            <given-names>VG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Westhead</surname>
            <given-names>DR</given-names>
          </string-name>
          :
          <article-title>A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function</article-title>
          .
          <source>Bioinformatics</source>
          <year>2003</year>
          ,
          <volume>19</volume>
          (
          <issue>17</issue>
          ):
          <fpage>2199</fpage>
          -
          <lpage>209</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          14.
          <string-name>
            <surname>Kaminker</surname>
            <given-names>JS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waugh</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haverty</surname>
            <given-names>PM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peters</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sebisanovic</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stinson</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Forrest</surname>
            <given-names>WF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bazan</surname>
            <given-names>JF</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Seshagiri</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhang</surname>
            <given-names>Z</given-names>
          </string-name>
          :
          <article-title>Distinguishing cancer-associated missense mutations from common polymorphisms</article-title>
          .
          <source>Cancer Res</source>
          <year>2007</year>
          ,
          <volume>67</volume>
          (
          <issue>2</issue>
          ):
          <fpage>465</fpage>
          -
          <lpage>73</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wainreb</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashkenazy</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bromberg</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Starovolsky-Shitrit</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haliloglu</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruppin</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Avraham</surname>
            <given-names>KB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rost</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ben-Tal</surname>
            <given-names>N</given-names>
          </string-name>
          :
          <article-title>MuD: an interactive web server for the prediction of non-neutral substitutions using protein structural data</article-title>
          .
          <source>Nucleic Acids Res</source>
          <year>2010</year>
          ,
          <volume>38</volume>
          Suppl:
          <fpage>W523</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          16.
          <string-name>
            <surname>Ferrer-Costa</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orozco</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de la Cruz</surname>
            <given-names>X</given-names>
          </string-name>
          :
          <article-title>Characterization of disease associated single amino acid polymorphisms in terms of sequence and structure properties</article-title>
          .
          <source>J Mol Biol</source>
          <year>2002</year>
          ,
          <volume>315</volume>
          (
          <issue>4</issue>
          ):
          <fpage>771</fpage>
          -
          <lpage>86</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          17.
          <string-name>
            <surname>Bromberg</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rost</surname>
            <given-names>B</given-names>
          </string-name>
          :
          <article-title>SNAP: predict effect of non-synonymous polymorphisms on function</article-title>
          .
          <source>Nucleic Acids Res</source>
          <year>2007</year>
          ,
          <volume>35</volume>
          (
          <issue>11</issue>
          ):
          <fpage>3823</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          18.
          <string-name>
            <surname>Adzhubei</surname>
            <given-names>IA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peshkin</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramensky</surname>
            <given-names>VE</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gerasimova</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bork</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kondrashov</surname>
            <given-names>AS</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sunyaev</surname>
            <given-names>SR</given-names>
          </string-name>
          :
          <article-title>A method and server for predicting damaging missense mutations</article-title>
          .
          <source>Nat Methods</source>
          <year>2010</year>
          ,
          <volume>7</volume>
          (
          <issue>4</issue>
          ):
          <fpage>248</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          19.
          <string-name>
            <surname>Calabrese</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Capriotti</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fariselli</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martelli</surname>
            <given-names>PL</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Casadio</surname>
            <given-names>R</given-names>
          </string-name>
          :
          <article-title>Functional annotations improve the predictive score of human disease-related mutations in proteins</article-title>
          .
          <source>Hum Mutat</source>
          <year>2009</year>
          ,
          <volume>30</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1237</fpage>
          -
          <lpage>44</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          20.
          <string-name>
            <surname>Torkamani</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schork</surname>
            <given-names>NJ</given-names>
          </string-name>
          :
          <article-title>Accurate prediction of deleterious protein kinase polymorphisms</article-title>
          .
          <source>Bioinformatics</source>
          <year>2007</year>
          ,
          <volume>23</volume>
          (
          <issue>21</issue>
          ):
          <fpage>2918</fpage>
          -
          <lpage>25</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          21.
          <string-name>
            <surname>Yue</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moult</surname>
            <given-names>J</given-names>
          </string-name>
          :
          <article-title>Loss of protein structure stability as a major causative factor in monogenic disease</article-title>
          .
          <source>J Mol Biol</source>
          <year>2005</year>
          ,
          <volume>353</volume>
          (
          <issue>2</issue>
          ):
          <fpage>459</fpage>
          -
          <lpage>73</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          22.
          <string-name>
            <surname>Karchin</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Diekhans</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thomas</surname>
            <given-names>DJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pieper</surname>
            <given-names>U</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eswar</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haussler</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sali</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources</article-title>
          .
          <source>Bioinformatics</source>
          <year>2005</year>
          ,
          <volume>21</volume>
          (
          <issue>12</issue>
          ):
          <fpage>2814</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          23.
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>JM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pozo</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vazquez</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Prioritization of pathogenic mutations in the protein kinase superfamily</article-title>
          .
          <source>BMC Genomics</source>
          <year>2012</year>
          ,
          <volume>13</volume>
          (
          <issue>Suppl 4</issue>
          ):
          <fpage>S3</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          24.
          <string-name>
            <surname>Gonzalez-Perez</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lopez-Bigas</surname>
            <given-names>N</given-names>
          </string-name>
          :
          <article-title>Improving the assessment of the outcome of non synonymous SNVs with a consensus deleteriousness score</article-title>
          ,
          <source>Condel. Am J Hum Genet</source>
          <year>2011</year>
          ,
          <volume>88</volume>
          (
          <issue>4</issue>
          ):
          <fpage>440</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          25.
          <string-name>
            <surname>Krallinger</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>JMG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodriguez-Penagos</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Extraction of human kinase mutations from literature, databases and genotyping studies</article-title>
          .
          <source>BMC Bioinformatics</source>
          <year>2009</year>
          , 10
          <issue>Suppl 8</issue>
          :
          <fpage>S1</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          26.
          <string-name>
            <surname>Hurst</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McMillan</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Porter</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Allen</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fakorede</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>The SAAPdb web resource: A largescale structural analysis of mutant proteins</article-title>
          .
          <source>Hum Mutat</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          27.
          <string-name>
            <surname>Lahiry</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Torkamani</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schork</surname>
            <given-names>NJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hegele</surname>
            <given-names>RA</given-names>
          </string-name>
          :
          <article-title>Kinase mutations in human disease: interpreting genotype-phenotype relationships</article-title>
          .
          <source>Nat Rev Genet</source>
          <year>2010</year>
          ,
          <volume>11</volume>
          :
          <fpage>60</fpage>
          -
          <lpage>74</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          28.
          <string-name>
            <surname>Greenman</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wooster</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Futreal</surname>
            <given-names>PA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stratton</surname>
            <given-names>MR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Easton</surname>
            <given-names>DF</given-names>
          </string-name>
          :
          <article-title>Statistical analysis of pathogenicity of somatic mutations in cancer</article-title>
          .
          <source>Genetics</source>
          <year>2006</year>
          ,
          <volume>173</volume>
          (
          <issue>4</issue>
          ):
          <fpage>2187</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          29.
          <string-name>
            <surname>Stratton</surname>
            <given-names>MR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campbell</surname>
            <given-names>PJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Futreal</surname>
            <given-names>PA</given-names>
          </string-name>
          :
          <article-title>The cancer genome</article-title>
          .
          <source>Nature</source>
          <year>2009</year>
          ,
          <volume>458</volume>
          (
          <issue>7239</issue>
          ):
          <fpage>719</fpage>
          -
          <lpage>24</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          30.
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>JMG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McMillan</surname>
            <given-names>LEM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baresic</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orengo</surname>
            <given-names>CA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            <given-names>ACR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Characterization of pathogenic germline mutations in human Protein Kinases</article-title>
          .
          <source>BMC Bioinformatics</source>
          <year>2011</year>
          , 12
          <issue>Suppl 4</issue>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          31.
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Redfern</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orengo</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Cancer-associated mutations are preferentially distributed in protein kinase functional sites</article-title>
          .
          <source>Proteins</source>
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          32.
          <string-name>
            <surname>Manning</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>White D B</surname>
          </string-name>
          ,
          <string-name>
            <surname>Martinez</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hunter</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sudarsanam</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>The protein kinase complement of the human genome</article-title>
          .
          <source>Science</source>
          (
          <year>2002</year>
          ) vol.
          <volume>298</volume>
          (
          <issue>5600</issue>
          ) pp.
          <fpage>1912</fpage>
          -
          <lpage>34</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          33
          <string-name>
            <surname>Miranda-Saavedra</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barton</surname>
            <given-names>G</given-names>
          </string-name>
          :.
          <article-title>Classification and functional annotation of eukaryotic protein kinases</article-title>
          .
          <source>Proteins</source>
          (
          <year>2007</year>
          ) vol.
          <volume>68</volume>
          (
          <issue>4</issue>
          ) pp.
          <fpage>893</fpage>
          -
          <lpage>914</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          34.
          <string-name>
            <surname>Hoffmann</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Implementing the iHOP concept for navigation of biomedical literature</article-title>
          .
          <source>Bioinformatics</source>
          <year>2005</year>
          , 21
          <issue>Suppl 2</issue>
          :
          <fpage>ii252</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          35.
          <string-name>
            <surname>Krallinger</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirschman</surname>
            <given-names>L</given-names>
          </string-name>
          :
          <article-title>Linking genes to literature: text mining, information extraction, and retrieval applications for biology</article-title>
          .
          <source>Genome Biol</source>
          <year>2008</year>
          ,
          <volume>9</volume>
          <issue>Suppl 2</issue>
          :
          <fpage>S8</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          36.
          <string-name>
            <surname>Izarzugaza</surname>
            <given-names>JMG</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baresic</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McMillan</surname>
            <given-names>LEM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yeats</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clegg</surname>
            <given-names>AB</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Orengo</surname>
            <given-names>CA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martin</surname>
            <given-names>ACR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>An integrated approach to the interpretation of single amino acid polymorphisms within the framework of CATH and Gene3D</article-title>
          .
          <source>BMC Bioinformatics</source>
          <year>2009</year>
          , 10
          <issue>Suppl 8</issue>
          :
          <fpage>S5</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          37.
          <string-name>
            <surname>Lopez</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tress</surname>
            <given-names>ML</given-names>
          </string-name>
          :
          <article-title>FireDB-a database of functionally important residues from proteins of known structure</article-title>
          .
          <source>Nucleic Acids Res</source>
          <year>2007</year>
          ,
          <volume>35</volume>
          (Database issue):
          <fpage>D219</fpage>
          -
          <lpage>23</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          38.
          <string-name>
            <surname>Yip</surname>
            <given-names>YL</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lachenal</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pillet</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Veuthey</surname>
            <given-names>AL</given-names>
          </string-name>
          :
          <article-title>Retrieving mutationspecific information for human proteins in UniProt/Swiss-Prot Knowledgebase</article-title>
          .
          <source>J Bioinform Comput Biol</source>
          <year>2007</year>
          ,
          <volume>5</volume>
          (
          <issue>6</issue>
          ):
          <fpage>1215</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          39.
          <string-name>
            <surname>Ortutay</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Valiaho</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stenberg</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vihinen</surname>
            <given-names>M</given-names>
          </string-name>
          :
          <article-title>KinMutBase: a registry of disease-causing mutations in protein kinase domains</article-title>
          .
          <source>Hum Mutat</source>
          <year>2005</year>
          ,
          <volume>25</volume>
          (
          <issue>5</issue>
          ):
          <fpage>435</fpage>
          -
          <lpage>42</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          40.
          <string-name>
            <surname>Bamford</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dawson</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Forbes</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clements</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pettett</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dogan</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flanagan</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teague</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Futreal</surname>
            <given-names>PA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stratton</surname>
            <given-names>MR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wooster</surname>
            <given-names>R</given-names>
          </string-name>
          :
          <article-title>The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website</article-title>
          .
          <source>Br J Cancer</source>
          <year>2004</year>
          ,
          <volume>91</volume>
          (
          <issue>2</issue>
          ):
          <fpage>355</fpage>
          -
          <lpage>8</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          41.
          <string-name>
            <surname>Valencia</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hidalgo</surname>
            <given-names>M.</given-names>
          </string-name>
          <article-title>Getting personalized cancer genome analysis into the clinic: the challenges in bioinformatics</article-title>
          .
          <source>Genome Medicine</source>
          <year>2012</year>
          ,
          <volume>4</volume>
          :
          <fpage>61</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          42.
          <string-name>
            <surname>Quesada</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Conde</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villamor</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ordóñez</surname>
            <given-names>GR</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jares</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bassaganyas</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramsay</surname>
            <given-names>AJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beà</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pinyol</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martínez-Trillos</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>López-Guerra</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Colomer</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Navarro</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baumann</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aymerich</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rozman</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Delgado</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giné</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hernández</surname>
            <given-names>JM</given-names>
          </string-name>
          ,
          <string-name>
            <surname>González-Díaz</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Puente</surname>
            <given-names>DA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velasco</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freije</surname>
            <given-names>JM</given-names>
          </string-name>
          ,
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>