<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>R. B. Russell, M. A. Saqi, R. A. Sayle, P. A. Bates, and M. J. Sternberg.
Recognition of analogous and homologous protein folds: analysis of sequence
and structure conservation. J Mol Biol</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>ASSP; the Antibody Secondary Structure Profile search tool</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dimitrios Vlachakis</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alexandros Armaos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kasampalidis I</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Arianna Filntisi</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sophia Kossida⇤</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Bioinformatics &amp; Medical Informatics Team, Biomedical Research Foundation Academy of Athens</institution>
          ,
          <addr-line>Soranou Efessiou 4, Athens 11527</addr-line>
          ,
          <country country="GR">Greece ⇤</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1997</year>
      </pub-date>
      <volume>269</volume>
      <issue>3</issue>
      <fpage>423</fpage>
      <lpage>439</lpage>
      <abstract>
        <p>Antibodies constitute the first line of defense against harmful invaders. In the post genomics era the sheer size of antibody related NGS information is a major bottleneck in the quest of understanding and tackling complex genetic diseases and immunological disorders. Bioinformatics is becoming hugely involved in the processing of this data with the development of new, more accurate and ecient algorithms. However, one of the major drawbacks of modern bioinformatics is the fact that protein similarity and blast searches are still based on primary amino acid sequence rather than structural data. Primary sequence searches are inadequate, as they fail to provide a realistic fingerprint for the query protein. Antibody function is much more related to its 3D structure and physicochemical profile rather than its primary amino acid sequence. After all, structure is much more conserved than sequence in nature. In this direction, a novel platform has been developed, which is capable of performing a customized hydropathy blast using traditional sequence blast filtering and an integrated fast similarity search algorithm that uses protein secondary structure information. The Antibody Secondary Structure Profile (ASSP) tool will use secondary structural information from the PDB database when available, whereas if the query antibody is not indexed in the RCSB PDB database, it will automatically determine the secondary elements of the given antibody by performing an “on the fly” secondary structure prediction. All query antibodies are then blasted against the RCSB PDB secondary elements database. Hits are scored, ranked and returned to the user via a well-organized and user friendly graphical interface.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1</p>
      <p>
        Introduction
The hydrophobic e↵ect is the tendency of non-polar substances to avoid contact
with water. The hydropathy of an amino acid, which is derived from the
physicochemical properties of its side chains, determines in part the orientation of its side
chains in the three-dimensional protein structure. In particular, when a protein
folds into a three-dimensional structure, the majority of the hydrophobic
sidechains cluster together within the core of the protein. This removal of the
hydrophobic side-chains out of contact with water generates sucient free energy to
maintain the folded structure of the protein. The determination of the hydrophobic
or hydrophilic inclinations of a given amino acid side-chain has been approached
in a number of ways. Measuring the partition coecient of a given amino acid
side-chain between water and a non-interacting, isotropic phase as well as
calculating a transfer free energy from that coecient is one such approach. Another
way to calculate the hydropathy of a given side-chain is the tabulation of residue
accessibilities from the atomic co-ordinates of twelve globular proteins, taking into
consideration that the ensemble average of the actual locations of a side-chain
should be a direct evaluation of its hydropathy. An additional approach combined
those previously mentioned methods, resulting in the construction of a hydropathy
scale, according to which each amino acid has been given a value reflective of its
relative hydrophilicity and hydrophobicity [
        <xref ref-type="bibr" rid="ref15 ref4">4, 17</xref>
        ].
      </p>
      <p>
        The twenty amino acids found in nature have been categorized in hydropathy
classes based on the previously mentioned amino acid hydropathy index developed
by Kyte and Doolittle (1982). Specifically, the amino acids with a hydropathy index
equal to or more than 1.8 were defined as hydrophobic. The amino acids with a
hydropathy index equal to or less than 3.3 were defined as hydrophilic, while the
amino acids with a hydropathy index less than 1.8 and more than -3.3 were defined
as neutral. Three classes were thus defined: the hydrophobic class (I, V, L, F, C, M,
A, W), the neutral class (G, T, S, Y, P, H) and the hydrophilic class (D, N, E, Q, K,
R). Tryptophan (W) was included in the hydrophobic class, its hydropathy index
varying from 0.9 to 1.9, depending on the study. As a general observation, amino
acids with large, nonpolar or largely nonpolar side-chains tend to be hydrophobic,
while the least hydrophobic amino acids are the ones that are charged and largely
polar, such as asparagine. Statistical analysis, in particular correspondence analysis
(COA) and hierarchic classification (CAH), has been conducted according to those
three hydropathy classes upon 2474 sequences of antibody variable regions, which
were extracted from human productively rearranged sequences [
        <xref ref-type="bibr" rid="ref15 ref23 ref8">17, 25, 8</xref>
        ]. There
is a significant variation among the hydrophobicities of the amino acids. Some
are strongly hydrophobic, others are strongly hydrophilic, while others include
both hydrophobic and hydrophilic parts and are called amphiphilic. For such
amphiphilic molecules it is sometimes useful to define a hydrophobic moment,
which is analogous to a dipole moment. For a single amino acid, the hydrophobic
moment can be defined as a line that points from the Ca atom to the middle of
the side-chain, and whose length is proportional to the hydrophobicity of the
sidechain. The dipole moment of a protein, or a part of it, is obtained by summing the
individual vectors (in magnitude and direction) corresponding to the amino acids
the protein is composed of. For example, an ↵ helix located on the surface of a
protein will have one side of the helix exposed to solvent and the other side facing
the interior of the protein. The amino acids that comprise the buried side of the ↵
helix will, in general, be much more hydrophobic than those on the solvent-exposed
side of the helix. This asymmetry results in the ↵ helix having a large hydrophobic
moment directed towards the center of the protein [
        <xref ref-type="bibr" rid="ref19">21</xref>
        ].
      </p>
      <p>
        The three-dimensional structure of a protein is determined by the balance
between a number of destabilizing and stabilizing forces, such as conformational
entropy, electrostatic interactions, hydrogen bonds, van der Waals interactions and
hydrophobic interactions. However, hydropathy is considered to be the most
prominent driving force responsible for the folding of proteins. Protein folding occurs
in the presence of water, the properties of which are dominated by its inclination
to form hydrogen bonds. Polar compounds can share hydrogen bonds with water
and, for this reason, are readily soluble. In contrast, when a hydrophobic nonpolar
surface is introduced into an aqueous environment, it prevents hydrogen bonding
from occurring, which forces the water molecules to adopt alternative arrangements
that permit hydrogen bonding to other water molecules. This inflicted restriction
on the alignment of the water molecules has an energetic cost and is the physical
basis of the hydrophobic e↵ect. It has been calculated that when a protein folds,
81% of the nonpolar side-chains (Ala, Val, Ile, Leu, Met, Phe, Trp, Cys), 70% of
the peptide groups, 63% of the polar side chains (Asn, Gln, Ser, Thr, Tyr) and
54% of the charged side chains (Arg, Lys, His, Asp, Glu) are buried in the interior
of the protein, out of contact with water [
        <xref ref-type="bibr" rid="ref19 ref21">21, 23</xref>
        ].
2
      </p>
      <p>Description of ASSP
Hydropathy is a physicochemical property known to be well conserved among
antibodies, which can be explained to a large extent by the significant contribution
of the hydrophobic residues to the folding of antibodies. Numerous studies on
proteins and antibodies have demonstrated that the information necessary to
produce a given three dimensional protein structure can be encoded by many di↵erent
amino acids. In contrast, it has been demonstrated that the periodicity of
polar and nonpolar amino acids is the major determinant of secondary structure in
self-assembling oligomeric peptides. In fact, the choice between ↵ -helical and
sheet secondary structure is influenced by the sequence periodicity of polar and
nonpolar amino acids. Even though amino acid residues may di↵er in their
intrinsic preferences for one secondary structure versus another, these preferences can
be overwhelmed by the drive to form amphiphilic structures capable of burying
hydrophobic surface area. It can be observed that structural similarity among
antibodies is reflected on the distribution of hydropathicity along their amino acid
sequences, since the hydrophobicity patterns of residues match the periodicity of
secondary structures.</p>
      <p>
        Homologous antibodies and proteins within a antibody/protein family as well
as proteins with related structures appear to have similarities in their hydropathy
distributions, even when sequence similarities could not be detected [
        <xref ref-type="bibr" rid="ref2 ref22 ref28">14, 2, 32, 24</xref>
        ].
      </p>
      <p>Since the hydropathy distribution along the antibody sequence has been
recognized as a feature useful for the characterization of protein structure in the form of
hydropathy profiles, a number of methods based on hydropathy have been
developed in order to explain the folding and the structural features of antibodies. The
realization that protein sequence contains hydropathy patterns led to the
development of reduced amino acid alphabets based on hydropathy for the prediction of
secondary structure.</p>
      <p>
        Hydropathy has also been utilized for the detection of analogous and distantly
related proteins and the classification of new protein sequence data. The use of
hydropathy profile analysis has made possible the identification of more distantly
related antibodies than could be done by sequence comparison. In addition,
antibody sequence databases have been analyzed using hydropathy patterns with the
goal of identifying new members of functional classes [
        <xref ref-type="bibr" rid="ref15 ref17 ref18 ref22 ref29 ref5 ref6 ref7">17, 24, 7, 26, 5, 33, 19, 20, 6</xref>
        ].
      </p>
      <p>
        Many homologous proteins share very low primary sequence identity and
similarity scores amongst them. The most characteristic example of such proteins is
viral enzymes. Helicases, proteases and polymerases are just few of the many
examples of protein families that are structurally conserved but may share no more
than 10% sequence identity with each other. Consequently, looking for homologues
of a certain viral enzyme or even for a suitable template structure during
homology modelling using traditional amino acid based blast searches is futile. However,
careful structural analysis of any of the above enzymes reveals that those proteins
are actually highly conserved in their secondary, tertiary and quaternary
structures. Moreover, all evolutionary protein relationships as well as protein function
analysis should also be based on searches that utilize structural information.
Overall, it has been established that homologous proteins are much more conserved in
their structures than in their amino acid primary sequences [
        <xref ref-type="bibr" rid="ref15 ref4">4, 17</xref>
        ].
      </p>
      <p>
        Herein, the ASSP tool takes advantage of the full RCSB PDB secondary
structure database in order to perform blast-like searches in the secondary element level
amongst proteins. To date, even though long studies have been conducted in many
fields of structural biology and modern bioinformatics this problem not been yet
satisfactorily addressed [
        <xref ref-type="bibr" rid="ref1 ref13 ref20">15, 1, 22</xref>
        ]. This is a fact that necessitates the need to the
development of such a platform.
      </p>
      <p>
        ASSPs main-window is a menu-driven interface as well as a tab step-by-step
layout. Initially the user has an option regarding the query input type that will
be used. ASSP will handle both primary amino acid sequence as well as
DSSPformatted secondary element protein sequence [13]. The user can follow two main
routes for the ASSP run: Firstly, the user may input either raw primary amino
acid sequence for a conventional blast search or opt for a quick secondary
structure prediction of the amino acid sequence using the built-in STRAP module [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
STRAP will perform a very fast, over the internet secondary structure prediction,
which will eventually return the predicted secondary element composition of the
query protein [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Eventually a DSSP compatible secondary structure
determination code will have been obtained for the actual secondary structure similarity
search [13]. Secondly, an existing DSSP compatible secondary structure
determination code may be used as input from the user straightaway, which will then be
automatically blasted against the secondary structural index database of the RCSB
Protein Databank. If a secondary element antibody sequence description is used,
then ASSP will move swiftly to the actual similarity blast search.
      </p>
      <p>The screening process of ASSP is broken down in two steps. First a conventional
primary sequence-based blast is performed with a threshold value of 30% identity,
when temporary file is created with all sequences sharing more than 30% identity
with the query antibody sequence (using the blosum62 substitution matrix). Then
the custom made hydropathy substitution matrix is engaged and the previously
filtered entries are ranked according to their identity/similarity scores based on
their hydropathy profiles.</p>
      <p>
        The hydropathy matrix has been created using the antibody hydropathy index
from IMGT [
        <xref ref-type="bibr" rid="ref16">18</xref>
        ]. Results in the form of alignments and similarity percentages are
calculated, scored, ranked and returned to the user through the same graphical
interface that has been specifically designed to simplify the task for the user and
to eliminate the possibility of a user-inflicted error (see Figure 1). The output
window is color-coded in accordance to the IMGT coloring scheme for hydropathy.
The identical residues are colored green, the hydrophobic residues (4, 5 to 0, 9)
are colored blue, the neutral residues ( 0, 4 to 3, 2) are colored red and the
hydrophilic residues ( 3, 5 to 4, 5) are colored yellow. The results are outputted
and saved in easy to manipulate text-based text/ascii files for future analysis.
      </p>
      <p>
        The secondary description code that ASSP has adopted is the same with the
one DSSP has been using for many years now [13]. This was intentionally done
for ease of use and backward compatibility issues. More specifically an eight-letter
description code is used. Using just eight letters, instead of the traditional twenty
amino acid letters, makes similarity searches ever more ecient and faster than
ever before. The eight letter secondary element code comprises of the following
letters: H for ↵ helix conformation, B for residues in isolated beta-bridge, E for
extended strands that participate in beta ladders, G for 3/10 helices, I for pi helices,
T for hydrogen bonded turns and S for bends. C is used for the blank space in
the DSSP secondary structure, which represents a loop or an irregular element.
Other major suites, such as the PDBFINDER suite, also adopt this convention
with unstructured protein regions [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Same WHATIF uses C, as many times
leaving a blank may be confusing, misleading and inconvenient [
        <xref ref-type="bibr" rid="ref10 ref12 ref27">31, 10, 12</xref>
        ]. A
batch execultion mode has also been prepared for the ASSP suite. A simple text
file is required with as many sequences as the user wishes, each one stored in a
di↵erent line. The ASSP algorithm will then automatically read that file line by
line and execute the antibody similarity search for as many times as the lines of the
input batch file. This comes quite handy for those who wish to perform secondary
structure similarity searches on large databases of protein or peptide sequences
[
        <xref ref-type="bibr" rid="ref14 ref24 ref25 ref26 ref3">28, 30, 29, 3, 27, 16</xref>
        ]. Finally, an extensive manual and use-case based examples
for the use of ASSP, will pop-up through the Help button, using the operating
systems HTML browser application.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Conclusions</title>
      <p>In conclusion, the ASSP toolkit provides a novel, quick and reliable tool for in
silico antibody similarity searches in one pipelined platform under a user friendly
graphical user interface. We therefore, propose that our structural similarities
application described here would yield results of great interest to many
antibodyrelated scientific disciplines. The ASSP platform is distributed as freeware under
a GNU license.
4</p>
    </sec>
    <sec id="sec-3">
      <title>Availability</title>
      <p>Availability: ASSP can be freely downloaded via our dedicated server system at
http://www.bioacademy.gr/bioinformatics/assp/index.html
ASSP is an open source, cross platform application available freely to all users
under a GNU license basis. The full package, including installation scripts, figures,
a full description, a detailed manual, complete tutorials as hands-on use cases,
software prerequisites and various examples can be downloaded at: http://www.
bioacademy.gr/bioinformatics/assp/. Prior to download; check the provided
information on the website about software prerequisites. Please email comments
and bug reports at dvlachakis@bioacademy.gr.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work was partially supported by:
1. The BIOEXPLORE research project. BIOEXPLORE research project falls
under the Operational Program “Education and Lifelong Learning” and is co-financed
by the European Social Fund (ESF) and National Resources.
2. European Union (European Social Fund - ESF) and Greek national funds
through the Operational Program “Education and Lifelong Learning” of the
National Strategic Reference Framework (NSRF) - Research Funding Program:
Thales. Investing in knowledge society through the European Social Fund.</p>
      <p>The authors would like to thank Prof. Marie-Paule LeFranc and the IMGT
institute for their constructive comments and support.
[13] W. Kabsch and C. Sander. Dictionary of protein secondary structure:
pattern recognition of hydrogen-bonded and geometrical features. Biopolymers,
22(12):2577–2637, Dec 1983.
[14] S. Kamtekar, J. M. Schi↵er, H. Xiong, J. M. Babik, and M. H. Hecht. Protein
design by binary patterning of polar and nonpolar amino acids. Science,
262(5140):1680–1685, 1993.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Berbalk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Schwaiger</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Lackner</surname>
          </string-name>
          .
          <article-title>Accuracy analysis of multiple structure alignments</article-title>
          .
          <source>Protein Sci.</source>
          ,
          <volume>18</volume>
          (
          <issue>10</issue>
          ):
          <fpage>2027</fpage>
          -
          <lpage>2035</lpage>
          ,
          <year>Oct 2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. U.</given-names>
            <surname>Bowie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. F.</given-names>
            <surname>Reidhaar-Olson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. A.</given-names>
            <surname>Lim</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R. T.</given-names>
            <surname>Sauer</surname>
          </string-name>
          .
          <article-title>Deciphering the message in protein sequences: tolerance to amino acid substitutions</article-title>
          .
          <source>Science</source>
          (New York, N.Y.),
          <volume>247</volume>
          (
          <issue>4948</issue>
          ):
          <fpage>1306</fpage>
          -
          <lpage>1310</lpage>
          , Mar.
          <year>1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Vlachakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Tsiliki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Megalooikonomou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Kossida</surname>
          </string-name>
          .
          <article-title>Protein signatures using electrostatic molecular surfaces in harmonic space</article-title>
          .
          <source>PeerJ</source>
          ,
          <volume>1</volume>
          :
          <fpage>e185</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>C.</given-names>
            <surname>Chothia</surname>
          </string-name>
          .
          <article-title>The nature of the accessible and buried surfaces in proteins</article-title>
          .
          <source>Journal of Molecular Biology</source>
          ,
          <volume>105</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          ,
          <year>July 1976</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>S. Y.</given-names>
            <surname>Chung</surname>
          </string-name>
          and
          <string-name>
            <given-names>S.</given-names>
            <surname>Subbiah</surname>
          </string-name>
          .
          <article-title>A structural explanation for the twilight zone of protein sequence homology</article-title>
          .
          <source>Structure</source>
          ,
          <volume>4</volume>
          (
          <issue>10</issue>
          ):
          <fpage>1123</fpage>
          -
          <lpage>1127</lpage>
          , Oct.
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Clements</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. E.</given-names>
            <surname>Martin</surname>
          </string-name>
          .
          <article-title>Identification of novel membrane proteins by searching for patterns in hydropathy profiles</article-title>
          .
          <source>Eur. J. Biochem.</source>
          ,
          <volume>269</volume>
          (
          <issue>8</issue>
          ):
          <fpage>2101</fpage>
          -
          <lpage>2107</lpage>
          ,
          <year>Apr 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>D.</given-names>
            <surname>Eisenberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Weiss</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. C.</given-names>
            <surname>Terwilliger</surname>
          </string-name>
          .
          <article-title>The hydrophobic moment detects periodicity in protein hydrophobicity</article-title>
          .
          <source>Proc Natl Acad Sci U S A</source>
          ,
          <volume>81</volume>
          (
          <issue>1</issue>
          ):
          <fpage>140</fpage>
          -
          <lpage>144</lpage>
          , Jan.
          <year>1984</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Engelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. A.</given-names>
            <surname>Steitz</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Goldman</surname>
          </string-name>
          .
          <article-title>Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins</article-title>
          .
          <source>Annual review of biophysics and biophysical chemistry</source>
          ,
          <volume>15</volume>
          :
          <fpage>321</fpage>
          -
          <lpage>353</lpage>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>C.</given-names>
            <surname>Gille</surname>
          </string-name>
          . STRAP:
          <article-title>Structure based sequences alignment program</article-title>
          . http:// www.bioinformatics.org/strap/index2.html.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>M. L. Hekkelman</surname>
            ,
            <given-names>T. A.</given-names>
          </string-name>
          <string-name>
            <surname>Te Beek</surname>
            ,
            <given-names>S. R.</given-names>
          </string-name>
          <string-name>
            <surname>Pettifer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Thorne</surname>
            ,
            <given-names>T. K.</given-names>
          </string-name>
          <string-name>
            <surname>Attwood</surname>
            , and
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Vriend. WIWS:</surname>
          </string-name>
          <article-title>a protein structure bioinformatics Web service collection</article-title>
          .
          <source>Nucleic Acids Res</source>
          .,
          <volume>38</volume>
          (
          <issue>Web Server issue</issue>
          ):
          <fpage>W719</fpage>
          -
          <lpage>723</lpage>
          ,
          <year>Jul 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>R.</given-names>
            <surname>Hooft</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Sander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scharf</surname>
          </string-name>
          , and
          <string-name>
            <surname>G. Vriend.</surname>
          </string-name>
          <article-title>The pdbfinder database: a summary of pdb, dssp and hssp information with added value</article-title>
          .
          <source>Computer applications in the biosciences : CABIOS</source>
          ,
          <volume>12</volume>
          (
          <issue>6</issue>
          ):
          <fpage>525</fpage>
          -
          <lpage>529</lpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R. W. W.</given-names>
            <surname>Hooft</surname>
          </string-name>
          , G. Vriend,
          <string-name>
            <given-names>C.</given-names>
            <surname>Sander</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. E. Abola.</surname>
          </string-name>
          <article-title>Errors in protein structures</article-title>
          .
          <source>Nature</source>
          ,
          <volume>381</volume>
          (
          <issue>6580</issue>
          ):
          <fpage>272</fpage>
          , May
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>R.</given-names>
            <surname>Kolodny</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Koehl</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Levitt</surname>
          </string-name>
          .
          <article-title>Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures</article-title>
          .
          <source>J. Mol. Biol</source>
          .,
          <volume>346</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1173</fpage>
          -
          <lpage>1188</lpage>
          ,
          <year>Mar 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>E.</given-names>
            <surname>Krissinel</surname>
          </string-name>
          .
          <article-title>Enhanced fold recognition using ecient short fragment clustering</article-title>
          .
          <source>Journal of Molecular Biochemistry</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ),
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>J.</given-names>
            <surname>Kyte</surname>
          </string-name>
          and
          <string-name>
            <given-names>R. F.</given-names>
            <surname>Doolittle</surname>
          </string-name>
          .
          <article-title>A simple method for displaying the hydropathic character of a protein</article-title>
          .
          <source>Journal of Molecular Biology</source>
          ,
          <volume>157</volume>
          (
          <issue>1</issue>
          ):
          <fpage>105</fpage>
          -
          <lpage>132</lpage>
          , May
          <year>1982</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Lefranc</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Giudicelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ginestoux</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bodmer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Muller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Bontrop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Lemaitre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Malik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Barbie</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Chaume</surname>
          </string-name>
          .
          <article-title>IMGT, the international ImMunoGeneTics database</article-title>
          .
          <source>Nucleic Acids Res</source>
          .,
          <volume>27</volume>
          (
          <issue>1</issue>
          ):
          <fpage>209</fpage>
          -
          <lpage>212</lpage>
          ,
          <year>Jan 1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Lolkema</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Slotboom</surname>
          </string-name>
          .
          <article-title>Estimation of structural similarity of membrane proteins by hydropathy profile alignment</article-title>
          .
          <source>Mol. Membr. Biol</source>
          .,
          <volume>15</volume>
          (
          <issue>1</issue>
          ):
          <fpage>33</fpage>
          -
          <lpage>42</lpage>
          ,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J. S.</given-names>
            <surname>Lolkema</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. J.</given-names>
            <surname>Slotboom</surname>
          </string-name>
          .
          <article-title>Hydropathy profile alignment: a tool to search for structural homologues of membrane proteins</article-title>
          .
          <source>FEMS Microbiol</source>
          . Rev.,
          <volume>22</volume>
          (
          <issue>4</issue>
          ):
          <fpage>305</fpage>
          -
          <lpage>322</lpage>
          ,
          <year>Oct 1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>B. W.</given-names>
            <surname>Matthews</surname>
          </string-name>
          . Hydrophobic Interactions in Proteins. John Wiley &amp; Sons, Ltd,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>G.</given-names>
            <surname>Mayr</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F. S.</given-names>
            <surname>Domingues</surname>
          </string-name>
          , and
          <string-name>
            <given-names>P.</given-names>
            <surname>Lackner</surname>
          </string-name>
          .
          <article-title>Comparative analysis of protein structure alignments</article-title>
          .
          <source>BMC Struct. Biol</source>
          .,
          <volume>7</volume>
          :
          <fpage>50</fpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>C. N.</given-names>
            <surname>Pace</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. A.</given-names>
            <surname>Shirley</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <article-title>McNutt, and</article-title>
          K. Gajiwala.
          <article-title>Forces contributing to the conformational stability of proteins</article-title>
          .
          <source>FASEB journal</source>
          ,
          <volume>10</volume>
          (
          <issue>1</issue>
          ):
          <fpage>75</fpage>
          -
          <lpage>83</lpage>
          , Jan.
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [24]
          <string-name>
            <surname>J. P</surname>
          </string-name>
          <article-title>´anek, I. Eidhammer, and</article-title>
          <string-name>
            <given-names>R.</given-names>
            <surname>Aasland</surname>
          </string-name>
          .
          <article-title>A new method for identification of protein (sub)families in a set of proteins based on hydropathy distribution in proteins</article-title>
          .
          <source>Proteins</source>
          ,
          <volume>58</volume>
          (
          <issue>4</issue>
          ):
          <fpage>923</fpage>
          -
          <lpage>934</lpage>
          ,
          <year>03 2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>C.</given-names>
            <surname>Pommi</surname>
          </string-name>
          ´e, S. Levadoux,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sabatier</surname>
          </string-name>
          , G. Lefranc, and
          <string-name>
            <given-names>M.-P.</given-names>
            <surname>Lefranc</surname>
          </string-name>
          .
          <article-title>Imgt standardized criteria for statistical analysis of immunoglobulin v-region amino acid properties</article-title>
          .
          <source>Journal of Molecular Recognition</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ):
          <fpage>17</fpage>
          -
          <lpage>32</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [28]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vlachakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tsagkrasoulis</surname>
          </string-name>
          , G. Tsiliki, and
          <string-name>
            <given-names>S.</given-names>
            <surname>Kossida</surname>
          </string-name>
          .
          <article-title>The future of structural bioinformatics in the post-genomic era</article-title>
          .
          <source>EMBnet. journal</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ):pp3,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vlachakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. C.</given-names>
            <surname>Tsaniras</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Feidakis</surname>
          </string-name>
          , and
          <string-name>
            <surname>S. Kossida.</surname>
          </string-name>
          <article-title>An in silico 3D study of the biglycan core protein, using homology modelling techniques</article-title>
          .
          <source>Journal of Molecular Biochemistry</source>
          ,
          <volume>2</volume>
          (
          <issue>2</issue>
          ),
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vlachakis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Tsiliki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Tsagkrasoulis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. S.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Megalooikonomou</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Kossida</surname>
          </string-name>
          .
          <article-title>Speeding up the drug discovery process: structural similarity searches using molecular surfaces</article-title>
          .
          <source>EMBnet.journal</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ),
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>G.</given-names>
            <surname>Vriend. WHAT IF</surname>
          </string-name>
          <article-title>: a molecular modeling and drug design program</article-title>
          .
          <source>J Mol Graph</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>52</fpage>
          -
          <lpage>56</lpage>
          ,
          <year>Mar 1990</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>H.</given-names>
            <surname>Xiong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B. L.</given-names>
            <surname>Buckwalter</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. M.</given-names>
            <surname>Shieh</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. H.</given-names>
            <surname>Hecht</surname>
          </string-name>
          .
          <article-title>Periodicity of polar and nonpolar amino acids is the major determinant of secondary structure in self-assembling oligomeric peptides</article-title>
          .
          <source>Proceedings of the National Academy of Sciences</source>
          ,
          <volume>92</volume>
          (
          <issue>14</issue>
          ):
          <fpage>6349</fpage>
          -
          <lpage>6353</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>X. J.</given-names>
            <surname>Yu</surname>
          </string-name>
          and
          <string-name>
            <given-names>D. H.</given-names>
            <surname>Walker</surname>
          </string-name>
          .
          <article-title>Sequence and characterization of an Ehrlichia cha↵eensis gene encoding 314 amino acids highly homologous to the NAD A enzyme</article-title>
          .
          <source>FEMS Microbiol</source>
          . Lett.,
          <volume>154</volume>
          (
          <issue>1</issue>
          ):
          <fpage>53</fpage>
          -
          <lpage>58</lpage>
          ,
          <year>Sep 1997</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>