<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Vec2SPARQL: integrating SPARQL queries and knowledge graph embeddings</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre for Computational Biology, University of Birmingham</institution>
          ,
          <addr-line>Birmingham</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Computer, Electrical and Mathematical Science and Engineering Division, Computational Bioscience Research Center, King Abdullah University of Science and Technology</institution>
          ,
          <addr-line>Thuwal 23955</addr-line>
          ,
          <country country="SA">Saudi Arabia</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Institute of Data Science, Maastricht University</institution>
          ,
          <addr-line>Maastricht</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>1820</year>
      </pub-date>
      <abstract>
        <p>Recent developments in machine learning have led to a rise of large number of methods for extracting features from structured data. The features are represented as vectors and may encode for some semantic aspects of data. They can be used in a machine learning models for different tasks or to compute similarities between the entities of the data. SPARQL is a query language for structured data originally developed for querying Resource Description Framework (RDF) data. It has been in use for over a decade as a standardized NoSQL query language. Many different tools have been developed to enable data sharing with SPARQL. For example, SPARQL endpoints make your data interoperable and available to the world. SPARQL queries can be executed across multiple endpoints. We have developed a Vec2SPARQL, which is a general framework for integrating structured data and their vector space representations. Vec2SPARQL allows jointly querying vector functions such as computing similarities (cosine, correlations) or classifications with machine learning models within a single SPARQL query. We demonstrate applications of our approach for biomedical and clinical use cases. Our source code is freely available at https://github.com/bio-ontology-research-group/vec2sparql and we make a Vec2SPARQL endpoint available at http://sparql.bio2vec.net/.</p>
      </abstract>
      <kwd-group>
        <kwd>SPARQL</kwd>
        <kwd>vector space</kwd>
        <kwd>knowledge graph</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        SPARQL is a standardized NoSQL query language originally developed for data
represented in the Resource Description Framework (RDF) [
        <xref ref-type="bibr" rid="ref36">36</xref>
        ] format. It supports
basic inference on graphs and can be used to query multiple endpoint simultaneously.
The flexibility of SPARQL has led the development of a large number of adapters and
wrappers that enable querying data formats and storage technologies beyond RDF,
including SQL databases [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], text files and large tables [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], or any other kind of
structured data.
      </p>
      <p>
        The flexibility and wide applicability of SPARQL makes it well suited for
managing large heterogeneous data infrastructures such as found in many clinics and
hospitals [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ], or complex research data infrastructures such as UniProt [
        <xref ref-type="bibr" rid="ref42">42</xref>
        ] and other
genomic databases [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>SPARQL is well-suited for querying structured data, including metadata.
However, it is not well suited to query the “content” of unprocessed and unstructured
data such as images, videos, or large text corpora. For example, querying all chest
xray images in a database that show a cardiomegaly in male patients is possible using
SPARQL based on the metadata attached to the images but cannot be done based
on the content of the images alone. Methods that could determine whether an x-ray
image shows such a phenotype will commonly rely on extracting features from an
image and using a machine learning algorithm.</p>
      <p>
        Recently, several machine learning methods have been developed that extract
features from unstructured data. These features are represented within a vector space
and may encode some aspects of semantics. Deep learning techniques have been
applied to images [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ], audio [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], video [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ], but also to domain-specific data types
such as protein sequences [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] or DNA [
        <xref ref-type="bibr" rid="ref45">45</xref>
        ]. Deep Learning models can also be
applied to structured datasets such as RDF itself [
        <xref ref-type="bibr" rid="ref2 ref32">2, 32</xref>
        ], or to formal knowledge bases
and ontologies [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]. To collect domain-specific deep learning models, domain
specific model repositories have been developed in which these models are shared and
made reusable and interoperable [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        The vector space feature representations generated from deep learning models
can also be queried using functions that perform vector operations and may have
some defined semantics. For example, similarity measures such as cosine similarity,
correlation coefficients between vectors, or other similarity measures are sometimes
used to determine relatedness between entities from which features were extracted
[
        <xref ref-type="bibr" rid="ref24 ref25">24, 25</xref>
        ], and certain vector transformation may be used for analogical reasoning and
inference [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ].
      </p>
      <p>While the vector space representations resulting from machine learning systems
enable a type of query on a dataset, the kinds of queries that can be asked about
vector space representations are largely disconnected from SPARQL. SPARQL is
applicable to querying of structured data and meta-data, while vector operations can
identify semantic relatedness based on features extracted from an item. It can be
useful to combine queries involving semantic relatedness (with respect to a particular
feature extraction model) and structured information represented as meta-data. For
example, once feature vectors are extracted from images, meta-data that is
associated with the images (such as geo-locations, image types, author, or similar) could be
queried using SPARQL and combined with the semantic queries over the feature
vectors extracted from the images themselves. Such a combination would, for example,
allow to identify the images authored by person a that are most similar to an image
of author b; it can enable similarity- or analogy-based search and retrieval in
precisely delineated subsets; or, when feature learning is applied to structured datasets,
can combine similarity search and link prediction based on knowledge graph
embeddings with structured queries based on SPARQL.</p>
      <p>Here, we present a general framework to integrate vector space representations
of data together with their metadata, and query both within a joint framework. We
implement a prototype of this framework using (a) a repository of feature vector
representations associated with entities in a knowledge graph, (b) a mapping between
the entities in the repository and the knowledge graph, (c) a method to retrieve
entities from the repository based on vector space operations, among a specified set of
entities, and (d) a set of function extensions for SPARQL that make this search
accessible from within a SPARQL query and semantically integrate these operations with
the SPARQL syntax and semantics. We make our prototype implementation as well
as a demo freely available on Github 1. Furthermore, we demonstrate using
biomedical, clinical, and bioinformatics use cases how our approach can enable new kinds
of queries and applications that combine symbolic processing and retrieval of
information through sub-symbolic semantic queries within vector spaces.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Vector space projections and operations</title>
      <p>
        A large number of machine learning models have been developed that can take data
of various types as input and project them onto vectors that capture or represent
some aspects of the semantics within a vector space. A large number of models are
available for text [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], knowledge graphs [
        <xref ref-type="bibr" rid="ref2 ref32 ref7">2, 7, 32</xref>
        ], images (x-ray, dermascope, etc.)
[
        <xref ref-type="bibr" rid="ref12 ref30">30, 12</xref>
        ], and audio [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
      </p>
      <p>
        In the case of RDF and OWL data, the vector space projection can be done either
trivially by representing classes as binary vectors or through machine learning. As an
example for the first case, a type of genes could be represented as a binary vector
based on its associated GO classes to be further used in a similarity measurement
such as a wide range of semantic similarity measures [
        <xref ref-type="bibr" rid="ref13 ref28">13, 28</xref>
        ] or vector similarity
measures, to determine functionally similar genes. For more complex data, however,
machine learning algorithm can now be used to extract relevant features and
generate vector representations of nodes and relations in knowledge graphs. As these
vector aim to encode information about nodes that represent the local structure in
which a node is embedded, these vectors are called knowledge graph embeddings.
      </p>
      <p>
        For example, these embeddings can be generated based on a random walk
approach in combination with word2vec [
        <xref ref-type="bibr" rid="ref2 ref26">26, 2</xref>
        ] to generate embeddings for diseases,
genes, or drugs [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], and which have shown utility in predicting gene-disease
associations. Other approaches rely on constrained optimization where certain
invariances that exist in a knowledge graph are preserved with respect to certain vector
space operations. For example, Translational Embeddings (TransE) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] which
represent knowledge graph relations as translations in a vector space. A number of
translational embedding methods have been proposed which are based on TransE [
        <xref ref-type="bibr" rid="ref16 ref17 ref44">44, 16,
17</xref>
        ]
      </p>
      <p>Once the biomedical entities are represented as vectors, different methods can be
applied to measure the similarity between the entities. The most widely used
simi1 https://github.com/bio-ontology-research-group/vec2sparql
larity measures for vectors generated as word or knowledge graph embeddings are
cosine similarity, and, for many types of feature vectors, also correlation coefficients
(e.g., Pearson and Spearman correlation) which are often used to compare features
and identify similarity between entities.</p>
      <p>Cosine similarity measures the orientation of two n-dimensional vectors. It is
calculated by the dot product of two numeric vectors, and it is normalized by the dot
product of the vectors’ magnitudes. Output values close to 1 indicate high similarity.
Cosine similarity between two vectors x and y is formally defined in equation 1 as
follows:
cos(x , y ) Æ</p>
      <p>x ¢ y
jjx jj ¢ jjy jj</p>
      <p>The Pearson correlation coefficient is used to measure linear relationships
between two variables. An output value of 1 represents a perfect positive correlation,
¡1 indicates a perfect negative relationship, and 0 indicates the absence of a
relationship between variables. Peason’s correlation coefficient of X and Y is formulated
as:
½ Æ
cov(X , Y )
¾x ¾y
(1)
(2)</p>
      <p>
        However, in general, any function f : Rn £ Rn 7! R that takes two vectors as input
and outputs a real number can be used to determine similarity between vectors and
potentially measure a form of “semantic” similarity between the entities represented
by the vectors. Specifically, artificial neural networks (ANNs) can implement (or
approximate) any function [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] and can be trained to output specific types of similarity,
for example using a sigmoid classification function as output.
      </p>
      <p>
        Computing similarity is not the only operation that can be performed on
vector spaces. Some vector space models are built to preserve certain semantic
invariances under other operations, such as addition and subtraction of vectors. For
example, analogical reasoning can be performed using addition and subtraction based
on word embeddings generated by Word2Vec [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], and in translational embedding
models it is possible to add relation vectors to vectors representing nodes in the
graph to perform multi-relational link prediction [
        <xref ref-type="bibr" rid="ref16 ref17 ref44 ref7">7, 44, 16, 17</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3 Vec2SPARQL: jointly querying structured data and vector space representations</title>
      <p>Vec2SPARQL bridges vector space embeddings of entities and the structured data
about the entities that are accessible through SPARQL in a single framework. Vec2SPARQL
assumes on the vector space side the existence of a repository (or a set) of vectors
and the ability to perform certain vector space operations (such as computing
similarity between two vectors). Each vector representation must be identified through
an Internationalized Resource Identifier (IRI). The vectors, their IRIs, and the
vector space operations should be made available through an API. On the SPARQL side,
Vec2SPARQL assumes the existence of a SPARQL end-point linked to either
structured data (e.g. knowledge graph) or semi-structured data (e.g., text documents). In
SPARQL, entities and relations are identified through an IRI. Vec2SPARQL assumes
that there is an overlap between the IRIs accessible through the SPARQL endpoint
and the IRIs that are used to identify the vectors. The underlying idea is that the
structured and semi-structured information about an entity can be queried through
SPARQL, and the unstructured information about the entity can be queried through
the vector space representations. For example, we can identify metadata about chest
x-ray images in SPARQL, including an IRI to identify the image, the date the image
was taken, the anatomical location, the diagnosis, and other meta-data relating to
a patient; then, using a feature extractor for x-ray images, we can generate a vector
representation and identify the vector with the same IRI as the image, and use vector
space similarity (e.g., Pearson correlation coefficients) to identify similar or related
images. Vec2SPARQL enables a bi-directional information flow between both types
of information and combinations of these queries.</p>
      <p>Specifically, in Vec2SPARQL, we extend the SPARQL query syntax by adding two
custom functions with a particular semantics. The first function is similarity(?x, ?y)
where x and y are entities that are identified by an IRI in SPARQL. The function
computes similarity of the corresponding embedding vectors. similarity is an expression
function and can be used in FILTER, BIND, and SELECT statements.</p>
      <p>The second function in Vec2SPARQL is called mostSimilar(?x, n) where x is an IRI
of an entity accessible through SPARQL and for which a vector representation exists,
and n is an integer. most Si mi l ar is a property function that allows to create new
matches within a query using a similarity function define on the vector
representations of entities. As a result of this function we will get n entities which are the most
similar to x and can further be processed with query operators.</p>
      <p>
        Vec2SPARQL is implemented using ARQ, which is a query engine for Apache Jena
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. It currently implements only a single similarity function (cosine similarity) but
can easily be extended to other functions. When multiple different similarity
functions are to be used, we will extend the Vec2SPARQL functions with an additional
argument that specifies which function to use.
      </p>
      <p>
        Vec2SPARQL can easily be run using a Docker [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ]. Docker is an open-source
container software which allows to distribute, deploy and run a software tools in a
virtual environment. We have configured a docker image which installs all required
dependencies and builds the Vec2SPARQL executable in order to start the SPARQL
endpoint.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4 Remote querying of vector similarity</title>
      <p>
        To build a more flexible infrastructure we do not maintain the repository of
vector representations or compute the similarity functions directly in Vec2SPARQL but
rather use an API that consists of a repository of such vectors and a set of functions
(mainly search by similarity) to execute on them. For this purpose, we rely on the
prototypical Bio2Vec platform. Bio2Vec is a platform for representing, sharing,
integrating, and querying vector space embeddings. Its current content covers
embeddings from text, knowledge graphs, and biological interaction networks, and has
vector representations of several types of biological and biomedical entities, including
gene functions (from Gene Ontology [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]), genes, drugs, and diseases. Bio2Vec has an
API through which the vector space can be searched by similarity, currently using
only the cosine similarity measure.
      </p>
      <p>Vec2SPARQL utilizes Bio2Vec and its similarity functions within SPARQL queries.
We implemented a first version of Vec2SPARQL to work with Bio2Vec API. However,
Vec2SPARQL can be integrated with other external APIs which provide similar kinds
of vector space operations.</p>
      <p>Figure 1 illustrates the general picture of the Vec2SPARQL approach. Vec2SPARQL
provides a single SPARQL endpoint in which queries can be performed over the Bio2Vec
API together with data currently stored in the Vec2SPARQL endpoint. We use the
Bio2Vec REST API, which is based on an ElasticSearch index with a vector scoring
plugin, to store and search for embedding vectors. When a Vec2SPARQL custom
function is used in a query, Vec2SPARQL retrieves embeddings or similarity values from
Bio2Vec. On the other side, the structured data is represented as RDF and is queried
using Apache Jena’s ARQ query engine.</p>
    </sec>
    <sec id="sec-5">
      <title>5 Use case: phenotype-driven disease gene prioritization</title>
      <p>
        One widely used application of similarity-based search in biomedical applications
is the comparison of phenotypes using a similarity measure. Recently, several vector
space models have been developed that show competitive performance when using
phenotype similarity within and across species to predict gene–disease associations
[
        <xref ref-type="bibr" rid="ref1 ref39">1, 39</xref>
        ]. In these models, data is prepared in a custom manner and vector
similarity (or, in some cases, a neural network) is used to exhaustively compute similarity
between genes and diseases. We can use Vec2SPARQL to perform queries of a
knowledge graph of mouse genes, diseases and phenotypes and incorporate Vec2SPARQL
similarity functions.
      </p>
      <p>
        We created a knowledge graph of mouse genes and their phenotype annotations
obtained from the Mouse Genome Informatics (MGI) [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ] database (accessed on
06.08.2018) and human diseases in the Online Mendelian Inheritance in Men (OMIM)
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] database, and their phenotype associations from the Human Phenotype Ontology
(HPO) database (accessed on 27.07.2018). Our aim in this use case is to find mouse
gene associations with human diseases by prioritizing them using their phenotypic
similarity, and simultaneously restrict the similarity comparisons to genes and
diseases with specific properties (such as being associated with a particular phenotype).
      </p>
      <p>
        The phenotypic similarity can be computed in different ways. One way is to use
ontology based semantic similarity measure such as simGIC [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] or Resnik similarity
measure [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. However, there are hundreds of similarity measures and it is difficult to
choose one. Also, semantic similarity measures can be biased towards well studied
genes when the variance of annotation size is very high [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>Another way of computing similarity is to use representation learning methods
which can provide an embedding vector for the entities of a dataset. The embeddings
can further be used as features for machine learning methods or compute
similarities such as cosine or correlation coefficients depending on type of the embeddings.</p>
      <p>
        Here, we employ a knowledge graph representation learning method by [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. To
include semantics of phenotype annotations both from Mammalian Phenotype
Ontology (MP) and Human Phenotype Ontology (HPO) we add an integrated phenotype
ontology PhenomeNET [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ] to our knowledge graph. First, the method generates a
corpus by randomly walking the knowledge graph including edge information. The
corpus captures a neighborhood of each entity in the knowledge graph and can
further be used in representation learning methods. Then, we use Word2Vec [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ] and
extract embeddings vectors for each entity in our knowledge graph. The Word2Vec
embeddings are optimized for a cosine similarity. In other words, we can get similar
entities of our knowledge graph by computing the cosine similarity of their
embedding vectors.
      </p>
      <p>Our hypothesis is that associated genes and diseases should have similar
phenotype associations, therefore they should have more similar embeddings than not
associated genes and diseases. Using Vec2SPARQL, we can answer this question with
a simple SPARQL query. For example, the following query extracts disease
associations for a mouse gene Pax6 (MGI:97490):
PREFIX b2v: &lt;http://bio2vec.net/graph_embeddings/function#&gt;
PREFIX MGI: &lt;http://www.informatics.jax.org/gene/MGI_&gt;
PREFIX obo: &lt;http://purl.obolibrary.org/&gt;
PREFIX rdf: &lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt;</p>
      <sec id="sec-5-1">
        <title>PREFIX rdfs: &lt;http://www.w3.org/2000/01/rdf-schema#&gt; SELECT ?sim ?dis (b2v:similarity(?sim, MGI:97490) as ?val) { }</title>
        <p>?sim b2v:mostSimilar(MGI:97490 10000) .
?sim a obo:disease .</p>
        <p>?sim rdfs:label ?dis
This query will return all diseases in OMIM among first 10,000 similar entities in the
knowledge graph ordered by the cosine similarity value.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6 Use case: image repositories in health care environments</title>
      <p>
        During decision making in the medical and clinical domains, physicians and
consultants are often required to not only base their diagnosis on their medical expert
knowledge, but furthermore to relate this diagnosis to past patients. To extract and
model medical and clinical expert knowledge, a number of approaches based on
machine learning and artificial intelligence have been proposed in the last decades. In
recent years, this trend has been even more accelerated with the rise of deep
convolutional networks. Some examples include the segmentation of organs and tissues
in medical images or the prediction of diagnoses [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ] and referral urgency using 3D
Optical coherence tomography (OCT) images [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Other approaches aim to identify
similar case histories to enhance the cased-based retrieval of similar patients based
on the available medical information, often captured using images [
        <xref ref-type="bibr" rid="ref37 ref4">4, 37</xref>
        ].
      </p>
      <p>
        Here, we employ an approach inspired by these last approaches in order to
demonstrate the general applicability of Vec2SPARQL within the medical and clinical
domains. We employed a publicly available dataset of chest x-ray images [
        <xref ref-type="bibr" rid="ref43">43</xref>
        ] which is
also available from Kaggle for use in machine learning and analytics challenges. It
consist of more than 112,000 chest x-ray images including basic annotations such
as gender, age, and diagnosis of the patients. We downloaded the images and their
annotations (accessed on 12.08.2018)) and scaled the original images to a resolution
of 256 £ 256 pixels. We then used the BVLC GoogLeNet [
        <xref ref-type="bibr" rid="ref41">41</xref>
        ] from the Caffe Model
Zoo to extract feature vectors from these images. Specifically, we presented each
resized image to the network and extracted the last layer before the final softmax layer
(pool5/7x7_s1) as vector for use in Vec2SPARQL. Within our system, we have
employed a set of approximately randomly chosen 15,881 images from a variety of
medical diagnoses, patient ages, and gender.
      </p>
      <p>Our hypothesis is that similar x-ray images should have similar clinical
diagnoses. Using Vec2SPARQL, we can evaluate this with a SPARQL query. The
following query for example extracts similar chest x-ray images to a patient with patient
ID 9890, represented by the image 00009890_001.png (patient ID 9890, diagnosis:
Atelectasis, age: 55, gender: male):
PREFIX b2v: &lt;http://bio2vec.net/patient_embeddings/function#&gt;</p>
      <sec id="sec-6-1">
        <title>PREFIX BVP: &lt;http://bio2vec.net/patients/BVP_&gt;</title>
        <p>PREFIX IMG: &lt;http://bio2vec.net/patients/IMG_&gt;
PREFIX BV: &lt;http://bio2vec.net/patients/&gt;
SELECT ?sim (b2v:similarity(?sim, IMG:00009890_001.png) as ?val) ?p ?f
{</p>
        <p>The query selects 10 most similar images, of which two of the five most
similar ones possess the medical diagnosis Atelectasis (patient ID: 8791, Age: 68,
Gender: male, similarity: 0.870 and patient ID: 9488, Age: 58, Gender: male, similarity:
0.859). These results are somewhat surprising as the vector extraction employed by
the BVLC GoogLeNet is not aimed for any kind of medical image retrieval or analysis,
yet nevertheless appears to yield results that indicate the biological relationship.</p>
        <p>Furthermore, these results can be enhanced by further constraining the
similarity based image retrieval by additional, structured information such as the selection
of gender or age ranges within SPARQL. Such query capabilities should allow
medical consultants an easier access to relevant cases and facilitate improved diagnosis. It
can also enable a faceted exploration of these images and their similarity, using RDF
properties as facets to filter on the similarity space. Overall, we also envision our
system to be used for other un-structured and semi-structured data in the medical and
clinical domain, such as for electrocardiograms (ECGs) and clinical notes.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7 Conclusion</title>
      <p>Here we presented a first prototype of Vec2SPARQL, a framework which bridges
vector space and structured queries in a single endpoint. We have provided a proof of
concept for our system using only a single similarity function (cosine similarity)
between vectors that can be compared using this function. However, Vec2SPARQL is
generic and can be extended with other similarity functions and even functions that
have been learned in a supervised manner.</p>
      <p>We illustrate the utility of Vec2SPARQL on two use cases: phenotype-driven
prioritization of gene–disease associations and retrieval of clinical images based on
comparing image feature vectors extracted through deep learning models. Both types of
similarity search are combined with structured queries in SPARQL and demonstrate
the flexibility and strength of our approach. We believe that Vec2SPARQL serves as
a useful framework that can fill the gap between the vector space operations and
SPARQL for performing semantic queries on structured and unstructured data.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgements</title>
      <p>This work was supported by funding from King Abdullah University of Science and
Technology (KAUST) Office of Sponsored Research (OSR) under Award No.
URF/1/345401-01, FCC/1/1976-08-01, and FCS/1/3657-02-01.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Alshahrani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoehndorf</surname>
          </string-name>
          , R.:
          <article-title>Semantic disease gene embeddings (smudge): phenotypebased disease gene prioritization without phenotypes</article-title>
          .
          <source>Bioinformatics</source>
          <volume>34</volume>
          (
          <issue>17</issue>
          ),
          <fpage>i901</fpage>
          -
          <lpage>i907</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alshahrani</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Neuro-symbolic representation learning on biological knowledge graphs</article-title>
          .
          <source>Bioinformatics</source>
          <volume>33</volume>
          (
          <issue>17</issue>
          ),
          <fpage>2723</fpage>
          -
          <lpage>2730</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Amberger</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>A new face and new challenges for Online Mendelian Inheritance in Man (OMIM)</article-title>
          .
          <source>Hum Mutat</source>
          <volume>32</volume>
          ,
          <fpage>564</fpage>
          -
          <lpage>567</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Anavi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          , et al.:
          <article-title>Visualizing and enhancing a deep learning framework using patients age and gender for chest x-ray image retrieval</article-title>
          .
          <source>In: Medical Imaging</source>
          <year>2016</year>
          :
          <string-name>
            <surname>Computer-Aided</surname>
            <given-names>Diagnosis</given-names>
          </string-name>
          , San Diego, California, United States,
          <volume>27</volume>
          <fpage>February</fpage>
          - 3
          <source>March</source>
          <year>2016</year>
          . vol.
          <volume>9785</volume>
          , pp.
          <fpage>9785</fpage>
          -
          <lpage>9785</lpage>
          -
          <fpage>6</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Ashburner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Gene ontology: tool for the unification of biology</article-title>
          .
          <source>Nature Genetics</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ),
          <fpage>25</fpage>
          -
          <lpage>29</lpage>
          (May
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Avsec</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , et al.:
          <article-title>Kipoi: accelerating the community exchange and reuse of predictive models for genomics</article-title>
          .
          <source>bioRxiv</source>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Bordes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Translating embeddings for modeling multi-relational data</article-title>
          . In: Burges,
          <string-name>
            <surname>C.J.C.</surname>
          </string-name>
          , et al. (eds.)
          <source>Advances in Neural Information Processing Systems</source>
          <volume>26</volume>
          , pp.
          <fpage>2787</fpage>
          -
          <lpage>2795</lpage>
          . Curran Associates, Inc. (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Bowman</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          , et al.:
          <article-title>Learning distributed word representations for natural logic reasoning</article-title>
          .
          <source>In: AAAI Spring Symposium on Knowledge Representation and Reasoning</source>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Carroll</surname>
            ,
            <given-names>J.J.</given-names>
          </string-name>
          , et al.:
          <article-title>Jena: Implementing the Semantic Web recommendations</article-title>
          .
          <source>Technical Report HPL-2003-146</source>
          ,
          <string-name>
            <surname>Hewlett</surname>
            <given-names>Packard</given-names>
          </string-name>
          , Bristol, UK (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Cyganiak</surname>
          </string-name>
          , R.:
          <article-title>Tarql:sparql for tables</article-title>
          . https://github.com/tarql/tarql (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>De Fauw</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , et al.:
          <article-title>Clinically applicable deep learning for diagnosis and referral in retinal disease</article-title>
          .
          <source>Nature Medicine</source>
          <volume>24</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1342</fpage>
          -
          <lpage>1350</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Esteva</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Dermatologist-level classification of skin cancer with deep neural networks</article-title>
          .
          <source>Nature</source>
          <volume>542</volume>
          ,
          <string-name>
            <surname>115</surname>
            <given-names>EP</given-names>
          </string-name>
          - (
          <year>Jan 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Harispe</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies</article-title>
          .
          <source>Bioinformatics</source>
          <volume>30</volume>
          (
          <issue>5</issue>
          ),
          <fpage>740</fpage>
          -
          <lpage>742</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Hinton</surname>
          </string-name>
          , G., et al.:
          <article-title>Deep neural networks for acoustic modeling in speech recognition</article-title>
          .
          <source>Signal Processing Magazine</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Hornik</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Approximation capabilities of multilayer feedforward networks</article-title>
          .
          <source>Neural Networks</source>
          <volume>4</volume>
          (
          <issue>2</issue>
          ),
          <fpage>251</fpage>
          -
          <lpage>257</lpage>
          (
          <year>1991</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Ji</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.:
          <article-title>Knowledge graph embedding via dynamic mapping matrix</article-title>
          .
          <source>In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)</source>
          . pp.
          <fpage>687</fpage>
          -
          <lpage>696</lpage>
          . Association for Computational Linguistics (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Ji</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.:
          <article-title>Knowledge graph completion with adaptive sparse transfer matrix (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Jupp</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , et al.:
          <article-title>The EBI RDF platform: linked open data for the life sciences</article-title>
          .
          <source>Bioinformatics</source>
          <volume>30</volume>
          (
          <issue>9</issue>
          ),
          <fpage>1338</fpage>
          -
          <lpage>1339</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Karpathy</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Large-scale video classification with convolutional neural networks</article-title>
          .
          <source>In: 2014 IEEE Conference on Computer Vision and Pattern Recognition</source>
          . pp.
          <fpage>1725</fpage>
          -
          <lpage>1732</lpage>
          (
          <year>June 2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Kulmanov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoehndorf</surname>
          </string-name>
          , R.:
          <article-title>Evaluating the effect of annotation size on measures of semantic similarity</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          <volume>8</volume>
          (
          <issue>1</issue>
          ), 7 (
          <year>January 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Kulmanov</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , et al.:
          <article-title>Deepgo: predicting protein functions from sequence and interactions using a deep ontology-aware classifier</article-title>
          .
          <source>Bioinformatics</source>
          <volume>34</volume>
          (
          <issue>4</issue>
          ),
          <fpage>660</fpage>
          -
          <lpage>668</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Litjens</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.:
          <article-title>A survey on deep learning in medical image analysis</article-title>
          .
          <source>Medical Image Analysis</source>
          <volume>42</volume>
          ,
          <fpage>60</fpage>
          -
          <lpage>88</lpage>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Merkel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : Docker:
          <article-title>Lightweight linux containers for consistent development and deployment</article-title>
          .
          <source>Linux J</source>
          .
          <year>2014</year>
          (
          <volume>239</volume>
          ) (
          <year>Mar 2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , et al.:
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In: Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2</source>
          . pp.
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          . NIPS'
          <volume>13</volume>
          , Curran Associates Inc.,
          <source>USA</source>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Morcos</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          , et al.:
          <article-title>Insights on representational similarity in neural networks with canonical correlation</article-title>
          . CoRR abs/
          <year>1806</year>
          .05759v2 (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Perozzi</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          , et al.:
          <article-title>Deepwalk: Online learning of social representations</article-title>
          .
          <source>In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          . pp.
          <fpage>701</fpage>
          -
          <lpage>710</lpage>
          . KDD '14,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al.:
          <article-title>Metrics for GO based protein semantic similarity: a systematic evaluation</article-title>
          .
          <source>BMC Bioinformatics</source>
          <volume>9</volume>
          (
          <issue>Suppl 5</issue>
          ),
          <source>S4</source>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Pesquita</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al.:
          <article-title>Semantic similarity in biomedical ontologies</article-title>
          .
          <source>PLoS Comput Biol</source>
          <volume>5</volume>
          (
          <issue>7</issue>
          ),
          <source>e1000443 (07</source>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Prud'hommeaux</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>Bertails</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A mapping of sparql onto conventional sql</article-title>
          .
          <source>Tech. rep. (</source>
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Rajpurkar</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , et al.:
          <article-title>Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning</article-title>
          .
          <source>CoRR abs/1711</source>
          .05225 (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Resnik</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Semantic similarity in a taxonomy: An Information-Based measure and its application to problems of ambiguity in natural language</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          <volume>11</volume>
          ,
          <fpage>95</fpage>
          -
          <lpage>130</lpage>
          (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Ristoski</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
          </string-name>
          , H.:
          <article-title>Rdf2vec: Rdf graph embeddings for data mining</article-title>
          . In: International Semantic Web Conference (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Rodríguez-García</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Á.</surname>
          </string-name>
          , et al.:
          <article-title>Integrating phenotype ontologies with phenomenet</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          <volume>8</volume>
          (
          <issue>1</issue>
          ),
          <volume>58</volume>
          (Dec
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34.
          <string-name>
            <surname>Russakovsky</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , et al.:
          <article-title>ImageNet Large Scale Visual Recognition Challenge</article-title>
          .
          <source>International Journal of Computer Vision</source>
          (IJCV)
          <volume>115</volume>
          (
          <issue>3</issue>
          ),
          <fpage>211</fpage>
          -
          <lpage>252</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          35.
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Advancing translational research with the semantic web</article-title>
          .
          <source>BMC Bioinformatics</source>
          <volume>8</volume>
          (
          <issue>Suppl 3</issue>
          ), S2+ (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Seaborne</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prud</surname>
          </string-name>
          'hommeaux, E.:
          <article-title>SPARQL query language for RDF. W3C recommendation</article-title>
          ,
          <source>W3C (January</source>
          <year>2008</year>
          ), http://www.w3.org/TR/2008/REC-rdf
          <string-name>
            <surname>-</surname>
          </string-name>
          sparql-query-
          <volume>20080115</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37.
          <string-name>
            <surname>Shah</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al.:
          <article-title>Deeply learnt hashing forests for content based image retrieval in prostate mr images</article-title>
          .
          <source>In: Medical Imaging</source>
          <year>2016</year>
          :
          <string-name>
            <given-names>Image</given-names>
            <surname>Processing</surname>
          </string-name>
          , San Diego, California, USA, February
          <volume>27</volume>
          ,
          <year>2016</year>
          . vol.
          <volume>9784</volume>
          , pp.
          <fpage>9784</fpage>
          -
          <lpage>9784</lpage>
          -
          <fpage>6</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <string-name>
            <surname>Smaili</surname>
            ,
            <given-names>F.Z.</given-names>
          </string-name>
          , et al.:
          <article-title>Onto2vec: joint vector-based representation of biological entities and their ontology-based annotations</article-title>
          .
          <source>Bioinformatics</source>
          <volume>34</volume>
          (
          <issue>13</issue>
          ),
          <fpage>i52</fpage>
          -
          <lpage>i60</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Smaili</surname>
            ,
            <given-names>F.Z.</given-names>
          </string-name>
          , et al.:
          <article-title>Opa2vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction</article-title>
          . Bioinformatics p.
          <year>bty933</year>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          , et al.:
          <article-title>Mouse genome database (mgd)-2018: knowledgebase for the laboratory mouse</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>46</volume>
          (
          <issue>D1</issue>
          ),
          <fpage>D836</fpage>
          -
          <lpage>D842</lpage>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41.
          <string-name>
            <surname>Szegedy</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , et al.:
          <article-title>Going deeper with convolutions</article-title>
          .
          <source>In: CVPR</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . IEEE Computer Society (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42.
          <article-title>The UniProt Consortium: Uniprot: the universal protein knowledgebase</article-title>
          .
          <source>Nucleic Acids Research</source>
          <volume>46</volume>
          (
          <issue>5</issue>
          ),
          <volume>2699</volume>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          , et al.:
          <article-title>Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases</article-title>
          .
          <source>In: CVPR</source>
          . pp.
          <fpage>3462</fpage>
          -
          <lpage>3471</lpage>
          . IEEE Computer Society (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          , et al.:
          <article-title>Knowledge graph embedding by translating on hyperplanes (</article-title>
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Troyanskaya</surname>
            ,
            <given-names>O.G.</given-names>
          </string-name>
          :
          <article-title>Predicting effects of noncoding variants with deep learningbased sequence model</article-title>
          .
          <source>Nature Methods</source>
          <volume>12</volume>
          ,
          <string-name>
            <surname>931</surname>
            <given-names>EP</given-names>
          </string-name>
          - (
          <year>08 2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>