<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMiner Use Case</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Rothamsted Research, Computational and Analytical Sciences Department</institution>
          ,
          <addr-line>Harpenden, AL5 2JQ</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Graph-based modelling is becoming more popular, in the sciences and elsewhere, as a flexible and powerful way to exploit data to power world-changing digital applications. Compared to the initial vision of the Semantic Web, knowledge graphs and graph databases are becoming a practical and computationally less formal way to manage graph data. On the other hand, linked data based on Semantic Web standards are a complementary, rather than alternative, approach to deal with these data, since they still provide a common way to represent and exchange information. In this paper we introduce rdf2neo, a tool to populate Neo4j databases starting from RDF data sets, based on a configurable mapping between the two. By employing agrigenomicsrelated real use cases, we show how such mapping can allow for a hybrid approach to the management of networked knowledge, based on taking advantage of the best of both RDF and property graphs.</p>
      </abstract>
      <kwd-group>
        <kwd>semantic web</kwd>
        <kwd>property graphs</kwd>
        <kwd>Neo4j</kwd>
        <kwd>knowledge networks</kwd>
        <kwd>plant biology</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Sharing data, information and knowledge is ever more important in life sciences [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]
and other human activities [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. As recently highlighted by the well-known paper about
the FAIR principles [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], we can achieve maximum benefits from data by sharing them
according to the principles of Findability, Accessibility, Interoperability and
Reusability. In the life science and other sciences the linked data approach [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and the Semantic
Web technologies [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] allow for the integration of heterogeneous knowledge and the
study of phenomena involving interdependencies at multiple scales [
        <xref ref-type="bibr" rid="ref6 ref7 ref8">6–8</xref>
        ]. Over the
years, science data practitioners have developed high quality data models, mostly based
on ontology engineering and other highly formal modelling techniques [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. While this
gives advantages in many applications, for instance by allowing for advanced automatic
reasoning and data validation [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], it is difficult for both software developers and
domain experts to deal with these models and curate data that comply with them [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
especially when data are shared on a large scale [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. For these reasons, knowledge
graphs have appeared, especially in commercial contexts [
        <xref ref-type="bibr" rid="ref13 ref14">13,14</xref>
        ], where data are
modelled in a more semi-formal way and logical inference is based on statistics and heuristic
techniques, not only on formal logics [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. At the infrastructural level, a number of data
storage solutions have emerged in the last decade (or even conceived before the
Semantic Web), including distributed relational-like databases, the so-called NoSQL
databases [
        <xref ref-type="bibr" rid="ref16 ref17">16,17</xref>
        ] and graph databases based on the idea of property graphs (PG, [
        <xref ref-type="bibr" rid="ref18 ref19 ref20">18–
20</xref>
        ]). Property graphs are different from triple stores based on the RDF’s Semantic Web
standard, in three key ways [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Firstly, the PG data model is more coarse-grained than
RDF, allowing the model to group all the datatype properties of an entity in a single
node, whereas they require multiple triples in RDF. In a PG this is extended to the graph
edges, i.e., binary relations linking entities, which can be enriched with attributes
describing aspects like the provenance of a relation or its confidence score. Doing the
same in RDF is known to be more complicated, since it requires graph patterns like
reification [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. On the other hand, the finer granularity of RDF makes it easier to do
operations like merging attribute lists about an entity coming from different data
sources. A second difference is in the underlying philosophy of PG projects like Neo4j
(and, more in general, of NoSQL databases [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]), in the sense that many use cases are
focused on providing stores for single applications and single organisations, with less
commitment to, and support for standardisation, interoperability and sharing. This
contrasts with the priorities in the Semantic Web World, where sharing and interoperability
are primary motivations. A third difference is that triple stores are best at graph pattern
searches, rather than more local searches based on graph-traversal algorithms, for
which several PG databases are optimised [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
1.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Motivating Use Case</title>
      <p>
        Our group has been involved in biological data integration and discovery for many
years. In particular, we maintain the KnetMiner software suite [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], primarily based on
web components, which can be used to explore large scale and semantically-rich
knowledge networks, named Genome-Scale Knowledge Networks (GSKN [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]).
Starting from keyword-based search, the user can explore networked knowledge that include
genes, encoded proteins, phenotypes, disease, biological pathways and PUBMED
citations. We use KnetMiner mostly to support the analysis of plant biology and
agrigenomics-related organisms. Our published data sets include Arabidopsis, wheat, rice
and maize. Our data types include ENSEMBL genes [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ], AraCyc pathways [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ] and
Plant Ontology [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ]. Recently we have started to redesign the backend architecture that
KnetMiner is based on (see Figure 2 in [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ]). In so doing, we aim to address three
different objectives: i) better separation between data access services and data client
applications, namely adopting web APIs and common graph-oriented query languages
ii) management of data through common graph-oriented database servers, rather than
ad-hoc solutions iii) Publicly sharing our data by means of well-known data standards.
Based on a preliminary analysis of several options available [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] and further
investigations presented in this paper, we propose that a “traditional” linked data approach can
usefully be combined with the adoption of more recent property graph systems. In
particular, we have developed BioKNO [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], a lightweight OWL-based ontology, falling
under the category of application ontologies [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ], which we are using as the reference
data model to integrate biological data from external sources, available in different
formats and data models. In order to facilitate data integration tasks as well as the
development of Semantic Web-oriented applications, we have made a SPARQL endpoint
available, based on the Virtuoso triple store [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ]. Moreover, we have established a data
pipeline to convert our RDF-based data (hence, after initial extraction, loading,
transformation, or ELT) into Cypher instructions, to be used to populate instances of the
Neo4j property graph database [
        <xref ref-type="bibr" rid="ref33">33</xref>
        ]. We have decided to add this additional data access
channel, because our bioinformaticians consider Cypher and Neo4j useful tools for a
number of tasks. For instance, we are using Cypher as a query language for what we
call semantic motif searches [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], which are graph pattern-based searches tracking
wellknown relation paths between biological entities (e.g. Gene-&gt;
expresses-&gt;Protein&gt;published_in-&gt;Publication). We use semantic motifs for searching plant biology data,
to implement term-based gene searches and to rank results based on scoring the
relevance of a gene to the searches.
      </p>
      <p>
        In order to support our use case, we have developed rdf2neo [
        <xref ref-type="bibr" rid="ref34">34</xref>
        ], a tool that can be
flexibly used to map any RDF schema to a desired PG schema, so that it is possible to
rely on RDF/OWL for data modelling and acquisition purposes and then deliver data
through a number of useful formats and technologies. In particular, the configurable
mapping available in our tool eases the definition of PG schemas aligned to the
RDFbased model, which favours a data-centric application development.
2
2.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Methods &amp; Implementation</title>
      <p>The rdf2neo Approach
rdf2neo is implemented as a Java library, which can be used either programmatically
or by means of a command-line package. As shown in Fig. 1, its main function is
mapping an RDF-based data model to a Property Graph data model, which is eventually
created by a set of Cypher commands. SPARQL is used to select groups of values that
make up the entities of a target property graph (nodes or relations). This works in two
steps. Considering the case of PG nodes, first a query selects URIs1 of RDF resources
that correspond to PG nodes (e.g., bkr:tob1 in the figure). Then, each of those URIs is
used to instantiate two other SPARQL queries, both parameterised on a
conventionallynamed variable (?iri). One query is used to extract node labels (usually analogous to
RDF/RDF-S/OWL classes), the other is used to extract pairs of property name + value
for that node (e.g., dcterms:title, bka:YEAR). These node-related data are then used to
issue Cypher instructions to a Neo4j server. Relationship mapping is similar to the node
case: one SPARQL query, lists all the PG desired relationships, together with the URIs
of the relationship origin and destination nodes, as well as a string about the relationship
type (contrary to node labels, a relationship can have one type only in Cypher/Neo4j).
Another SPARQL query, instantiated with the relationship’s URI, lists the relationship
property/value pairs, if any (bka:Score in our figure). In both node and relationship
creation, the original node/relationship URI is always stored on the Neo4j side, by
means of the property named ‘iri’. This, in addition to being useful for future reference,
is used for our operations, to refer nodes during the relationship creation stage.
The process above is implemented in the architecture shown in Fig. 2. This comprises
a component that selects node URIs (same for relation pointers) and dispatches them
to parallel threads, each fetching node (or relation) details, to eventually populate Neo4j
via Cypher.</p>
      <p>
        The SPARQL files we mentioned above are part of rdf2neo configuration, which is
managed through Spring Beans configuration files [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ]. This way, the task of
translating a new RDF data set into a Neo4j database consists of defining new SPARQL
mapping queries in a set of files and passing them to rdf2neo through a Spring configuration.
This makes configuration flexible and convenient for the many developers who are
familiar to Spring.
1 In the code we use the term IRI in order to mean the general case, where practically IRIs can be
considered as URIs with international support. Hereby, we call them with the better known
URIs word and, for what matters us, the two acronyms can be considered synonyms.
In order to ensure the correctness of the RDF/PG process described above, as well as
study relevant properties, we have developed a formalisation of it (see Supplementary
Document 1). One result we achieved from this analysis is that the computational
complexity of our conversion procedures is mainly affected by the SPARQL mapping, and
less by the overhead of Neo4j/Cypher interaction. Thus, the key factor for improving
the performance of rdf2neo is by optimising the specific SPARQL queries used to
realise such a mapping in a given use case [
        <xref ref-type="bibr" rid="ref36 ref37">36,37</xref>
        ].
3
      </p>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>In order to assess the performance of the two types of graph databases we use, we have
created a preliminary simple benchmark of specific aspects of interest to us. Namely,
we compared Virtuoso to Neo4j. On the one hand, we made a qualitative evaluation of
how easy it is to write queries in both SPARQL and Cypher languages. On the other
hand, we assessed the speed of the respective query engines, when used with the typical
data that we deal with in the KnetMiner project. This included an evaluation of
scalability issues, both in the case of data loading and data querying. Results are summarised
below.
Fig 3. Query performance of Virtuoso and Neo4j. Top: typical graph in tested data sets (other
queries are about Gene Ontology and other ontology trees). Charts: Three test data sets, of
increasing size, first two from the plant species Arabidopsis (A. thaliana), third from wheat (T.
eestivum). Every data set was tested against Virtuoso and Neo4j. All execution times are in ms.
3.1</p>
    </sec>
    <sec id="sec-5">
      <title>Test Settings</title>
      <p>
        As shown in [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ] in detail, we have made separate tests for Virtuoso and Neo4j. For
each database, we tested three different data sets, having different sizes and degrees of
data complexity in terms of types of entities and relations represented. The data sources
are the same we use for powering the KnetMiner application (see section 1.1). In our
tests, we focused mainly on biological pathways and gene ontology annotations from
the Arabidopsis and wheat plant species. Fig. 3 shows a typical graph structure that
occurs in all the three data sets, which is explored by many of the queries we used for
the tests. In writing the queries, we considered aspects like: a) basic functionality (e.g.,
‘cnt*, ‘sel’ queries in the cited document) b) typical data retrieval occurring in our
applications, as said, these are mainly navigating biological pathways (e.g., ‘join*’,
‘*union*’) and ontologies (‘varPath*’) c) complex queries, known to be challenging for
most query engines (‘pway’, ‘grp*’, ‘*exist*’, ‘*Ag’), including queries inspired by
other benchmarks available in literature (‘nestAg’). For each of these query types, we
wrote both a Cypher and SPARQL version. Writing two similar versions for the same
data retrieval semantics was fairly easy in most cases, thanks to the use of a common
data model.
3.2
      </p>
    </sec>
    <sec id="sec-6">
      <title>Performance of Data Loading and Querying</title>
      <p>
        Figure 1 in [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ] shows the loading times we obtained for the three tested data sets and
the two graph databases. The higher loading times in the case of Neo4j are due to the
fact that these include the rdf2neo conversion described above, which spends time
loading BioKNO/RDF data into its embedded TDB instance, as well as in querying them
using the SPARQL mapping queries. While we plan to improve the performance of our
tool, its conversion operations cannot be made as fast as loading plain RDF into
efficient triple stores. That said, these times seem reasonable for a batch operation, which
is not needed very often (a few times per year, when we update our data sets).
Moreover, as expected, rdf2neo scales well enough with the input size.
      </p>
      <p>Results from the query performance are reported in Fig. 3. As expected, Neo4j performs
well in graph traversal queries (up to 17x), where initial nodes are matched against the
query and then the graphs connected to them are expanded. Property graph databases
like Neo4j are known to be optimised for this kind of task. On the other hand, Virtuoso
showed performance better than, or comparable to Neo4j for queries involving the
union of graph branches (eg, ‘2union’, ‘nestAg’, ‘exist’), for queries having mostly
‘tuplelike’ patterns (eg, 3x for ‘joinRe’, 8x for ‘joinReif’), where graph patterns have to be
matched by searching several triple patterns and graph traversal does not bring many
advantages, and, probably for similar reasons, for queries involving aggregation
(‘*grp*’ and ‘*Ag’ queries). Most queries scale well with the data set size. We were
surprised to see significant differences (up to 14s versus less than 1ms) in queries like
‘count all nodes’ or ‘select all instances of a given type’. Presumably, Neo4j is much
faster at running them because it exploits indices of coarse-grained node structures,
while Virtuoso indices cannot rely on node-level aggregation.
3.3</p>
    </sec>
    <sec id="sec-7">
      <title>Qualitative Considerations</title>
      <p>
        An area where the Semantic Web technologies still appear to be more suitable than
property graphs is in data integration. For instance, in RDF systems the merging of data
about the same entities happens almost for free, by ensuring that URIs generated in
different data sets are the same when they represent the same real-world entities. A set
of techniques for doing so are known (e.g., [
        <xref ref-type="bibr" rid="ref39">39</xref>
        ]) and tools like TARQL [
        <xref ref-type="bibr" rid="ref40">40</xref>
        ] exist, to
translate from format like CSV to RDF. Such tools often rely on the same Semantic
Web standards they target (e.g., TARQL uses SPARQL). Schema mapping and
alignment is another area where Semantic Web technologies are useful. As a small example
of that, see the federated search example mentioned in [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ], which was made possible
by mapping our BioKNO ontology to standards like BioPAX. On the other hand, a
system like Neo4j appears to be particularly good in the more classical scenario where
a data repository is accessed by a limited set of applications and users, who can rely on
simple and well-known schemas, without having the wider Semantic Web ambitions of
data integration on a very large scale. As explained above, the semantic motif case is
an example of that. In fact, the typical semantic motif query that we need to study
agrigenomics-related knowledge networks is a graph pattern representing a chain of given
node and relation types, a kind of pattern that is a first-class citizen in Cypher, while
the same requires a more verbose syntax in the triple-oriented SPARQL. These query
language considerations vary significantly when other use cases are considered. For
instance, the query 2union1Nest from our benchmarking queries shows that matching
multiple branches in a single SPARQL query is relatively straightforward, mostly
thanks to the possibility to nest UNION constructs and define an overall graph pattern
that reflects the queried data branching structure quite closely (e.g., see red and blue
paths in the Fig. 3 graph). Doing something similar in Cypher can be challenging, even
for simple cases, although this might improve in future [
        <xref ref-type="bibr" rid="ref41 ref42">41,42</xref>
        ].
3.4
      </p>
    </sec>
    <sec id="sec-8">
      <title>Limits</title>
      <p>
        The tests described here are not intended to be comprehensive or general, as in the case
of works like [
        <xref ref-type="bibr" rid="ref43 ref44">43,44</xref>
        ]. Our evaluation is limited to the typical use cases we deal with in
the KnetMiner project, that is, knowledge networks about molecular biology, mostly
biological pathways and ontologies. Furthermore, we have only considered two
popular, knowingly performant graph databases for RDF and property graphs, although there
are many alternatives [
        <xref ref-type="bibr" rid="ref45 ref46">45,46</xref>
        ]. Finally, we have run the tests sending one query at a time
only, without considering load issues (again, both Virtuoso and Neo4j are known for
having good parallel performance).
4
4.1
      </p>
    </sec>
    <sec id="sec-9">
      <title>Discussion and Conclusions</title>
    </sec>
    <sec id="sec-10">
      <title>Related Work</title>
      <p>
        Populating a property graph database like Neo4j from RDF-encoded data has been
gaining interest in recent years, due to the opportunities offered by combining Semantic
Web data standards, including the expressivity of ontologies, with the ease of use and
efficiency of property graph technology [
        <xref ref-type="bibr" rid="ref47">47</xref>
        ]. A well-known tool to import RDF data
into Neo4j has been implemented as a Neo4j plug-in [
        <xref ref-type="bibr" rid="ref48">48</xref>
        ]. While this defines a fixed
‘natural’ mapping between RDF graphs and property graphs, we allow for arbitrary
correspondences between the two. This might be a valuable advantage, if one wants to
rename ontology-oriented entities on the RDF side (e.g., ‘continuant’) in favour of more
domain-specific terminologies, or to simplify reified property patterns. Ad-hoc
mapping from RDF to PG/Neo4j has been employed in other life science projects, including
protein structure exploration [
        <xref ref-type="bibr" rid="ref49">49</xref>
        ], heterogeneous data integration [
        <xref ref-type="bibr" rid="ref50 ref51">50,51</xref>
        ] and ontology
management [
        <xref ref-type="bibr" rid="ref52">52</xref>
        ]. [
        <xref ref-type="bibr" rid="ref53">53</xref>
        ] is an interesting alternative way to combine linked data and
property graphs through query languages, which we are interested in exploring in
future. Works like [
        <xref ref-type="bibr" rid="ref54 ref55">54,55</xref>
        ] show the benefit of reducing the granularity of the RDF data
models through mapping them onto more coarse-grained structures.
4.2
      </p>
    </sec>
    <sec id="sec-11">
      <title>Discussion</title>
      <p>
        Our approach increases the opportunities for knowledge sharing by offering diversified
interfaces and means to access the data, which are all based on a common model. In
our recent work [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] we describe in detail how rdf2neo is part of a new infrastructure
to support plant biology and agriculture-related data, and how this contributes to the
realisation of FAIR sharing principles. To summarise this aspect, anchoring multiple
formats and technologies to a common model helps with data Accessibility and
Interoperability. Aligning such models to standard bio-ontologies contributes to data
Interoperability. Supporting multiple technologies and query languages ease data Reuse and
Accessibility. Adopting services like SPARQL endpoints and resolvable URIs helps
with Findability.
4.3
      </p>
    </sec>
    <sec id="sec-12">
      <title>Conclusion</title>
      <p>
        Considering the analysis presented in this paper, adopting a hybrid, linked data and
property graph approach significantly eases the collaboration and connections between
different information types, enabling diversified access to knowledge networks based
on common data models. By so doing, one can usefully combine the best of the linked
data world (data modelling standards like ontologies, technologies aimed at data
integration and interoperability) with those provided by the Property Graphs (simple query
languages, particularly suitable for knowledge navigation use cases, data models and
implementations optimised for graph traversal). While such hybrid architecture is not
without disadvantages, e.g., having to maintain a more complex infrastructure and to
have multiple IT skills, adopting it is recommendable in situations similar to our use
case. Automating tasks like data conversions and alignment is a relevant activity in
maintaining such a hybrid architecture and rdf2neo can contribute significantly to it.
In future, we aim at improving the performance of rdf2neo, in particular by issuing
fewer SPARQL queries during node/relation property fetching. Moreover, we plan to
improve data interoperability by better integration with bioschemas [
        <xref ref-type="bibr" rid="ref56">56</xref>
        ]. Another
objective we have is to extend our benchmarking to alternative triple stores and graph
databases such as Gremlin [
        <xref ref-type="bibr" rid="ref57">57</xref>
        ], as well as a wider set of scientific data.
5
6
      </p>
    </sec>
    <sec id="sec-13">
      <title>Acknowledgements References</title>
      <p>This work forms part of the Designing Future Wheat (DFW) strategic programme
(BB/P016855/1) funded by the Biotechnology and Biological Sciences Research
Council (BBSRC).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mons</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>van Haagen</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chichester</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoen P-B '</surname>
          </string-name>
          t, den Dunnen JT,
          <string-name>
            <surname>van Ommen</surname>
            <given-names>G</given-names>
          </string-name>
          , et al.
          <article-title>The value of data</article-title>
          .
          <source>Nat Genet</source>
          .
          <year>2011</year>
          ;
          <volume>43</volume>
          :
          <fpage>281</fpage>
          -
          <lpage>3</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <article-title>The data deluge [Internet]</article-title>
          .
          <source>The Economist; 2010</source>
          . Available from: https://www.economist.com/node/15579717
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Wilkinson</surname>
            <given-names>MD</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aalbersberg</surname>
            <given-names>IjJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Appleton</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Axton</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baak</surname>
            <given-names>A</given-names>
          </string-name>
          , et al.
          <article-title>The FAIR Guiding Principles for scientific data management and stewardship</article-title>
          .
          <source>Scientific data</source>
          .
          <year>2016</year>
          ;
          <volume>3</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Ruttenberg</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clark</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Samwald</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bodenreider</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            <given-names>H</given-names>
          </string-name>
          , et al.
          <article-title>Advancing translational research with the Semantic Web</article-title>
          .
          <source>BMC Bioinformatics</source>
          .
          <year>2007</year>
          ;
          <volume>8</volume>
          :
          <fpage>S2</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Wang</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gorlitsky</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Almeida</surname>
            <given-names>JS</given-names>
          </string-name>
          .
          <article-title>From XML to RDF: how semantic web technologies will change the design of 'omic' standards</article-title>
          .
          <source>Nature Biotechnology</source>
          .
          <year>2005</year>
          ;
          <volume>23</volume>
          :
          <fpage>1099</fpage>
          -
          <lpage>103</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Belleau</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nolin</surname>
            <given-names>M-A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tourigny</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rigault</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morissette</surname>
            <given-names>J</given-names>
          </string-name>
          . Bio2RDF:
          <article-title>Towards a mashup to build bioinformatics knowledge systems</article-title>
          .
          <source>Journal of Biomedical Informatics</source>
          .
          <year>2008</year>
          ;
          <volume>41</volume>
          :
          <fpage>706</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Smith</surname>
            <given-names>B</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ashburner</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosse</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bard</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bug</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            <given-names>W</given-names>
          </string-name>
          , et al.
          <article-title>The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration</article-title>
          .
          <source>Nature Biotechnology</source>
          .
          <year>2007</year>
          ;
          <volume>25</volume>
          :
          <fpage>1251</fpage>
          -
          <lpage>5</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jupp</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malone</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolleman</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brandizi</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davies</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia</surname>
            <given-names>L</given-names>
          </string-name>
          , et al.
          <article-title>The EBI RDF platform: linked open data for the life sciences</article-title>
          .
          <source>Bioinformatics</source>
          .
          <year>2014</year>
          ;
          <volume>30</volume>
          :
          <fpage>1338</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Staab</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Studer</surname>
            <given-names>R</given-names>
          </string-name>
          , editors.
          <source>Handbook on ontologies. 2</source>
          . ed. Berlin: Springer;
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Baader</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            <given-names>U</given-names>
          </string-name>
          .
          <article-title>Chapter 3 Description Logics</article-title>
          . In:
          <string-name>
            <surname>van Harmelen</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lifschitz</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Porter</surname>
            <given-names>B</given-names>
          </string-name>
          , editors.
          <source>Foundations of Artificial Intelligence [Internet]. Elsevier; 2008 [cited 2018 Mar</source>
          <volume>7</volume>
          ]. p.
          <fpage>135</fpage>
          -
          <lpage>79</lpage>
          . Available from: http://www.sciencedirect.com/science/article/pii/S1574652607030039
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>James</surname>
            <given-names>Malone</given-names>
          </string-name>
          ,
          <string-name>
            <surname>ICBO</surname>
          </string-name>
          <year>2017</year>
          keynote [Internet]. Available from: https://www.slideshare.net/JamesMalone5/malone-icbo2017-keynote
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Cai</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            <given-names>Y.</given-names>
          </string-name>
          <article-title>The Challenges of Data Quality and Data Quality Assessment in the Big Data Era</article-title>
          . Data
          <source>Science Journal [Internet]</source>
          .
          <source>2015 [cited 2018 Mar</source>
          <volume>7</volume>
          ];14. Available from: http://datascience.codata.org/articles/10.5334/dsj-2015-002/
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ehrlinger</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wöß</surname>
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Towards</surname>
          </string-name>
          <article-title>a Definition of Knowledge Graphs. SEMANTiCS (Posters</article-title>
          , Demos, SuCCESS).
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Jesse</surname>
            <given-names>W</given-names>
          </string-name>
          , Paul T.
          <article-title>Facebook Linked Data via the Graph API</article-title>
          .
          <source>Semantic Web</source>
          .
          <year>2013</year>
          ;
          <fpage>245</fpage>
          -
          <lpage>250</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Dong</surname>
            <given-names>X</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabrilovich</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heitz</surname>
            <given-names>G</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horn</surname>
            <given-names>W</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lao</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Murphy</surname>
            <given-names>K</given-names>
          </string-name>
          , et al.
          <article-title>Knowledge vault: a web-scale approach to probabilistic knowledge fusion</article-title>
          .
          <source>ACM Press; 2014 [cited 2018 Feb</source>
          <volume>22</volume>
          ]. p.
          <fpage>601</fpage>
          -
          <lpage>10</lpage>
          . Available from: http://dl.acm.org/citation.cfm?doid=
          <volume>2623330</volume>
          .
          <fpage>2623623</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. Jing Han,
          <string-name>
            <surname>Haihong</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Guan</surname>
            <given-names>Le</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Jian</given-names>
            <surname>Du</surname>
          </string-name>
          .
          <source>Survey on NoSQL database</source>
          .
          <source>2011 6th International Conference on Pervasive Computing and Applications</source>
          [Internet].
          <source>Port Elizabeth, South Africa: IEEE; 2011 [cited 2018 Sep</source>
          <volume>26</volume>
          ]. p.
          <fpage>363</fpage>
          -
          <lpage>6</lpage>
          . Available from: http://ieeexplore.ieee.org/document/6106531/
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Hecht</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jablonski</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>NoSQL evaluation: A use case oriented survey</article-title>
          .
          <source>2011 International Conference on Cloud and Service Computing [Internet]. Hong Kong, China: IEEE; 2011 [cited 2018 Sep</source>
          <volume>26</volume>
          ]. p.
          <fpage>336</fpage>
          -
          <lpage>41</lpage>
          . Available from: http://ieeexplore.ieee.org/document/6138544/
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Robinson</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Webber</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Eifrem</surname>
            <given-names>E</given-names>
          </string-name>
          . Graph databases.
          <source>Second edition</source>
          . Beijing: O'Reilly;
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Buerli M. The</surname>
          </string-name>
          Current State of Graph Databases.
          <year>2012</year>
          . Available from: https://www.semanticscholar.org/paper/The-Current-
          <article-title>State-of-Graph-DatabasesBuerli/5b5b6b80badccd291e3437460222e24326c65979?tab=abstract</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Angles R. A</surname>
          </string-name>
          <article-title>Comparison of Current Graph Database Models</article-title>
          .
          <source>2012 IEEE 28th International Conference on Data Engineering Workshops [Internet]. Arlington</source>
          ,
          <string-name>
            <surname>VA</surname>
          </string-name>
          , USA: IEEE; 2012
          <source>[cited 2018 Sep</source>
          <volume>26</volume>
          ]. p.
          <fpage>171</fpage>
          -
          <lpage>7</lpage>
          . Available from: http://ieeexplore.ieee.org/document/6313676/
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Malhotra</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nair</surname>
            <given-names>TR</given-names>
          </string-name>
          .
          <article-title>Evolution of Knowledge Representation and Retrieval Techniques</article-title>
          .
          <source>International Journal of Intelligent Systems and Applications</source>
          .
          <year>2015</year>
          ;
          <volume>7</volume>
          :
          <fpage>18</fpage>
          -
          <lpage>28</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Defining</surname>
          </string-name>
          N
          <article-title>-ary Relations on the Semantic Web [Internet]</article-title>
          . Available from: https://www.w3.org/TR/swbp-n-aryRelations/
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Holzschuher</surname>
            <given-names>F</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peinl</surname>
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Querying</surname>
          </string-name>
          <article-title>a graph database - language selection and performance considerations</article-title>
          .
          <source>Journal of Computer and System Sciences</source>
          .
          <year>2016</year>
          ;
          <volume>82</volume>
          :
          <fpage>45</fpage>
          -
          <lpage>68</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Hassani-Pak K. KnetMiner -</surname>
          </string-name>
          <article-title>An integrated data platform for gene mining and biological knowledge discovery [PhD Thesis]</article-title>
          . Universität Bielefeld;
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Hassani-Pak</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            <given-names>C</given-names>
          </string-name>
          .
          <article-title>Knowledge discovery in biological databases for revealing candidate genes linked to complex phenotypes</article-title>
          .
          <source>Journal of integrative bioinformatics</source>
          .
          <year>2017</year>
          ;
          <volume>14</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <given-names>O</given-names>
            <surname>'Leary</surname>
          </string-name>
          <string-name>
            <given-names>NA</given-names>
            ,
            <surname>Wright</surname>
          </string-name>
          <string-name>
            <given-names>MW</given-names>
            ,
            <surname>Brister</surname>
          </string-name>
          <string-name>
            <given-names>JR</given-names>
            ,
            <surname>Ciufo</surname>
          </string-name>
          <string-name>
            <given-names>S</given-names>
            ,
            <surname>Haddad</surname>
          </string-name>
          <string-name>
            <given-names>D</given-names>
            ,
            <surname>McVeigh</surname>
          </string-name>
          <string-name>
            <surname>R</surname>
          </string-name>
          , et al.
          <article-title>Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation</article-title>
          .
          <source>Nucleic Acids Res</source>
          .
          <year>2016</year>
          ;
          <volume>44</volume>
          :
          <fpage>D733</fpage>
          -
          <lpage>745</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Mueller LA. AraCyc: A Biochemical Pathway</surname>
          </string-name>
          <article-title>Database for Arabidopsis</article-title>
          .
          <source>PLANT PHYSIOLOGY</source>
          .
          <year>2003</year>
          ;
          <volume>132</volume>
          :
          <fpage>453</fpage>
          -
          <lpage>60</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Cooper</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walls</surname>
            <given-names>RL</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elser</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gandolfo</surname>
            <given-names>MA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevenson</surname>
            <given-names>DW</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            <given-names>B</given-names>
          </string-name>
          , et al.
          <article-title>The Plant Ontology as a Tool for Comparative Plant Anatomy</article-title>
          and
          <string-name>
            <given-names>Genomic</given-names>
            <surname>Analyses</surname>
          </string-name>
          .
          <source>Plant and Cell Physiology</source>
          .
          <year>2013</year>
          ;
          <volume>54</volume>
          :
          <fpage>e1</fpage>
          -
          <lpage>e1</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Brandizi</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Singh</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hassani-Pak K. Towards FAIRer Biological Knowledge</surname>
          </string-name>
          <article-title>Networks Using a Hybrid Linked Data and Graph Database Approach</article-title>
          . Journal of Integrative Bioinformatics [Internet].
          <source>2018 [cited 2018 Sep</source>
          <volume>26</volume>
          ];
          <fpage>15</fpage>
          . Available from: http://www.degruyter.com/view/j/jib.
          <year>2018</year>
          .
          <volume>15</volume>
          .issue-3/jib-2018-0023/jib-2018-0023.xml
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <given-names>Marco</given-names>
            <surname>Brandizi</surname>
          </string-name>
          .
          <article-title>A Preliminary survey of RDF/Neo4j as backends for KnetMiner [Internet]</article-title>
          .
          <year>2017</year>
          . Available from: https://www.slideshare.net/mbrandizi/a
          <article-title>-preliminary-survey-ofrdfneo4j-as-backends-for-knetminer</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Menzel C. Reference</surname>
          </string-name>
          Ontologies - Application
          <source>Ontologies: Either/Or or Both/And? Proceedings of the KI2003 Workshop on Reference Ontologies and Application Ontologies [Internet]. Hamburg</source>
          , Germany;
          <year>2003</year>
          . Available from: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.
          <issue>1</issue>
          .1.
          <fpage>2490</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Erling</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikhailov</surname>
            <given-names>I. RDF</given-names>
          </string-name>
          <article-title>Support in the Virtuoso DBMS</article-title>
          . In:
          <string-name>
            <surname>Pellegrini</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tochtermann</surname>
            <given-names>K</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schaffert</surname>
            <given-names>S</given-names>
          </string-name>
          , editors.
          <source>Networked Knowledge - Networked Media [Internet]</source>
          . Berlin, Heidelberg: Springer Berlin Heidelberg; 2009
          <source>[cited 2018 Sep</source>
          <volume>26</volume>
          ]. p.
          <fpage>7</fpage>
          -
          <lpage>24</lpage>
          . Available from: http://link.springer.com/10.1007/978-3-
          <fpage>642</fpage>
          -02184-
          <issue>8</issue>
          _
          <fpage>2</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Vukotic</surname>
            <given-names>A. Neo4j</given-names>
          </string-name>
          <article-title>in action</article-title>
          . Shelter Island, NY: Manning Publications Co;
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          34. rdf2neo Tool, Code Repository [Internet].
          <source>2018 [cited 2018 Sep</source>
          <volume>26</volume>
          ]. Available from: https://github.com/Rothamsted/rdf2neo
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>35. Spring Beans Core Technologies [Internet]. Available from: https://docs.spring.io/spring/docs/current/spring-framework-reference/core.html</mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          36.
          <string-name>
            <surname>Loizou</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Angles</surname>
            <given-names>R</given-names>
          </string-name>
          , Groth P.
          <article-title>On the formulation of performant SPARQL queries</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          .
          <year>2015</year>
          ;
          <volume>31</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>26</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          37. 7 Steps to Fast SPARQL Queries - Stardog [Internet]. Available from: https://www.stardog.com/blog/7
          <article-title>-steps-to-fast-sparql-queries/</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          38.
          <article-title>Benchmarking kNetMiner data, Neo4j vs Virtuoso [Internet]</article-title>
          . Available from: https://github.com/Rothamsted/graphdb-benchmarks/blob/master/README.md
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          39.
          <string-name>
            <surname>Davidson P. Designing</surname>
            <given-names>URI</given-names>
          </string-name>
          <article-title>sets for the UK public sector [Internet]</article-title>
          .
          <source>Chief Technology Officer Council; 2009</source>
          . Available from: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/60975/designing-URI-
          <article-title>sets-uk-publicsector</article-title>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          40.
          <article-title>Tarql: SPARQL for Tables [Internet]</article-title>
          . Available from: http://tarql.github.io/
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          41. openCypher/CIP2016-06-22
          <article-title>-nested-updating-and-chained-subqueries.adoc at CIP-nestedsubqueries · petraselmer/openCypher [Internet]</article-title>
          . Available from: https://github.com/petraselmer/openCypher/blob/CIP-nested-subqueries/cip/1.accepted/CIP2016-06-22
          <article-title>-nested-updating-and-chained-subqueries</article-title>
          .adoc
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          42. Neo4j :
          <string-name>
            <surname>Post-UNION processing - Neo4j Graph Database Platform</surname>
          </string-name>
          [Internet]. Available from: https://neo4j.com/developer/kb/post-union-processing/
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          43.
          <string-name>
            <surname>Wu</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fujiwara</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamamoto</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolleman</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yamaguchi</surname>
            <given-names>A</given-names>
          </string-name>
          .
          <source>BioBenchmark Toyama</source>
          <year>2012</year>
          :
          <article-title>an evaluation of the performance of triple stores on biological data</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          .
          <year>2014</year>
          ;
          <volume>5</volume>
          :
          <fpage>32</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          44.
          <string-name>
            <surname>Guo</surname>
            <given-names>Y</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            <given-names>Z</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heflin</surname>
            <given-names>J. LUBM</given-names>
          </string-name>
          :
          <article-title>A benchmark for OWL knowledge base systems</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          .
          <year>2005</year>
          ;
          <volume>3</volume>
          :
          <fpage>158</fpage>
          -
          <lpage>82</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          45.
          <string-name>
            <surname>Gubichev</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Then</surname>
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Graph Pattern</surname>
          </string-name>
          <article-title>Matching: Do We Have to Reinvent the Wheel?</article-title>
          <source>Proceedings of Workshop on GRAph Data management Experiences and Systems - GRADES'14 [Internet]. Snowbird</source>
          ,
          <string-name>
            <surname>UT</surname>
          </string-name>
          , USA: ACM Press;
          <year>2014</year>
          [
          <article-title>cited 2018 Sep 27]</article-title>
          . p.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          . Available from: http://dl.acm.org/citation.cfm?doid=
          <volume>2621934</volume>
          .
          <fpage>2621944</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          46.
          <string-name>
            <surname>Pobiedina</surname>
            <given-names>N</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rümmele</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Skritek</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Werthner</surname>
            <given-names>H</given-names>
          </string-name>
          .
          <article-title>Benchmarking Database Systems for Graph Pattern Matching</article-title>
          . In:
          <string-name>
            <surname>Decker</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lhotská</surname>
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Link</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spies</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wagner</surname>
            <given-names>RR</given-names>
          </string-name>
          , editors.
          <source>Database and Expert Systems Applications [Internet]. Cham: Springer International Publishing; 2014 [cited 2018 Sep</source>
          <volume>27</volume>
          ]. p.
          <fpage>226</fpage>
          -
          <lpage>41</lpage>
          . Available from: http://link.springer.com/10.1007/978-3-
          <fpage>319</fpage>
          -10073-9_
          <fpage>18</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          47.
          <string-name>
            <surname>Liebig</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vialard</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Opitz</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Metzl</surname>
            <given-names>S.</given-names>
          </string-name>
          <article-title>GraphScale: Adding Expressive Reasoning to Semantic Data Stores</article-title>
          .
          <source>International Semantic Web Conference (Posters &amp; Demos)</source>
          .
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          48.
          <string-name>
            <given-names>J.</given-names>
            <surname>Barrasa</surname>
          </string-name>
          .
          <article-title>Importing RDF data into Neo4j [Internet]</article-title>
          .
          <source>Jesús Barrasa</source>
          .
          <source>2016 [cited 2018 Mar</source>
          <volume>9</volume>
          ]. Available from: https://jesusbarrasa.wordpress.com/
          <year>2016</year>
          /06/07/importing-rdf
          <string-name>
            <surname>-</surname>
          </string-name>
          datainto-neo4j/
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          49.
          <string-name>
            <surname>Alocci</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mariethoz</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horlacher</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bolleman</surname>
            <given-names>JT</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Campbell</surname>
            <given-names>MP</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lisacek</surname>
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Property</surname>
          </string-name>
          <article-title>Graph vs RDF Triple Store: A Comparison on Glycan Substructure Search</article-title>
          .
          <string-name>
            <surname>Helmer-Citterich</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <article-title>editor</article-title>
          .
          <source>PLOS ONE</source>
          .
          <year>2015</year>
          ;
          <volume>10</volume>
          :
          <fpage>e0144578</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          50.
          <string-name>
            <surname>Lysenko</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Roznovăţ</surname>
            <given-names>IA</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saqi</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mazein</surname>
            <given-names>A</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rawlings</surname>
            <given-names>CJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auffray</surname>
            <given-names>C</given-names>
          </string-name>
          .
          <article-title>Representing and querying disease networks using graph databases</article-title>
          .
          <source>BioData Min [Internet]</source>
          .
          <source>2016 [cited 2018 Mar</source>
          <volume>15</volume>
          ];9. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4960687/
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          51.
          <string-name>
            <surname>Pareja-Tobes</surname>
            <given-names>P</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tobes</surname>
            <given-names>R</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manrique</surname>
            <given-names>M</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pareja</surname>
            <given-names>E</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pareja-Tobes E. Bio4j</surname>
          </string-name>
          <article-title>: a high-performance cloud-enabled graph-based data platform</article-title>
          .
          <source>2015 [cited 2018 Oct</source>
          <volume>31</volume>
          ]; Available from: http://biorxiv.org/lookup/doi/10.1101/016758
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          52.
          <string-name>
            <surname>Jupp</surname>
            <given-names>S</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Burdett</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leroy</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parkinson HE</surname>
          </string-name>
          .
          <article-title>A new Ontology Lookup Service at EMBLEBI</article-title>
          .
          <year>SWAT4LS</year>
          .
          <year>2015</year>
          . p.
          <fpage>118</fpage>
          -
          <lpage>119</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          53.
          <string-name>
            <surname>Thakkar</surname>
            <given-names>H</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Punjani</surname>
            <given-names>D</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lehmann</surname>
            <given-names>J</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Auer</surname>
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Killing Two</surname>
          </string-name>
          <article-title>Birds with One Stone-Querying Property Graphs using SPARQL via GREMLINATOR</article-title>
          .
          <source>arXiv preprint arXiv:180109556</source>
          .
          <year>2018</year>
          ;
        </mixed-citation>
      </ref>
      <ref id="ref54">
        <mixed-citation>
          54.
          <string-name>
            <surname>Pham M-D</surname>
            , Passing
            <given-names>L</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erling</surname>
            <given-names>O</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boncz</surname>
            <given-names>P</given-names>
          </string-name>
          .
          <article-title>Deriving an Emergent Relational Schema from RDF Data</article-title>
          .
          <source>Proceedings of the 24th International Conference on World Wide Web - WWW '15 [Internet]. Florence</source>
          , Italy: ACM Press;
          <year>2015</year>
          [cited 2018 Oct 31]. p.
          <fpage>864</fpage>
          -
          <lpage>74</lpage>
          . Available from: http://dl.acm.org/citation.cfm?doid=
          <volume>2736277</volume>
          .
          <fpage>2741121</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref55">
        <mixed-citation>
          55.
          <string-name>
            <surname>Virgilio</surname>
            <given-names>RD</given-names>
          </string-name>
          .
          <source>Smart RDF Data Storage in Graph Databases</source>
          .
          <year>2017</year>
          17th IEEE/ACM International Symposium on Cluster,
          <article-title>Cloud and Grid Computing (CCGRID) [Internet]</article-title>
          . Madrid, Spain: IEEE; 2017
          <source>[cited 2018 Oct</source>
          <volume>31</volume>
          ]. p.
          <fpage>872</fpage>
          -
          <lpage>81</lpage>
          . Available from: http://ieeexplore.ieee.org/document/7973793/
        </mixed-citation>
      </ref>
      <ref id="ref56">
        <mixed-citation>
          56.
          <string-name>
            <surname>Gray</surname>
            <given-names>AJ</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goble</surname>
            <given-names>C</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jimenez</surname>
            <given-names>RC</given-names>
          </string-name>
          . Bioschemas: From Potato Salad to Protein Annotation.
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref57">
        <mixed-citation>
          57.
          <string-name>
            <surname>Rodriguez</surname>
            <given-names>MA</given-names>
          </string-name>
          .
          <article-title>The Gremlin Graph Traversal Machine and Language (Invited Talk)</article-title>
          .
          <source>Proceedings of the 15th Symposium on Database Programming Languages [Internet]</source>
          . New York, NY, USA: ACM;
          <year>2015</year>
          . p.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . Available from: http://doi.acm.
          <source>org/10</source>
          .1145/2815072.2815073
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>