<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Validata: An online tool for testing RDF data conformance</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jacob Baungard Hansen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andrew Beveridge</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Roisin Farmer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Leif Gehrmann</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alasdair J G Gray</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Sunil Khutan</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tomas Robertson</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Johnny Val</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Heriot-Watt University</institution>
          ,
          <addr-line>Edinburgh</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Validata is an online web application for validating an RDF document against a set of constraints. This is useful for data exchange applications or ensuring conformance of an RDF dataset against a community agreed standard. Constraints are expressed as a Shape Expression (ShEx) schema. Validata extends the ShEx functionality to support multiple requirement levels. Validata can be repurposed for di erent deployments by providing it with a new ShEx schema. The Validata code is available from https://github.com/HW-SWeL/Validata.</p>
      </abstract>
      <kwd-group>
        <kwd>Validation</kwd>
        <kwd>Conformance</kwd>
        <kwd>Metadata</kwd>
        <kwd>Dataset Descriptions</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Validation { checking the conformance of a dataset against a schema { has been
an integral part of XML and relational data generation, use, and exchange. It
provides guarantees that the data conforms to some structure which enables tools
to consume and process that data. This is essential to support data exchange.
However there is no standard for expressing constraints for RDF data.</p>
      <p>
        There has been a growing interest in validating RDF data beyond checking
consistency with OWL ontologies [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. For example, it may be desirable to
constrain that a title for a resource is given using the dcterm:title property and
that only one title is supplied. The W3C RDF Data Shapes Working Group1
are de ning a recommendation for capturing and testing such validation
requirements. One existing language for de ning such constraints for RDF data is the
Shape Expression language (ShEx) [
        <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
        ] which provides a means to declaratively
and concisely describe the shape of the graphs that are permitted. However
applications are required that will validate an RDF dataset against a schema
declared in ShEx.
      </p>
      <p>
        The contribution of this paper is to present the Validata tool: a web
application that enables users to easily check the conformance of an RDF document
against a ShEx schema description. Validata provides client-side, browser-based
validation of RDF documents, although it is also possible to use it through
an API so that it can be included as part of a data publishing pipeline. The
validation provides support for multiple RDF serialisations as well as di erent
1 http://www.w3.org/2014/data-shapes/ accessed November 2015
requirement levels, i.e. stating whether a property is required, desirable or
entirely optional. Validata was developed to check dataset descriptions (RDF
documents) against the W3C Health Care and Life Sciences Community (HCLS)
Pro le for Dataset Descriptions [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Due to its use of ShEx, it is capable of
being re-purposed for other metadata standards (e.g. DCAT [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]) or validation use
cases that occur in data exchange scenarios.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Motivating Use Case</title>
      <p>
        The W3C HCLS Community Pro le [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] speci es a common format for dataset
descriptions in the HCLS domain using RDF. The pro le consists of three
different types of description capturing di erent notions of a dataset (Figure 1): (i)
the abstract idea of a dataset, (ii) speci c versions of the dataset, and (iii) the
le distributions of a speci c version. That is, for a given dataset, say ChEMBL,
there is an abstract idea of its existence that can be described providing
properties that are unlikely to change over time (e.g. the name and description of the
dataset); then there are the speci c release versions of ChEMBL (20 at the time
of writing) for which the properties speci c to the release can be captured (e.g.
version number and release date); and nally there are distribution les for the
version, typically available in multiple le formats (e.g. database dump, RDF,
or SD le), which can also be described.
      </p>
      <p>
        For each description type, the metadata properties that need to be supplied
are identi ed and provided with a requirement level drawn from [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], viz. MUST,
SHOULD, or MAY. In total 61 metadata properties are identi ed in the HCLS
Community Pro le using terms drawn from 18 vocabularies. As such, it is di
cult to verify when a dataset description conforms to the community pro le. To
encourage uptake, it was recognised that there would be a need to automatically
validate the RDF descriptions of datasets against the HCLS Community Pro le.
      </p>
      <p>
        The HCLS Community Pro le is not the only dataset description speci
cation that exists. Others include BioDBCore [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] (a bioinformatics community
initiative), Open PHACTS dataset descriptions [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] (a project speci c pro le),
and DCAT [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] (a cross-domain initiative). It is desirable that a validation tool
is not made for a speci c standard but could be employed for several of these
standards through changing the con guration.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Requirements</title>
      <p>We aimed to develop a tool that would meet the needs for validating RDF
descriptions of datasets against di erent conformance schemas. The tool should
be able to accept RDF in commonly used RDF serialisations, e.g. Turtle, TriG,
and N-Triples, and the descriptions may be supplied by either copying and
pasting into the web tool, or via le upload. The tool should also support common
features of RDF such as string literals being provided with language tags.</p>
      <p>
        It should be possible to specify di erent requirement levels such as those
speci ed in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. The validator should consider the requirement level when returning
conformance reports to the user.
      </p>
      <p>The main motivating use case for our tool comes from the HCLS community
pro le. Thus the validation tool should be deployable on the W3C HCLS pages,
i.e. it can be served from a standard web server without any server side code. The
tool should support recent versions of the major browsers, i.e. Chrome, Firefox,
Internet Explorer, and Safari. However, as validation is an important part of
publishing data, it is desirable that the tool can also be used as part of a data
publishing pipeline.</p>
      <p>A typical interaction with the tool will be to re ne a dataset description to
ensure conformance with the chosen schema. To support this the tool should
provide informative error and warning messages to the user. It should also be
possible to edit the RDF data and easily recheck the validation.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Constraint De nitions</title>
      <p>
        The Shape Expression Language (ShEx) was chosen for specifying the schema
constraints on the RDF dataset descriptions [
        <xref ref-type="bibr" rid="ref2 ref3">2,3</xref>
        ], due to its concise notation
(based on regular expressions), expressive coverage, and the existence of
support libraries in Javascript. The concise notation enables a manual veri cation
between the written ShEx schema for a conformance standard and the written
documents which often include a checklist of properties, e.g. the table provided
in Section 5 of [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] or the bullet lists in Section 4.1 of [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>Figure 2 shows an excerpt of the ShEx schema for the HCLS community
pro le. The shape de nes the properties and objects for a resource to match.
In this case it de nes a &lt;SummaryLevelShape&gt; that consists of four constraints.
The shape must have a rdf:type declaration of dctypes:Dataset, and a
title provided using dct:title with a language tagged string value. The shape
may have an alternative title provided as a language tagged string using the
dct:alternative property. The nal constraint is that the dct:created
property must not be used at all { the negation is stated with the `!' and the match
1 &lt;SummaryLevelShape&gt; {
2 `MUST` rdf:type (dctypes:Dataset),
3 `MUST` dct:title rdf:langString,
4 `MAY` dct:alternative rdf:langString+,
5 `MUST` !dct:created .</p>
      <p>6 }
all with the `.'. Multiplicity of properties can be stated using standard regular
expression syntax, e.g. '+' for one or more occurrences.</p>
      <p>
        The full ShEx schema2 contains a shape declaration for each of the
description types: SummaryLevelShape, VersionLevelShape, DistributionLevelShape,
and RDFDistributionLevelShape. Two distribution level shapes are de ned
since several of the properties only make sense when describing an RDF dataset,
viz. the VoID properties [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>5 Implementation</title>
      <p>The Validata tool is a web application that is composed of two modules { a user
interface over a standalone validator. Both the user interface and the validator
are written in Javascript to enable the web application to be deployed on the
W3C web server, and run entirely on the client side. The code for the Validata
user interface is available from https://github.com/HW-SWeL/Validata while
the code for the validation backend is available from https://github.com/
HW-SWeL/ShEx-validator.</p>
      <p>The user interface has been developed using various existing libraries. jQuery3
is used for DOM manipulation with cross-browser functionality while Twitter
Bootstrap4 is used for cross-browser responsive pages. Browserify5 was used to
manage dependencies, and CodeMirror6 was used to provide text editor panes
which have line numbers. This enables informative error messages that reference
the line in the original RDF document that was at fault.</p>
      <p>The ShEx-Validator has been developed as a node.js module7 which provides
a standalone Javascript runtime environment. The core of the validator is the
ShEx Javascript library8. The ShEx-Validator separates the tasks of validation
2 https://github.com/HW-SWeL/ShEx-validator/blob/
043b396d9652deaf178628ba6f0b648464e4a8da/samples/HCLS%20Deployment/
hcls-chembl17-example-description.ttl accessed November 2015
3 https://jquery.com accessed November 2015
4 http://getbootstrap.com/ accessed November 2015
5 http://browserify.org/ accessed November 2015
6 https://codemirror.net/ accessed November 2015
7 https://nodejs.org/ accessed November 2015
8 Developed by Eric Prud'hommeaux https://github.com/ericprud/ShExDemo/
blob/96c7fb04827254af4618bdaeed49b3032fd002d3/RDF.js accessed Nov 2015
and reporting errors; the latter being handled by the user interface. To enable
support for di erent requirement levels, the ShEx language was extended to
enable arbitrary tags prior to each path expression (shown in Figure 2). peg.js9
was used to parse the extended grammar to generate the ShEx schema parser.
For parsing RDF documents the N3.js library10 was used which supports Turtle,
TriG, N-Triples and N-Quads. Instead of using asynchronous callbacks in the
code, the promises library11 was used to enable an immediate response which
would resolve to its actual value at some point in the future. Support for language
tags has also been added to the ShEx validator. This means that multiplicity
constraints are not violated if the property is given in multiple languages. A
library of unit tests have been developed for the ShEx-Validator backend. These
are run using the Mocha framework12 and ensure that the new features added
to the validator do not break existing features.</p>
      <p>
        The separation of the UI and the ShEx-Validator means that the validator
functionality can be deployed independently of the UI. For example, it could be
incorporated into a data publishing pipeline meaning that dataset descriptions
are validated as part of the data creation process. Alternatively it could be used
as a library in other tools such as a dataset description editor tool [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Usage
examples are available in the code documentation and in the Validata code itself.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Deployment</title>
      <p>A screenshot of the W3C HCLS community pro le deployment of the Validata
tool is given in Figure 3. This is shown with the ChEMBL 2015 Demo deployed
and the validation requirement level set to `MAY' { meaning that the user would
like error reports generated for properties declared at any of the conformance
levels. Essentially the toggle enables the user to select the strictness of the validator
which is desirable in di erent scenarios.
6.1</p>
      <sec id="sec-6-1">
        <title>Information Panel</title>
        <p>
          The left side of the UI is used to provide summary information to the user. At
the top are the typical series of steps for using the tool, viz. (1) Select Schema, (2)
Input Data, (3) Con gure Options, and (4) Validation Results. These navigation
aids guide the user through the main panel without breaking the information
into multiple pages; thus allowing the user to scroll up and down and adjust
things as they require. Immediately below these instructions is the option to
load a precon gured demo. In this case it corresponds to the example used in
[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] that describes the ChEMBL dataset [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ].
9 http://pegjs.org/ version 0.8.0; accessed November 2015
10 Developed by Ruben Verborgh https://github.com/RubenVerborgh/N3.js version
0.4.1; accessed November 2015
11 https://www.promisejs.org/ version 6.1.0; accessed November 2015
12 http://mochajs.org accessed November 2015
        </p>
        <p>At the bottom of the left panel is a Status Summary box that indicates the
validation progress. In this case it shows that the schema and the data are both
syntactically valid but that the data does not conform to the constraints speci ed
in the schema at the `MAY' level.
6.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>Main Panel</title>
        <p>The main part of the UI is used for providing the RDF data to validate, and
displaying the results of the validation.</p>
        <p>Select Schema Panel. The Select Schema panel (collapsed in the screenshot)
is used for selecting a ShEx schema from a drop down menu; in the HCLS
deployment there is only a single schema to choose from. The ShEx speci cation
of the schema can be viewed by clicking on the Show Source button (not shown in
the screenshot due to the collapsed panel). The motivation for the Select Schema
panel is to allow users to validate their descriptions against multiple schemes in
a single tool.</p>
        <p>Input Data Panel. RDF data can be supplied for validation in one of several
serialisations, with the validator automatically detecting the syntax. The user
can either drag and drop an RDF le onto the panel, select the Browse button
to upload a le, or copy and paste the RDF into the editor box. The RDF is
parsed as it is entered to ensure that it is syntactically valid, resulting in the
data row of the Status Summary turning green.</p>
        <p>Con gure Options Panel. The user may con gure various options that are
applied during validation. Initially the ShEx-Validator attempts to automatically
identify the resources in the supplied RDF, and the shape that each resource
should validate against. These are shown at the top of the Con gure Options
panel. These often need correcting, either resources are not identi ed or they
are not being validated against the appropriate shape. The user can use the
drop down menus to correct these. For example if a resource is identi ed but is
being validated against the wrong shape then the correct shape can be selected.
Additional resources from the supplied RDF can be added using the Add Resource
button, and correspondingly removed using the Remove Resource button. The
interaction between the user and the add and remove resource buttons needs
additional work to improve the user experience as only the bottom resource can
currently be removed. Instead the user should be able to directly select the rows
that are to be removed.</p>
        <p>
          The next con guration option allows the user to select whether to generate
an error if additional properties are found, i.e. a resource matches a shape but
has additional properties. Following the nomenclature of [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] the option is labelled
Closed World Validation.
        </p>
        <p>The nal con guration option informs the ShEx-Validator at which
requirement level the validation should take place. Any properties that are de ned in
the ShEx schema with the selected requirement level and missing from the
resource in the RDF will be reported as errors, those with a lower requirement
level will be reported as warnings. This panel is only shown for schema that
have requirement levels declared. The e ect of choosing di erent requirement
levels is that the UI will return validation results at the di erent levels as either
errors or warnings. Users have found this desirable as a means of rst developing
a minimal description by ensuring that it matches the `MUST' properties and
then iteratively validating at the lower levels.</p>
        <p>Validation Results Panel. The bottom panel of the tool provides the
validation results. For each of the resources shown in the Con gure Options panel, an
entry is shown in the Validation Results panel. If the resource conforms to the
shape description, then it is shown under the green Matches panel. This allows
the user to see which parts of the RDF document match to a ShEx constraint.
Clicking on an individual match will jump the focus of the UI up to the
corresponding line of the supplied RDF data. If all the properties match then the
validation row of the Status Summary will turn green.</p>
        <p>Two severity levels of reports can be generated when properties are not
present: (i) errors shown with red highlighting, and (ii) warnings shown with
amber highlighting. The categorisation is controlled by the chosen requirement
level. This is desirable as several properties de ned at the SHOULD and MAY
requirement level are seen as optional since they do not make sense for every
situation. If ShEx optional properties were used, such reports would not be
generated.</p>
        <p>Additionally, error reports are generated for invalid syntax or when the
incorrect object is supplied for a property, e.g. a string is provided instead of an
IRI. Through the use of the codemirror library, when these errors are clicked-on
the corresponding line in the RDF is highlighted in the editor pane allowing
the user to correct the error. Due to the nature of parsing errors, these are not
always accurate, but are generally in the correct region of the document.
6.3</p>
      </sec>
      <sec id="sec-6-3">
        <title>Additional Deployments</title>
        <p>Validata is a generic tool for validating RDF documents against a set of
constraints captured as a ShEx schema. The tool can easily be deployed in di
erent settings by editing the con guration le containing the ShEx schema. To
demonstrate this exibility, we have deployed Validata at two other locations
for di erent dataset description schemas.</p>
        <p>
          An instance capable of validating DCAT records [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] is available at http:
//www.macs.hw.ac.uk/~ajg33/validata/. This instance does not use
requirement levels as these are not de ned in the DCAT recommendation. Nor does
the DCAT speci cation specify which properties are optional or mandatory, as
such all properties have been speci ed as optional. The demonstration example
uses the example in the DCAT speci cation. This example does not use closed
world validation as it includes rdfs:label which is not speci ed anywhere else
in the DCAT recommendation.
        </p>
        <p>
          We have also deployed a version of Validata for validating dataset descriptions
against the Open PHACTS speci cation [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. This instance is available from
http://openphacts.cs.man.ac.uk/validata/ and uses a description for the
DrugBank dataset for its example. Again requirement levels are used.
7
        </p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Related Work</title>
      <p>
        Various approaches for describing constraints on an RDF document have been
considered [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] and some tools have been developed for enabling users to validate
their RDF documents against these constraints.
      </p>
      <p>The Fancy ShEx Demo developed by Eric Prud'hommeaux13 is a proof of
concept system to showcase the capabilities of the ShEx language. It provides a
Javascript implementation for validating RDF against a ShEx schema and was
chosen as the basis for our own implementation. It is limited to only accepting
RDF in the turtle serialisation and does not support the use of language tags.
The focus of the tool is not on the user experience when validating an RDF
document. Instead the purpose of the tool is on demonstrating the capabilities
of the ShEx language. Thus it has several features that are not required for
the validation use case, e.g. the ability to generate SPARQL queries to perform
the validation. The error messages generated are terse and require knowledge of
ShEx. We have aimed to provide more user oriented error messages.</p>
      <p>ShExScala14 developed by Jose Emilo Labra Gayo is a Scala implementation
of a ShEx validator. It supports multiple RDF serialisations and can be used to
validate an RDF dataset from a given URL or SPARQL endpoint. It was not
possible to use our implementation as it requires a server-side deployment.</p>
      <p>
        The most similar work to our own is the DCAT validator developed as part
of the Google Summer of Code 201515. This is a web application for validating
DCAT dataset descriptions [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] written entirely in Javascript. Users can enter
their description by le upload, by URI, or direct entry. As with our user
interface, the reporting of validation results are split into errors and warnings.
The validation constraints are captured in a JSON array as key value pairs,
rather than reusing one of the approaches being developed in the W3C RDF
Data Shapes Working Group. For the DCAT deployment, it is unclear where
the choice of mandatory and optional constraints has come from. Entering the
examples from the DCAT recommendation result in validation errors.
8
      </p>
    </sec>
    <sec id="sec-8">
      <title>Conclusions</title>
      <p>Validata provides a recon gurable, online tool for validating RDF documents
against a set of constraints de ned in a ShEx schema. Informative error and
warning messages are presented to the user to enable them to re ne their RDF to
conform with the schema. The validation implementation extends the standard
ShEx de nition to support di erent requirement levels; with the levels being
de ned in the schema de nition rather than prescribed by the ShEx language.</p>
      <p>The tool has been deployed by the W3C HCLS Interest Group to validate
dataset descriptions against their community pro le. Additional deployments
have been made for the DCAT speci cation and the Open PHACTS project.</p>
      <p>Future work on the Validata tool includes improving the user experience, e.g.
allowing users to save edited RDF les and improving the selection of resources
to validate. Once these changes to the UI have been made a user evaluation
should be conducted.</p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgements</title>
      <p>We thank Eric Prud'hommeaux for the support and advice received throughout
the development of Validata.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Hors</surname>
            ,
            <given-names>A.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solbrig</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , Prud'hommeaux, E.:
          <source>Rdf validation workshop report: Practical assurances for quality rdf data. Report, W3C (December</source>
          <year>2012</year>
          ) http: //www.w3.org/
          <year>2012</year>
          /12/rdf-val
          <source>/report.</source>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Prud'hommeaux</surname>
          </string-name>
          , E.,
          <string-name>
            <surname>Labra</surname>
            <given-names>Gayo</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>J.E.</given-names>
            ,
            <surname>Solbrig</surname>
          </string-name>
          , H.:
          <article-title>Shape expressions: An RDF validation and transformation language</article-title>
          .
          <source>In: 10th International Conference on Semantic Systems</source>
          . (
          <year>2014</year>
          )
          <volume>32</volume>
          {40 doi:10.1145/2660517.2660523.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Gayo</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Prudhommeaux</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Solbrig</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <article-title>Rodr guez</article-title>
          , J.A.:
          <article-title>Validating and describing linked data portals using RDF Shape Expressions</article-title>
          .
          <source>In: Workshop on Linked Data Quality</source>
          . (
          <year>2014</year>
          ) http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1215</volume>
          /paper-06.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>A.J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baran</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Marshall</surname>
            ,
            <given-names>M.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumontier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Dataset descriptions: HCLS community pro le</article-title>
          . Interest group note,
          <source>W3C (May</source>
          <year>2015</year>
          ) http://www.w3. org/TR/hcls-dataset/.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Maali</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Erickson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Data catalog vocabulary (DCAT)</article-title>
          .
          <source>Recommendation, W3C (January</source>
          <year>2014</year>
          ) http://www.w3.org/TR/vocab-dcat/.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Bradner</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Key words for use in RFCs to indicate requirement levels</article-title>
          .
          <source>Best current practice</source>
          , Network Working Group, Internet Engineering Task Force (
          <year>March 1997</year>
          ) http://www.ietf.org/rfc/rfc2119.txt.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Gaudet</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bairoch</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Field</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , et al.:
          <article-title>Towards BioDBcore: a community-de ned information speci cation for biological databases</article-title>
          .
          <source>Database: The Journal of Biological Databases and Curation</source>
          <year>2011</year>
          (
          <year>January 2011</year>
          ) doi:10.1093/database/baq027.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>A.J.G.</given-names>
          </string-name>
          :
          <article-title>Dataset descriptions for the open pharmacological space</article-title>
          . Working draft,
          <string-name>
            <surname>Open</surname>
            <given-names>PHACTS</given-names>
          </string-name>
          (
          <year>September 2012</year>
          ) http://www.openphacts.org/specs/ datadesc/.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Alexander</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hausenblas</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Describing Linked Datasets with the VoID Vocabulary</article-title>
          . Note, W3C (
          <year>2011</year>
          ) http://www.w3.org/TR/void/.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Goble</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>A.J.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tatakis</surname>
          </string-name>
          , E.:
          <article-title>Help me describe my data : A demonstration of the Open PHACTS VoID Editor</article-title>
          .
          <source>In: ISWC 2014 Poster and Demos</source>
          . (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Bento</surname>
            ,
            <given-names>A.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaulton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hersey</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          , et al:
          <article-title>The ChEMBL bioactivity database: an update</article-title>
          .
          <source>Nucleic acids research</source>
          42(Database issue) (
          <year>January 2014</year>
          )
          <volume>D1083</volume>
          {
          <fpage>1090</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>