<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Design and generation of Linked Clinical Data Cubes</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laurent Lefort</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Hugo Leroux</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>CSIRO ICT Centre</institution>
          ,
          <addr-line>Canberra, ACT</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>The Australian E-Health Research Centre, CSIRO</institution>
          ,
          <addr-line>Brisbane, Queensland</addr-line>
          ,
          <country country="AU">Australia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Clinical Study Data Exchange technologies, based on XML, have improved the data capture phase of clinical data and enabled larger and more diverse longitudinal clinical research studies. There is now a growing interest in this community for solutions based on Semantic Web standards. Healthcare and life sciences metadata resources such as medication classifications are now shared via linked data platforms. The increasing pressure to make clinical trial data more open is another strong incentive for the adoption of linked open data technologies. This paper describes the application of semantic statistics vocabularies to deliver clinical data as linked data in a form that is easy to consume by statisticians and easy to enrich with links to complementary data sources. We combine the strengths of the RDF Data Cube and DDI-RDF vocabularies to propose a Linked Clinical Data Cube (LCDC), a set of modular data cubes that helps us manage the multi-disciplinary nature of the source data. We validate our approach on the Australian, Imaging, Biomarker and Lifestyle study of Ageing (AIBL). This dataset, comprising more than 1600 variables clustered in 25 different sub-domains, has been fully converted into RDF with one general data cube and one specialised data cube for each sub-domain. This implementation demonstrates the effectiveness of the association of the RDF Data Cube and DDI-RDF vocabularies for the publication of large and diverse clinical datasets as linked data. We also show that the structure of the LCDC overcomes the monolithic nature of clinical data exchange standards and expedites the navigation and querying of the data from multiple views.</p>
      </abstract>
      <kwd-group>
        <kwd>linked data</kwd>
        <kwd>clinical study</kwd>
        <kwd>data cube</kwd>
        <kwd>semantic statistics</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The Australian, Imaging, Biomarker and Lifestyle study of Ageing1 (AIBL) [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] is a
longitudinal clinical study of more than 1100 Australians aged over 65 years focusing
on early pathological indicators of Alzheimer’s disease. The AIBL dataset contains 25
sub-domains encompassing more than 1600 variables. AIBL uses a Clinical Data
Management System to collect and manage the study data. The tool used [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
OpenClinica2, supports the creation of customisable studies and the design of user-defined
Case Report Forms (CRFs) using an Excel spreadsheet. It adheres to the Clinical Data
Interchange Standards Consortium 3 (CDISC) Operational Data Model (ODM) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]
XML-based standard. ODM-compliant files contain the study data and the associated
descriptions of the data items, their groupings into CRFs and the associated questions
and code lists.
      </p>
      <p>
        Our main motivation for the publication of AIBL data as linked data is to make the
data seamlessly available to researchers and to enrich it when possible with other data
sources. Medication information collected in the study can be mapped [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] to the
Australian Medicines Terminology4 (AMT) and SNOMED-CT5. The rapid growth of the
healthcare and life sciences Linked Open Data cloud (OPEN-PHACTS6, Bio2RDF7)
opens new opportunities to add value to the AIBL data. Linking to DrugBank8, for
example, can bring extra information on drug interaction, targets and pathways.
      </p>
      <p>
        In this paper, we explain why we have opted to use semantic statistics vocabularies
and how they help us overcome the monolithic nature of the ODM data model and its
limitations outside the data capture phase. The RDF Data Cube vocabulary [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is a
proven solution [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] for the construction of multi-dimensional data cubes offering
multiple access points to the data via thematic slices. The DDI-RDF Discovery
Vocabulary [
        <xref ref-type="bibr" rid="ref12 ref7 ref8">7, 8, 12</xref>
        ] helps us manage the links between the data cube variables and their
definitions supplied via the study-specific data dictionary embedded in the ODM
standard. We describe how we have split the AIBL dataset into a set of data cubes to
increase its modularity and designed its URI scheme to ensure that access to it is not
constrained by the original data model. The slicing strategy supports the grouping of
data into times series with various temporal granularity (phase of the study, day of the
observation), and into cross-sections offering different options to group patient
together, such as the membership of patients to specific cohorts or the gender.
      </p>
      <p>
        We validate our approach on the AIBL dataset. We use this example to show how
we can add additional slices to enrich the Medication sub-cube with external linked
data sources and how we can consume the linked data with a generic off the shelf
mashup tool, Visual Box [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
      <p>
        The discussion is focused on the compliance of the Linked Clinical Data Cube
with the RDF Data Cube vocabulary. Our goal is to have a solution that is compatible
with visualisation tools based on this specification. We have reviewed the
applicability of the proposed integrity constraints to our use case and found that clinical
research data is a category of data that is patchier than other categories of statistical
data. We conclude that semantic statistics vocabularies can and should serve a more
diverse range of use cases than the ones already documented in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
2 https://openclinica.com/
3 http://www.cdisc.org/
4 http://www.nehta.gov.au/our-work/clinical-terminology/australian-medicines-terminology
5 http://www.ihtsdo.org/snomed-ct/
6 http://www.openphacts.org/
7 http://bio2rdf.org/
8 http://www.drugbank.ca/
      </p>
      <p>The rest of this paper is structured as follows. Section 2 details the coverage of
CDISC ODM features by the RDF Data Cube and DDI-RDF Discovery vocabularies.
Section 3 introduces the design of the Linked Clinical Data Cube and of its URI
scheme. Section 4 presents our work on the AIBL linked clinical dataset. Section 5
contains the discussion on the alignment and compliance issues and reviews the
linked data management requirements which are specific to clinical studies.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Coverage of CDISC ODM by Semantic Statistics vocabularies</title>
      <sec id="sec-2-1">
        <title>CDISC ODM</title>
        <p>
          The CDISC ODM standard [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] defines an XML-based format that facilitates the
capture of clinical data during a clinical study. The tree structure of the ODM XML
Schema is shown in Figure 1. For the data sub-tree, the top level element is study,
followed by subject and study event (phase of the study). The next three elements
match the structure of the electronic forms used for data capture. The ODM format
also contains the variable definitions (items) and their associated codelists. Each
“data” element is linked to a “def” element in the metadata sub-tree.
The W3C RDF Data Cube vocabulary [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] is a vocabulary for the publication of
statistical data in RDF [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] which is derived and compatible with the cube model that
underlies SDMX 10 (Statistical Data and Metadata eXchange), a statistical data and
metadata standard. This cube model (Figure 2) allows users to group subsets of
observations within a dataset into slices where all but one (or a small subset) of the
dimensions are fixed. The dimensions, measures and attributes of the data cube and
their usage in slices and observations are specified via a Data Structure Definition (or
DSD) object. The guidelines for DSDs published by SDMX [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] define the
method9 http://xml.coverpages.org/CDISC-ODMOverviewV1-1.pdf
10 http://sdmx.org/
ology for slicing with detailed advice on how to design the data cube structure
according to the nature of the data.
The Data Documentation Initiative (DDI) is an alliance developing XML-based
standards for information describing statistical and social science data. The key
motivation for DDI is the need to share highly-detailed metadata to ensure the correct
analysis and use of the data collected during surveys.
The DDI-RDF Discovery vocabulary is a RDF version of a subset of the DDI
standard created [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] by members of the Linked Open Data community. It is published as an
unofficial draft [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] by DDI and reuses or extends several linked data vocabularies,
including the RDF Data Cube. Figure 3 shows the relationships between the main
classes defined by the Disco vocabulary [
          <xref ref-type="bibr" rid="ref12 ref7">7, 12</xref>
          ]. DDI-RDF also contains definitions
for statistics based on quantitative and qualitative data, which can also be useful.
2.4
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Coverage of ODM features by QB and Disco</title>
        <p>
          We have outlined in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] our approach to map the ODM data to the RDF Data Cube
Vocabulary and the rationale behind our decision to split the AIBL dataset into one
main data cube containing the common data and multiple specialised data cubes
adapted to each sub-domain. The strength of the Data Cube, at the level of the main
cube, is that the original structure of the ODM data model
(Study-SubjectStudyEvent-Form-ItemGroup-Item) can be replicated in the generated cube if needed.
Furthermore, it supports alternative methods of accessing the data, in particular,
methods where the data is aggregated along other dimensions or along the same
dimension in different order.
        </p>
        <p>The correct use of the data recorded during surveys is also important for the
producers of clinical trial data. We have opted to reuse the DDI-RDF Discovery
Vocabulary to consistently manage the study-specific data dictionary exported from the
OpenClinica tool via the ODM format and the CDISC metadata resources (STDM,
CDASH). disco:Universe defines the domain at multiple levels of the data cube.
disco:Variable corresponds to the property used to store the data and
disco:VariableDefinition is used to link to the definition of this property
(metadata).</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Design of the Linked Clinical Data Cube</title>
      <p>
        The design of the Linked Clinical Data Cube is done in three steps. The first step,
also discussed in [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] is to split our dataset into a number of smaller, specialised,
cubes. The second step is to define several slice hierarchies to offer multiple access
options to individual data records. The third step is to define a URI scheme that
supports access at all the levels of the slice hierarchy.
      </p>
      <p>
        We have used the SDMX guidelines [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] to define the dimensions and attributes
for our time series and cross-sectional slices. The time-series slices address the
longitudinal nature of the study and organise the data into time intervals and dated and
non-dated time points. The cross-section slices adopt a subject-centric approach into
abstracting the data set along some important concepts such as gender, genotype and
neurological classification. The Theme slices categorise the data into the study
domains and sub-domains (disco:Universe in DDI-RDF) and help to link the main
and specialised cubes. The navigation and querying of the data in the LCDC is easier
because we provide three direct links to the node containing the data instead of one:
the Phase series (at the level of Study Event Data in ODM), the Subject section
(Subject Data) and the Sub-theme slice (Item Group Data).
      </p>
      <p>The RDF Data Cube (QB) specification restricts the use of the
qb:observation property to cases where the range class is a qb:Observation
and does not allow qb:observation o qb:observation property chains
between qb:Slice and qb:Observation via qb:ObservationGroup. We
use void:subset11 for the dataset/slice and slice-sub-slice links shown in Figure 5.</p>
      <p>The use of QB properties is shown in Figure 6 which presents only the LCDC
slices which subsume qb:Slice. We use qb:observation and
qb:observationGroup for the slice-observation links and slice-observation
group links. The specialisedSeries and specialisedSection properties
are for the links between the slices in the main and specialised cubes. The
11 http://www.w3.org/TR/void/
specialisedObservation property manages the links between the observation
groups in the main cube and the corresponding observations in the specialised cubes
and is a sub-property of qb:observation. Finally, the mainDataSet property
is defined to link the observation groups back to the dataset.</p>
      <p>
        The LCDC URI scheme (Table 1 and Table 2) follows the convention adopted by
projects [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] which use the Linked Data API12. This convention uses URIs finishing
with an identifier to give access to a single instance (Item endpoint) and URIs
finishing with a keyword to give access to a list of instances (List endpoint).
12 https://code.google.com/p/linked-data-api/
      </p>
      <p>Specialised cube URI scheme
Sub-theme series ROOT/{dataset}/ts/pr/{pr}/th/{th}/st/{st}/ph/{ph}
Sub-theme section ROOT/{dataset}/ts/pr/{pr}/th/{th}/st/{st}/nd/{nd}/su/{su}
Observations</p>
      <p>ROOT/{dataset}/{pr/{pr}/th/{th}/st/{st}/ph/{ph}/su/{su}</p>
      <p>The URI patterns listed above are shortened to fit in more compact tables. The
longer version of the pattern listed in last row of Table 2 is:</p>
      <p>ROOT/{dataset}/product/{product}/theme/{theme}/subtheme
/{subtheme}/phase/{phase}/subject/{subject}</p>
      <p>Using alternate keywords and identifiers user-friendly URIs:
ROOT/lcdc/product/odm/theme/cognitive/subtheme/neuropsych
/phase/72months/subject/ss_1175.</p>
      <p>The LCDC ontologies are available via the URIs included in Table 3. A majority
of the core classes (DataFile, Phase, Product, Question, Questionnaire, Study,
StudyGroup, SubTheme, Subject, SupplVariableDefinition,Theme, Variable,
VariableDefinition) and properties are based on Disco. The Observation, Time Series,
Cross-Section, Domain Slice and Cube ontologies contain the classes corresponding
to the different aspects of the Linked Clinical Data Cube described above and the
associated properties. The ODM and AIBL ontologies define datatype properties for
the identifiers present in the ODM file to capture this information as provenance data.</p>
      <p>Ontology URI
Core http://purl.org/sstats/lcdc/def/core#
Observations http://purl.org/sstats/lcdc/def/obs#
Time Series http://purl.org/sstats/lcdc/def/time-series#
Cross-section http://purl.org/sstats/lcdc/def/cross-section#
Domain Slice http://purl.org/sstats/lcdc/def/domain-slice#
Cube http://purl.org/sstats/lcdc/def/cube#
ODM http://purl.org/sstats/lcdc/def/odm#
AIBL http://purl.org/sstats/lcdc/def/aibl#</p>
      <p>To create them, we have defined an Excel spreadsheet template to capture all the
information required to generate the LCDC OWL ontologies with XSL
transformations. Our template supports the definition of classes and properties
definitions, their alignment to QB, VoID and Disco, the URI prefixes and patterns and
the DSD levels. We can also generate SPARQL queries for the retrieval of instance
data for each class and for the detection of traversal links between LCDC-named
entities. We plan to further extend this template to automate the creation of the Linked
Data API configuration files as much as possible.</p>
    </sec>
    <sec id="sec-4">
      <title>Our application</title>
      <sec id="sec-4-1">
        <title>The AIBL study</title>
        <p>
          AIBL has been designed to support investigations of the predictive utility of
various biomarkers, cognitive parameters and lifestyle factors as indicators of
Alzheimer’s disease (AD) with a cohort of over one thousand participants residing in two
Australian cities, Perth and Melbourne. Each recruited participant completed blood
and neurological testing and some underwent brain imaging testing. The AIBL study
data was successfully migrated to the OpenClinica platform in 2011 [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] and has been
live since August 2011.
4.2
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Conversion of the AIBL data</title>
        <p>We have converted an ODM file containing the data from AIBL study. This dataset
uses more than 1600 variables clustered in 25 different sub-domains. The AIBL study
has been split into five themes: Study, Clinical, Lifestyle, Imaging and Cognitive. The
‘Study’ category comprises administrative information, most of which will not be
shared in the cube. Table 5 gives the total number of instances per theme for different
LCDC classes.</p>
        <p>Theme
Clinical
Cognitive
Imaging
Lifestyle
Study</p>
        <p>
          The LCDC design has been extended to support our plan [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] to use the AMT and
SNOMED CT-AU taxonomies to enrich the medication data with other medication
resources, in particular the ones that are already available as linked open data. We
have implemented specific types of slices for the Concomitant Medication sub-cube
to serve observations which contain links to external resources like AMT, SNOMED
and the World Health Organization Anatomical Therapeutic Chemical Defined Daily
Dose classification (ATC DDD). The CM (Concomitant Medication) ontologies
(Ta13 The total number of variables is smaller than 1600 because the generation to RDF
suppresses duplicates.
ble 5) contain sub-classes of the observation and cross-section classes defined in the
core ontology.
        </p>
        <p>Ontology URI
CM http://purl.org/sstats/lcdc/cm/def/cm#
CMATC http://purl.org/sstats/lcdc/cm/def/cm-atc#
CMAMT http://purl.org/sstats/lcdc/cm/def/cm-amt#
CMSNOMEDhttp://purl.org/sstats/lcdc/cm/def/cm-snomed#</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.4 Visualisation of the AIBL data</title>
        <p>
          An example of visualisation of the LCDC data developed is presented in Figure 8.
We have used the Visual Box14 [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] tool to build visualisations of SPARQL query
results to support data verification activities.
        </p>
        <p>Fig.7.ClassificationofAIBLsubjectsat18months</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5 Discussion</title>
      <sec id="sec-5-1">
        <title>5.1 Implementation report</title>
        <p>
          The RDF Data Vocabulary specification W3C Candidate Recommendation [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] has
reached the stage used by W3C to gather implementation experience prior to the final
decisions on “at risk” features. We can provide feedback on the usefulness of optional
14 http://visualbox.org
terms and on the applicability of integrity constraints. We use two optional terms: the
qb:ObservationGroup class and qb:observationGroup property.
        </p>
        <p>Some specialised data cubes do not satisfy the integrity constraints, specifying that
every qb:DataStructureDefinition must include at least one declared
measure (IC-3), that only attributes may be optional (IC-6) and that each individual
qb:Observation must have a value for every declared measure (IC-14). These
constraints are too restrictive for our Nutrition data cube where the presence or
absence of a value for a particular category of food varies according to the subject’s
diet. This is a concern for survey questionnaires using previously entered values to
determine if a field on a form should be mandatory filled.
5.2</p>
      </sec>
      <sec id="sec-5-2">
        <title>Coverage of our use case by Semantic Statistics vocabularies</title>
        <p>
          We recommend that the semantic statistics vocabularies under development cover a
broader set of use cases than the ones currently outlined in [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] that correspond to
collections of "regular" CSV files, spreadsheets and OLAP data cubes. The LCDC use
of the RDF Data Cube vocabulary is different from the more common use cases [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]
primarily because of the unreliable, disparate and longitudinal nature of clinical data.
This, however, should still allow us to reuse visualisation tools based on the RDF
Data Cube specification and especially RDF Data Cube browsers such as CubeViz15.
        </p>
        <p>On the other hand, we have found that the DDI-RDF vocabulary is well suited to
addressing the needs of the ODM community in standardising access to the clinical
data and explicitly linking the clinical data with its associated metadata.
6</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusions</title>
      <p>This paper has outlined an approach to integrate clinical study data exchange
standards with semantic statistics standards to make the clinical data available as linked
data. In particular, we have outlined the design of a Linked Clinical Data Cube, which
integrates a general and several specialised data cubes to expedite the navigation and
querying of clinical data. The Linked Clinical Data Cube combines the strength of the
RDF Data Cube in defining multi-dimensional data cubes and the DDI-RDF
vocabulary to encode the study-specific data dictionary as linked data. Our approach was
validated on a large and diverse clinical dataset with features that differ from other
types of statistical datasets. The sheer volume of variables has necessitated a split of
the clinical data into a set of modular data cubes to improve their manageability
during the generation process and facilitate their discovery and usability by end users.
We have observed that the patchy nature of clinical data is also more pronounced than
for other types of statistical datasets. We are convinced that the integration of clinical
study data exchange technologies and semantic statistics vocabularies will expedite
the deployment of cross-study analysis and evidence-based medicines by facilitating
the integration of clinical trials from disparate sources. We conclude that the
associa15 http://aksw.org/Projects/CubeViz.html
tion of the RDF Data Cube and DDI-RDF vocabularies is very effective in facilitating
the publication of large and diverse data set and hope that this will provide the
catalyst for increased coordination between the two initiatives.
7</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ellis</surname>
            ,
            <given-names>K. A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Bush</surname>
            ,
            <given-names>A. I.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Darby</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ; De Fazio,
          <string-name>
            <given-names>D.</given-names>
            ;
            <surname>Foster</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ; Hudson,
          <string-name>
            <given-names>P.</given-names>
            ;
            <surname>Lautenschlager</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. T.</given-names>
            ;
            <surname>Lenzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            ;
            <surname>Martins</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. N.</given-names>
            ;
            <surname>Maruff</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ;
            <surname>Masters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ;
            <surname>Milner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ;
            <surname>Pike</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ;
            <surname>Rowe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ;
            <surname>Savage</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          ; Szoeke,
          <string-name>
            <given-names>C.</given-names>
            ;
            <surname>Taddei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ;
            <surname>Villemagne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ;
            <surname>Woodward</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            &amp;
            <surname>Ames</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer's disease</article-title>
          ,
          <source>International Psychogeriatrics</source>
          <volume>21</volume>
          (
          <issue>04</issue>
          ), pp.
          <fpage>672</fpage>
          -
          <lpage>687</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Leroux</surname>
          </string-name>
          , H.;
          <string-name>
            <surname>McBride</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Gibson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>On selecting a clinical trial management system for large scale, multi-centre, multi-modal clinical research study</article-title>
          ,
          <source>Stud Health Technol Inform</source>
          <volume>168</volume>
          , pp.
          <fpage>89</fpage>
          -
          <lpage>95</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>CDISC</given-names>
            <surname>Operational Data</surname>
          </string-name>
          <article-title>Model (ODM) Version 1</article-title>
          .3.
          <issue>1</issue>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>McBride</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lawley</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leroux</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Gibson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Using Australian Medicines Terminology (AMT) and SNOMED CT-AU to better support clinical research</article-title>
          ,
          <source>Studies in Health Technology and Informatics</source>
          ,
          <volume>178</volume>
          , pp.
          <fpage>144</fpage>
          -
          <lpage>149</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Reynolds</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Tennison</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>The RDF Data Cube Vocabulary, W3C Candidate Recommendation 25 June 2013</article-title>
          . World Wide Web Consortium (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Lefort</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bobruk</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haller</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Woolf</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>A Linked Sensor Data Cube for a 100 Year Homogenised Daily Temperature Dataset</article-title>
          .
          <source>In Proc. Semantic Sensor Networks Workshop (SSN</source>
          <year>2012</year>
          ),
          <source>CEUR Workshop Proceedings</source>
          , vol.
          <volume>904</volume>
          , ceur-ws.org pp.
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Bosch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zapilko</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cotton</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gregory</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kämpgen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Olsson</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paulheim</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wackerow</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>DDI-RDF Discovery Vocabulary A vocabulary for publishing metadata about data sets (research and survey data) into the Web of Linked Data Unofficial Draft 20 June 2013 Technical report</article-title>
          , DDI Alliance (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Bosch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gregory</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Wackerow DDI-RDF Discovery</surname>
          </string-name>
          <article-title>Vocabulary: A Metadata Vocabulary for Documenting Research</article-title>
          and
          <source>Survey Data Proc. 6th Workshop on Linked Data on the Web (LDOW2013) CEUR-WS Workshop Proceedings</source>
          . vol.
          <volume>996</volume>
          ceurws.org (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Graves</surname>
            ,
            <given-names>A</given-names>
          </string-name>
          :
          <article-title>Creation of visualizations based on linked data</article-title>
          .
          <source>In Proc. of the 3rd International Conference on Web Intelligence, Mining and Semantics (WIMS '13)</source>
          . ACM (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Kämpgen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          &amp;
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ;
          <source>Use Cases and Lessons for the Data Cube Vocabulary W3C Working Group Note 27 February</source>
          <year>2013</year>
          , World Wide Web Consortium (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <article-title>SDMX SDMX guidelines for DSDs Statistical Data</article-title>
          and Metadata
          <string-name>
            <surname>Exchange</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Bosch</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Cyganiak</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ; Wackerow,
          <string-name>
            <given-names>J.</given-names>
            &amp;
            <surname>Zapilko</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          (
          <year>2012</year>
          ),
          <article-title>Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences</article-title>
          ,
          <source>in Proc. of the 12th International Conference on Dublin Core and Metadata Applications (DC</source>
          <year>2012</year>
          )
          <volume>46</volume>
          -
          <fpage>55</fpage>
          , Dublin Core Metadata Initiative (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Leroux</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Lefort</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Using CDISC ODM and the RDF Data Cube for the Semantic Enrichment of Longitudinal Clinical Trial Data</article-title>
          .
          <source>In Proc. Semantic Web Applications and Tools for Life Sciences (SWAT4LS</source>
          <year>2012</year>
          ) CEUR Workshop Proceedings vol.
          <volume>954</volume>
          , ceurws.org (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>