<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>An Ontology Blueprint for Constructing Qualitative and Quantitative Scienti c Variables</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Institute of Arctic and Alpine Research University of Colorado</institution>
          ,
          <addr-line>Boulder 80309</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This work presents an ongoing e ort to develop simple ontological design patterns for describing scienti c variables with a high level of speci city in resource description format (RDF). The application of the ontology design patterns discussed here were used to create a variables ontology for the geosciences. The long-term aim of this work is to develop an ontological blueprint for automated ontology generation from a corpus. Such ontologies can be used for semantic mediation in automated scienti c work ows and semantic alignment of content in heterogeneous resources.</p>
      </abstract>
      <kwd-group>
        <kwd>ontology design pattern scienti c variables semantic mediation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The Ontology for Constructing Scienti c Variables (OSV) is a mechanism for
storing conceptual information necessary for identifying, disambiguating, and
assembling scienti c variables. OSV is a successor to the Geoscience Ontology
(GSN)[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and extends the principles introduced in the CF standard names [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]
and the CSDMS standard names (CSN) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]; whereas the aforementioned naming
e orts relied on encoding scienti c variables using controlled vocabularies and
one-dimensional strings, the OSV is terminology-agnostic and encodes relational
and contextual information via the Resource Description Format (RDF),
resulting in a richer representation with more degrees of freedom. OSV is a critical tool
for semantic mediation, providing the language to link unstructured information
contained in large corpora to structured information captured in data sets and
used by computational models. Along with other interpretative tools, OSV is
designed to enable automated alignment and integration of distributed scienti c
information.
      </p>
      <p>
        There are a wide range of scienti c ontologies available, see e.g., [
        <xref ref-type="bibr" rid="ref5 ref6 ref7">5,6,7</xref>
        ].
However, although these ontologies are useful for speci c applications, there is, to the
authors' knowledge, no available ontology that (a) provides the desired
specicity for distinguishing variables at a highly granular level within a domain,
(b) comprises patterns that are readily extensible to other domains, and (c)
denes mandatory components of a variable. The ontology we present in this work
aims to decompose and modularize the construction of scienti c variables,
explicitly labeling required elements that must be provided in order to completely
and unambiguously identify the concepts represented by a scienti c variable|
namely an object of observation, a corresponding property, and a quantity with
units. We start by identifying the core ontology building blocks in Section 2 and
then describing how the building blocks are combined to build complex concept
representations.
2
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Concept Class De nitions</title>
      <p>
        Physical Concepts
A Phenomenon is a fact or situation that is observed or could be observed
to exist or happen in the physical world. A phenomenon that is observed to
exist is at equilibrium, whether dynamic, chemical or static, and one that is
observed to happen is removed from equilibrium, experiencing a change of state
as a result of certain processes. A phenomenon consists of the substance of which
it is made (Matter), a Form that de nes its occupation of space, and possibly,
one or more Processes. Phenomena are de ned recursively[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], where any given
phenomenon can be decomposed into smaller phenomena and can be combined
with other phenomena to build larger, more complex phenomena. A Body is a
phenomenon at equilibrium that is identi ed by its Matter and Form. A Process
is a set of actions that may occur in parallel or sequentially.
2.2
A System is the abstracted, diagrammatic representation of a phenomenon, and
includes any applied, human-contrived physical or mathematical abstractions or
models. In OSV, a system that has a relatively unchanging state is static, while
a changing system is dynamic. A static system may comprise multiple dynamic
systems which together are at equilibrium. Like Phenomena, Systems are de ned
recursively.
      </p>
      <p>A Property is a characteristic or feature of a system. A Value may be
numerical or categorical and represents a system state, evaluated either objectively
or subjectively; it is associated with a property. A Quantity is a numerical value
with associated units. An Attribute is a property-value pair. It is important to
note that some properties may be observable but may not be able to be
measured directly and may be assessed through manipulation of other attributes;
examples include severity and resilience.</p>
      <p>A Variable is a phenomenon-property pair. It must comprise an object of
measurement|one or more Phenomena|as well as a Property. As an example,
`precipitation' is not a complete variable, as it only identi es a process, and
neither is `rainfall', as it only identi es a phenomenon|the precipitation of water
from clouds. In order to properly identify a variable, a property (such as `volume
ux' or `duration' in the case of rainfall) must also be identi ed.</p>
    </sec>
    <sec id="sec-3">
      <title>Building a Variable</title>
      <p>The steps for identifying the components of a scienti c variable are:
1. Select a phenomenon of interest for study{this is called the object of
observation and will be the object of the variable.
2. Select one or more properties of that phenomenon to evaluate.
3. Diagram that phenomenon for the desired analysis, and if necessary,
identify any applied abstractions, such as approximate mathematical or physical
models (e.g., surface, ellipsoid, etc.).</p>
      <p>A system is de ned recursively in the ontology and comprises one or more
participants, the role of each participant, and accompanying processes.
Participants are recursively de ned as distinct subsystems of the larger whole to provide
the desired level of granularity. The granularity of any system may be further
re ned by identifying system attributes (system state) that are constant for the
scope of measurement.</p>
      <p>Figure 1 provides an overview of the di erent systems that can be modeled.
Static systems involve processes that are at equilibrium while dynamic systems
are removed from equilibrium. The single-body, static system is equivalent to
the Body class. Matter is a type of multiple-body, static system. When enclosed
with a boundary, a multiple-body, static system may be turned into a static,
single-body system. When a Form is applied to Matter, a Body system results.</p>
      <sec id="sec-3-1">
        <title>Single Body</title>
      </sec>
      <sec id="sec-3-2">
        <title>Multiple Body</title>
        <p>c
i
t
a
t
S
c
i
m
a
n
y</p>
        <p>D</p>
        <p>A variable is assembled by linking the system of interest to the desired
property. If applicable, a variable may also include a reference frame for the evaluation
of the property, as well as context phenomena. Figure 2 shows an example of
how the building blocks are used to build a variable.</p>
        <p>fuel
attribute:
gaseous
participant role:
consumed
consumption</p>
        <p>_:?
participant role:
consumer
participant role: source
emission
carbondioxide
participant role: main
mass
World Development Indicator: CO2 emissions from gaseous fuel consumption (kt)</p>
        <p>GSN construction: carbon-dioxide~emitted-from-fuel~gaseous-consumption_mass</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4 Implementation</title>
      <p>
        The Geoscience Ontology[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] is an example of a domain-speci c OSV application
which expresses a wide range of scienti c variables. The linked website provides
a web interface to a SPARQL endpoint to query a beta version of the ontology.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Hooft</surname>
          </string-name>
          , G.:
          <article-title>In Search of the Ultimate Building Blocks</article-title>
          . Cambridge University Press, Cambridge, UK (
          <year>1997</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>2. Guidelines for Construction of CF Standard Names</article-title>
          , http://cfconventions.org/ Data/cf-standard-names/docs/guidelines.html.
          <source>Last accessed 1</source>
          June 2018
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>CSDMS</given-names>
            <surname>Standard Names</surname>
          </string-name>
          , https://csdms.colorado.edu/wiki/CSDMS_Standard_ Names.
          <source>Last accessed 1</source>
          June 2018
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Geoscience</given-names>
            <surname>Ontology</surname>
          </string-name>
          , http://www.geoscienceontology.org.
          <source>Last accessed 1</source>
          June 2018
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>SWEET</given-names>
            <surname>Ontology</surname>
          </string-name>
          , https://sweet.jpl.nasa.gov/.
          <source>Last accessed 24 July 2018</source>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>QUDT</given-names>
            <surname>Ontology</surname>
          </string-name>
          , http://www.qudt.org/.
          <source>Last accessed 24 July 2018</source>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>GCOS</given-names>
            <surname>Ontology</surname>
          </string-name>
          , http://vocab-test.ceda.ac.uk/ontology/gcos/ gcos-content/.
          <source>Last accessed 24 July 2018</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>