<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>H. Rijgersberg, M. Van Assem, J. Top, Ontology of units of measure and related concepts,
Semantic Web</journal-title>
      </journal-title-group>
      <issn pub-type="ppub">1613-0073</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.1136/jamia.1999.0060151</article-id>
      <title-group>
        <article-title>An ontology for units of measures across history, standards, and scientific and technology domains</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Oskar B. Andersson</string-name>
          <email>oskar.andersson@liu.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Huanyu Li</string-name>
          <email>huanyu.li@liu.se</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Patrick Lambrix</string-name>
          <email>patrick.lambrix@liu.se</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rickard Armiento</string-name>
          <email>rickard.armiento@liu.se</email>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer and Information Science, Linköping University</institution>
          ,
          <addr-line>581 83 Linköping</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Physics, Chemistry and Biology, Linköping University</institution>
          ,
          <addr-line>581 83 Linköping</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Swedish e-Science Research Centre</institution>
          ,
          <addr-line>Linköping</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2018</year>
      </pub-date>
      <volume>4</volume>
      <issue>2013</issue>
      <fpage>0000</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>Units of measure are central in all areas of science and technology. There are several ontological frameworks aiming to improve interoperability and precision in digital data exchange of quantities involving units. We introduce an ontology that specifically targets challenges for handling units across databases of computational and experimental data from various sources. The ontology is created using definition files from the community-driven OPTIMADE standard for a common API for materials databases. The resulting ontology allows addressing data integration challenges encountered in that efort, including ( i) referencing both specific and more general instances of units that have changed over time; (ii) the use of unit systems to define short domain-relevant identifiers for a collection of units that make sense within a specific subdomain, rather than having to adopt globally standardized naming schemes; (iii) specifications of relationships between units that enables tools to convert between them; and (iv) units not part of the International System of Units (SI) can be represented without defining them in SI units or using SI system conventions. This paper provides a brief survey of existing ontologies for units of measure and then presents the design and discuss features of an ontology based on the OPTIMADE unit definitions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CEUR
ceur-ws.org</p>
    </sec>
    <sec id="sec-2">
      <title>1. Introduction</title>
      <p>
        [
        <xref ref-type="bibr" rid="ref4 ref5 ref6">4, 5, 6</xref>
        ], several ontologies have emerged to describe units and their relationships [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
        ].
An incident frequently cited to highlight the importance of standardized ways to handle units in
digital systems is the loss of the NASA Mars Climate Orbiter attributed to incorrect assumptions
about units between two software components [ 11].
      </p>
      <p>A key application for ontologies is database integration. Databases can include semantic
information through ontologies to help ensure that data from diferent providers are
systematically and correctly interpreted [12, 13]. Reasoning based on these semantics can be used to, e.g.,
map a database query expressed in data fields available in one database on those available in
another. A good example of an application area undergoing an increasing adoption of rigorous
data standards is materials science [14, 15]. The Open Databases Integration for Materials
Design (OPTIMADE) consortium, which includes the present authors, develops a standard for a
common API for materials data1 [16]. The recently released version v1.2 of the standard
introduces machine-readable property definitions for database fields using an extended subset of the
JSON schema standard2 [17]. These property definitions include definitions of units of measure
to address interoperability challenges met across the databases included in the consortium.
Turning these unit definitions into an ontology expressed using semantic languages, e.g., OWL
and RDF, enables aligning these definitions with other unit ontologies. An ontology of these
community-agreed definitions may also be useful in semantically guided systems beyond the
OPTIMADE API, e.g., for automated GraphQL server generation [18].</p>
      <p>A feature of the SI system relevant in the present context is that the units are occasionally
redefined to utilize progress in measurement methodology and technology that allows more
accurate measurements. For example, when the name SI was established in 1960, the metre
was defined in terms of wavelengths of radiation from the krypton 86 atom, whereas in 1983,
it was redefined in terms of the speed of light. Likewise, in 1960, the second was defined
to use a particular astronomical definition of the year, and in 1967 it was redefined based on
oscillations between levels of the cesium 133 atom. Yet another redefinition of the second based
on modern atomic clocks is being prepared for 2030 [19]. These redefinitions of the base SI
units also implicitly redefine all related derived units. For example, the tesla is a magnetic
lfux unit equal to kg/s2A which consequently has been redefined four times since 1960. The
implicit redefinitions also reach outside the SI system, e.g., the ångström length unit, which
is in ubiquitous use in materials science and spectroscopy, is usually defined as 10−10 m. This
definition is only unambiguous when combined with a particular definition of the metre. Hence,
there are at least two versions of this unit, one before and one after 1983, when the SI metre
was redefined.</p>
      <p>The diferences between the historical definitions of the SI units may seem inconsequential,
given how they are meant to only make the unit definitions more precise, i.e., a new definition is
meant to be within the uncertainty of the most accurate measurements of the quantity used for
the prior definition. Nevertheless, in particular in the context of databases containing numerical
results of computational simulations, values can be computed and recorded to virtually any
precision. A redefined unit then fundamentally alters the interpretation of the stored quantities.
Hence, especially in this context, it is useful to allow references to particular historical unit
1https://www.optimade.org
2https://json-schema.org/specification
definitions. Furthermore, another useful feature is to include less precise representations that
refer to any historical definition. The authors are unaware of any unit ontology currently in
use that allows separately handling and referencing diferent historical SI definitions and more
broadly generalized definitions. However, a key feature of the OPTIMADE property definitions
is that they are designed to handle multiple units and unit systems with definitions that change
over time. The definition files included with version v1.2 of the standard cover the changes
in the SI system since 1960. Hence, bringing these definitions and relations into a standard
ontology format will allow this feature to be used also in other semantic systems.</p>
    </sec>
    <sec id="sec-3">
      <title>2. Related work</title>
      <p>
        In the process of designing an ontology based on the unit definitions provided as part of the
OPTIMADE standard, we have surveyed several existing ontologies and related eforts that
provide formal representations of units of measure in various ways. We comment on them
briefly below. Several more in-depth reviews of ontologies for units are also available in the
literature [
        <xref ref-type="bibr" rid="ref10 ref7 ref8 ref9">7, 8, 9, 10</xref>
        ].
      </p>
      <p>• Quantities, Units, Dimensions and Data Types Ontologies (QUDT)3 [20] provides a
collection of linked ontologies that is described as an “architecture for the conceptual
representation of quantities, quantity kinds, units, dimensions, and data types.” It is largely based
on the SI standard, ISO standards on Units and Quantities (e.g., ISO 80000 [21]), and the
NIST Guide for the use of the International System of Units [22]. QUDT is extensible and
has a submission system to add units to existing or new vocabularies. However, we have
not found information that suggests how to accommodate changes in units over time and
the definitions in the current version 2.1.41 appear to be based on the pre-2019 version of
the SI system (e.g., the kilogram entry refers to the international kilogram prototype).
• Unified Code for Units of Measure (UCUM) 4 is not presented as an ontology, but rather
a “code system intended to include all units of measures being contemporarily used in
international science, engineering, and business.”. It was originally created for clinical
information systems [23]. An RDF datatype has been defined to represent physical
quantities using this code system [24]. There does not appear to be a clear way to use the
UCUM code system to refer to diferent versions of, e.g., the SI base units.
• Ontology of units of Measure (OM)5 [25] models concepts and relations for the
formulation of quantitative knowledge with a strong focus on units, quantities, measures, and
dimensions. The ontology does not appear to have any explicit definitions or mechanisms
for changes of unit definitions over time, and also the latest version 2.0 refers to the
pre-2019 SI definition of the kilogram.
• Semantic Web for Earth and Environmental Terminology (SWEET)6 [26] is a highly
modular ontology suite covering Earth system science which also covers scientific units.
3https://qudt.org/
4https://ucum.org
5https://github.com/HajoRijgersberg/OM
6https://github.com/ESIPFed/sweet
• Extensible Observation Ontology (OBOE)7 [27, 28] is a formal ontology for capturing the
semantics of scientific observation and measurement oriented around the concepts of
observations, measurements, entity, characteristic, standard, and protocol.
• Measurement Units Ontology (MUO)8 was created to represent units in a software
development framework for mobile devices. It provided definitions of unit instances automatically
extracted from UCUM.
• Quantity Kinds and Units (QU)9 is based on the conceptual framework (or model) Quantity,</p>
      <p>Unit, Dimension, and Value (QUDV), a part of OMG SysML [29]).
• Units of Measurement Ontology (UO)10 [30] targets the standardization of units of
measurement in the biomedical domain.
• The Wikidata project [31] provides a range of unit definitions used for structured data to
be referenced from Wikimedia websites, including Wikipedia.11
• The GNU units software package distributes a range of unit definitions in a database
ifle. 12 This collection of units has been adopted to express formalized property definitions
in the OpenKIM project [32].</p>
      <p>
        Several of the surveyed ontologies include representations of in-depth concepts in the domain
of quantities and units. In particular, many include the concept of unit dimensions [
        <xref ref-type="bibr" rid="ref2 ref7">7, 2</xref>
        ].
While dimensions and related concepts are critical to describe all aspects of how quantities,
units, and unit systems relate, they can be dificult to model in a unit system-independent way.
However, these concepts are not strictly necessary to represent units used for quantities and to
aid database integration, and are therefore not currently part of the ontology presented here.
      </p>
      <p>The primary motivation for the ontology presented in this work is to encode the
communityagreed unit definitions part of the OPTIMADE standard. We do not aim for this work to fully
evaluate the advantages and limitations in the resulting design compared to existing ontologies
for units of measure.</p>
    </sec>
    <sec id="sec-4">
      <title>3. Definition of the ontology</title>
      <sec id="sec-4-1">
        <title>3.1. Requirements</title>
        <sec id="sec-4-1-1">
          <title>We present an ontology designed around the following principles:</title>
          <p>• Allow multiple historical definitions of the same unit to co-exist within the same version
of the ontology (i.e., not via the versioning of the ontology itself).</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>7https://github.com/NCEAS/oboe</title>
          <p>8While many works reference the MUO, we have been unable to locate a detailed description online or as a
published work. An archived web page with some details can be accessed via the Internet Archive Wayback
Machine at https://web.archive.org/web/20130723220432/http://forge.morfeo-project.org/wiki_en/index.php/Units_
of_measurement_ontology
9https://www.w3.org/2005/Incubator/ssn/ssnx/qu/qu and https://www.w3.org/2005/Incubator/ssn/ssnx/qu/qu-rec20.
html
10http://obofoundry.org/ontology/uo
11https://www.wikidata.org/wiki/Wikidata:Units
12https://www.gnu.org/software/units/
• Standards-agnostic design (i.e., not designed around the SI standard).
• Freedom to represent units using relevant domain-specific short symbols.
• Support compact mathematical expressions to express compound units and relationships
between units.
• Provide relevant identifiers for various uses: display symbols for mathematical typesetting
and latin symbols and identifiers for referencing in computer code and data contexts.
• Entities separated into physical units, constants, and prefixes.
• Support of referencing unit systems as a collection of units with associated short symbols.
• IRIs that refer to the exact same entities as the OPTIMADE unit definitions are retained
from those definitions.
• A versioning scheme that handles changes in the ontology and in the referenced unit
definitions provided via OPTIMADE.
3.1.1. Use cases
A couple of example use cases that can be addressed by the ontology follows:
1. Specification (via IRI) of the unit of measure that applies to a stored numerical value (e.g.,
for a database field) at the level of specificity that applies to that quantity. For example,
one IRI refers to the SI unit second defined in 1967, and another IRI more broadly to any
SI definition of the second (either the current one or any of the historical SI definitions).
2. Specification (via IRI) of the precise unit used in a computer programming interface, e.g.,
to avoid ambiguity for values returned by an API.
3. Determination if the numerical values of two quantities can be directly compared based
on reasoning to determine if their units are the same or if one can be generalized into the
other.
4. Determine an appropriate typographical symbol to use for a unit in a user interface.
5. Reference a set of domain-relevant short identifiers referencing specific unit definitions
with a single IRI.
6. Discover mathematical relationships between units (either as defining relationships or
approximate, such as for the dalton unit of atomic mass and the kilogram), which can be
used by a tool for unit conversions.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>3.2. Development method</title>
        <p>The ontology has been developed by re-engineering the community-agreed OPTIMADE unit
definitions into an ontological representation. The OPTIMADE unit definitions are provided as
machine-readable files using a JSON-based format specified by a JSON Schema meta-schema
which is part of the OPTIMADE standard. In practice, the JSON files are generated from more
user-friendly YAML-formatted source files. These files are also used to generate human-readable
HTML and Markdown (MD) files made available for browsing via one of the websites maintained
by OPTIMADE.13 The OPTIMADE standard v1.2 includes unit definition files for all current
constant
Is-a</p>
        <p>Is-a</p>
        <p>Is-a</p>
        <p>Is-a
Is-a
unitsystem</p>
        <p>resource
physical_unit</p>
        <p>Is-a
mega_prefix_si
pi_constant_math_basic
metre_si_1983_base
Is-a</p>
        <p>Is-a</p>
        <p>
          Is-a
M [mega_prefix_si]
pi [pi_constant_math_basic]
m [metre_si_1983_base]
and historical SI unit definitions since 1960, and most other, non-SI, units referenced from the
nine editions of the SI brochure [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. In particular, this includes all units referred to as “non-SI
units accepted to use with the SI system”. A few other non-SI units are also included, e.g., the
digital information units bit and byte.
        </p>
        <p>In this work, the source format is the JSON unit definitions provided as part of the OPTIMADE
standard, which are converted by a special-purpose tool into the ontological representation
RDF using turtle syntax [33]. The ontology is intentionally designed to, as closely as possible,
reflect the intent and content of the OPTIMADE unit definitions. One materials science domain
expert and two ontology knowledge engineers have reviewed the ontology design.</p>
        <p>A pre-release version v0.9.0 of the ontology is available via our GitHub repository: https:
//github.com/LiUSemWeb/units-of-measure.</p>
      </sec>
      <sec id="sec-4-3">
        <title>3.3. Ontology design</title>
        <p>We have chosen a class-based design approach consisting of hierarchical levels of concepts
where all unit-related data is purely expressed through object properties and data properties
in class definitions. The ontology consists of the following top level concepts as illustrated in
Fig. 1:
• unitsystem ⊑ ⊤
• resources ⊑ ⊤
• unit_entity ⊑ ⊤
• physical_unit ⊑ unit_entity
• constant ⊑ unit_entity
• prefix ⊑ unit_entity</p>
        <p>Here, ⊤ is the top concept (owl:Thing in OWL). Specific units of measure, e.g., the metre
unit as defined by SI in 1983, are sub-concepts of physical_unit. Sub-concepts of constant are
numerical and scientific constants, e.g., the mathematical constant  and the elemental charge
physical constant. The inclusion of constants is arguably not essential for an ontology for units
of measure, but is useful by allowing unit definitions to describe relationships to other units
using these constants. Sub-concepts of prefix are unitless scaling factors used in unit systems
to reference units scaled to larger and smaller magnitudes, e.g., the SI prefix mega with symbol
M denotes a factor of one million.</p>
        <p>The concepts physical_unit, constant, prefix are sub-concepts of the more general concept
unit_entity. Unit entities have an annotation property description for expressing the definition
of the unit entity with a human-readable string and a set of data and object properties. One
of these object properties are to specific individuals of the resource concept to express various
types of relationships to external resources, e.g., the oficial standards document that is the
source of the definition, or an informational link to the description of the unit in Wikipedia.
The sub-concepts of unit_entity are specific unit entity concepts . Each of these specific unit
entity concepts has another level of sub-concepts, symbol concepts, for a specific unit entity
represented by a specific symbol. The symbol concepts provides a form of reification for the
combination of a specific unit entity and a symbol, which allows concepts that are, e.g., the
same specific unit entity, but with several diferent symbols to co-exist in the ontology.</p>
        <p>
          The sub-concepts of unitsystem are collections of unit entities, e.g., the concept for the SI
2019 unit system has relationships to all the physical_unit and prefix symbol concepts with
their respective standard SI symbols, defined as part of the SI system of units in the 9th edition
of the SI brochure [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]. It is possible to extend the ontology using sub-concepts of unitsystem to
define other standardized units systems and domain-specific collections of physical units and
prefixes with specific choices of symbols.
3.3.1. Labels
All direct sub-concepts of physical_unit, constant, and prefix are given an rdfs:label
annotation. These labels are taken from the x-optimade-definition/label field in the
OPTIMADE definition files. They are on the format &lt;name&gt;_&lt;standard&gt;_&lt;year&gt;_&lt;category&gt;, e.g.,
metre_si_1960_base. The first segment &lt;name&gt; is a short name describing the unit. The
second segment &lt;standard&gt; is the name of the standards organization from which the definition
originates, or the word “independent” if the definition cannot be referenced to such an
organization (e.g., the definition of the bit unit references an early paper on digital information by
Shannon). The third segment &lt;year&gt; is the year that the definition was introduced. The fourth
segment &lt;category&gt; is a relevant categorization either made by the standards organization,
or if no such categorization exists, just an informal categorization as a means to disambiguate
cases of similarly or identically named units. As stated in a previous example, minute is often
used to refer to both a time unit and an angular unit, thus it is helpful to know at a glance which
kind of minute a label applies to.
        </p>
        <p>Furthermore, the symbol concepts (i.e., the second level sub-concepts) of
physical_unit, constant, and prefix are given labels on the format &lt;symbol&gt;
[&lt;name&gt;_&lt;standard&gt;_&lt;year&gt;_&lt;category&gt;] where &lt;symbol&gt; is the symbol, e.g., the SI
1960 definition of the metre unit referred to using the symbol m is given the label m
[metre_si_1960_base].
3.3.2. IRIs
The ontology base IRI is &lt;https://github.com/LiUSemWeb/units-of-measure#&gt; for the
prerelease of the ontology. However, the concepts physical_unit, constant, prefix reference entities
already assigned IRIs by the OPTIMADE standard, which then are retained in the ontology. These
IRIs reside under &lt;https://schemas.optimade.org/defs/v1.2/&gt;, e.g., metre_si_1960_base
has IRI &lt;https://schemas.optimade.org/defs/v1.2/units/si/1960/base/metre&gt;. The
symbol sub-concepts of physical_unit, constant, and prefix for, e.g., a specific unit represented
by a specific symbol are assigned the OPTIMADE IRI with an additional anchor, #symbol.
For example, metre_si_1960_base has a sub-concept labeled m [metre_si_1960_base] with
IRI &lt;https://schemas.optimade.org/defs/v1.2/units/si/1960/base/metre#m&gt;. Finally,
concepts in the ontology that do not have IRIs assigned by OPTIMADE are assigned IRIs under
the ontology base IRI.
3.3.3. Symbols and other identifiers
A key aspect in the use of units is the practice of representing them by symbols which are
compact domain-specific identifiers. In practical use, diferent symbols are often used for the
same units in diferent contexts and domains, e.g., the nautical mile navigational length unit
has multiple common symbols: M, NM, nmi, and Nm. As outlined above, the primary way
symbols are implemented in the ontology is via symbol concepts as sub-concepts of a specific
physical unit, constant or prefix.</p>
        <p>The symbols used for symbol concepts are taken to be short identifiers composed of Latin
characters.14 The symbol is often just one or a few characters, e.g., “T” for the tesla unit and
“Bq” for the becquerel unit. The symbol can be longer if no clear short Latin symbol exists,
e.g. the ångström unit uses ”angstrom” rather than the more standard non-Latin character “Å”.
When possible, this is the symbol defined in the SI standard for SI units. However, when the SI
symbol requires Unicode or mathematical notation, a Latin version is chosen, e.g., “mc” is used
for the SI prefix micro, rather than “ ”. These limitations gives a symbol that is useful, e.g., for
referencing the unit in data and programming contexts, where non-Latin characters may not be
available.</p>
        <p>The symbol limited to Latin characters is complemented with:
• title: A data property for each unit entity concept for the name as a Unicode string with
white space and capitalization where relevant, e.g., “nautical mile”, “ångström”, “degree
Celsius”.
• display_symbol: A data property for symbol entity concepts for an appropriate
representation of the unit symbol in Unicode, e.g., “Å” for the ångström unit and “Ω” for the
ohm unit; and, when needed, mathematical expressions using MathJax notation, e.g.,
“\(\mu_B\)” for the Bohr magneton unit.
14More precisely: lowercase a−z, uppercase A−Z, and the underscore character “_”, but not white-space characters.
3.3.5. Compact mathematical expressions for compound units
The ontology supports expressing compound unit expressions using a domain-specific compact
mathematical string language defined in the OPTIMADE specification [ 16]. Similar languages
are found in the H5MD specification UCUM [ 23] and [35]. The compound unit expressions
are also used within the ontology to express the defining_relation_base_units_expression and
approximate_relations_base_units_expression data properties. Each symbol in the base units
in the compound unit expression is defined by referencing the symbol concept through the
defining_relation_base_unit and approximate_relation_base_unit object property. This allows
the freedom to use any desired symbol in the compound unit expression, as long as the symbol
concept is defined to its unit super-concept in the ontology. A defining relation is used strictly
for units defined in terms of other units represented in the ontology to describe that relationship.
An approximate relation describes a non-defining relationship that can be used to convert the
value of a quantity to express it with a diferent unit.
3.3.6. Versioning
There are three types of changes that afect the ontology: ( i) units are occasionally redefined
by standard organizations such as SI; (ii) the OPTIMADE consortium may change the
definition files the ontology is based on; ( iii) we may make changes to the ontology based on the
same OPTIMADE definition files. As an example, consider the 1960 definition of the SI metre
unit with IRI &lt;https://schemas.optimade.org/defs/v1.2/units/si/1960/base/metre&gt;.
Changes in the SI unit definitions are handled by introducing the new ones concurrent with
the old ones. The 1960 SI metre thus coexist with the redefined 1983 SI metre with IRI
&lt;https://schemas.optimade.org/defs/v1.2/units/si/1983/base/metre&gt;. For changes
made by the OPTIMADE consortium, the released version of the OPTIMADE standard is also
included in the IRIs naming scheme as the v1.2 in the second path component, which thus will
change when the ontology is modified to refer to the updated definitions (which also incurs a
version change of the ontology itself).</p>
        <p>All changes to the ontology, both in general or in response to changes of the kinds described
above, will be reflected by updating the version number in the ontology version IRI, which we
intend to assign using the pattern MAJOR.MINOR.PATCH, based on the semantic versioning
scheme.15 The semantic versioning scheme is interpreted according to the following: we will
increment the PATCH number for changes that only address unintended errors or omissions,
and the MINOR number when expanding the ontology with additional definitions, concepts,
and attributes that do not afect the use of those existing in prior releases of the ontology.
Major changes that alter the overall design and/or the meaning of concepts, attributes, or their
relationships, in ways that may be incompatible with the use or imports of prior versions will
increment the MAJOR version number.
3.3.7. Conversion from OPTIMADE definitions
As explained above, the ontology is created directly from the machine-readable JSON files
maintained by OPTIMADE. We have implemented a conversion tool in Python that reads files
on the format documented in version 1.2 of the OPTIMADE standard [16] and derives concept
definitions and their relationships for specific units as described in the sections above. A more
in-depth technical description of this conversion tool will be presented in a separate paper.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>4. Applications of the ontology</title>
      <p>The ontology allows reference to units used in OPTIMADE property definitions using standard
semantic languages. This enables the alignment of units in databases that use OPTIMADE
property definitions with existing unit ontologies. Furthermore, ontology reasoners can be
applied to these uses of units, and the OPTIMADE unit definitions can be integrated with other
semantical frameworks. Section 3.1.1 lists a number of such specific use case examples for the
ontology.</p>
      <p>More generally, databases need to represent units of measure to provide data about physical
quantities. This information is essential to aggregate data from more than one source or query
multiple databases with related data in a meaningful way. In the following we propose such a
use case in more detail: consider two databases that provide formation energies for materials
obtained from diferent computational software. Both software packages use the non-SI unit
electron volt, which implicitly depends on the derived SI unit volt. If the databases declare
their units using the ontology provided in this work, a client accessing the data will be able
to make an informed decision on the meaning of a comparison of these data fields. If both
databases declare the use of the exact same historical definition of the volt unit, it is meaningful
to directly compare the numerical values to full precision. If they declare diferent historical
definitions, the client must either perform a unit conversion or utilize the fact that both the
specific definitions generalize into volt_si_general, the higher level concept for the current, or
any one of the historical definitions of volt. A comparison on the level of volt_si_general yields
less precise information, but at least it is clear to the client that this is the case.</p>
      <p>As mentioned above, during construction, the ontology was reviewed by one materials science
domain expert and two ontology knowledge engineers. We also emphasize that the construction
is based on unit definitions in OPTIMADE that have been discussed by that consortium of
domain experts. The ontology has not yet undergone validation or performance measurements
beyond these considerations.</p>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusion</title>
      <p>
        In this paper we have presented an ontology that covers units of measure for physical units,
constants, and prefixes. The ontology have been created from a set of community-agreed
property definitions in the OPTIMADE materials database API standard. Our ontology engineering
has strived to preserve the original design idea of the source definition files as much as possible
while finding a way to express that design using standard semantic languages. The result is an
ontology that aims to allow the expression of units the way they are used in databases.A few
key design decisions are (i) a rich framework for representation that can accomodate multiple
historical definitions and diferent domain-relevant symbols for the same unit; ( ii) a unit system
agnostic design to allow representation of unit definitions across multiple standards; (iii) unit
definitions provided by domain experts that covers domain-relevant definitions, symbols, and
other identifiers. The ontology is populated by the definitions provided with the OPTIMADE
release v1.2, which includes the SI units in their current and past definitions since 1960, most
other definitions provided in the nine editions of the SI brochure [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and a few additional units
in practical use in the fields of materials science and spectroscopy.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>We acknowledge all contributors to the OPTIMADE property and unit definitions on which
this work is based. R.A. and O.B.A acknowledge support from the Swedish Research Council
(VR) Grant No. 2020-05402; R.A., O.B.A and P.L. from the Swedish e-Science Research Centre
(SeRC); H.L. from the Swedish National Graduate School in Computer Science (CUGS); and P.L.
and H.L. from the EU Horizon project Onto-DESIDE (Grant Agreement 101058682).
and Directions - Reasoner Evaluation, Springer International Publishing, Cham, 2017, pp.
40–54. doi:10.1007/978- 3- 319- 54627- 8_4.
[11] Mars Climate Orbiter Mishap Investigation Board - Phase I Report, Mars Climate Orbiter
Mishap Investigation Board - Phase I Report, Technical Report 611, NASA, 1999. URL:
https://llis.nasa.gov/lesson/641.
[12] J.-L. Hippolyte, M. Romanchikova, M. Bevilacqua, P. Duncan, S. E. Hunt, F. Grasso Toro,
A.-S. Piette, J. Neumann, Using Ontologies to Create Machine-Actionable Datasets: Two
Case Studies, Metrology 3 (2023) 65–80. doi:10.3390/metrology3010003.
[13] B. G. Pelkie, L. D. Pozzo, The laboratory of Babel: highlighting community needs for
integrated materials data management, Digital Discovery 2 (2023) 544–556. doi:10.1039/
D3DD00022B, publisher: Royal Society of Chemistry.
[14] P. Lambrix, R. Armiento, A. Delin, H. Li, FAIR Big Data in the Materials Design Domain,
in: A. Zomaya, J. Taheri, S. Sakr (Eds.), Encyclopedia of Big Data Technologies, Springer
International Publishing, Cham, 2020, pp. 1–8. doi:10.1007/978- 3- 319- 63962- 8_293- 2.
[15] H. Li, R. Armiento, P. Lambrix, A Method for Extending Ontologies with Application to the</p>
      <p>Materials Science Domain, Data Science Journal 18 (2019) 50. doi:10.5334/dsj- 2019- 050.
[16] M. L. Evans, J. Bergsma, A. Merkys, C. W. Andersen, O. B. Andersson, D. BeltrÃ¡n,
E. Blokhin, T. M. Boland, R. C. Balderas, K. Choudhary, A. D. DÃaz, R. D. GarcÃa, H. Eckert,
K. Eimre, M. E. F. Montero, A. M. Krajewski, J. J. Mortensen, J. M. N. Duarte, J. Pietryga, J. Qi,
F. d. J. T. Carrillo, A. Vaitkus, J. Yu, A. Zettel, P. B. de Castro, J. Carlsson, T. F. T. Cerqueira,
S. Divilov, H. Hajiyani, F. Hanke, K. Jose, C. Oses, J. Riebesell, J. Schmidt, D. Winston,
C. Xie, X. Yang, S. Bonella, S. Botti, S. Curtarolo, C. Draxl, L. E. F. Cobas, A. Hospital,
Z.-K. Liu, M. A. L. Marques, N. Marzari, A. J. Morris, S. P. Ong, M. Orozco, K. A. Persson,
K. S. Thygesen, C. Wolverton, M. Scheidgen, C. Toher, G. J. Conduit, G. Pizzi, S. Grazulis,
G.-M. Rignanese, R. Armiento, Developments and applications of the OPTIMADE API for
materials discovery, design, and data exchange, 2024. URL: http://arxiv.org/abs/2402.00572,
arXiv:2402.00572 [cond-mat].
[17] F. Pezoa, J. L. Reutter, F. Suarez, M. Ugarte, D. Vrgoč, Foundations of json schema, in:
Proceedings of the 25th International Conference on World Wide Web, International World
Wide Web Conferences Steering Committee, 2016, pp. 263–273.
[18] H. Li, O. Hartig, R. Armiento, P. Lambrix, Ontology-based GraphQL server generation for
data access and data integration, Semantic Web (2024). doi:10.3233/SW- 233550, publisher:
IOS Press.
[19] N. Dimarcq, M. Gertsvolf, G. Mileti, S. Bize, C. W. Oates, E. Peik, D. Calonico, T. Ido,
P. Tavella, F. Meynadier, G. Petit, G. Panfilo, J. Bartholomew, P. Defraigne, E. A. Donley,
P. O. Hedekvist, I. Sesia, M. Wouters, P. Dubé, F. Fang, F. Levi, J. Lodewyck, H. S. Margolis,
D. Newell, S. Slyusarev, S. Weyers, J.-P. Uzan, M. Yasuda, D.-H. Yu, C. Rieck, H. Schnatz,
Y. Hanado, M. Fujieda, P.-E. Pottie, J. Hanssen, A. Malimon, N. Ashby, Roadmap towards the
redefinition of the second, Metrologia 61 (2024) 012001. doi: 10.1088/1681- 7575/ad17d2,
publisher: IOP Publishing.
[20] FAIRsharing Team, FAIRsharing.org: QUDT; Quantities, Units, Dimensions and Types,
2015. doi:10.25504/FAIRSHARING.D3PQW7, last Edited: Friday, May 6th 2022, 11:03, Last
Accessed: Sunday, June 23rd 2024, 10:43,Last Reviewed: Monday, May 2nd 2022, 12:22.
[21] ISO 80000-1, ISO 80000-1:2022: Quantities and units, 2022. URL: https://www.iso.org/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>G.</given-names>
            <surname>Price</surname>
          </string-name>
          ,
          <article-title>On the communication of measurement results</article-title>
          ,
          <source>Measurement</source>
          <volume>29</volume>
          (
          <year>2001</year>
          )
          <fpage>293</fpage>
          -
          <lpage>305</lpage>
          . doi:
          <volume>10</volume>
          .1016/S0263-
          <volume>2241</volume>
          (
          <issue>00</issue>
          )
          <fpage>00053</fpage>
          -
          <lpage>1</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. P.</given-names>
            <surname>Foster</surname>
          </string-name>
          , Quantities, units and computing,
          <source>Computer Standards &amp; Interfaces</source>
          <volume>35</volume>
          (
          <year>2013</year>
          )
          <fpage>529</fpage>
          -
          <lpage>535</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.csi.
          <year>2013</year>
          .
          <volume>02</volume>
          .001.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <article-title>[3] BIPM, Le Système international d'unités / The International System of Units ('The SI Brochure'</article-title>
          ), ninth ed.,
          <source>Bureau international des poids et mesures</source>
          ,
          <year>2019</year>
          . URL: http://www. bipm.org/en/si/si_brochure/.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>F.</given-names>
            <surname>Probst</surname>
          </string-name>
          , Observations, measurements and semantic reference spaces,
          <source>Applied Ontology</source>
          <volume>3</volume>
          (
          <year>2008</year>
          )
          <fpage>63</fpage>
          -
          <lpage>89</lpage>
          . doi:
          <volume>10</volume>
          .3233/AO- 2008- 0046, publisher: IOS Press.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Keil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Schindler</surname>
          </string-name>
          ,
          <article-title>Comparison and evaluation of ontologies for units of measurement</article-title>
          ,
          <source>Semantic Web</source>
          <volume>10</volume>
          (
          <year>2018</year>
          )
          <fpage>33</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .3233/SW- 180310.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Schindler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Keil</surname>
          </string-name>
          ,
          <article-title>Building Ontologies for Reuse: Lessons Learned from Unit Ontologies</article-title>
          ,
          <source>in: Proceedings of the Joint Ontology Workshops</source>
          <year>2019</year>
          , volume
          <volume>2518</volume>
          <source>of CEUR Workshop Proceedings, CEUR-WS.org</source>
          ,
          <year>2019</year>
          . URL: https://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>2518</volume>
          /paper-BOG4.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>I. Johansson</surname>
          </string-name>
          ,
          <article-title>Metrological thinking needs the notions of parametric quantities, units and dimensions</article-title>
          ,
          <source>Metrologia</source>
          <volume>47</volume>
          (
          <year>2010</year>
          )
          <article-title>219</article-title>
          . doi:
          <volume>10</volume>
          .1088/
          <fpage>0026</fpage>
          - 1394/47/3/012.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>H.</given-names>
            <surname>Rijgersberg</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Wigham</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Top</surname>
          </string-name>
          ,
          <article-title>How semantics can improve engineering processes: A case of units of measure and quantities</article-title>
          ,
          <source>Advanced Engineering Informatics</source>
          <volume>25</volume>
          (
          <year>2011</year>
          )
          <fpage>276</fpage>
          -
          <lpage>287</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.aei.
          <year>2010</year>
          .
          <volume>07</volume>
          .008.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <article-title>A survey on units ontologies: architecture, comparison</article-title>
          and reuse,
          <source>Program</source>
          <volume>51</volume>
          (
          <year>2017</year>
          )
          <fpage>193</fpage>
          -
          <lpage>213</lpage>
          . doi:
          <volume>10</volume>
          .1108/PROG- 08- 2015- 0056.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>M. D. Steinberg</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Schindler</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          <string-name>
            <surname>Keil</surname>
          </string-name>
          ,
          <article-title>Use Cases and Suitability Metrics for Unit Ontologies</article-title>
          , in: M.
          <string-name>
            <surname>Dragoni</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Poveda-Villalon</surname>
          </string-name>
          , E. Jimenez-Ruiz (Eds.), OWL: Experiences
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>