<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Implementation of Materials Data Integration using Ontology</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Toshihiro Ashino</string-name>
          <email>ashino@acm.org</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nobutaka Nishikawa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Takuya Kadohira</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Masahiko Demura</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Mizuho Res. &amp; Technol., Ltd.</institution>
          <addr-line>2-1 Kanda-Nishikicho, Chiyoda-ku, Tokyo, 101-8443</addr-line>
          <country country="JP">JAPAN</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Institute for Materials Science</institution>
          ,
          <addr-line>1-1 Namiki, Tsukuba, Ibaraki 305-0044</addr-line>
          ,
          <country country="JP">JAPAN</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Toyo University</institution>
          ,
          <addr-line>5-28-20 Hakusan, Bunkyo-ku, Tokyo 112-8606</addr-line>
          ,
          <country country="JP">JAPAN</country>
        </aff>
      </contrib-group>
      <fpage>235</fpage>
      <lpage>239</lpage>
      <abstract>
        <p>For materials design, it is necessary to refer a variety of heterogeneous information resources, such as databases, formulae, and computational simulations. Ontology-based data integration is one of the approaches to combine data from multiple heterogeneous data resources. In this paper, an implementation of ontology-based data integration with Semantic Web standards and its application for materials data integration are presented.</p>
      </abstract>
      <kwd-group>
        <kwd>Ontology-based data integration</kwd>
        <kwd>Materials integration</kwd>
        <kwd>Semantic Web</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        In the development of research into data driven materials design, the importance of data
integration has been recognized. The materials design process requires utilization of a
wide variety of information, including traditional data sheets, data from new automated
experimental devices, knowledge of materials science and engineering, and simulations
using physical models. In order to support researchers, it is necessary to develop a
platform to access these data, process workflows, and visualize them. Such platform
requires to handle wide variety of information resources [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        There are several approaches to data integration [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and ontology-based integration,
utilizing ontology to overcome semantic heterogeneity [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], is one of them. One of the
implementations of ontology-based data integration is to create a single ontology, called
global ontology, and map different data to it [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In our implementation, the extended
version of Materials Ontology [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] is used as a global ontology to map material
information such as datasheets and formulas for creep, fatigue, and other properties. Fig.1
illustrates the concept of data integration for creep properties, where heterogeneous data
resources such as experimental data and computational data are linked into a single
ontology. We developed a web application using this as a back end and realized data
navigation using an ontology which is based on the knowledge of experts.
Copyright © 2021 for this paper by its authors. Use permitted under Creative Commons License
Attribution 4.0 International (CC BY 4.0).
M a teria ls Ontolog y
      </p>
      <sec id="sec-1-1">
        <title>M aterila_ Inform ation</title>
      </sec>
      <sec id="sec-1-2">
        <title>M aterial</title>
      </sec>
      <sec id="sec-1-3">
        <title>M aterial_ Process</title>
      </sec>
      <sec id="sec-1-4">
        <title>M aterial_ Property</title>
      </sec>
      <sec id="sec-1-5">
        <title>Source</title>
        <sec id="sec-1-5-1">
          <title>Ontology of M etallurgy • Description of M icrostructure • Relationships of M icrostructure and Perform ance • Relationships of M icrostructure and Process</title>
        </sec>
      </sec>
      <sec id="sec-1-6">
        <title>Substance</title>
      </sec>
      <sec id="sec-1-7">
        <title>Process</title>
      </sec>
      <sec id="sec-1-8">
        <title>Test</title>
      </sec>
      <sec id="sec-1-9">
        <title>Property</title>
        <sec id="sec-1-9-1">
          <title>M apping corresponding</title>
        </sec>
        <sec id="sec-1-9-2">
          <title>Concept and Data Item s</title>
          <p>Measurement
Computation</p>
        </sec>
        <sec id="sec-1-9-3">
          <title>Specim en</title>
        </sec>
        <sec id="sec-1-9-4">
          <title>Tensile Test</title>
        </sec>
        <sec id="sec-1-9-5">
          <title>Creep Test</title>
        </sec>
        <sec id="sec-1-9-6">
          <title>Creep Curve</title>
        </sec>
        <sec id="sec-1-9-7">
          <title>N orton’s Law FEM</title>
          <p>Heterogeneous data resources
The RDF files play key role to map the global ontology and the data resources. Th refer
classes defined in ontology and describe the corresponding information resource, such
as file location, name of XML element, names of data fields and access methods of data
entity.</p>
          <p>Fig. 2 shows a typical data navigation flow. When navigating a database, the user
first traces the conceptual structure of the ontology and find the target concept. The
system generates a SPARQL query to search the selected OWL class and issues it to
the SPARQL endpoint which stores the RDF metadata files. Each RDF file describes
metadata about a single information resource.
Materials Ontology</p>
          <p>OWL
RDF</p>
          <p>RDF
(MetadataR) DF
(Metadata)</p>
          <p>(Metadata)
SPARQL Endpoint</p>
          <p>Data
RInefIsonorfomurramcteiosn</p>
          <p>ation
Resource</p>
          <p>Resource</p>
          <p>File System/Server
Refer conceptual
structure</p>
          <p>SPARQL query
d
n
E
t
n
ro Extract metadata of
F Information Resources</p>
          <p>Select resource
and access</p>
          <p>Obtain data</p>
          <p>Materials Ontology (OWL)</p>
          <p>OWL classes
&lt;?xml version="1.0"?&gt;
&lt;rdf:RDF xmlns:dc=“http://purl.org/dc/elements/1.1/”…&gt;
..
&lt;dc:description rdf:about="tensile_reduction_of_area"&gt;
&lt;owl:class rdf:about="mat-ontology.nims.cds.owl#Reduction_of_Area"&gt;
&lt;rdfs:subClassOf rdf:resource="mat-ontology.nims.cds.owl#Tensile_Property"/&gt;
&lt;xsd:element name="property" rdf:resource="nims.cds.tensile.xsd"&gt;
…
&lt;xsd:selector xpath="property/i[@name='reduction_of_area']"/&gt;
&lt;/owl:class&gt;
&lt;/dc:description&gt;
&lt;?xml version="1.0"?&gt;
&lt;rdf:RDF xmlns:dc=“http://purl.org/dc/elements/1.1/”…&gt;
…
&lt;dc:description rdf:about=“creep_reduction_of_area"&gt;
&lt;owl:class rdf:about="mat-ontology.cds.owl#Reduction_of_Area"&gt;
&lt;rdfs:subClassOf rdf:resource="mat-ontology.cds.owl#Creep_Property"/&gt;
…
&lt;xsd:element name="property" rdf:resource="nims.cds.creep.xsd"&gt;
&lt;xsd:selector xpath="property/i[@name='reduction_of_area']"/&gt;
&lt;/xsd:element&gt;
&lt;/dc:description&gt;
XML Schema definition
for Creep datasheet
&lt;?xml version="1.0"?&gt;
&lt;data&gt;
&lt;material&gt;
…
&lt;/material&gt;
&lt;process/&gt;
&lt;structure/&gt;
&lt;test&gt;
&lt;i name="standard"&gt;JIS G 0567&lt;/i&gt;
&lt;i name="temperature" unit="C"/&gt;
&lt;/test&gt;
&lt;property&gt;
…
&lt;i name="reduction_of_area" temperature="23" unit="%"&gt;74&lt;/i&gt;
…
&lt;/property&gt;
&lt;/data&gt;</p>
          <p>XML formatted data file
Metadata files (RDF)</p>
          <p>Data Resources</p>
          <p>Since in current implementation, the target data resources are digitized version of
NIMS (National Institute for Materials Science) datasheets of creep, fatigue and others,
they are conforming XML Schema definition, we are considering RDF format only in
such data resources. Appropriate metadata tags used in RDF is under consideration for
other forms. It may be possible to support data stored in RDBMSs, SQL queries may
be used; for computational results, we can use the name of the software for the specified
computation, the parameters to be set, etc.</p>
          <p>Also in this system, it is inefficient to retrieve data by its value and some common
data items are distributed several data files. To find data by their values, load XML data
files into frontend and scan all values in these files. In order to improve this flaw, we
are considering creating RDF files which consist frequently used data item, such as
material type, and list of its values, pointers to data file. These RDF files are stored in
the same SPARQL endpoint, automatically generated, and updated by scanning all
XML data files, and retrievable by SPARQL engine.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Conclusion</title>
      <p>Data integration plays an important role in material data platforms that handle many
types of data and heterogeneous data resources. In this study, we implemented
ontology-based data integration in a three layer structure of Ontology, RDF for metadata and
Data Resources, using “Materials Ontology” as a global view. Currently, it
demonstrated by XML schema conforming data files as the data resource, which have been
developed for NIMS datasheets such as creep and fatigue.</p>
      <p>Acknowledgements. This work was supported by Council for Science, Technology
and Innovation(CSTI), Cross-ministerial Strategic Innovation Promotion Program
(SIP), “Materials Integration” for Revolutionary Design System of Structural Materials
(Funding Agency: JST).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Ashino</surname>
            , Toshihiro,
            <given-names>Nobutaka</given-names>
          </string-name>
          <string-name>
            <surname>Nishikawa</surname>
            , and
            <given-names>Takuya</given-names>
          </string-name>
          <string-name>
            <surname>Kadohira</surname>
          </string-name>
          .
          <article-title>"Data Analysis Environment for Materials Science and Engineering Integrating Heterogeneous Data Resources." In Data Analytics and Management in Data Intensive Domains:</article-title>
          ХХI International Conference DAМDID/RCDL'2019
          <source>(October 15-18</source>
          ,
          <year>2019</year>
          , Kazan, Russia):
          <source>Conference Proceedings. Edited bу Alexander Elizarov</source>
          , Boris Novikov,
          <string-name>
            <given-names>Sergey</given-names>
            <surname>Stupnikov</surname>
          </string-name>
          . Kazan Federal University,
          <year>2019</year>
          , p.
          <fpage>420</fpage>
          -
          <lpage>425</lpage>
          .
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Lenzerini</surname>
          </string-name>
          , Maurizio.
          <article-title>"Data integration: A theoretical perspective." In Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems</article-title>
          , pp.
          <fpage>233</fpage>
          -
          <lpage>246</lpage>
          .
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Uschold</surname>
            ,
            <given-names>Michael</given-names>
            , and Michael
          </string-name>
          <string-name>
            <surname>Gruninger</surname>
          </string-name>
          .
          <article-title>"Ontologies: Principles, methods and applications</article-title>
          .
          <source>" TECHNICAL REPORT-UNIVERSITY OF EDINBURGH ARTIFICIAL INTELLIGENCE APPLICATIONS</source>
          INSTITUTE AIAI TR (
          <year>1996</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Wache</surname>
            , Holger, Thomas Voegele, Ubbo Visser, Heiner Stuckenschmidt, Gerhard Schuster, Holger Neumann, and
            <given-names>Sebastian</given-names>
          </string-name>
          <string-name>
            <surname>Hübner</surname>
          </string-name>
          .
          <article-title>"Ontology-based integration of information-a survey of existing approaches." In Ois@ ijcai</article-title>
          .
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gusenkov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bukharaev</surname>
          </string-name>
          , and
          <string-name>
            <surname>E. Birialtsev.</surname>
          </string-name>
          <article-title>"On ontology based data integration: problems and solutions."</article-title>
          <source>In Journal of Physics: Conference Series</source>
          , vol.
          <volume>1203</volume>
          , no.
          <issue>1</issue>
          , p.
          <fpage>012059</fpage>
          . IOP Publishing,
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Ashino</surname>
          </string-name>
          , Toshihiro.
          <article-title>"Materials ontology: An infrastructure for exchanging materials information and knowledge</article-title>
          .
          <source>" Data Science Journal</source>
          <volume>9</volume>
          (
          <year>2010</year>
          ):
          <fpage>54</fpage>
          -
          <lpage>61</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>W3C</given-names>
            <surname>Semantic</surname>
          </string-name>
          <article-title>Web page</article-title>
          , http://w3.org//standards/semanticweb/,
          <source>last accessed</source>
          <year>2021</year>
          /5/17.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>