<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Patent Analysis Using an Ontology of Qualities of Inorganic Materials Based on Context-Dependency</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yoshinobu Kitamura</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Koki Taniguchi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Shinichi Kato</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Murata Manufacturing Co., Ltd</institution>
          ,
          <addr-line>617-8555</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Ritsumeikan University</institution>
          ,
          <addr-line>2-150 Iwakura-cho, Ibaraki, Osaka, 567-8570</addr-line>
          ,
          <country country="JP">Japan</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents the building of an ontology focused on the qualities of inorganic materials aimed at facilitating the analysis of patent documents. Acknowledging the contextual nature of qualities like “particle diameter” in inorganic materials, concepts are defined within specific contexts, such as objects. The paper discusses the application of this ontology in patent analysis, followed by the development of an ontology-based patent analysis system. This system enables the extraction of object-quality-value triples and the restoration of missing information. Through testing, the paper demonstrates the utility of this system in enhancing patent analysis processes.</p>
      </abstract>
      <kwd-group>
        <kwd>engineering</kwd>
        <kwd>patent analysis</kwd>
        <kwd>quality</kwd>
        <kwd>role1</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Efficiently analyzing patent documents is paramount for driving technological advancement. In
the domain of specific electronic products within the realm of inorganic materials, on which
this study primarily focuses, the number of relevant patents has exceeded 2000 annually in
recent years. Despite that finding out useful information from vast quantity of patents requires
efficient analysis of quality descriptions embedded in the text, existing technology does not
work satisfactorily. For example, conventional statistical natural language analysis, less
attention is paid to extracting numerical values written in text compared to other research fields
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], and methods using machine learning have problems with accuracy [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ][
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Moreover, a common issue arises from the omission of elements, as observed in phrases like
“particles of 100 nm”. In such cases, the omitted “a particle diameter” must be
contextdependently inferred with “particle”. Conversely, in phrases like “film of 2.0 μm”, the “length”
should be interpreted as "a thickness of 20 μm". These suggest the necessity of distinguishing
between basic generic qualities like length and context-dependent ones like diameter or
thickness as discussed in Section 2. While existing ontologies like [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] mainly define the
former as discussed in Section 3, this paper explores an ontology specifically designed to define
context-dependent qualities and the merits obtained by their utilization in patent analysis.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Research issues</title>
      <p>This study addresses the following issues of qualities and terminology used to describe
inorganic materials, emphasizing the need for context-dependent recognition and
identification.</p>
      <p>The first issue is to recognize relationship between synonyms; for example, between
synonyms for “粒子径” “particle diameter” and “粒径” (abbreviation for “particle size.”)</p>
      <p>The second issue involves identifying basic qualities (called generic quality) and
contextdependent qualities, particularly in relation to objects. For instance, the generic quality “length”
may be referred to as “diameter” when representing the width of a circular or spherical object.
Moreover, the expression of “diameter” varies depending on the object type; for instance,
“particle diameter” carries the same essence as diameter, but its expression varies based on the
context in which “particle” is utilized. On the other hand, “length” may be termed “thickness”
for “filmy materials.” Despite sharing a common “length value,” the terminology changes
depending on the context, whether it’s referred to as diameter or thickness.</p>
      <p>The third issue involves the utilization of the same term across different contexts. For
instance, the term 粒径 (abbreviation for “particle size”), mentioned earlier, not only denotes
the diameter of an individual “particle” but also refers to the average value of the particle size
of an assemblage of particles in a handful of “powder”. Sometimes “particle size” (粒径) refers
to the “mean of particle diameter” of “powder” or the “crystallite size” of a “crystal” resulting
from the baking and hardening of particles, with potential for omitted or implied meanings. It
is necessary to make these omitted meanings explicit and clearly distinguish them.</p>
      <p>The fourth issue pertains to quality values reliant intrinsically on measurement context. For
instance, “permittivity” varies based on specific parameters like frequency, temperature, and
voltage during measurement. Such qualities, termed “reaction-relational qualities,” are
contingent on “inputs” such as voltage applied to objects.</p>
      <p>The research issues found in this research also include the context-dependency on
manufacturing process. Although we have built an ontology of process-dependent qualities, we
omit it in this paper due to space limitation.</p>
      <p>Addressing these challenges necessitates the construction of an ontology systematically
defining concepts related to properties and their representative terms (aka labels). Moreover,
by explicitly delineating the contextual basis of each concept, it becomes feasible to
appropriately capture the relationship between terms whose expression varies with context.
Analysis suggests the need to accommodate a broad range of contexts, including object types
like “particle” and “crystal,” as well as input parameters such as measurement frequency.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Related work and approach</title>
      <sec id="sec-3-1">
        <title>3.1. Treatment of qualities and properties</title>
        <p>
          The so-called “qualities” and “attributes” of entities (expressed as quality, property, attribute,
etc.) are defined in various upper ontologies [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ][
          <xref ref-type="bibr" rid="ref7">7</xref>
          ][
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], albeit with significant differences [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
However, delving into the intricacies of these disparities exceeds the scope of this paper; thus,
readers are directed to the literature [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] for comprehensive insights. In this study, we use the
YAMATO ontology [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], as also mentioned in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], for two main reasons. Firstly, it effectively
handles the distinction between quality and quantity, which is important for engineering
problems. YAMATO separates quality instances from their values, allowing for clear treatment
of quality like John's weight and its value (quantity. e.g., 65 kg). This differentiation ensures
identical instances of this weight quality despite it denotes changes in its value over time.
        </p>
        <p>
          The second reason is the treatment of “role concepts” and “quality roles” [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. In YAMATO,
roles are anti-rigid, dynamic, and externally grounded [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. A key principle is that a potential
player for a role is a role-holder when it actually plays the role. A role is an entity to be
played, a potential player is an entity that can play a role, and a potential player becomes a
role-holder in playing a role. In the school example, when a person (a potential role player,
the class constraint of the slot expression) enrolls in a school (a context), the person plays a role
of “student” in the school and becomes a student (a role-holder, role-playing entity).
        </p>
        <p>
          In YAMATO, quality concepts are defined through a role pattern where a generic quality
(a potential player) plays a quality role (a role) with respect to a specific measured object (a
context), making it a quality role-holder [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. A generic quality is the most general kind of
quality and it represents basic physical parameters (e.g., length, mass, and temperature). A
quality role is a role played by a generic quality, it includes height, weight, and body
temperature. “Height” and “width” qualities of a rectangular object are both quality roles played
by the “length” as the generic quality. Their representations can change based on the orientation
of the object, showcasing their dynamic and anti-rigid nature, suggesting they be treated as
roles. This framework, treating qualities as roles, allows for flexible definitions, particularly
useful for inorganic materials. Thus, we choose to adopt YAMATO [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] and Hozo [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] for
ontology development due to their shared treatment of roles.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Definitions of “generic quality” in the scientific domain</title>
        <p>
          Generic quantity such as “length,” “mass,” and “time” in the scientific domain are rigorously
defined by standards such as ISO and are specified in many ontologies such as [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ][
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], as well as
being compared in survey papers [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ][
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. In[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], an Information Model (IM) including unit
conversion is established based on OM, QUDT, international standards, and other sources.
        </p>
        <p>While the ontology in this study also deals with such generic qualities and units, the primary
emphasis is not on their definition. Instead, the focal point centers on the context-dependency
of qualities, as expounded in Section 2.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Definitions of context-dependent qualities such as diameter</title>
        <p>
          Within the glossary of international standard organizations [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ], generic qualities such as
“length” are referred to as “quantity,” while concepts like “diameter” are termed “kind of
quantity” and explained as an “aspect common to mutually comparable quantities.” However,
the treatment of these concepts varies across various ontologies [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ][
          <xref ref-type="bibr" rid="ref16">16</xref>
          ][17]. I-ADOPT2 aims at
2 InteroperAble Descriptions of Obervable Property Terminology WG (I-ADOPT WG), https://i-adopt.github.io/
interoperability among them. For instance, ontologies such as NCIT3, PATO4, and SIO5 provide
examples of descriptions for both generic qualities like “length” and context-dependent qualities
like “diameter,” defined in parallel at the same level of the “is-a” hierarchy.
        </p>
        <p>
          Concerning the differentiation between “diameter” and generic qualities, pioneering
ontology research such as EngMath [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] has addressed this discrepancy. While OM6 [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] defines
“diameter” as a subclass of “length,” it does not explicitly articulate its relationship with objects
like circles or spheres. Similarly, QUDT7 conceptualizes “diameter” as a narrower concept of
“length,” akin to “width” and “height,” without specifying its association with circles or spheres.
QUDV [17] defines “width,” “diameter,” etc., as “SpecializedQuantityKind” related to the
“SimpleQuantityKind” “length,” yet lacks the contexts in which they might be called differently.
        </p>
        <p>
          In any case, qualities concepts dependent on entities like “diameter” are not clearly defined
as a distinct concept type from generic qualities like “length,” while being related to both
generic qualities and the types of objects (e.g., spheres or circles). In this study, we aim to define
objects as contexts to address such context-dependent issue. In addition, considering the
existence of non-quantitative quality such as way of birth, quality should not be modeled as
kind of quantity. Following DOLCE [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] and YAMATO [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], we define quality as dependent entity
rather than kind of quantity with a clear distinction between quality and quantity.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. An ontology of qualities of inorganic materials</title>
      <p>This section discusses a part of an ontology of qualities of inorganic materials, using concepts
such as “particle” and “particle diameter,” as example and explaining related concepts.</p>
      <sec id="sec-4-1">
        <title>4.1. Particle and particle diameter</title>
        <p>
          The concept of “particle” is defined in Figure 1. Specialization (is-a relation) begins from the top
level and progresses downward through “particular”, “independent entity”, “physical”,
“continuant” (physical object), “unitary object8” (in accordance with YAMATO definitions),
“solid object,” and “spherical object”, culminating in the subclass “particle.” The superclass
“physical object” typically possesses certain qualities, thus featuring a “quality role” slot. By
specifying the “diameter” slot in its subclass “spherical object” and designating its
corresponding subclass “length” as a subtype of “quality,” we establish the concept that a
“spherical object” possesses the quality of “length” referred to as “diameter.” In essence, within
the context where “length” serves as a quality of “spherical object,” it plays the role of
“diameter” (a concept of quality role. Refer to Section 3.1 and [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]), with “length” transitioning
into a role-holder concept (role-playing entity) termed “diameter.” Such definitions as “quality
role” based on YAMATO have advantage of uniform representation of qualities. As mentioned
in Section 3.1, “height” and “width” are anti-rigid and dynamic [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and thus quality roles.
3 National Cancer Institute Thesaurus, https://bioportal.bioontology.org/ontologies/NCIT
4 Phenotypic Quality Ontology, https://bioportal.bioontology.org/ontologies/PATO.
5 Semanticscience Integrated Ontology, https://bioportal.bioontology.org/ontologies/SIO.
6 The OM - Ontology of units of Measure, 2.0, https://github.com/HajoRijgersberg/OM.
7 QUDT–quantities, units, dimensions and data types ontologies, https://www.qudt.org/
8 Engineering processes often involve modeling objects from two inconsistent perspectives: (i) as a unitary entity,
like a particle, which becomes two different particles when it is cut into half, and (ii) as amount of matter. This paper
adopts the former perspective, modeling objects accordingly.
        </p>
        <p>Furthermore, the specialized concept “particle” inherits the “diameter” slot, yet its role name
is designated as “particle diameter.” Consequently, although both terms denote the diameter of
the same spherical object, the terminology (surface vocabulary) varies between particles and
spherical objects. However, it is clearly defined that they essentially represent the same
diameter. Additionally, using a green-colored alias node, 粒径 “particle size” is described as a
synonym for “particle diameter.”</p>
        <p>Moreover, as depicted in Figure 1 under “filmy object,” it is indicated that the term “length”
may also be referred to as “thickness,” while demonstrating that “crystal” may use the same
“length” term to denote “crystallite size.” This may also be referred to as “particle size.”</p>
        <p>Furthermore, as depicted in Figure 2, “length” is defined as something capable of holding a
quantity value. These quantity values can be expressed either quantitatively (quantitative
values) or qualitatively (qualitative values). Thus, we delineate them as "length quantity" and
"length qualitative value," respectively, as the class constraints of slots of “length.”</p>
        <p>In Figure 3, we defined “length quantity” as subclasses of “quality value.” “Length quantity”
represents the value attributed to “length,” with its unit confined to “unit of length.” Subclasses
of “unit of length” are defined to encompass units associated with length, such as “m” and “nm.”
The “#” symbol preceding the class constraint “unit of length” indicates that what goes into this
slot is not an instance but the class (type such as “m” and “nm”) itself. If values are measured
in different unit systems, two quantities, say, “10” as the number and “μm” as the unit and
“0.000393” and “inch” are equivalent9.
9 Strictly speaking, these values are results of observations/measurement like in SOSA/SSN [18]. This is out of scope
of this paper, though YAMATO’s treatment as a representation is discussed in [19].</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Reaction-relational quality</title>
        <p>We delineate qualities such as “length” and “weight,” intrinsic to objects themselves, as distinct
from qualities that manifest in response to external stimuli10, like “permittivity” and “electrical
resistance”, classified as subclasses of “reaction-relational quality”. Values of such quality
“realize (manifest)” as the response to the external stimuli unlike generic qualities like “length”.
To represent such reaction-relational qualities, it is imperative to reference input attribute
values. We designate this as a role concept termed “excite-reaction quality”. For instance, the
reaction-relational qualities “electric characteristics”, “permittivity” and “electrical resistance”
depicted in Figure 4 realizes in response to specific frequencies, temperatures, and voltages.
This value lacks standalone significance and necessitates contextual reference to these inputs.
We define “frequency quantity” as a class constraint and designate the role concept
“measurement frequency” as an excite-reaction quality of “electric characteristics.” Likewise,
“measurement temperature” and “measurement voltage” are defined as excite-reaction
qualities.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Qualities of a collection</title>
        <p>
          The “particle” defined in Figure 1 are often treated as a collection of particles so-called
“assemblage of particles”. It is often referred to as “powder”. As a collection, it is characterized
by qualities such as variance, mean, median of a specific quality. As depicted in Figure 5,
“variance, mean, median of particle diameter” are qualities of diameters of particles. The “mean
of diameter” maybe referred to as a “particle size”.
10 This is modeled as a kind of disposition in BFO [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Usages of the ontology for patent analysis</title>
      <p>The ontology outlined thus far serves as the foundation for a patent document analysis system.
This system is designed to extract compositions and qualities from collections of patent
documents, associate them with their respective values, and store them in a database. Through
the ontology, compositions and property names are standardized, accommodating paraphrases
and synonyms. Moreover, the system is capable of supplementing omitted mentions of objects
or quality names, establishing relationships between objects, qualities, and values whenever
feasible. Further insights into the system's configuration and effectiveness are provided in
Section 6.</p>
      <sec id="sec-5-1">
        <title>5.1. Identification of a triplet of an object, a quantity and its value</title>
        <p>The ontology described thus far enables us to capture the relationship between objects, qualities,
and quality values (plus units) in expressions such as the following. The top line represents an
example sentence found in patent documents (translated from Japanese documents), while the
bottom line represents the types of concepts identifiable through analysis.</p>
        <p>The mean of particle diameter of
quality
cerium oxide
object (powder).</p>
        <p>is 0.1-20 µm.</p>
        <p>quality value (+ unit)</p>
        <p>Firstly, cerium oxide is defined as a composition under the concept of “substance”. However,
it can also be interpreted as referring to the object that has it as a slot. Subsequently, as the term
“mean of particle diameter” is denoted as a “mean,” it can be interpreted as a “quality of a
collection,” as explicated in Section 4.3. Moreover, drawing from the definition of “assemblage
of particles (powder)” presented in Figure 5, it is recognized as a “quality” signifying the mean
of particle diameter within the particle assemblage. Furthermore, according to the definition
outlined in Section 4.1, “particle diameter” signifies the diameter of particles and is understood
to be accompanied by a length unit such as µm. Consequently, leveraging the ontology's
definition, the relationship between these terms can be discerned as a triplet comprising an
object, a quality, and its corresponding value. This framework underpins the discussions
pertaining to issues 1-3 in Section 2.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Identification of objects from quantities</title>
        <p>In the aforementioned example sentence, not only is the relationship recognized, but also
additional information absent in the original text has been extracted. The original sentence
solely denotes the composition of cerium oxide, leaving the object it represents implicit.
Referring to the definition in Figure 5, which specifies that “mean of particle diameter” is a
quality of an “assemblage of particles,” we can infer that it pertains to an “assemblage of
particles” (“powder”) (highlighted with double underlines).</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Identification of quantities from objects and quality units</title>
        <p>In the following example sentence, the quality type “particle diameter” is augmented by
considering the representation of the quality value unit (nm) and the object (particle). This
augmentation is derived from the defined relationship between “particle” and “particle diameter”
illustrated in Figure 1, along with the definition of “length” and “length quantity” presented in
Figures 2 and 3.</p>
        <p>particles of 100 nm → particles with a particle diameter of 100 nm</p>
        <p>object quality quality value</p>
        <p>On the other hand, in the subsequent example sentence, the quality type “thickness” is
supplemented from the context of the “film,” as it aligns with the generic quality of length. This
illustrates how distinct quality representations can be complemented based on the contextual
characteristics of the objects involved.</p>
        <p>film of 2.0 μm → filmy object with thickness of 20 μm</p>
        <p>object quality. quality value</p>
        <p>Consider if in the ontology both a diameter and a circumference are defined as quality roles
of a particle. If the patent document has no description, it is ambiguous for quality roles. This
is domain-dependent, in the inorganic material domain, by default, it refers to “diameter”.</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.4. Identification of reaction-relational qualities</title>
        <p>The “reaction-relational quality” defined in Section 4.2 can be measured by providing specific
values for “excite-reaction quality.” Hence, when presented with the following example
sentence, it is inferred by convention that ε represents permittivity. Referring to the definition
in Figure 4 of Section 4.2, it becomes apparent that excite-reaction qualities such as frequency,
voltage, and temperature are involved in measuring permittivity. Among them, "measurement
frequency" takes Hz as the unit, and 15GHz complements as the value of the measurement
frequency when measuring permittivity. As a result, as shown below, 15GHz complements as
the value of the measurement frequency, and 200 complements as the value of permittivity.
ε at 15 GHz is 200 → permittivity ε at measurement frequency 15 GHz
reaction-relational quality value of excite-reaction quality
is 200
value</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Real usages and their effects</title>
      <p>In this section, we discuss the implementation of systems and the effectiveness of ontologies
described in the previous section.</p>
      <sec id="sec-6-1">
        <title>6.1. Ontologies used in the system</title>
        <p>
          We constructed an ontology to define concepts related to the qualities of inorganic materials
using Hozo [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], which is exported to OWL. This ontology includes 532 comprehensive
definitions of concepts such as properties and subordinate concepts within a specific domain.
Additionally, elements such as titanium or barium, as well as compounds like barium titanate,
are chemically classified and defined with 367 subcategories under the concept of “substance.”
The composition and synonyms within these subject domains are also extensively described.
        </p>
      </sec>
      <sec id="sec-6-2">
        <title>6.2. Overview and procedure of text analysis</title>
        <p>Users prepare a rough patent document set to investigate a particular area that is later processed
through text analysis using our aforementioned ontology before extracting concepts and
numerical ranges for storage into a relational database (RDB). Text analyses are performed by
following procedure.</p>
        <p>1. If any concept from our ontology is found within the text, it is tagged with additional
augmentation like object or quality.
2. Any strings that were not tagged during step 1 are then subjected to morphological
analysis using MeCab [20].
3. Units along with numbers representing values or chemical formulas are identified and
assigned appropriate tags.
4. Phrases consisting of multiple concepts are tagged as a single word that includes those
concepts as attributes. For example, the phrase “multilayer ceramic capacitor” contains
three concepts: “multilayer structure,” “ceramic,” and “capacitor.”
5. Connections between numerical ranges and associated concepts are identified so
expressions like “using 10g particles measuring at 0.1μm” would be grouped together
under one tag while completing missing information on certain abbreviated qualities if
needed.
6. Information regarding objects or qualities obtained from step 5 is stored in an RDB
(Relational Database), while analyzed texts themselves may also be outputted.</p>
      </sec>
      <sec id="sec-6-3">
        <title>6.3. Examples of the complementary effect of ontology</title>
        <p>We present an example of the effectiveness of ontology in actual searches using 139 patent
documents related to Multilayer Ceramic Capacitors (MLCCs) that contain keywords such as
“withstanding voltage” or “dielectric constant.”</p>
        <p>Searching for the string “粒子(particle)” resulted in 101 instances where expressions like
“particles with a content greater than 90mol%” were found. However, by utilizing our ontology,
we complemented the term “粒子(particle)” with its associated characteristic “粒径”(abbreviation
for particle size) thereby increasing the search results to 152. As an illustrative example,
consider the expression “K2CO with a particle size below 1.0 μm.” By completing this phrase
with an object concept called “particle,” categorized under the quality named "particle size," we
successfully included it in our search results. These completions have been verified as accurate
by domain experts, and considering their completions as correct data provided by experts yield
perfect accuracy and precision rate at values of both being equal to one. On the other hand, for
instance, since there is no definition for “amount of warpage” within our ontology database,
expressions such as “the amount of warpage in Sample One is 55 μm” failed to be complemented
due to referring it back generically under “amount”. This led to failures concerning tag
completion regarding “layer.” A completion recall ratio was at approximately 0.88 while also
attaining an F-value close-to 0.93 (Table 1).</p>
        <p>Next, regarding the complementation of quality and excite-reaction qualities based on their
values which discussed in Section 5.4, for example, in the expression "dielectric loss at 1 MHz
is less than or equal to 9.8%", we were able to identify that “dielectric loss” has a excite-reaction
quality with the unit of “Hz” and could be completed as “measurement frequency”. In the
aforementioned collection of patent documents, there were 191 expressions containing the term
“dielectric loss”, but by searching specifically for dielectric losses at a measurement frequency
of 1 MHz only, we were able to narrow down the results to 32 cases. Among these cases, there
were six examples where the term “measurement frequency” was omitted. Additionally, this
search also covered cases such as ranges including 1 MHz like “0.5-2 MHz”.</p>
        <p>Furthermore, when we searched for terms such as “dielectric constant”, we obtained 265 search
results. Among them, there were 26 instances where the term “measurement frequency” was
explicitly mentioned and thus, it was possible to supplement it. There were two cases in which
supplementing the measurement frequency failed. This happened because those patterns are
currently not supported by our Japanese language processing system.</p>
        <p>In total, we were able to accurately complete implicit expressions of measurement frequency
for both dielectric loss and dielectric constant in 108 cases (True-Positive). There were 1404
cases where the frequency was not mentioned at all (True-Negative). Additionally, there were
six instances where only the numerical representation of the frequency was present but could
not be correctly completed as “measurement frequency” (False-Negative). Overall, summarizing
these results, the accuracy was 1.00, precision was 1.00, recall rate was 0.95, and F-value was
0.97 (Table 1).</p>
      </sec>
      <sec id="sec-6-4">
        <title>6.4. Graphical analysis and task evaluation</title>
        <p>After performing the tagging as described in Section 6.2, we conducted an analysis using
Keygraph 0 for patent documents. First, we visualized the results with basic morphological
analysis along with identification of certain chemical formulas and numerical ranges in Figure
6. As indicated by the red circles in the figure, there were instances where excessively dividing
words into fragments, such as “本” (this), and chemical prefix “ジ(di)”, and divisions that lacked
an object making it difficult to interpret their meaning, such as “細かい” (detailed) and “変動”
(variation). Additionally, within some islands on the graph, there was a mixture of words
belonging to various contexts which made it challenging to extract any meaningful
interpretation from them.</p>
        <p>detailed
this</p>
        <p>suppressing electron
carrier concentration related to</p>
        <p>valence band
conduction band</p>
        <p>carrier concentration
formation of
lattice defects</p>
        <p>excitation
conduction</p>
        <p>band gap
electron avalanche</p>
        <p>On the other hand, we conducted an analysis using ontology as shown in Figure 7. With the
utilization of ontology, there was an increase in nodes that could be interpreted with meaningful
phrases such as “suppressing carrier concentration” which made it easier to understand the
meaning of islands.</p>
        <p>By conducting such processing, it is believed that the visualization results move closer to
depicting semantic relationships as a network rather than simply representing morphological
relationships between tokens. As a result, domain experts can more easily evoke the effects and
mechanisms discussed in patents from the visualization results.</p>
        <p>To confirm these effects, an evaluation was conducted with three experts in inorganic
materials who possessed specialized knowledge. They were given the following tasks:
1. Observe the visualization results and identify keywords related to objectives.
2. Modify and label the visualization results to improve readability of island meanings.
3. Identify key technological points described in the patents.</p>
        <p>Table 2 shows the number of patent documents and the required time for the tasks.
According to the feedback from the workers, it was reported that the analysis time was reduced
to approximately 1/5 to 1/10 compared to manual analysis. Additionally, there was a tendency
for less increase in required time compared to an increase in the number of patents.</p>
        <p>Similar tasks were assigned to three different workers who provided their observations as
responses. They expressed that interpreting Keygraph and adjusting calculation parameters for
desired visualization results require trial-and-error processes or training efforts; however, they
could understand connections between keywords and islands, and they believe it is possible to
obtain necessary insights and triggers for new ideas.</p>
        <p>After the tasks, a survey was conducted using a 5-point scale for evaluation. The average
values of the evaluations from the three workers are shown in Table 3. With an average score
of 3.6, it can be considered that this method is deemed useful for patent investigation. Although
these results serve as reference values due to the small sample size, they also indicate variance.
Evaluation items with a variance value of 0.2 had less variability in ratings.</p>
        <p>Furthermore, regarding evaluation items with higher variances such as “Understanding
overall picture of patents” and “Understanding countermeasures/solutions”, interviews revealed
that these were subjective and influenced by users’ interpretations.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>7. Concluding remarks</title>
      <p>This paper has demonstrated the use of context-dependent definition of the ontology realizes
complementation of missing text and thus helps the engineers capture the contents of patent
documents, which has not been achieved by conventional methods. Deployment of the
contextdependency on manufacturing process remains as future work. Future work also include
incorporation of the LLM (Large Language Models)-based techniques.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgements</title>
      <p>The authors express their sincere thanks to Riichiro Mizoguchi for his valuable comments.
[17] OMG, Systems Modeling Language, Model Library for Quantities, Units, Dimensions, and</p>
      <p>Values (QUDV), OMG Document Number: ptc/2009-08-16, pp.221-240 (2009).
[18] Janowicz, K. et al.: SOSA: A lightweight ontology for sensors, observations, samples, and
actuators, Journal of Web Semantics, 56, pp.1-10, (2019).
[19] Mizoguchi, R.: YAMATO: Yet Another More Advanced Top-level Ontology,
https://www.hozo.jp/onto_library/YAMATO101216.pdf (2010).
[20] Mecab: Yet Another Part-of-Speech and Morphological Analyzer,
https://taku910.github.io/mecab/, (2024) (in Japanese).
[21] Llorà, X., Goldberg, D.E., Ohsawa, Y. et al.: Innovation and Creativity support via Chance
Discovery, Genetic Algorithms, New Mathematics and Natural Computation, 2(1),
pp.85100 (2006).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Göpfert</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuckertz</surname>
            , Weinand,
            <given-names>J. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kotzur</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stolten</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Measurement Extraction with Natural Language Processing: A Review, Findings of the Association for Computational Linguistics: EMNLP</article-title>
          <year>2022</year>
          , pp.
          <fpage>2191</fpage>
          -
          <lpage>2215</lpage>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Hao</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Weng</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Valx: A System for Extracting and Structuring Numeric Lab Test Comparison Statements from Text</article-title>
          ,
          <source>Methods of Information in Medicine</source>
          ,
          <volume>55</volume>
          (
          <issue>03</issue>
          ), pp.
          <fpage>266</fpage>
          -
          <lpage>275</lpage>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Muffo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cocco</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertino</surname>
          </string-name>
          , E.:
          <article-title>Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition</article-title>
          ,
          <source>Proc. 13th Conf. on Lang. res. And Eval(LREC2022)</source>
          , pp.
          <fpage>291</fpage>
          -
          <lpage>297</lpage>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Rijgersberg</surname>
            , H., van Assem,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Top</surname>
          </string-name>
          , J.:
          <article-title>Ontology of units of measure and related concepts</article-title>
          .
          <source>Semantic Web</source>
          .
          <volume>4</volume>
          (
          <issue>1</issue>
          ),
          <fpage>3</fpage>
          -
          <lpage>13</lpage>
          (
          <year>2013</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Aameri</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chui</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grüninger</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hahmann</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ru</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Foundational Ontologies for Units of Measure</article-title>
          , Applied Ontology,
          <volume>15</volume>
          (
          <issue>3</issue>
          ), pp.
          <fpage>313</fpage>
          -
          <lpage>359</lpage>
          , (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Borgo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Masolo</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <article-title>Ontological foundations of DOLCE</article-title>
          . In R. Poli,
          <string-name>
            <given-names>M.</given-names>
            <surname>Healy</surname>
          </string-name>
          and
          <string-name>
            <given-names>A</given-names>
            .
            <surname>Kameas</surname>
          </string-name>
          (Eds.),
          <source>Theory and Applications of Ontology: Computer Applications</source>
          , pp.
          <fpage>279</fpage>
          -
          <lpage>295</lpage>
          . Springer. doi:
          <volume>10</volume>
          .1007/
          <fpage>978</fpage>
          -90-481-8847-5_
          <fpage>13</fpage>
          . (
          <year>2010</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Arp</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Spear</surname>
            ,
            <given-names>A.D.</given-names>
          </string-name>
          <article-title>Building Ontologies with Basic Formal Ontology</article-title>
          . MIT Press.
          <article-title>(</article-title>
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Mizoguchi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borgo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , YAMATO:
          <article-title>Yet-another more advanced top-level ontology</article-title>
          ,
          <source>Applied Ontology</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ), pp.
          <fpage>211</fpage>
          -
          <lpage>232</lpage>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>Borgo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galton</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kutz</surname>
            ,
            <given-names>O</given-names>
          </string-name>
          . (eds.), Foundational ontologies in action, Applied Ontology,
          <volume>17</volume>
          (
          <issue>1</issue>
          ), (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Loebe</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Abstract vs. social roles-Towards a general theoretical account of roles</article-title>
          .
          <source>Applied Ontology</source>
          ,
          <volume>2</volume>
          (
          <issue>2</issue>
          ),
          <fpage>127</fpage>
          -
          <lpage>158</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>Mizoguchi</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sunagawa</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozaki</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Kitamura</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>A model of roles within an ontology development tool</article-title>
          : Hozo, Applied Ontology,
          <volume>2</volume>
          (
          <issue>2</issue>
          ),
          <fpage>159</fpage>
          -
          <lpage>179</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <surname>Zhang</surname>
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>D.:</given-names>
          </string-name>
          <article-title>A survey on units ontologies: architecture, comparison</article-title>
          and reuse, Program,
          <volume>51</volume>
          (
          <issue>2</issue>
          ), pp.
          <fpage>193</fpage>
          -
          <lpage>213</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <surname>Keil</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          and
          <string-name>
            <surname>Schindler</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Comparison and evaluation of ontologies for units of measurement</article-title>
          .
          <source>Semantic Web Journal</source>
          ,
          <volume>10</volume>
          (
          <issue>1</issue>
          ),
          <fpage>33</fpage>
          -
          <lpage>51</lpage>
          . doi:
          <volume>10</volume>
          .3233/SW-180310 (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>Martin-Recuerda</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          et al.:
          <article-title>Revisiting Ontologies of Units of Measure for Harmonising Quantity Values - A Use Case</article-title>
          ,
          <source>In Proc. of ISWC</source>
          <year>2020</year>
          , LNCS 21507, pp.
          <fpage>551</fpage>
          -
          <lpage>567</lpage>
          (
          <year>2020</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15] Joint Committee for Guides in Metrology,
          <source>International Vocabulary of Metrology</source>
          , 3rd ed., https://www.bipm.org/en/committees/jc/jcgm/, (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <surname>Gruber</surname>
            ,
            <given-names>T. R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Olsen</surname>
            ,
            <given-names>G.R.:</given-names>
          </string-name>
          <article-title>An ontology for engineering mathematics</article-title>
          ,
          <source>Proc. of Comparison of implemented ontology, ECAI'94 Workshop</source>
          , W13, pp.
          <fpage>93</fpage>
          -
          <lpage>104</lpage>
          (
          <year>1994</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>