<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>PKBD.Onto: A Plugin for Ontological Schemas Generation</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>OS OWL = C</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Matrosov Institute for System Dynamics and Control Theory, Siberian Branch of Russian Academy of Sciences</institution>
          ,
          <addr-line>Lermontov St. 134, Irkutsk</addr-line>
          ,
          <country country="RU">Russia</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The use of Semantic Web technologies (including ontologies) for intelligent systems and knowledge bases engineering is a widespread practice, it is true especially for tasks of conceptualization and formalization. However, tools and approaches used for these tasks in most cases provide only a manual manipulation of concepts and relationships. In this regard, the use of various information sources for automated ontology engineering is relevant. One of these sources is spreadsheets. In this paper, we propose an approach for the automated creation of ontological schemas based on the analysis and transformation of spreadsheets data. The feature of our approach is the original relational canonicalized form of spreadsheets. This form is used for preprocessing spreadsheets and unifying the input data. The proposed approach is implemented in the form of a plugin (PKBD.Onto) for Personal Knowledge Base Designer - software for prototyping rule-based expert systems. The main stages of the approach, the architecture and functions of the plugin, and the case study are also described.</p>
      </abstract>
      <kwd-group>
        <kwd>Spreadsheets</kwd>
        <kwd>Canonical Spreadsheet</kwd>
        <kwd>Ontological Schema</kwd>
        <kwd>OWL</kwd>
        <kwd>Model Transformation</kwd>
        <kwd>Code Generation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Vidiya
The use of Semantic Web technologies, including ontologies [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], for intelligent
systems and knowledge bases engineering is a widespread practice. In most cases,
ontologies and special software (e.g., Protégé, ONTOedit, Menthor Editor, Semaphore
Ontology Editor, OntoStudio, WebOnto, Fluent Editor, etc.) are used by analysts and
domain experts for tasks of knowledge conceptualization and formalization. However,
these tools provide a weak integration with external information sources (e.g.,
databases, texts, tables, conceptual models, etc.) in terms of importing domain concepts
and relationships. This fact reduces the efficiency of the ontology engineering
process. One of the information sources that can be used for the automated creation of
ontologies is spreadsheets. Today, a large volume of arbitrary tables has been
accumulated worldwide [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and presented in the spreadsheet-like formats (HTML,
Copyright © 2020 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0).
      </p>
      <p>EXCEL, and CSV). Arbitrary tables are a valuable data source in business
intelligence and data-driven research.</p>
      <p>
        In our previous papers [
        <xref ref-type="bibr" rid="ref14 ref5">5, 14</xref>
        ] we proposed an approach for automated analysis and
transformation of spreadsheets into conceptual domain models in the form of UML
class diagrams. In this paper, we propose to apply this approach for ontological
schemas generation (ontologies at the TBox level) in the OWL2 DL format [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. A feature
of the proposed approach is the use of the original canonicalized form for
representation of spreadsheets, which provides the unification of input data.
      </p>
      <p>Our approach is implemented in the form of the plugin, namely, PKBD.Onto, for
Personal Knowledge Base Designer (PKBD) [15] – software for prototyping
rulebased expert systems. A case study for the proposed approach and the plugin
description are also presented.
2
2.1</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <sec id="sec-2-1">
        <title>Method for Spreadsheets Transformation</title>
        <p>Rule 2: IF RH corresponds only one CH and at the same time RH contains two
values with the separator (“|”) THEN RH transformed to a class with properties from
CH and with an additional property "Name" that corresponds to RH-2.</p>
        <p>Rule 3: IF RH contains two values with the separator (“|”) and they correspond to
two CH values with the separator (“|”), THEN RH transformed to the first class, CH
transformed to the second class and a relationship stated between them.</p>
        <p>Rule 4: IF RH corresponds to three CH values with the separator (“|”), THEN RH
transformed to the first class with properties CH-1, and CH 2 and 3 transformed to the
second class and a relationship stated between them.</p>
        <p>All obtained parent-child relationships are interpreted as the association and the
cardinality of “1..*” is determined by default.</p>
        <p>By default attribute values are set based on the D column.</p>
        <p>The main results of this algorithm are fragments of conceptual models. These
fragments need to aggregate, including operations for clarifying the names of
concepts, their properties and relationships, and also their possible merging and
separation.</p>
        <p>The following rules used for automatic aggregation of conceptual models
fragments:</p>
        <p>Rule 1: Merge two classes when they have equal names from duplicate fragments
of class diagrams.</p>
        <p>Rule 2: Merge two classes when they have the same structure, i.e. when sets of
attributes are equal. In this case, only the first class with this structure stays in the
model.</p>
        <p>
          Rule 3: Merge two classes when they have similar names. The resulting fragments
of class diagrams can describe the same objects or processes. We suggest using a
simple string comparison method based on the Levenshtein distance [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] to determine
the similarity between two names of classes. If the distance is less than or equal to
three, then we assume the classes to be similar. Note that this is not enough, so we
also look at the structure of classes (names of attributes must partly match).
        </p>
        <p>Rule 4: Create a new association between two classes if homonymous classes and
attributes exist. In this case, a name in one class is equivalent to the attribute name in
another class. At the same time, the attribute of the same name is removed.</p>
        <p>Rule 5: Remove duplicate associations between classes.</p>
        <p>Manual merging and separation operations are performed by using PKBD.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>PKBD: a Tool for Knowledge Base Engineering</title>
        <p>
          We used PKBD when solving problems of knowledge bases of expert systems
engineering, in particular, in the field of ISI [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. PKBD is implemented as a desktop
application designed for non-programmers. The main purpose of PKBD is to prototype
knowledge bases that use the formalism of logical rules.
        </p>
        <p>
          One of PKBD features is a support of the Rule Visual Modeling Language
(RVML) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ]. RVML is considered as a UML extension. Other PKBD features are the
following:
• a modular architecture that provides the ability to add modules for
supporting knowledge programming languages. Currently, CLIPS and Drools
are supported;
• integrability with conceptual modeling tools when importing and
exporting concepts and relationships.
        </p>
        <p>The PKBD architecture determines the interaction of the following main software
components:
• a knowledge base management module, it provides storage of projects in
the EKB format (the proprietary XML-like format);
• a user interface subsystem includes the following modules: software
wizards for manipulating knowledge base elements, a GUI generation, a Tiny
RVML editor;
• a subsystem for supporting programming language modules, it provides
connection and disconnection of modules, access to their functions for
generating program codes;
• a module of integration with conceptual models sources: IBM Rational</p>
        <p>Rose, StarUML, XMind, CMapTools, and TabbyXL;
• a rule engines control module provides activation of rule engine for
testing knowledge bases;
• a module of interaction with the web-based software called Knowledge</p>
        <p>
          Base Development System (KBDS) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>
          Main functions of PKBD are:
• designing elements of rule bases (fact templates, facts, and rules) by
nonprogrammers using a set of wizards and defined sources of conceptual
models;
• checking the integrity of the developed knowledge bases (syntactic and
semantic control);
• representing knowledge base elements using RVML;
• generating knowledge base codes in the CLIPS format;
• testing developed knowledge base codes (logical inference) using the
integrated CLIPS rule engine;
• integrating with CASE-tools: IBM Rational Rose, StarUML, XMind, and
CMapTools, regarding import and transformation of conceptual models in
order to highlight the main entities (concepts) and relationships for
creating knowledge base drafts;
• integrating with TabbyXL [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] in terms of import and transformation of
canonical spreadsheet tables in order to highlight the main entities
(concepts) and relationships for creating knowledge base drafts;
• interacting with the KBDS service.
        </p>
        <p>We used PKBD as an open software platform and developed a PKBD.Onto plugin.
This plugin implements our approach for ontological schemas generation in the
OWL2 DL format.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Proposed Approach</title>
      <sec id="sec-3-1">
        <title>Method</title>
        <p>
          The method for generating ontological schemas is based on principles of a model
transformation. A model transformation is one of the key concepts in Model-Driven
Engineering (MDE) [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>From a formal point of view our method can be represented as a chain of
horizontal exogenous transformations:</p>
        <p>T : CS CSV
→ CM XML
where CS CSV is a source spreadsheet presented in a canonicalized form and saved
in CSV format using TabbyXL. The structure of a canonical spreadsheet is described
in Section 2.1; CM XML is a conceptual model resulted from spreadsheet
transformation, which is a form for the internal representation of domain concepts and
relationships for PKBD; OS OWL is a target ontological schema in the OWL2 DL format.</p>
        <sec id="sec-3-1-1">
          <title>Using (2), let’s describe CM XML in more detail:</title>
          <p>CM</p>
          <p>XML =</p>
          <p>C, DT , RL ,
where C is a set of classes; DT is a set of datatypes; RL is a set of relationships
between C . Let’s refine C from (3) as follows:</p>
          <p>C = {c1...cn }, ci = namei , ATi , i = 1, n , when namei is a class name; ATi is a
set of class attributes,</p>
          <p>ATi = {ai,1,..., ai,k }, ai, j = name j , type j , value j , j ∈1, k ,
when name j is an attribute name; type j is an attribute datatype, type j ∈ DT ; value j
is a possible attribute value.</p>
          <p>RL = {rl1...rln}, rli = namei , typei , lhsi , rhsi , i = 1, n , when typei is a relationship
type (inheritance, dependency, association, aggregation, composition, realization);
namei is a relationship name; lhsi is a left side of a relationship,
lhsi = namelhs , cdlhs , c j , when namelhs is a name of a class role at the left
relationship side, cdlhs is a cardinality of the left relationship side, c j is a link of a class at
the left relationship side,
c j ∈ C ; rhsi
is a right side of a relationship,
rhsi = namerhs , cardinalityrhs , ck , when namerhs is a name of a class role at the right
relationship side, cdrhs is a cardinality of the right relationship side, ck is a link of a
class at the right relationship side, ck ∈ C . Wherein, cdlhs , cdrhs = {0,0..1,0..*,1,1..*}.</p>
        </sec>
        <sec id="sec-3-1-2">
          <title>Using (2), let’s describe OS OWL in more detail:</title>
          <p>
            when C is a set of classes; OP is a set of object properties; DP is a set of datatype
properties; DT is a set of XML Schema datatypes. A detailed description of the
OWL 2 DL specification is given in [
            <xref ref-type="bibr" rid="ref6">6</xref>
            ].
          </p>
          <p>
            Analysis and transformation of source spreadsheets ( CS CSV ) and formation of a
conceptual model ( CM XML ) are discussed in detail in [
            <xref ref-type="bibr" rid="ref14 ref5">5, 14</xref>
            ]. In this paper, we will
describe in detail how to obtain ontological schemas ( OS OWL ). For this, using (2),
let’s describe a transformation operator ( T ):
          </p>
          <p>T = TCS −CM ,TCM −OSM ,TOSM −OS ,
TCS −CM : CS CSV</p>
          <p>→ CM XML , TCM −OSM : CM XML → OSM ,
TOSM −OS : OSM
→ OS OWL ,
where TCS −CM is a set of rules for transformation of a source spreadsheet in the CSV
format into a conceptual model, for example, a UML class diagram; TCM −OSM is a
set of rules for transformation of a conceptual model into an ontological schema
model; TOSM −OS is a set of rules for transformation of an ontological schema model into
OWL ontology code at the TBox level.</p>
          <p>Wherein: OSM is an ontological schema model designed for a unified
representation and storage of knowledge extracted from various information sources. This
model abstracts from features of knowledge representation languages and their dialects
used for the implementation of ontologies (e.g., OWL, RDFS, etc.).</p>
          <p>So, using sets of transformation rules ( TCM −OSM and TOSM −OS ), ontological
schemas generation ( OS OWL ) includes four main stages.</p>
          <p>Stage 1: Analysing and transforming an XML structure of PKBD internal
knowledge representation for conceptual models. This stage involves extracting
elements, their properties, and relationships from an XML tree (the depth-first search for
elements).</p>
          <p>Stage 2: Forming an ontological schema model. The main objective of this stage is
obtaining typical ontological fragments in the form of a set of classes and their
relationships (object and datatype properties), which describe a certain domain and based
on the extracted XML elements.</p>
          <p>Stage 3: Generating an ontological schema code in the OWL format based on an
ontological schema model.</p>
          <p>
            Transformations themselves can be described using special transformation
languages, for example, Transformation Model Representation Language (TMRL) [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]. In
this work, we use a general-purpose language to implement transformations.
Moreover, all transformations can be represented in tabular form (Table 1).
          </p>
          <p>Stage 4: Editing an obtained ontological schema. This stage is additional and
represents a refinement (modification) of OWL code obtained with the aid of various
ontological modeling tools, for example, Protégé and others.</p>
          <p>So, the main result of these stages is a set of ontology classes and their properties,
which define an ontological schema at the TBox level.
3.2</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>PKBD.Onto: a Plugin for PKBD</title>
        <p>The PKBD.Onto plugin is implemented in the form of a Dynamic Link Library (DLL)
that is dynamically connected via a unified PKBD API.</p>
        <p>The unified PKBD API for supporting integration modules with external software
in terms of import and export contains three functions:
• getting a description of DLL including name and version (“DllInfo”
function);
• getting a detailed description of DLL (“About” function);
• executing a main function of DLL, while a conceptual model in the PKBD
format, a resulting file name, and a list of possible parameters are passed
as a parameter (“Execute” function).</p>
        <p>In the PKBD.Onto plugin architecture (Fig. 1) can be distinguished following
components:
• supporting a PKBD format of conceptual models, which provides access
and manipulation of model elements;
• transforming the input model to the OWL2 DL format;
• transforming the input model to a set of linked data in the RDF format
(can be viewed as a mean for obtaining a set of specific facts).</p>
        <p>XML PKBD Parser
OWL DL Generator</p>
        <p>RDF Generator</p>
        <p>Fig. 1. A PKBD.Onto plugin architecture.
3.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Case Study</title>
        <p>Currently, PKBD is used in the educational process at Irkutsk National Research
Technical University (IrNRTU), Institute of Information Technology and Data
Science. Therefore, as an example, let’s consider the educational task of developing an
ontological schema fragment.</p>
        <p>Information on minerals in the form of arbitrary spreadsheets is used as source data
(Fig. 2). To unify the input data, a source arbitrary spreadsheet was preprocessed and
a canonical spreadsheet resulted (Fig. 3).</p>
        <p>Next, the canonical spreadsheet is analyzed using PKBD, in particular, by the
PKBD.Onto plugin. Conceptual model elements are extracted as a result of this
analysis. These elements can be visually represented as an RVML schema (Fig. 4). The
obtained model requires modification, namely, all minerals were aggregated into a
“Diamond” class (template), which must be renamed to “Mineral”.</p>
        <p>Based on the modified conceptual model (Fig. 4), we generated the code of the
ontological schema in the OWL format. Then, this code can be verified in Protégé (Fig. 5).
In this paper, we describe a method and tool for ontological schemas generation
(ontologies at the TBox level) in the form of a plugin for Personal Knowledge Base
Designer. Spreadsheets reduced to a canonicalized form and saved in the CSV format
were used as source data. Resulting OWL ontology codes are syntactically correct and
can be evaluated by end-users.</p>
        <p>
          The PKBD.Onto plugin allows one to create rapid prototypes of spreadsheet-based
ontologies for a specific domain. Modified and refined ontologies can be used for
intelligent systems and knowledge bases engineering [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
5
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgments</title>
      <p>This work was financially supported by the Council for Grants of the President of
Russia (grant No. MK-1647.2020.9), Program of the Fundamental Research of the
Siberian Branch of the Russian Academy of Sciences, project no. IV.38.1.2 (reg. no.
АААА-А17-117032210079-1), project no. IV.38.1.3 (reg. no.
АААА-А17117032210077-7). Results are achieved using the Centre of collective usage
«Integrated information network of Irkutsk scientific educational complex».
15. Yurin, A.Yu., Dorodnykh, N.O.: Personal knowledge base designer: Software for expert
systems prototyping. SoftwareX 11, 100411 (2020). DOI: 10.1016/j.softx.2020.100411</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Berman</surname>
            ,
            <given-names>A.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolaichuk</surname>
            ,
            <given-names>O.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yurin</surname>
            ,
            <given-names>A.Yu.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuznetsov</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          :
          <article-title>Support of DecisionMaking Based on a Production Approach in the Performance of an Industrial Safety Review</article-title>
          .
          <source>Chemical and Petroleum Engineering</source>
          <volume>50</volume>
          (
          <issue>1-2</issue>
          ),
          <fpage>730</fpage>
          -
          <lpage>738</lpage>
          (
          <year>2015</year>
          ). DOI:
          <volume>10</volume>
          .1007/s10556-015-9970-x
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Da</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.R.</surname>
          </string-name>
          :
          <article-title>Model-driven engineering: A survey supported by the unified conceptual model</article-title>
          .
          <source>Computer Languages, Systems &amp; Structures</source>
          <volume>43</volume>
          ,
          <fpage>139</fpage>
          -
          <lpage>155</lpage>
          (
          <year>2015</year>
          ). DOI:
          <volume>10</volume>
          .1016/j.cl.
          <year>2015</year>
          .
          <volume>06</volume>
          .001
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Dorodnykh</surname>
            ,
            <given-names>N.O.</given-names>
          </string-name>
          :
          <article-title>Web-based software for automating development of knowledge bases on the basis of transformation of conceptual models</article-title>
          .
          <source>Open Semantic Technologies for Intelligent Systems</source>
          <volume>1</volume>
          ,
          <fpage>145</fpage>
          -
          <lpage>150</lpage>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Dorodnykh</surname>
            ,
            <given-names>N.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yurin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          .:
          <article-title>A domain-specific language for transformation models</article-title>
          .
          <source>CEUR Workshop Proceedings (ITAMS-</source>
          <year>2018</year>
          )
          <volume>2221</volume>
          ,
          <fpage>70</fpage>
          -
          <lpage>75</lpage>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Dorodnykh</surname>
            ,
            <given-names>N.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yurin</surname>
            ,
            <given-names>A.Yu.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shigarov</surname>
            ,
            <given-names>A.O.</given-names>
          </string-name>
          :
          <article-title>Conceptual Model Engineering for Industrial Safety Inspection Based on Spreadsheet Data Analysis /</article-title>
          / Communications in Computer and Information Science.
          <source>Modelling and Development of Intelligent Systems (MDIS</source>
          <year>2019</year>
          )
          <volume>1126</volume>
          ,
          <fpage>51</fpage>
          -
          <lpage>65</lpage>
          (
          <year>2020</year>
          ). DOI:
          <volume>10</volume>
          .1007/978-3-
          <fpage>030</fpage>
          -39237-
          <issue>6</issue>
          _
          <fpage>4</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Grau</surname>
            ,
            <given-names>B.C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patel-Schneider</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>OWL 2: The next step for OWL</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>6</volume>
          (
          <issue>4</issue>
          ),
          <fpage>309</fpage>
          -
          <lpage>322</lpage>
          (
          <year>2008</year>
          ). DOI:
          <volume>10</volume>
          .1016/j.websem.
          <year>2008</year>
          .
          <volume>05</volume>
          .001
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Grishenko</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          <string-name>
            <surname>Dorodnykh</surname>
            ,
            <given-names>N.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nikolaychuk</surname>
            ,
            <given-names>O.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yurin</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
          </string-name>
          .:
          <article-title>Designing rulebased expert systems with the aid of the model-driven development approach</article-title>
          .
          <source>Expert Systems</source>
          <volume>35</volume>
          (
          <issue>5</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>23</lpage>
          (
          <year>2018</year>
          ). DOI:
          <volume>10</volume>
          .1111/exsy.12291
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Guarino</surname>
          </string-name>
          , N.:
          <article-title>Formal Ontology in Information Systems</article-title>
          .
          <source>In: the First International Conference on Formal Ontology in Information Systems (FOIS'98) 46</source>
          ,
          <fpage>3</fpage>
          -
          <lpage>15</lpage>
          (
          <year>1998</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lehmberg</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ritze</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Meusel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>A large public corpus of web tables containing time and context metadata</article-title>
          .
          <source>Proceedings 25th International Conference Companion on World Wide Web</source>
          ,
          <fpage>75</fpage>
          -
          <lpage>76</lpage>
          (
          <year>2016</year>
          ).
          <source>DOI: 10.1145/2872518</source>
          .2889386
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Levenshtein</surname>
            ,
            <given-names>V.I.</given-names>
          </string-name>
          :
          <article-title>Binary codes capable of correcting deletions, insertions, and reversals</article-title>
          .
          <source>Tech. Rep. 8</source>
          ,
          <string-name>
            <surname>Soviet</surname>
            <given-names>Physics Doklady</given-names>
          </string-name>
          (
          <year>1966</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Shigarov</surname>
            ,
            <given-names>A.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khristyuk</surname>
            ,
            <given-names>V.V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikhailov</surname>
            ,
            <given-names>A.M.:</given-names>
          </string-name>
          <article-title>TabbyXL: Software platform for rulebased spreadsheet data extraction and transformation</article-title>
          .
          <source>SoftwareX</source>
          <volume>10</volume>
          ,
          <issue>100270</issue>
          (
          <year>2019</year>
          ). DOI:
          <volume>10</volume>
          .1016/j.softx.
          <year>2019</year>
          .100270
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Shigarov</surname>
            ,
            <given-names>A.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikhailov</surname>
            ,
            <given-names>A.A.</given-names>
          </string-name>
          :
          <article-title>Rule-based spreadsheet data transformation from arbitrary to relational tables</article-title>
          .
          <source>Information Systems</source>
          <volume>71</volume>
          ,
          <fpage>123</fpage>
          -
          <lpage>136</lpage>
          (
          <year>2017</year>
          ). https://doi.org/10.1016/j.is.
          <year>2017</year>
          .
          <volume>08</volume>
          .004
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Tijerino</surname>
            ,
            <given-names>Y.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Embley</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lonsdale</surname>
            ,
            <given-names>D.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagy</surname>
          </string-name>
          , G.:
          <article-title>Towards Ontology Generation from Tables</article-title>
          .
          <source>World Wide Web</source>
          <volume>8</volume>
          (
          <issue>3</issue>
          ),
          <fpage>261</fpage>
          -
          <lpage>285</lpage>
          (
          <year>2005</year>
          ).
          <source>DOI: 10.1007/s11280- 005-0360-8</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Yurin</surname>
            ,
            <given-names>A.Yu.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dorodnykh</surname>
            ,
            <given-names>N.O.:</given-names>
          </string-name>
          <article-title>A Reverse Engineering Process for Inferring Conceptual Models from Canonicalized Tables</article-title>
          .
          <source>Proceedings of the 2019 International MultiConference on Engineering, Computer and Information Sciences (SIBIRCON</source>
          )
          <fpage>485</fpage>
          -
          <lpage>490</lpage>
          (
          <year>2020</year>
          ).
          <source>DOI: 10.1109/SIBIRCON48586</source>
          .
          <year>2019</year>
          .8958458
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>