<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Full of beans: a study on the alignment of two flowering plants classification systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Yi-Yun Cheng</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Bertram Ludäscher</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Information Sciences, University of Illinois at Urbana-Champaign</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Advancements in technologies such as DNA analysis have given rise to new ways in organizing organisms in biodiversity classification systems. In this paper, we examine the feasibility of aligning two classification systems for flowering plants using a logic-based, Region Connection Calculus (RCC-5) approach. The older “Cronquist system” (1981) classifies plants using their morphological features, while the more recent Angiosperm Phylogeny Group IV (APG IV) (2016) system classifies based on many new methods including genome-level analysis. In our approach, we align pairwise concepts X and Y from two taxonomies using five basic set relations: congruence (X=Y), inclusion (X&gt;Y), inverse inclusion (X&lt;Y), overlap (X&gt;&lt;Y), and disjointness (X!Y). With some of the RCC-5 relationships among the Fabaceae family (beans family) and the Sapindaceae family (maple family) uncertain, we anticipate that the merging of the two classification systems will lead to numerous merged solutions, socalled possible worlds. Our research demonstrates how logic-based alignment with ambiguities can lead to multiple merged solutions, which would not have been feasible when aligning taxonomies, classifications, or other knowledge organization systems (KOS) manually. We believe that this work can introduce a novel approach for aligning KOS, where merged possible worlds can serve as a minimum viable product for engaging domain experts in the loop.</p>
      </abstract>
      <kwd-group>
        <kwd>taxonomy alignment</kwd>
        <kwd>KOS alignment</kwd>
        <kwd>interoperability</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        With the advent of large-scale technologies and datasets, it has become increasingly
difficult to organize information using a stable unitary classification scheme over time.
An ideal work classification system, as noted in [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], should be neither too fine-grained,
nor too esoteric, to stand the test of time. However, a real-life knowledge organization
system (KOS) oftentimes has trade-offs in its stability and granularity, especially when
new developments have been made technologically. For example, the use of DNA
analysis has provided new data signals that in turn have changed the way biologists classify
organisms—traditionally, they may classify organisms based on similarities of
surfacelevel features, while classifying based on similarities in micro-level DNA analysis has
become prevalent now. Therefore, the interoperability among different KOSs over time
addressing the same topic has become more and more important in this current era with
rapid developments and innovations.
      </p>
      <p>
        In the biodiversity communities, taxonomies, a type of KOS under the classification
schemes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], has always been one of the main focuses of research; especially in the
field of systematics. As such, similar arguments about maintaining a ‘unitary
classification’ over time were made in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The authors [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] stated that it is common for
taxonomists to contradict each other’s or even their own previous taxonomies. To this end,
rather than having a permanent anchor for a specific KOS (taxonomy or classification
scheme), a better approach, perhaps, is to embrace the fact that KOS are dynamic,
timespecific, and responsive to both empirical signals and human classification interests.
Thus, a more principled solution in dealing with the interoperability issues among KOS
is perhaps to compare and align the classifications and at the same time presenting the
disagreements among them [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ][
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <p>
        In this paper, we propose the use of a logic-based approach to compare and
reconcile two major classification systems in the field of plant systematics, with the aim of
demonstrating the feasibility of integrating two classifications that result in numerous
possible solutions. Further, we demonstrate the computational power that can aid us in
aligning KOS which could not have been possible when working with alignments
manually. Specifically, we consider a use case of the flowering plants classification systems
by (1) Arthur Cronquist (1981) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], and (2) Angiosperm Phylogeny Group IV [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] in
order to map plant families (concepts) mentioned in both classifications with one of the
five Region Connection Calculus relations (RCC-5) using an open source, logic-based
tool named Euler/X. The flowering plants we have included in this study are not only
some of the most common families that we see in our everyday lives, such as the
sunflowers (Asteraceae sec. APG IV1), but also those flowering plants by a biologist’s
definition such as the beans family (Fabaceae sec. APG IV). We hope that this work
will further shed light on the possible alignments of the classifications in the
information science community and bring a novel approach for aligning KOS in the future.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Two Flowering Plant Classifications</title>
      <p>
        In line with the traditions of Bentham and Hooker, Takhtajan, and Bessy, the Cronquist
system [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] is an approach in classifying and identifying flowering plants based on
phyletics (classifying resemblances based on evolution and morphological similarity -
similar characters - of the plants). The Cronquist system divides the whole flowering plant
world into two phyla, Magnoliopsida and Liliopsida, with approximately 300 families
included in the former, and 60 families in the latter [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]; this system is said to be the
most “fully developed phyletic system” of flowering plant classification systems2 by
far [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        However, rapid breakthroughs in DNA studies and technologies have given rise to
a more recent camp of approaches in classifying plants based on phylogenetics. Early
1 Taxonomic Concept Labels (TCLs): name sec. source
2 In biodiversity classification systems, the sequence of the concepts is not taken into account.
cladistic analysis or phylogenetic systematics, established by Willi Hennig, has put
systematics to the task of finding shared, derived character states among any three groups
of organisms to find their common ancestors, or clades [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. Modern phylogenetics
consists of the use of “both morphological and molecular data and modern methods of data
analyses to study evolutionary relationships among organisms” [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The Angiosperm
Phylogeny Group system (APG IV) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], is one classification in this camp that has
become the de facto standard for the classification of plants of the modern era. The most
noticeable differences between the APG IV system and the other plant classification
systems are in the higher-level ranks. Instead of using classes or phyla, the APG uses
clades (e.g. rosid clade, asterid clade), or even other higher-level ranks (e.g. monocots,
eudicots) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Though the APG IV system has become the most recent major
classification systems, the Cronquist system, established almost 40 years ago, still remains
highly influential to this date for its completeness and comprehensiveness, and many
legacy papers with key plant data still used Cronquist’s classifications.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>Reasoning about taxonomies and Euler/X</title>
      <p>
        Taxonomies are one type of KOS with hierarchical structures, similar to classification
schemes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Taxonomy alignment refers to the mapping and reconciliation of two or
more taxonomies. In our approach, we employ a logic-based approach called Region
Connection Calculus (RCC-5) to align concepts across taxonomies using five possible
relationships: congruence (X=Y), inclusion (X&gt;Y), inverse inclusion (X&lt;Y), overlap
(X&gt;&lt;Y), and disjointness (X!Y). Euler/X (https://github.com/EulerProject/EulerX) is a
logic-based reasoning tool for taxonomy alignment based on set constraints,
specifically RCC-5 and implemented in answer set programming and direct RCC reasoning.
      </p>
      <p>PW4 is_a (inferred)PW62 overlaPpsW(i7nferred) 1</p>
      <p>is_a (input) 2 is_a (inferred) 2
overlaps (inferred) 1 is_a (input) 2
Fig. 1. A simple taxonomy alignment problem with T1, T2 and 7 possible worlds
=
T1.a</p>
      <p>T2.x
T1.b</p>
      <p>T1.c</p>
      <p>T2.y</p>
      <p>T2.z</p>
      <p>Input Taxonomies
T1.a
T2.x
T1.a
T2.x</p>
      <p>T2.z
T1.b
T2.y
T1.b</p>
      <p>PW3</p>
      <p>T1.c
T2.y
T1.c
T2.z</p>
      <p>T1.a</p>
      <p>T2.x
T1.a
T2.x</p>
      <p>Nodes
congruent T2.z 1</p>
      <p>T2 2
T1 T1.c 2</p>
      <p>Edges
overlaps (inferPreWd)5 1
is_a N(inofdereresd) 2
icso_nag(riunepnutt)T2.y 12
T1.a T2 2
T2.x T1</p>
      <p>Edges T1.c 2</p>
      <p>T1.b
T2.y
T1.c</p>
      <p>T2.z
PW1</p>
      <p>T1.b
T2.y
T1.b
T2.z</p>
      <p>T1.b</p>
      <p>T2.z
TN1.oades
coTn2g.xruent 3 T1.c</p>
      <p>Edges T2.y
is_a (inputP)W2 2</p>
      <p>Nodes
congruent 1</p>
      <p>T2 T2.y 2
T1 2</p>
      <p>EdgeTs1.c
T1o.averlaps (inferred) 1
T2.x
is_a (NinofedrerTes2d.)z 2
isc_oang(rinupeuntt) 12</p>
      <p>T2 T1.b 2
T1 2
Edges</p>
      <p>Nodes
congruent 3</p>
      <p>Edges
is_a (input) 2</p>
      <p>Nodes
congruent</p>
      <p>T2
T1
Edges</p>
      <p>
        Given two taxonomies T1 and T2, and a set of articulations (relationships between
concept X in T1 and concept Y in T2, defined in RCC-5 relations), the Taxonomy
Alignment Problem (TAP) is to derive a merged taxonomy T3. A TAP may have zero, one,
or many solutions – inconsistent, unique, or ambiguous solutions respectively. In
previous work [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], we have addressed the challenges of vocabulary confusion and
interoperability between similar but different taxonomies by reconciling the taxonomic
disagreement via unique Euler/X solutions, i.e., where there is only one “possible world”
that includes all logically inferred inter-taxonomy relationships. In this paper, we
further demonstrate features of Euler/X, i.e., its capability to naturally represent ambiguity
via multiple possible worlds. A possible world is a consistent solution where there are
no contradictions on the ways concepts were aligned, and that each concept in the two
taxonomies is “sorted out” with exactly one of the five RCC relations. To show a simple
TAP example and the multiple possible worlds that exist, consider two taxonomies T1
and T2, each with two children (Figure 1). Assume that we only know about these that
the highest-level nodes are congruent to each other (T1.a == T 2.x); how T1.b, T1.c relate
to T2.y, T2.z is unknown. With these underspecified articulations in our TAP,
ambiguities arise. Therefore, we end up with seven possible worlds where PW1 and PW2 depict
how the concepts can be aligned congruently, PW3 to PW6 depict similar but subtle
differences on how T1.b, T1.c is included in or includes T2.y, T2.z, while PW7 depicts
T1.b, T1.c, T2.y, T2.z all overlapping each other (red dashed lines).
4
      </p>
    </sec>
    <sec id="sec-4">
      <title>Method</title>
      <p>
        It is well-known that the basic taxonomic ranks in biological classification include
kingdom, phylum, class, order, family, genus, and species. In this research, we are mainly
focusing on the alignment of the family-level flowering plants. Among the 295,383
flowering plants species [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] within the Magnoliophyta phylum sec. Cronquist, or the
Angiosperms sec. APG IV, we are considering only the 40 most common families or
subfamilies out of a total of 416 flower families [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. These families include
Magnoliaceae, Ranunculaceae, Papaveraceae, Cataceae, Betulaceae, Fabaceae, Rosaceae, just
to name a few.
      </p>
      <p>Both the Cronquist systems and the APG system have these 40 families or some
modifications to the names of these families in their classifications. Each family serves
as a concept in our alignment. In our first alignment study, if the family in both systems
shares the exact same name, we assume (possible incorrectly) that they are congruent
to each other. If there are similar but different names, we will leave the concepts
unmapped at first. To be more specific, if concept X in the Cronquist system is exactly
the same as concept Y in the APG system, we will mark them as [C.X {=} APG.Y].
See Figure 2 for our initial input taxonomies. The following six articulations are the
ones that are uncertain to us because they have different names:
[C.Caesalpiniaceae ? APG.Caesalpinioideae]
[C.Mimosaceae ? APG.Mimosoideae]
[C.Fabaceae ? APG.Faboideae]
[C.Aceraceae ? APG.Sapindaceae]
[C.Sapindaceae ? APG.Sapindaceae]
[C.Hippocastanaceae ? APG.Sapindaceae]
ts
o
c
i
d
u
E
l
a
s
a
B
.</p>
      <p>G
P
A
s
d
li
o
n
g
a
M
.</p>
      <p>G
P
A
s
e
l
a
i
m
a
L
.</p>
      <p>G
P
A
e
d
a
l
C
d
ir
e
t
s
A
.</p>
      <p>G
P</p>
      <p>A
il.tsneaaenPGG il.scspaeaPADG ll.soaneaSPAG l.trsseeaAPAG
A
a
t
y
h
p
o
li
o
n
g
a
M
.</p>
      <p>C
1 3
8 7
ts
o
c
o
n
o
M
.</p>
      <p>G
P
A
s
e
l
a
li
o
n
g
a
M
.</p>
      <p>G
P
A
s
e
l
a
l
u
c
n
u
n
a
R
.</p>
      <p>G
P
A
s s
le le
iilhagp itrcabu
a u
.M .C
G G
P P
A A
s
il.rscsaeaB ll.vsaeaGM l.sabeaFG
G P P
P A A</p>
      <p>A
tems (green=Cronquist; yellow=APG), with the Fabaceae and the Sapinadaceae relationship
unknown on both sides (red dotted lines)</p>
      <p>
        Due to the above six uncertain relationships, we expect that the articulations
between the concepts of the two classifications in our initial input taxonomies are
underspecified and will result in numerous possible worlds. Furthermore, the bold
assumption to mark names that are spelled the same in both taxonomies as congruent is often
not enough, as noted in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. This is the reason we conducted a second round of
alignment to seek out consultation from a domain expert to verify our ‘congruent’
alignments as well as to sort out the underspecified articulations. The possible worlds we
have produced in the first stage thus became the minimum viable product for us to
communicate with the expert and let him/her grasp all possible solutions for the alignment
problem. The domain expert in our research asserted that the congruent alignments are
indeed correct and that the modified articulations for the Fabaceae and the Sapindaceae
families are as follows:
[C.Caesalpiniaceae {=} APG.Caesalpinioideae]
[C.Mimosaceae {=} APG.Mimosoideae]
[C.Fabaceae {=} APG.Faboideae]
[C.Aceraceae {&lt;} APG.Sapindaceae]
[C.Sapindaceae {&lt;} APG.Sapindaceae]
[C.Hippocastanaceae {&lt;} APG.Sapindaceae]
5
      </p>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>The first round of alignment between the two classification systems resulted in 555
possible worlds—meaning that we have 555 different ways for reconciling these two
classifications (see Figure 3 for a few examples). The alignment resulted in so many
solutions because the articulations for the Fabaceae family (the beans family) and the
Sapindaceae family (the maple family) were left ambiguous and underspecified.</p>
      <p>In the Cronquist system, the three families placed under the order Fabales are:
Caesalpiniaceae sec. Cronquist
Mimosaceae sec. Cronquist
Fabaceae sec. Cronquist
Caesalpinioideae sec. APG IV
Mimosoideae sec. APG IV</p>
      <p>Faboideae sec. APG IV
While in the APG IV system, the order Fabales only consists of one family Fabaceae,
and under the Fabaceae there are three sub-families:</p>
      <p>
        First, the family/subfamily names between the two systems are not exactly the same,
the spelling is different in the suffix ( -ceae vs. -deae); furthermore, the ‘umbrella’
family Fabaceae in the APG system is spelled the same as one of the three bean families in
the Cronquist system (also Fabaceae). Similarly, we raise doubts on whether the
Sapindaceae in the Cronquist system is entirely equivalent to the Sapindaceae sec. APG
IV; though they share the exact same name, the Aceraceae sec. Cronquist, and
Hippocastanaceae sec. Cronquist, also in the order of Sapindales, were totally void in the APG
system. Therefore, in our initial attempt we could not firmly state what the articulations
among these families were, echoing the claim that “names are not enough” in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <sec id="sec-5-1">
        <title>C.Caryophyllaceae APG.Caryophyllaceae</title>
        <p>After consulting the domaiCn.Caeryoxphpylelalrest on our secondCr.Poortulacaceoaef alignm=ent, we AwPGe.Proertulaacbacelaee
und
to refine our alignments betweeC.nPoltyghonealetswo taxonomies and make the rel=ationshiApPGs.Paolmygoonancegae
the beaCn.Csaryfoaphmyllidialeies explicit. DespC.iStaelicatlehs e slight differenCc.ePoslygionnactehaee suffix, the three families
under Cronquist indeed were congruent to the three sCu.Sbali-cafcaeameilies wit=hin the AAPPG.GSalicaceae
sys</p>
      </sec>
      <sec id="sec-5-2">
        <title>C.Violales</title>
        <p>
          tem (Figure 4), and that the Sapindaceae sec. CronquisC.tV,iolAacecaeeraceae se=c. Cronquist, and
Hippocastanaceae sec. CronquCi.sCtapaparraelesall combined as one as Sapindaceae sec.AAPGP.ViolaceIaVe
G
now. C.Malvales C.Cucurbitaceae = APG.Cucurbitaceae
New developments, whether in technologies or in paradigms, always challenge the
views of past major knowledge organization systems. In the biodiversity domain,
molecular data when examined under careful sampling, can provide valuable pointers to
the classification of plants. However, to quote [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]: “... molecular characters are subject
to evolutionary convergence, parallelism, and reversal; therefore, molecular methods
are not a panacea. Molecular evidence should be used with, not in place of,
morphological evidence.” Though the APG system has shown more substantial importance in
recent years due to the advancement in micro-level analysis of the molecular data, the
Cronquist system still maintains its esteemed role for its comprehensiveness and
preciseness in morphologically classifying the flowering plants.
        </p>
        <p>This paper serves as an exploratory research on the comparison and alignment of
two KOS, specifically classification schemes. Our approach suggests that
classifications can coexist with each other while disambiguating the names among concepts in a
merged possible world. Our approach also demonstrates the capability of
computationally solving complex logic-based alignments for cases where, e.g., due to
underspecified relations in the input KOS alignments, manual efforts would likely fail to yield all
555 different ways to merge and reconcile the two KOS.
=
=
=
=
=
=
=
=
=
=
=</p>
      </sec>
      <sec id="sec-5-3">
        <title>APG.Oleaceae</title>
      </sec>
      <sec id="sec-5-4">
        <title>APG.Asclepiadaceae</title>
      </sec>
      <sec id="sec-5-5">
        <title>APG.Caprifoliaceae</title>
      </sec>
      <sec id="sec-5-6">
        <title>APG.Solanaceae</title>
      </sec>
      <sec id="sec-5-7">
        <title>APG.Asteraceae</title>
      </sec>
      <sec id="sec-5-8">
        <title>APG.Ericaceae</title>
      </sec>
      <sec id="sec-5-9">
        <title>APG.Cactaceae</title>
      </sec>
      <sec id="sec-5-10">
        <title>APG.Brassicaceae</title>
      </sec>
      <sec id="sec-5-11">
        <title>APG.Euphorbiaceae</title>
      </sec>
      <sec id="sec-5-12">
        <title>APG.Malvaceae</title>
      </sec>
      <sec id="sec-5-13">
        <title>APG.Fabaceae</title>
      </sec>
      <sec id="sec-5-14">
        <title>APG.Sapindales</title>
      </sec>
      <sec id="sec-5-15">
        <title>APG.Myrtales</title>
      </sec>
      <sec id="sec-5-16">
        <title>APG.Gentianales</title>
      </sec>
      <sec id="sec-5-17">
        <title>APG.Dipsacales</title>
      </sec>
      <sec id="sec-5-18">
        <title>APG.Solanales</title>
      </sec>
      <sec id="sec-5-19">
        <title>APG.Asterales</title>
      </sec>
      <sec id="sec-5-20">
        <title>APG.Ericales</title>
      </sec>
      <sec id="sec-5-21">
        <title>APG.Caryophyllales</title>
        <p>C.Portulacaceae
APG.Portulacaceae
C.Scrophulariales
APG.Lamiaceae
C.Lamiaceae
C.Lamiales</p>
        <p>C.Oleaceae
APG.Oleaceae
C.Scrophulariaceae
APG.Scrophulariaceae</p>
        <p>Nodes</p>
        <p>C
congruent
APG</p>
        <p>Edges
is_a (inferred)
is_a (input)
overlaps (inferred) 12
15
42
9
12
69
C.Hamamelidae
C.Rosidae
APG.RosidClade
APG.BasalEudicots
APG.Ranunculales
APG.Magnoliaceae
APG.Magnoliales
APG.Magnoli ds
C.Magnoliaceae
C.Magnoliales
APG.Asparagales</p>
        <p>C.Liliales
APG.Arecaceae
APG.Arecales
C.Arecaceae
C.Arecales
APG.Alismatales
APG.Araceae
C.Araceae
C.Arales</p>
        <p>APG.CoreEudicots
APG.Hamamelidaceae
APG.Saxifragales
C.Hamamelidaceae
C.Hamamelidales</p>
        <p>C.Fagales
APG.Fagales
APG.Fabaceae
APG.Fabales
C.Fabales
APG.Rosales
APG.Sapindaceae
APG.Sapindales
C.Sapindales
APG.Myrtales
APG.Onagraceae</p>
        <p>C.Myrtales
C.Onagraceae
APG.Malpighiales
APG.Malvaceae
APG.Malvales
C.Malvaceae
C.Malvales
C.Violales
APG.Apiaceae
APG.Apiales
C.Apiaceae</p>
        <p>C.Apiales
APG.Brassicaceae
APG.Brassicales
C.Brassicaceae
C.Capparales
APG.Ericaceae
APG.Ericales
C.Ericaceae
C.Ericales</p>
        <p>C.Asteridae
APG.Ranunculaceae
C.Ranunculaceae
C.Ranunculales
APG.Papaveraceae
C.Papaveraceae
C.Papaverales
APG.Orchidaceae
C.Orchidaceae
C.Orchidales
C.Iridaceae
APG.Iridaceae
APG.Liliaceae
APG.Liliales
C.Liliaceae</p>
        <p>C.Fagaceae
APG.Fagaceae
C.Betulaceae
APG.Betulaceae
C.Fabaceae
APG.Faboideae
C.Caesalpiniaceae
APG.Caesalpinioideae</p>
        <p>C.Mimosaceae
APG.Mimosoideae
APG.Moraceae
C.Moraceae
C.Urticales
APG.Rosaceae
C.Rosaceae</p>
        <p>C.Rosales
line=inferred overlapping relationship from Euler/X)</p>
        <p>Somewhat ironically, the limitation in this study also lies in the strong inferential
power—in which when a parent node only has one child, the RCC reasoner in Euler/X
will collapse the concepts and merge them as the same node. For example, Euler/X
derived that the family Ericaceae and the order Ericales are exactly the same and
merged Cronquist.Ericaceae, APG.Ericaceae, Cronquist.Ericales, and APG.Ericales as
congruent. However, we argue that this limitation is due to the fact that we have only
chosen some 40 major flower families instead of all 416 families. We could add missing
children or artificial children here, or include the full 416 angiosperms families, but this
is beyond the scope of our study. For the purposes of demonstrating the logic-based
taxonomy alignment approach, our smaller use case is sufficient and more tractable .</p>
        <p>It is also worth noting that domain expert opinions are still needed to differentiate
and lead us to the single solution we are looking for. We foresee our logic-based
approach for aligning KOS as an essential preliminary processing steps and a minimum
viable product before bringing the KOS interoperability alignment problems to the
domain experts. KOS alignment problems are usually complicated with a slow learning
curve; domain experts, in our case, plant systematists, may not fully comprehend at first
the reason we need to align different KOS. If we approach them directly with two
classifications and ask them one by one what relationships between each concept are, they
will probably feel befuddled by the situation. Once we have the Euler/X-generated
possible worlds in the first round of alignments, whether a few, tens, hundreds, or even
thousands of PWs, the alignment problem will become concrete to the domain experts
and consulting them for validation of the articulations would be much easier.</p>
        <p>This study, we also believe, has further implications on making our classification
systems more “full of beans” (here we take on the positive connotation, meaning full of
energy), meaning that it may open doors to enable semantic interoperability, and enrich
diversity in classification systems when we work with KOS alignments using the
logicbased RCC-5 approach. We believe that in the future we can implement this approach
for semantic interoperability issues among classifications in the information science
community, or even other higher-level KOS such as ontologies.
7</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgement</title>
      <p>The first author wishes to thank Dr. Stephen R. Downie for the wonderful
introduction to plant systematics. This research is the outcome of the course project of IB335
and the Independent Study taught by Dr. Downie. The authors also thank Dr. Nico M.
Franz and Ms. Ly Dinh for support and kind feedback on this research.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Bowker</surname>
            ,
            <given-names>G. C.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Star</surname>
            ,
            <given-names>S. L.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Sorting things out: Classification and its consequences</article-title>
          . MIT press.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Hodge</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          (
          <year>2000</year>
          ).
          <article-title>Systems of Knowledge Organization for Digital Libraries: Beyond Traditional Authority Files</article-title>
          .
          <source>Digital Library Federation, Council on Library and Information Resources</source>
          , 1755 Massachusetts Ave.,
          <string-name>
            <surname>NW</surname>
          </string-name>
          , Suite 500, Washington, DC 20036.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>N. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peet</surname>
            ,
            <given-names>R. K.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Weakley</surname>
            ,
            <given-names>A. S.</given-names>
          </string-name>
          (
          <year>2008</year>
          ).
          <article-title>On the use of taxonomic concepts in support of biodiversity research and taxonomy</article-title>
          . Systematics Association Special Volume,
          <volume>76</volume>
          ,
          <fpage>63</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Cheng, Y.-Y.,
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scheider</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rodenhasen</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ludäscher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Agreeing to disagree: reconciling conficting taxonomic views using a logic-based approach</article-title>
          .
          <source>In Proceedings of the ASIS&amp;T 2017 Annual Meeting</source>
          , Washington,
          <string-name>
            <surname>D.C.</surname>
          </string-name>
          , USA, October 27- November 1st. IDEALS: http://hdl.handle.
          <source>net/2142/97907</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Cronquist</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>1981</year>
          ).
          <article-title>An integrated system of classification of flowering plants</article-title>
          . Columbia University Press.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Byng</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chase</surname>
            ,
            <given-names>M. W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Christenhusz</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fay</surname>
            ,
            <given-names>M. F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Judd</surname>
            ,
            <given-names>W. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mabberley</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          , ... &amp;
          <string-name>
            <surname>Briggs</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>An update of the Angiosperm Phylogeny Group classification for the orders and families of flowering plants: APG IV</article-title>
          .
          <source>Botanical Journal of the Linnean Society</source>
          ,
          <volume>181</volume>
          (
          <issue>1</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>20</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Downie</surname>
            ,
            <given-names>S. R.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <source>Lecture on Historical Systematics. Personal Collection of S. R Downie</source>
          , University of Illinois at Urbana-Champaign, Champaign IL.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Judd</surname>
          </string-name>
          , W. S.,
          <string-name>
            <surname>Campbell</surname>
            ,
            <given-names>C. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kellogg</surname>
            ,
            <given-names>E. A.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>P. F.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Plant systematics. A phylogenetic approach</article-title>
          . Sinauer Associates, Sunderland, Mass., USA,
          <volume>464</volume>
          ,
          <fpage>3</fpage>
          -
          <lpage>4</lpage>
          . 3rd edition
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Vane-Wright</surname>
            ,
            <given-names>R.I.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Taxonomy, Methods of</article-title>
          . In Levin S.A. (ed.),
          <source>Encyclopedia of Biodiversity (2nd edn)</source>
          <volume>7</volume>
          :
          <fpage>97</fpage>
          -
          <lpage>111</lpage>
          . Waltham, MA: Academic Press.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Christenhusz</surname>
            ,
            <given-names>M. J.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Byng</surname>
            ,
            <given-names>J. W.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>The number of known plants species in the world and its annual increase</article-title>
          .
          <source>Phytotaxa</source>
          ,
          <volume>261</volume>
          (
          <issue>3</issue>
          ),
          <fpage>201</fpage>
          -
          <lpage>217</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>N. M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kianmajd</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bowers</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weakley</surname>
            ,
            <given-names>A. S.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Ludäscher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2016</year>
          ).
          <article-title>Names are not good enough: Reasoning over taxonomic change in the Andropogon complex 1</article-title>
          .
          <source>Semantic Web</source>
          ,
          <volume>7</volume>
          (
          <issue>6</issue>
          ),
          <fpage>645</fpage>
          -
          <lpage>667</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Takhtadzhian</surname>
            ,
            <given-names>A. L.</given-names>
          </string-name>
          (
          <year>1997</year>
          ).
          <article-title>Diversity and classification of flowering plants</article-title>
          . Columbia University Press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>