<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Mapping Biographical events to ODPs through Lexico-Semantic Patterns?</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>o Antonio Str</string-name>
          <email>marcoantonio.stranisci@unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>rio B</string-name>
          <email>valerio.basile@unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>] Ross</string-name>
          <email>rossana.damiano@unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>] Vivi</string-name>
          <email>viviana.patti@unito.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Dipartimento di Informatica, University of Turin</institution>
          ,
          <addr-line>C.so Svizzera 185</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we present a collection of semantically-encoded biographies of authors who were born in former colony countries from 1945. The data set relies on an ontology that represents the life of an author through the two key concepts of migration from birth place and legal status in a country, both modeled on two Ontology Design Patterns: Time Indexed Person Status and Basic Execution Plan. Together with the resource, we describe a pipeline to convert the textual biographies of the authors gathered from Wikipedia into the roles experienced by them in migrations. The pipeline includes modules for linguistic preprocessing and named entity recognition, and an entity linking step relying on Wikipedia and Wikidata APIs to link places and organizations to their respective countries. A set of lexico-semantic patterns based on verb classes from the Uni ed Verb Index has been developed in order to extract migration-related knowledge from unseen text biographies.</p>
      </abstract>
      <kwd-group>
        <kwd>Biography</kwd>
        <kwd>Immigration</kwd>
        <kwd>Pattern-based information extraction</kwd>
        <kwd>ODP</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Under-representation of non-Western people is an open issue with a long
tradition [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Ethnic minorities su er this condition in crucial sectors of society, such
as schools [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] and media players [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Even collaborative projects seem to be
a ected by cultural [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] and gender biases. For instance, [
        <xref ref-type="bibr" rid="ref28">28</xref>
        ] observed that most
Wikipedia contributors are European and male, and this may have an in uence
on the creation of contents on this platform [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ].
      </p>
      <p>
        Our work addresses this topic by providing structured knowledge about
writers who su er a lack of representation on Wikipedia due to their ethnic
origin [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. In this paper, we present a pipeline for the automatic extraction of
? Copyright © 2021 for this paper by its authors. Use permitted under Creative
      </p>
      <p>
        Commons License Attribution 4.0 International (CC BY 4.0).
biographical events from Wikipedia through the adoption of Lexico-Semantic
Patterns [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]; biographical events are semantically described by referring to the
Ontology Design Patterns (ODP) framework [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. The development of a mapping
from raw-text biographies to semantic categories is a preliminary step to linking
the literary production of under-represented writers to their lives.
      </p>
      <p>The paper is structured as follows. In Section 2 we discuss the Linked Data
projects that inspired our work, and review the state-of-the-art approaches to
event extraction and encoding. In Section 3 we present the Ontology of
UnderRepresented Writers, describing how we encoded their biographies through
recurrent semantic patterns, and how we modeled the interplay between the authors
and their places of birth. In Section 4, we present the pipeline for the automatic
extraction of biographical events through Lexico-Semantic patterns. Finally, in
Section 5, we analyze, and evaluate results. A discussion about open issues and
future work concludes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>
        In recent years, thanks to the availability of sources in a digital form, a new
interest in the study of biographies has arisen is literary, cultural and historical
studies. In particular, three existing Knowledge Graphs share many similarities
with ours: the Orlando project1, Enslaved2, and WeChangEd3. The Orlando
Project is a collection of biographies of 1; 300 British women writers; Enslaved is
a data set of 509; 783 people of the historical slave trade developed from 8
preexisting archives; WeChangeEd is a collection of 1; 800 female editors born between
1710 and 1920, aligned with Wikidata. All these data sets rely on Semantic Web
technologies [
        <xref ref-type="bibr" rid="ref21 ref22 ref27">22,21,27</xref>
        ], which are used to represent socio-demographic
information about the individuals, such as ethnicity, family relationships, and social
status.
      </p>
      <p>The URW project has a similar perspective to these projects in terms of
aiming to represent a group of persons sharing a speci c condition. However,
the concept of \being under-represented" is challenging to model, because it
has blurred boundaries and it can be very subjective. Our project intentionally
does not rely on a taxonomy of ethnicities, choosing instead to fully describe the
interplay between a person and the places where they lived their life, in order to
avoid a Western representation of non-Western writers biographies.</p>
      <p>
        Several approaches aimed at encoding and annotating events have been
proposed in the last years. Despite the common representational goal, these
approaches vary signi cantly, since events can be formalized at di erent levels of
granularity. The Biography Ontology [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], part of an ontology network within the
TrendMiner project [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ], models biographical events as time-dependent knowledge
by directly adding temporal arguments to the materialised triples.
      </p>
      <sec id="sec-2-1">
        <title>1 http://www.artsrn.ualberta.ca/orlando/</title>
      </sec>
      <sec id="sec-2-2">
        <title>2 https://enslaved.org/</title>
      </sec>
      <sec id="sec-2-3">
        <title>3 https://www.wechanged.ugent.be/</title>
        <p>
          Other works analyze events at a word level. The ACE/ERE projects [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ][
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]
rely on the identi cation of the events through the use of a lexical `Trigger'. The
TimeML annotation scheme [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] has been speci cally designed for identifying
all the temporal expressions in a text, and annotating the chronological relation
between them. The Richer Event Description (RED) framework [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] simpli es
the taxonomy of events proposed in TimeML, but adds information about the
causal relations over them.
        </p>
        <p>
          Biographical event extraction from raw text is the subject of works relying
on Wikipedia as a source of knowledge. The Pantheon 1.0 data set [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ] is a
collection of 11,341 biographies available in more than 25 languages in Wikipedia.
Individuals in the data set have been categorized according their occupation by
using a controlled vocabulary relying on Freebase. Information about the number
of page views for each biography is provided as a way to measure its popularity.
        </p>
        <p>
          Other projects have attempted to extract time and geographical information
from biographical texts. Russo et al. [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] collected 782 biographies of people
deported to Nazi concentration camps, extracting relevant dates and places of their
lives. Then, all information has been arranged into a structured representation
by using the TimeML framework [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. The RAMBLE ON application [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] takes
as input a biographical raw text, and automatically detects Motion frames [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]
together with the georeferencing of each place mentioned in frames.
        </p>
        <p>
          Our proposal aims at extracting geographical knowledge and life events jointly,
to provide a semantic model for representing biographies. Unlike existing
approaches, which are focused on detecting the lexical entries triggering an event [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ],
our work provides a mapping between the textual and the semantic level.
Biographical patterns, encoded by adopting the ODP framework, are populated
extracting semantic knowledge from raw text biographies.
3
        </p>
        <p>A Semantic Model for Under-Represented Writers
The semantic model is designed with the purpose of providing a formal and
objective description of authors who are potentially under-represented due to
the context where they were born. In particular, it encodes biographical events
and situations in which a dul:Person is: (i) a writer and (i) has experienced
the condition of being under-represented. In this way, a correlation between
biographical events and literary production of under-represented authors can be
drawn, and employed to gain insight on the motivations and themes re ected in
their narratives. The main components of this semantic model are: the condition
of being under-represented and the identi cation of objective criteria to classify
countries which correlate with this condition.</p>
        <p>
          Biographical patterns. According to our formalization, a writer who is
underrepresented is a person who published one or more literary works, and may have
experienced the process of migrating, intended as the movement from a country
to another, and the condition of living in a given country after leaving one's
place of birth. Within the latter situation, the author's legal or professional
status may be expressed. Our solution to encode these situations draws from
the ODP framework, which provides foundationally sound, re-usable building
blocks for representing common patterns across ontologies. More speci cally,
the urw:Migration pattern refers to the BasicPlanExecution ODP [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], since a
migration represents the execution of a intentionally devised line of action. The
legal status of a person, urw:TimeIndexedPersonStatus (TIPS), relies on
the TimeIndexedPersonRole ODP [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ], since this condition is typically subject to
change and can be modelled as time-bounded role. As can be seen in Figure 1 and
2, both Migration and TIPS describe situations that are the setting for an entity
of the type dul:Person, which refers to person according to the commonsense
intuition, with a dul:Role. A role in a TIPS is a urw:ConditionRole, de ned
by one or more urw:Conditions, such as being a foreign student, a worker, a
refugee. Since multiple conditions could co-occur in de ning a role, each of them
has setting in a separate dul:Classification situation. The urw:Migration
Role in the Migration pattern is de ned by a urw:MigrationReason, namely
the reason of the plan of migrating (e.g.: eeing war, seeking for a job). Both
situations are time-indexed, and take place in one or more speci c urw:Place.
Integration with Existing Resources. In addition to the Migration and
TIPS patterns, existing resources have been integrated in the semantic model:
geographical resources for identifying the countries correlated with the lack of
representation, and linguistic resources for mapping raw text biographical facts
to the ontology. In fact, the TIPS and Migration patterns do not provide
themselves a criterion to identify the under-representation, since they only portray
the condition of living outside one's country. However, an author such as Italo
Calvino, who was born in Italy and moved to France during his life should not
be considered as under-represented, since his birthplace was a wealthy European
country. Hence, three indicators have been encoded in the ontology to identify
a country as under-represented:
{ the country's colonial past;
{ its Human Development Index (HDI)4, a measure of the the global
development of countries provided by the United Nations;
{ its mobility score5, namely the number of countries where a person could
travel with the passport of the country.
        </p>
        <p>In our formalization, an under-represented country must be a former colony, it
must have a medium or lower HDI (below 0:8), and it must fall within the second
half of the ranking of countries by mobility score. The Named Authority List of
countries maintained by the European Union6, an authoritative, comprehensive,
and multilingual reference for country names, has been used to standardize and
index all these sources of geographical knowledge.</p>
        <p>
          Concerning the linguistic resources, we rely on the Ontolex-Lemon model,
which [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] plays the function of mapping the morphological and syntactic
properties of lexical entries to the semantic categories expressed by OWL classes. The
use of this models facilitates the process of converting the raw text of the authors'
biographies into RDF triples by maintaining the lexico-semantic information in
the nal representation, as described in Section 4.
        </p>
        <p>
          Finally, the PROV-O Ontology [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] is a standard to express the provenance
information of a work. In the context of our research, this model is used to identify
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>4 http://hdr.undp.org/en/content/human-development-index-hdi</title>
      </sec>
      <sec id="sec-2-5">
        <title>5 https://www.passportindex.org/</title>
        <p>6 https://op.europa.eu/en/web/eu-vocabularies/dataset/-/resource?uri=http:
//publications.europa.eu/nresource/dataset/country
the LSPs as prov:SoftwareAgent, and the textual Wikipedia biographies as
the source of knowledge from which biographical patterns have been derived.
4</p>
        <p>From Ontology Patterns to Lexico-Semantic Patterns
Before collecting the biographies from Wikipedia, under-represented writers have
been identi ed through the occupation Wikidata property (WDT:P106). Each
person who worked as a writer, novelist, or poet has been collected and classi ed
by retrieving the country of origin associated to her/his birthplace (WDT:P19).
For each author, the biography in English language, if present, has been retrieved
from Wikipedia. The total amount of collected person entities is 114; 675. Writers
who were born from 1945 on, in any Asian or African under-represented country
(see Section 3) have been chosen to highlight only on biographies of people who
experienced or born after the Decolonization process.</p>
        <p>
          Starting from this initial corpus, a pipeline to convert raw texts biographies
in TIPS, and Migration classes based on Lexico-Semantic Patterns (LSP) [
          <xref ref-type="bibr" rid="ref4 ref6">6,4</xref>
          ]
has been developed. LSPs are rules composed of semantic and syntactic elements
related to classes and properties of an ontology. When a rule matches a string of
text, the ontology is automatically populated with one or more RDF triples. An
example of a Lexico-Semantic Pattern, created to extract geographical
information from text, is the following [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]:
        </p>
        <p>The rule $subject : Concept COMP RB? IN? $object : Concept matches
the phrase Administrative territory of Prague is divided into localities
retrieving a mereological relation between Prague, and localities to be
stored in an ontology.</p>
        <p>Our pipeline is based on three steps: text parsing, LSP development,
Information Extraction.
Text parsing. Using the SpaCy library7, each biography has been split in
sentences, and only the ones containing at least one entity of the type Organization
(ORG), or Geopolitical Entity (GPE) have been stored in JSON format, together
with the name of the author, and her/his country and year of birth. Below, there
is an example of an item, in JSON format, referring to the Nigerian writer, and
radio presenter Dotun Adebayo:
author: Dotun Adebayo,
birthPlace: Nigeria,
birthYear: 1960,
places: [(Stationers' Company's Comprehensive School,
ORG),(Stockholm University)],
sentence: He then went on to Stationers' Company's
Comprehensive School in Hornsey, North London, followed by</p>
        <p>Stockholm University, where he studied Literature.</p>
        <p>In parallel, each ORG and GPE has been linked with the respective country.
All the strings identi ed as geopolitical entities or organizations by the SpaCy
Named Entity Recognition module have been used as an input for search through
the Wikipedia API. The rst 10 results of the search have been subsequently
analyzed, and, among them, the rst candidate that holds the Wikidata property
`country' (WDT:P17) has been selected, if any. Only the 25; 554 sentences
containing an ORG or GPE belonging to di erent countries than the birth country
of an author have been selected for the next step.</p>
        <p>LSP development. After the sentences have been collected, a random subset of
them has been analyzed in order to de ne LSP rules for encoding the biographic
facts contained in the raw text into the two main patterns of the URW ontology:
urw:TimeIndexedPersonCondition (TIPS), and urw:Migration.</p>
        <p>Given the structure of these patterns, three key elements have been
identied as necessary in an input sentence to make it a candidate trigger: a verb
expressing a change of place or a condition (e.g.: eeing a country, obtaining
a graduation), a preposition, and an entity of the type Organization (ORG)
or Geo Political Entity (GPE) belonging to a di erent country from the place
of birth. For instance (see Figure 4), the elements in bold face in the sentence
\He [Dotun Adebayo] then went on to Stationers' Company's
Comprehensive School in Hornsey, North London." match the pattern
(escape-51.11)(to|at|in)(GPE|ORG). So, from this rule, the following RDF triples are
extracted:
[ a urw:Migration;
dul:isSettingFor [
a urw:MigrationRole;
dul:isDefinedIn Study_Abroad
dul:isRoleOf Dotun_Adebayo.</p>
      </sec>
      <sec id="sec-2-6">
        <title>7 https://spacy.io/</title>
        <p>
          The subsequent step in the de nition of the LSPs has been the clustering of
verbs through a mapping to general verb types, aimed at reducing the number
of patterns and increasing their recall. To do so, we employed the Uni ed Verb
Index8, a repository resulting from the mapping of several lexical resources that
provides syntactic and semantic frames of English verbs. In particular, we linked
the verbs in our data to the VerbNet classes in Uni ed Verb Index (UVI) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ].
In the previous example, the relevant class for mapping movement verbs onto
the Migration ontology patters is escape-51.1-1-1, which includes the following
lemmas: depart, disembark, escape, exit, ee, leave, vacate.
        </p>
        <p>
          As anticipated in Section 3, the mapping between LSPs and VerbNet classes
is expressed in the ontology through the Ontolex-Lemon speci cation [
          <xref ref-type="bibr" rid="ref10 ref11">10,11</xref>
          ].
According to this model, each verb is an ontolex:LexicalEntry with a
corresponding set of ontolex:LexicalSenses (WordNet o sets [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]), which represent
the lexicalized sense of the ontolex:LexicalConcept, namely the VerbNet
class. The ontolex:LexicalConcept is the bridge between the lexical entries
and the ontology classes. For instance, the ontolex:LexicalEntry lex leave
        </p>
      </sec>
      <sec id="sec-2-7">
        <title>8 https://uvi.colorado.edu/</title>
        <p>has a corresponding ontolex:LexicalSense which is the v#2009433
WordNet o set. The latter is one of the possible lexicalization of the escape-51.1-1-1
VerbNet class, which is the ontolex:Concept.</p>
        <p>Information Extraction. After the creation and re nement of the LSPs, 53
rules of the form:</p>
        <p>VerbNet class $preposition GPE|ORG
have been formulated and applied to the annotated sentences.</p>
        <p>The following is an example of how the same LSP matches sentences with
different verbs and preposition, and encodes them as urw:TimeIndexedPerson
Status:</p>
        <p>LSP: obtain-13.5.2 fromjforjatjbyjinjas GPEjORG
Ajunwa9 received her BA at University of California, Davis in
2003.</p>
        <p>He held a master's degree in Theatrical Directing which he obtained
from the University of So a
5</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Analysis and evaluation of the results</title>
      <p>From the life events encoding pipeline (Section 4) 12; 147 sentences containing
an instance of TIPS, Migration, or both have been obtained.10
Some preliminary statistics can help to assess the relevance of the data stored
in the KG. The resulting Knowledge Graph includes 2; 618 di erent authors'
biographies, place of birth, and year of birth. 1; 638 of these authors were born
in Asia, 980 in Africa. In total, 39; 167 RDF triples have been stored in the data
set.</p>
      <p>In order to test the precision of the LSP (Lexico-Syntactic Patterns), we
manually evaluated a random sample of 2; 555 sentences, which correspond to the
10% of sentences containing at least one GPE or ORG di erent from the author
country of birth. Each sentence was labelled as expressing a `TIPS|Migration'
(48:5%) or `None' (51:5%), then compared with the patterns.</p>
      <p>Table 1 shows that LSPs performed with a precision of 0:68. The manual
analysis of prediction errors revealed they had several causes. In some cases,
the subject was not the author but another person (e.g., `His father left India</p>
      <sec id="sec-3-1">
        <title>9 https://en.wikipedia.org/wiki/Ifeoma Ajunwa</title>
        <p>
          10 The rst version of the data set is publicly available on the GitHub
repository of the Under-Represented Writers project https://w3id.org/
UnderRepresentedWritersOntology/. 10; 569 sentences expressing at least one
TIPS were detected, 3; 549 with Migration patterns. In 1; 971 cases both are present
in the same sentences.
in early 1963 to study at Oxford University'). Another source of error is the
presence of reported speech of the writer (e.g., `Members of her African
audience have asserted that Thiam does not understand why women may support
FGM'). Finally, both the NER and the entity linking pipeline seem to introduce
false positives (e.g., in the phrase `Shatrughan Sinha, has also spoken in Kumar's
favour on Twitter', Twitter is marked as an organization in the United States).
It is important to mention an imbalance in the performance of the two
biographical patterns: Migration situations are retrieved with a precision of 0:805,
in line with recent ndings from the literature [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], while precision for TIPS is
0:665. This di erence is probably due to the nature of the latter pattern, which
is highly heterogeneous and needs a deeper analysis to specialize it into speci c
patterns for di erent status types. In order to investigate the low performance of
the TIPS LSP, we conducted a closer analysis of the situations encompassed by
the TIPS pattern. The results (Table 2) show that the type of status described
in the sentences that matched this pattern is varied: it can refer to occupation
(39:2% of the manually evaluated cases), publications (17:8%), education (12%),
awards (8:5%), or involvement in social causes (8%). Since these situation types
are highly consistent with the URW domain, this preliminary categorization
suggests that more speci c rules are needed to encode this information together
with a deeper speci cation of TIPS within the ontology, and that this ability to
discriminate will improve the performance.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>In this paper we presented a pipeline to extract life events of writers born in an
Asian or African Former Colony Countries from 1945 onwards from Wikipedia
biographies through Lexico-Semantic Patterns. At the present stage, the data
set includes 12; 147 biographical events about 2; 618 authors.</p>
      <p>A manual evaluation of a sample of the results showed a good precision of
Lexico-Semantic Patterns. However, some rules need to be further specialized in
order to extract a taxonomy of TIPS-related conditions. Despite these
limitation, it is important to underline that a pipeline based on a small set of rules has
produced a relatively large corpus, from which holistic knowledge about life's
narratives can be extracted, and generalized to other types of biographies.
Future works must take into account the chronological arrangement of Migration
and TIPS patterns within a whole biography, and generalize Lexico-Semantic
Patterns to other categories of under-represented people { ethnic minorities and
second generation migrants, people with other occupations { which can be
collected in the URW Knowledge Graph.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Baker</surname>
            ,
            <given-names>C.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fillmore</surname>
            ,
            <given-names>C.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lowe</surname>
            ,
            <given-names>J.B.</given-names>
          </string-name>
          :
          <article-title>The Berkeley Framenet project</article-title>
          .
          <source>In: 36th Annual Meeting of the ACL and 17th Int. Conf. on Computational Linguistics</source>
          , Volume
          <volume>1</volume>
          . pp.
          <volume>86</volume>
          {
          <issue>90</issue>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Doddington</surname>
            , G., Mitchell,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Przybocki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramshaw</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strassel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weischedel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>The Automatic Content Extraction (ACE) Program { Tasks, Data, and Evaluation</article-title>
          .
          <source>In: Proceedings of the 4th Int. Conf. on Language Resources and Evaluation (LREC'04)</source>
          . ELRA, Lisbon, Portugal (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Fellbaum</surname>
          </string-name>
          , C. (ed.):
          <article-title>WordNet: An Electronic Lexical Database</article-title>
          . Language, Speech, and Communication, MIT Press, Cambridge, MA (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Frasincar</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Borsje</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levering</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>A semantic web-based approach for building personalized news services</article-title>
          .
          <source>International Journal of E-Business Research (IJEBR) 5</source>
          (
          <issue>3</issue>
          ),
          <volume>35</volume>
          {
          <fpage>53</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Ontology design patterns</article-title>
          . In: Handbook on ontologies, pp.
          <volume>221</volume>
          {
          <fpage>243</fpage>
          . Springer (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>IJntema</surname>
          </string-name>
          , W.,
          <string-name>
            <surname>Sangers</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogenboom</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frasincar</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>A lexico-semantic pattern language for learning ontology instances from text</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>15</volume>
          ,
          <issue>37</issue>
          {
          <fpage>50</fpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Krieger</surname>
            ,
            <given-names>H.U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Declerck</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Tmo|the Federated Ontology of the TrendMiner Project</article-title>
          . In: LREC. pp.
          <volume>4164</volume>
          {
          <issue>4171</issue>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Krieger</surname>
            ,
            <given-names>H.U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Declerck</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>An OWL ontology for biographical knowledge. representing time-dependent factual knowledge</article-title>
          .
          <source>In: BD</source>
          . pp.
          <volume>101</volume>
          {
          <issue>110</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Lebo</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sahoo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McGuinness</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Belhajjame</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheney</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Corsar</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garijo</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Soiland-Reyes</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zednik</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Prov-o: The PROV ontology</article-title>
          .
          <source>Tech. rep., World Wide Web Consortium</source>
          (
          <year>2013</year>
          ), https://www.w3.org/TR/prov-o/
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>McCrae</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montiel-Ponsoda</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cimiano</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Integrating WordNet and Wiktionary with Lemon</article-title>
          .
          <source>In: Linked Data in Linguistics</source>
          , pp.
          <volume>25</volume>
          {
          <fpage>34</fpage>
          . Springer (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>McCrae</surname>
            ,
            <given-names>J.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bosque-Gil</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gracia</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Buitelaar</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cimiano</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>The OntolexLemon model: development and applications</article-title>
          .
          <source>In: eLex 2017</source>
          . pp.
          <volume>19</volume>
          {
          <issue>21</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Menini</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sprugnoli</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Moretti</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bignotti</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tonelli</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lepri</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          : RAMBLE ON:
          <article-title>Tracing movements of popular historical gures</article-title>
          .
          <source>In: Software Demonstrations of the 15th Conf. of EACL</source>
          . pp.
          <volume>77</volume>
          {
          <issue>80</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Mikander</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , et al.:
          <article-title>Westerners and others in Finnish school textbooks</article-title>
          . University of Helsinki, Institute of Behavioural Sciences,
          <article-title>Studies in Education (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Nishikawa</surname>
            ,
            <given-names>K.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Towner</surname>
            ,
            <given-names>T.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Clawson</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waltenburg</surname>
            ,
            <given-names>E.N.</given-names>
          </string-name>
          :
          <article-title>Interviewing the interviewers: Journalistic norms and racial diversity in the newsroom</article-title>
          .
          <source>The Howard Journal of Communications</source>
          <volume>20</volume>
          (
          <issue>3</issue>
          ),
          <volume>242</volume>
          {
          <fpage>259</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>O</given-names>
            <surname>'Gorman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Wright-Bettner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Palmer</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Richer Event Description: Integrating event coreference with temporal, causal and bridging annotation</article-title>
          .
          <source>In: Proc. of the 2nd Workshop on Computing News Storylines (CNS</source>
          <year>2016</year>
          ). pp.
          <volume>47</volume>
          {
          <issue>56</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Presutti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gangemi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Content ontology design patterns as practical building blocks for web ontologies</article-title>
          .
          <source>In: Int. Conference on Conceptual Modeling</source>
          . pp.
          <volume>128</volume>
          {
          <fpage>141</fpage>
          . Springer (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Pustejovsky</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castano</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingria</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sauri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaizauskas</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setzer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.:</given-names>
          </string-name>
          <article-title>TimeML: Robust speci cation of event and temporal expressions in text</article-title>
          .
          <source>New directions in question answering 3</source>
          , 28{
          <fpage>34</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Russo</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caselli</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Monachini</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Extracting and Visualising Biographical Events from Wikipedia</article-title>
          .
          <source>In: BD</source>
          . pp.
          <volume>111</volume>
          {
          <issue>115</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Saeeda</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Med</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ledvinka</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blasko</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kremen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Entity linking and lexico-semantic patterns for ontology learning</article-title>
          . In: Harth,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Kirrane</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Ngonga</surname>
          </string-name>
          <string-name>
            <surname>Ngomo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.C.</given-names>
            ,
            <surname>Paulheim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Rula</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Gentile</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.L.</given-names>
            ,
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Cochez</surname>
          </string-name>
          , M. (eds.)
          <article-title>The Semantic Web</article-title>
          . pp.
          <volume>138</volume>
          {
          <fpage>153</fpage>
          . Springer, Cham (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Schuler</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.K.: VerbNet: A Broad-Coverage</surname>
          </string-name>
          ,
          <article-title>Comprehensive Verb Lexicon</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of Pennsylvania (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Shimizu</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hitzler</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirt</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rehberger</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Estrecha</surname>
            ,
            <given-names>S.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foley</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sheill</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hawthorne</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mixter</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Watrall</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , et al.:
          <article-title>The Enslaved ontology: Peoples of the historic slave trade</article-title>
          .
          <source>Journal of Web Semantics</source>
          <volume>63</volume>
          ,
          <issue>100567</issue>
          (
          <year>2020</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Simpson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brown</surname>
          </string-name>
          , S.:
          <article-title>From XML to RDF in the Orlando Project</article-title>
          .
          <source>In: 2013 Int. Conf. on Culture and Computing</source>
          . pp.
          <volume>194</volume>
          {
          <fpage>195</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Song</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bies</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strassel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Riese</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mott</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ellis</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wright</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kulick</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ryant</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>From light to rich ERE: Annotation of Entities, Relations, and Events</article-title>
          .
          <source>In: Proc. of the the 3rd Workshop</source>
          on EVENTS: De nition, Detection, Coreference, and Representation. pp.
          <volume>89</volume>
          {
          <issue>98</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Spivak</surname>
          </string-name>
          , G.C.
          <article-title>: Can the subaltern speak?</article-title>
          <source>Die Philosophin</source>
          <volume>14</volume>
          (
          <issue>27</issue>
          ),
          <volume>42</volume>
          {
          <fpage>58</fpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Stranisci</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Patti</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Damiano</surname>
          </string-name>
          , R.:
          <article-title>Representing the Under-Represented: a Dataset of Post-Colonial, and Migrant Writers</article-title>
          .
          <source>In: 3rd Conference on Language, Data and Knowledge (LDK</source>
          <year>2021</year>
          ).
          <article-title>Schloss Dagstuhl-Leibniz-Zentrum fur Informatik (</article-title>
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Peng</surname>
          </string-name>
          , N.:
          <article-title>Men are elected, women are married: Events gender bias on Wikipedia</article-title>
          .
          <source>In: Proc. of the 59th Annual Meeting of the ACL and the 11th International Joint Conference on Natural Language Processing</source>
          (Vol.
          <volume>2</volume>
          )).
          <source>ACL</source>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Van Remoortel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Birkholz</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alesina</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bezari</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>D'Eer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Forestier</surname>
          </string-name>
          , E.: Women editors in europe.
          <source>Journal of European Periodical Studies</source>
          <volume>6</volume>
          (
          <issue>1</issue>
          ), 1{
          <issue>6</issue>
          (
          <year>2021</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>A.Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ronen</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hidalgo</surname>
            ,
            <given-names>C.A.</given-names>
          </string-name>
          :
          <article-title>Pantheon 1.0, a manually veri ed dataset of globally famous biographies</article-title>
          .
          <source>Scienti c data 3(1)</source>
          ,
          <volume>1</volume>
          {
          <fpage>16</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>