<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic National Biography of Finland</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Eero Hyvo¨ nen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Petri Leskinen</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Minna Tamper</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jouni Tuominen</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kirsi Keravuori</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Finnish Literature Society</institution>
          ,
          <addr-line>SKS</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>HELDIG - Helsinki Centre for Digital Humanities, University of Helsinki</institution>
          ,
          <country country="FI">Finland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Semantic Computing Research Group (SeCo), Aalto University</institution>
          ,
          <country country="FI">Finland</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>The Vision: Biographies as Linked Data</institution>
        </aff>
      </contrib-group>
      <abstract>
        <p>This paper presents the vision of publishing and utilizing textual biographies as Linked (Open) Data on the Semantic Web. As a case study, we publish the live stories of the National Biography of Finland, created by the Finnish Literature Society, as semantic, i.e., machine “understandable” metadata in a SPARQL endpoint using the Linked Data Finland (LDF.fi) service. On top of the data service various Digital Humanities applications are built. The applications include searching and studying individual personal histories as well as historical research of groups of persons using methods of prosopography. The biographical data is enriched by extracting events from unstructured and semi-structured texts, and by linking entities internally and to external data sources. A faceted semantic search engine is provided for filtering groups of people from the data for prosopographical research. An extension of the event-based CIDOC CRM ontology is used as the underlying data model, where lives are seen as chains of interlinked events populated from the data of the biographies and additional data sources, such as museum collections, library databases, and archives.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Nouvelle Biographie ge´ne´rale [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], Biography Portal of the Netherlands [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ],
BiographyNet [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], and Dictionary of Swedish National Biography [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. While the
biographical dictionaries had an unconcealed nationalist agenda well into the 20th century, the
contemporary on-line national biographies have all made an effort to include groups
previously ignored by national history. Pioneering women in all fields are included, as
well as many marginal and minority groups.
      </p>
      <p>The National Biography of Finland started out as an on-line publication in 1997, a
decade before its publication as a 10-volume book series was completed. A selection of
its 6 000 lives was published on-line in Swedish as Biografiskt lexikon fo¨r Finland. In
addition to the national Biography of Finland, the Biographical Centre of the Finnish
Literature Society has published several other peer-reviewed biographical collections
on-line, such as the Finnish Business Leaders, the Finnish Clergy (1552–1920), and the
Finnish Generals and Admirals in the Russian Armed Forces (1809–1917).</p>
      <p>Even if lots of biographical information is available online for humans to read and
interpret, the information is seldom available as machine readable data for 1) Digital
Humanities research and 2) to be used in Cultural Heritage (CH) portals, such as
Europeana4 and Digital Public Library of America5, or in CH applications for the public.
Furthermore, the information is distributed in different national data silos using
heterogeneous formats and is written in different languages. This makes aggregation and
reuse of biographical data challenging.</p>
      <p>A biographical data source can be used to address various research questions from
perspectives of different disciplines and nations, such as:
1. What kind of persons and institutions are actually included in the various national
biographical dictionaries?
2. How do countries portray their “heroes”; is there a place for people that belong
to minorities or opposition groups? Are women portrayed in a different way than
men? What about race and ethnicity? What qualifications are used for selecting the
portrayed individuals?
3. Which disciplines are the scholars representing? Are there national differences?</p>
      <p>What disciplines are considered to be of international value?
4. What kind of persons are included in the corpora at different times? What topics
are considered to be breakthroughs on the long term?
5. What can we learn about the historical groups and institutions — social, religious,
political, etc. — by analyzing the biographical details of their members? What is
the nature of the networks that existed among them?</p>
      <p>To address questions like these, biographies as data are needed, in many cases linked
across nations and languages. This can only be done in multi-disciplinary
collaboration between humanists, computer scientists, and linguists. There is a need for methods
to transform semi-structured biography entries and unstructured texts into structured
forms. We need methods to represent knowledge in an interoperable way across
language barriers, and tooling for data analysis, visualization, and knowledge discovery.</p>
    </sec>
    <sec id="sec-2">
      <title>4 http://www.europeana.eu/portal/en 5 http://dp.la</title>
      <p>
        This paper focuses on Finnish biographies of persons of national importance,
selected and edited by the scholarly editorial board of the National Biography of Finland
and four additional collections: Business Leaders, the Finnish Clergy (1554–1721) and
(1800–1920), and the Finnish Generals and Admirals in the Russian Armed Forces
(1809–1917). In our earlier work [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] on the Semantic National Biography of Finland
(SNBF), we addressed the research question: How can the reading experience of
biographies be enhanced using web technologies? As a solution approach, the idea of
data linking and a spatiotemporal visualization based on an interconnected timeline and
a geographical map view were presented. To continue and complement this work, this
paper presents:
1. An extended aggregated collection of biographies from different databases.
2. A new datamodel Bio CRM for representing the biographical data.
3. A new knowledge extraction pipeline for mining entity references and events from
unstructured and semi-structured texts.
4. A faceted search engine for searching the biographies/persons.
5. New data analysis and visualization tools for research on groups of persons.
      </p>
      <p>We first present the datasets and data model underlying the new version of SNBF.
Then the process of transforming biographies into data is discussed. As end-user
perspectives to the data, a faceted search and browsing application is presented, with
additional data analysis and visualization tools for biographical research based on filtered
datasets. In conclusion, contributions of our work are summarized, related work
discussed, and directions for further research are outlined.
2</p>
      <sec id="sec-2-1">
        <title>Data Model and Datasets</title>
        <p>
          To enrich and link biographical data with related datasets, the data must be made
semantically interoperable, either by data alignments (using, e.g., Dublin Core and the
dumb down principle) or by data transformations into a harmonized form [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. Since
biographies are based on life events we selected the data harmonization approach and
the event-centric CIDOC CRM6 [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] ISO standard as the ontological basis in our case
study. To adapt CIDOC CRM for biographical data it was first extended to a model
we call “Bio CRM”. This model was then populated by instance data from different
biographical databases.
        </p>
        <p>
          Bio CRM7 [
          <xref ref-type="bibr" rid="ref33">33</xref>
          ] is a semantic data model for harmonizing and interlinking
heterogeneous biographical information from different data sources. It is a domain specific
extension of CIDOC CRM, effectively providing compatibility with other cultural
heritage information, too. A natural choice for modeling life stories is the event-based
approach where a person’s life is seen as a sequence of spatiotemporal, possibly
interlinked events from birth to death. The data model includes structures for basic data of
people, personal relations, professions, and events with participants in different roles.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>6 http://www.cidoc-crm.org 7 http://ldf.fi/schema/bioc/</title>
      <p>
        Bio CRM makes a distinction between enduring unary roles of actors, their
enduring binary relationships, and perduring events, where the participants can take different
roles modeled as a role concept hierarchy. Bio CRM provides the general data model for
biographical datasets. The individual datasets may concern different cultures, time
periods, or are collected by different researchers that may introduce extensions for
defining additional event and role types. For representing the roles of actors, we chose an
VIVO/BFO-inspired8 [
        <xref ref-type="bibr" rid="ref32">32</xref>
        ], intuitive, and simple approach where specific roles are
instantiated from the role classes, and connected to actors with the property ”inheres in”.
      </p>
      <p>
        The Bio CRM model can be used as a basis for semantic data validation and
enrichment by reasoning. The data model aims to support principal prosopographical query
types, and is designed to be intuitive in terms of knowledge representation and
writing SPARQL queries in flexible ways. Use cases for data represented using Bio CRM
include prosopographical information retrieval, network analysis, knowledge
discovery, and dynamic analysis. The development of Bio CRM was started in the EU COST
project Reassembling the Republic of Letters9 and was first piloted in the case of
enriching and publishing the printed register of over 10 000 alumni of the Finnish Norssi
high school as Linked Data [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ].
      </p>
      <p>Datasets The National Biography of the Finland10 consists of biographies of
notable Finnish people throughout history. The biographies describe the lives and
achievements of these historical figures, containing vast amounts of references to notable Finnish
and foreign figures, including internal links to other biographies of the National
Biography of Finland. In addition, the text contains references to historical events, notable
works (such as paintings, books, music, and acting), places (such as place of birth and
death), organizations, and dates.</p>
      <p>The data consists of several person registry databases listed in Table 1. These source
datasets ware made available as CSV tables, which were converted into RDF format11,
the foundation of the Semantic Web standard stack. In addition to actors, the resulting
data includes (at the moment) 13 144 biographies, 51 937 family relations, 4953 places,
3101 occupational titles, and 2938 companies extracted from the source data.</p>
      <p>Dataset name # of People
National Biographies 6478
Business Leaders 2235
Finnish Generals and Admirals 1809–1917 481
Finnish Clergy 1554–1721 2716</p>
      <p>Finnish Clergy 1800–1920 1234</p>
    </sec>
    <sec id="sec-4">
      <title>8 http://vivoweb.org 9 http://www.republicofletters.net 10 http://kansallisbiografia.fi 11 http://www.w3.org/RDF/</title>
      <p>To earn the 5th star in the Linked Data 5-star model12, the data was linked not only
internally but also enriched with owl:sameAs links to the external data sources of
Table 2. This facilitates data aggregation of a person described in several data sources.</p>
      <p>Data Source</p>
      <p># of Links Description
Wikipedia 5760
Wikidata 5749
BLF 972
BookSampo 715
WarSampo 243
ULAN 171
VIAF 2272
Geni 4935
Homepages 43
Parliament of Finland 628
http://fi.wikipedia.org
http://www.wikidata.org
Biografiskt Lexikon fo¨r Finland
Finnish fiction literature on the Semantic Web service
Second World War LOD service and portal
Union List of Artist Names Online
Virtual International Authority Files
Family research and family tree data
Personal web sites</p>
      <p>Web pages of Parliament of Finland
This section discusses the process of entity linking and knowledge extraction from
semi-structured and unstructured biographies.</p>
      <p>Semi-structured Data Extraction A simple custom event extractor was created for
transforming biographies into the Bio CRM model represented in RDF. The extractor
analyzes the major parts of a biography: a textual story followed by systematically
titled sections listing major achievements of the person, such as “works”, “awards”, and
“memberships” as snippets. A snippet represents an event and typically contains
mentions of years and places. For example, the biography of architect Eliel Saarinen tells
“WORKS: ...; Suomen Kansallismuseo (National Museum of Finland, 1902–1911;...”
indicating an artistic creation event. Also known family relations were extracted from
the textual descriptions, and for each mentioned relative also a resource was added into
the person ontology of the system.</p>
      <p>
        The system has its own place ontology, and the ARPA linker [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] was used for
finding and linking place names mentioned in the event snippets. Places in Finland were
extracted from the Finnish Gazetteer of Historical Places and Maps (Hipla) databases and
data service13 [
        <xref ref-type="bibr" rid="ref17 ref19">19,17</xref>
        ]. Foreign placenames were linked using the Google Maps APIs14.
For example, the locations of medieval universities in Europe, towns of the Hanseatic
12 http://5stardata.info/en/
13 http://hipla.fi
14 http://developers.google.com/maps/
League, Finnish mansions, churches, and other well-known buildings in Finland were
added to the place ontology by using the Google services.
      </p>
      <p>In order to create temporal links for the events, textual expressions of dates and their
intervals were analyzed and recognized using a set of regular expressions. The actor of
the event snipped was easily determined as the subject person of the biography.</p>
      <p>The result of processing a biography was a list of spatio-temporal events with short
titles (snippet texts) related to the corresponding person. Altogether, first version of the
extracted knowledge graph of the Semantic National Biography of Finland had 37 730
births, 25 552 deaths, 102 300 other biographical events, and 52 000 family relations.
At the moment, the extractor uses only the snippets for event creation—more generic
event extraction from the free biography narrative remains a topic of further research.</p>
      <p>From a data linking viewpoint, the birthday and full name of the persons were
known at this point, which could be used to enrich the data from several external
datasets listed in Table 2. Links were created to Wikipedia, Wikidata, Biografiskt lexikon
fo¨ r Finland BLF15, BookSampo16 Linked Data, WarSampo17 portal, ULAN18 authority
register of The J. Paul Getty Trust, VIAF19, and the genealogical data service Geni20.
Furthermore, some special links, like personal web pages or a person entry at the web
sites of the Finnish Parliament, where extracted from corresponding Wikidata resources.
At the moment no additional information is extracted from the external databases, but
the plan is utilize them in this way in the future, too.</p>
      <p>For entity linking to external databases offering a SPARQL endpoint, the tool SPARQL
ARPA21 was used. In cases where the database provides a REST API, like Wikipedia or
Geni.com, a special Python script was created and used. A database specific script was
used also in the case of BLF, where the data was available as a CSV formatted table.</p>
      <p>
        A Pipeline for Text Analysis In order to identify entities and events from
unstructured texts, a tool is being constructed to extract knowledge from them. The application
uses multiple different linguistic tools to do morphological analysis, part of speech
tagging, and lemmatization. In addition, the tool’s purpose is to transform all the data into
NLP Interchange Format (NIF)22 and to enrich it with linguistic information gathered
from the linguistic tools. This linguistic information can be then used in decision
making in named entity recognition [
        <xref ref-type="bibr" rid="ref11 ref27">27,11</xref>
        ] by providing context to the entities, so that
they can be linked more correctly and efficiently into the corresponding ontologies and
datasets. In addition, the tool’s results can be changed into a format where it is possible
to disambiguate entities also manually if the results are not satisfactory to the user.
      </p>
      <p>Our pipeline model is illustrated in the Fig. 1. In this model, the application can
retrieve the texts from a CSV file or from a SPARQL endpoint. In order to read a CSV
file, the application needs to know the columns for the text and possible document
15 http://www.sls.fi/sv/projekt/blf-biografiskt-lexikon-finland
16 http://www.kirjasampo.fi
17 http://sotasampo.fi/en/
18 http://www.getty.edu/research/tools/vocabularies/ulan/
19 http://www.viaf.org
20 http://www.geni.com
21 http://seco.cs.aalto.fi/projects/dcert/
22 http://persistence.uni-leipzig.org/nlp2rdf/specification/api.html
identifiers. SPARQL endpoint usage requires that the text is in a SPARQL endpoint and
that it is split into ordered text paragraphs. To query the text, the application needs to
be given a query and an endpoint to retrieve the text.</p>
      <p>After the application has acquired the text, it transforms (in the Prepare data module)
the document structure (i.e. the document, its paragraphs and titles) into NIF format.
Each document is represented as an instance of Structure class that has a property that
refers to the document identifier. The document identifier can be read from the CSV
file or by querying the SPARQL endpoint along with the text. In addition, the text is
divided into paragraphs and titles that are divided into multiple different text files for
the pipeline to process one paragraph at a time. The division into paragraphs is done
because each sentence and word needs to be connected to a particular paragraph later
on in the process.</p>
      <p>
        The process that the paragraphs go through is represented in the figure within the
curly brackets. It starts by taking a single text paragraph, transforms it into CoNLL
format using the Finnish dependency parser23. After this, the CoNLL file is transformed
into NIF format by the CoNLL2NIF [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] application. This application constructs from
each word and sentence their corresponding classes (Sentence and Word) and adds the
CoNLL information for each word instance. The last part of the process is a reasoner
that infers missing information for each instance. Currently, the reasoner infers only
information about the order of the words and sentences from the existing information.
After processing a paragraph of text, the application serializes paragraph data into RDF
format.
      </p>
      <p>
        Once the application has processed all the texts, there is a possibility for a user to
upload all the RDF files into a specific SPARQL server automatically. The output of
this application is the given text document in NIF format. This data can be later on used
to identify named entities and mentions of events in the texts. For future work we are
23 http://turkunlp.github.io/Finnish-dep-parser/
planning to implement an application that can use this data to identify named entities
and events, based on semantically interlinked texts and contexts. We envision that this
will help semantic disambiguation of the entities as well as extraction of complex events
significantly. An application of this is, for example, to use a contextual reader [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ] for
giving the readers of the biographies more contextual information about the entities and
events mentioned in the texts.
4
      </p>
      <sec id="sec-4-1">
        <title>Faceted Search Engine and Prosopography</title>
        <p>
          Based on the RDF data, a faceted search and browsing application24 depicted in
Fig. 2 was created using the SPARQL Faceter tool [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ] and AngularJS25 framefork. On
the left, the first column contains a free text search 1) and facets 2a-i) for searching or
filtering person entries included in external databases 2a), by time period 2b), by family
name 2c), by gender 2d), etc. For example, by selecting WarSampo or Wikipedia on the
external dataset facet 2a), people with a history in the WarSampo Second World War
history portal or people having a Wikipedia page can be filtered, and corresponding
personal homepages on these external services can be found. In the result list each
person is presented by an image (if there is one), her/his lifespan 3), and his/her linkage
24 http://semanticcomputing.github.io/nbf
25 http://angularjs.org
to external databases 4). The last column 5) contains original introduction text from
the biographical description. Specially interesting from a data linking perspective is the
facet and column for links to other external data sources. Using the links, the reading
experience of an end user can be extended substantially beyond the biography text in
SNBF.
        </p>
        <p>After clicking on a person’s image or name fields, a personal “homepage” depicted
in Fig. 3 is opened. This page consists of the person’s basic information 2), links to
external databases 3), his relatives 4), and full biographical descriptions 5) – a
biographical description can be much longer than what is shown in the figure. At the top
of the page 1), the user can switch between the person page with textual descriptions or
a timeline page, depicted in Fig. 4. On the timeline page, a list of events relating to the
person is shown on the left 1), events with known locations are shown on the map 2),
and below there is a timeline 3) showing the timespans of each event. The timeline has
four horizontal lines for showing family events, career events, achievements, and
mentions of honour. When an event is hovered on the list or timeline by the mouse cursor,
the corresponding marker on the map is highlighted.</p>
        <p>The faceted search engine provides the end user with a means for filtering and
studying subgroups of historical people in the data service for prosopographical research. The
criterion for filtering the group can be specified flexibly by using the facets. For
example, one can study people having a Wikipedia page, born in the same area during a time
period, having the same education or profession, etc.</p>
        <p>
          A simple tool for such analysis is business graphics, but also other methods and
tools, such as network analysis or knowledge discovery could be applied here to support
Digital Humanities research. To start with, pie charts and histograms, based on Google
graphics, will be added to the system in a similar manner as with our earlier project on
Norssi high school alumni on the Semantic Web [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. To study the behavior of groups
and other phenomena, a map and timeline application similar to the person pages will
be implemented.
5
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Discussion, Related Work, and Future Research</title>
        <p>Our case study suggests that biography publication is a promising application case for
Linked Data. The event-based modeling approach was deemed useful and handy, after
learning basics of the fairly complex CIDOC CRM model.</p>
        <p>For the current RDF version of the biographies, only their semistructured parts have
been considered and linked by using the ARPA tool and custom linkers. The snippet
events could be extracted and aligned with related places, times, and actors fairly
accurately using simple string-based techniques without deeper semantic disambiguation.
However, the precision and recall of event extraction and entity linking have not been
evaluated formally. It is obvious that problems grow with larger datasets and when
analyzing free texts. These issues remain topics of future research.</p>
        <p>
          Biographies have been studied by genealogists (e.g., (Event) GEDCOM26), CH
organizations (e.g., the Getty ULAN27), and semantic web researchers (e.g., BIO
ontology28). Semantic web event models include, e.g., Event Ontology [
          <xref ref-type="bibr" rid="ref30">30</xref>
          ], LODE
ontology29, SEM [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], and Event-Model-F30 [
          <xref ref-type="bibr" rid="ref31">31</xref>
          ]. A history ontology with map
visualizations is presented in [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ], and an ontology of historical events in [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Visualization
using historical timelines is discussed, e.g., in [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], and event extraction in [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ].
        </p>
        <p>
          Previous works of applying Linked Data technologies to biographical data include,
e.g., [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ], Biography.net31 [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ], and our own earlier work [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The conference
proceedings [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] includes several papers on bringing biographical data online, on analyzing
biographies with computational methods, on group portraits and networks, and on
visualizations. Complementing these works, the study of this paper focuses on extracting
linked data from semi-structured biographies. Our work also emphasizes the idea of
enriching the texts with external links to other biographical datasets. As for
applications, faceted search and browsing of biographical data for prosopographical studies
was considered as well as spatiotemporal visualization of life stories.
        </p>
        <p>Our work continues, e.g., on developing new models of biographical data for
prosopographical research, on semantic disambiguation, on finalizing and evaluating the data
extraction and linking process (precision and recall), and on extending the
demonstrator with new tools for solving the DH research questions discussed in section 1. In a
project such as this it is of critical importance to assess the outcome also from the
vantage point of historical and biographical scholarship and to evaluate relevance of the
new knowledge that these methods enable us to find and analyze.</p>
        <p>Acknowledgements Our work is part of the Severi project32, funded mainly by
Tekes. Our work is also part of the Open Science and Research Programme33, funded
by the Ministry of Education and Culture of Finland.
26 http://en.wikipedia.org/wiki/GEDCOM
27 http://www.getty.edu/research/tools/vocabularies/ulan/
28 http://vocab.org/bio/0.1/.html
29 http://linkedevents.org/ontology/
30 http://www.uni-koblenz-landau.de/koblenz/fb4/AGStaab/Research/ontologies/events
31 http://www.biographynet.nl
32 http://seco.cs.aalto.fi/projects/severi
33 https://openscience.fi</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>American</given-names>
            <surname>National Biography</surname>
          </string-name>
          (
          <year>2017</year>
          ), http://www.anb.org/aboutanb.html
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. Biography Portal of the Netherlands (
          <year>2017</year>
          ), http://www.biografischportaal.nl/en
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>BiographyNet</surname>
          </string-name>
          (
          <year>2017</year>
          ), http://www.biographynet.nl
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Dictionary of Swedish National Biography (
          <year>2017</year>
          ), https://sok.riksarkivet.se/Sbl/Start.aspx?lang=en
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Neue</given-names>
            <surname>Deutsche Biographie</surname>
          </string-name>
          (
          <year>2017</year>
          ), http://www.ndb.badw-muenchen.de/ndb aufgaben e.htm
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Nouvelle</surname>
          </string-name>
          <article-title>Biographie ge´ne´rale (</article-title>
          <year>2017</year>
          ), https://fr.wikipedia.org/wiki/Nouvelle Biographie g%
          <article-title>C3%A9n%C3%A9rale</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. Oxford Dictionary of National Biography (
          <year>2017</year>
          ), http://global.oup.com/oxforddnb/info/
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. ter Braake,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Anstke</surname>
          </string-name>
          <string-name>
            <surname>Fokkens</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.S.</given-names>
            ,
            <surname>Declerck</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Wandl-Vogt</surname>
          </string-name>
          , E. (eds.):
          <article-title>BD2015 Biographical Data in a Digital World 2015</article-title>
          . CEUR Workshop Proceedings (
          <year>2015</year>
          ), http://ceurws.org/Vol-
          <volume>1272</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Chiarcos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          , Fa¨th, C.:
          <string-name>
            <surname>CoNLL-RDF</surname>
          </string-name>
          :
          <article-title>Linked corpora done in an NLP-friendly way</article-title>
          .
          <source>In: Language</source>
          , Data, and Knowledge - First International Conference LDK. pp.
          <fpage>74</fpage>
          -
          <lpage>88</lpage>
          (
          <year>2017</year>
          ), https://doi.org/10.1007/978-3-
          <fpage>319</fpage>
          -59888-8 6
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Doerr</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The CIDOC CRM-an ontological approach to semantic interoperability of metadata</article-title>
          .
          <source>AI</source>
          Magazine
          <volume>24</volume>
          (
          <issue>3</issue>
          ),
          <fpage>75</fpage>
          -
          <lpage>92</lpage>
          (
          <year>2003</year>
          ), https://doi.org/10.1609/aimag.v24i3.
          <fpage>1720</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hachey</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radford</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nothman</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Honnibal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Curran</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          :
          <article-title>Evaluating entity linking with Wikipedia</article-title>
          .
          <source>Artificial Intelligence</source>
          <volume>194</volume>
          ,
          <fpage>130</fpage>
          -
          <lpage>150</lpage>
          (
          <year>January 2013</year>
          ), http://dx.doi.org/10.1016/j.artint.
          <year>2012</year>
          .
          <volume>04</volume>
          .005
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. van Hage,
          <string-name>
            <surname>W.R.</surname>
          </string-name>
          , Malaise´,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Segers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Hollink</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Schreiber</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Design and use of the simple event model (SEM)</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          <volume>9</volume>
          (
          <issue>2</issue>
          ),
          <fpage>128</fpage>
          -
          <lpage>136</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Hogenboom</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frasincar</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kaymak</surname>
          </string-name>
          , U., de Jong, F.:
          <article-title>An overview of event extraction from text</article-title>
          .
          <source>In: DeRiVE</source>
          <year>2011</year>
          , Detection, Representation, and
          <article-title>Exploitation of Events in the Semantic Web (</article-title>
          <year>2011</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>779</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. Hyvo¨nen, E.:
          <article-title>Publishing and using cultural heritage linked data on the semantic web</article-title>
          . Morgan &amp; Claypool, Palo Alto, CA (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15. Hyvo¨nen, E.,
          <string-name>
            <surname>Alm</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kuittinen</surname>
          </string-name>
          , H.:
          <article-title>Using an ontology of historical events in semantic portals for cultural heritage</article-title>
          .
          <source>In: Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC</source>
          <year>2007</year>
          )
          <article-title>(</article-title>
          <year>2007</year>
          ), http://www.cs.vu.nl/ laroyo/CH-SW.html
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16. Hyvo¨nen, E.,
          <string-name>
            <surname>Alonen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ikkala</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , Ma¨kela¨, E.:
          <article-title>Life stories as event-based linked data: Case semantic national biography</article-title>
          .
          <source>In: Proceedings of ISWC 2014 Posters &amp; Demonstrations Track. CEUR Workshop Proceedings (October</source>
          <year>2014</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1272</volume>
          /
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. Hyvo¨nen, E.,
          <string-name>
            <surname>Ikkala</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuominen</surname>
          </string-name>
          , J.:
          <article-title>Linked data brokering service for historical places and maps</article-title>
          .
          <source>In: Proceedings of the 1st Workshop on Humanities in the Semantic Web (WHiSe)</source>
          . pp.
          <fpage>39</fpage>
          -
          <lpage>52</lpage>
          . CEUR Workshop Proc. (
          <year>2016</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1608</volume>
          /#paper-
          <fpage>06</fpage>
          , vol
          <volume>1608</volume>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18. Hyvo¨nen, E.,
          <string-name>
            <surname>Leskinen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heino</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuominen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sirola</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Reassembling and enriching the life stories in printed biographical registers: Norssi high school alumni on the semantic web</article-title>
          .
          <source>In: Language, Technology and Knowledge</source>
          . pp.
          <fpage>113</fpage>
          -
          <lpage>119</lpage>
          . Springer-Verlag (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Ikkala</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuominen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Hyvo¨nen, E.:
          <article-title>Contextualizing historical places in a gazetteer by using historical maps and linked data</article-title>
          .
          <source>In: Proceedings of Digital Humanities</source>
          <year>2016</year>
          ,
          <article-title>short papers</article-title>
          . pp.
          <fpage>573</fpage>
          -
          <lpage>577</lpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Jensen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Vizualising complex semantic timelines</article-title>
          .
          <source>NewsBlip Research Papers, Report NBTR2003-001</source>
          (
          <year>2003</year>
          ), http://www.newsblip.com/tr/
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Keith:,
          <string-name>
            <surname>T.</surname>
          </string-name>
          :
          <article-title>Changing conceptions of National Biography</article-title>
          . Cambridge University Press (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Koho</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heino</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , Hyvo¨nen, E.:
          <article-title>SPARQL Faceter-Client-side Faceted Search Based on SPARQL</article-title>
          . In: Troncy,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Verborgh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Nixon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Kurz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Schlegel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Vander</surname>
          </string-name>
          <string-name>
            <surname>Sande</surname>
          </string-name>
          , M. (eds.)
          <source>Joint Proc. of the 4th International Workshop on Linked Media and the 3rd Developers Hackshop. CEUR Workshop Proceedings</source>
          (
          <year>2016</year>
          ), http://ceur-ws.org/Vol1615/semdevPaper5.pdf
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Larson</surname>
          </string-name>
          , R.:
          <article-title>Bringing lives to light: Biography in context (</article-title>
          <year>2010</year>
          ), http://metadata.berkeley.edu/Biography
          <source>Final Report.pdf, Final Project Report</source>
          , University of Berkeley
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Leskinen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tuominen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heino</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , Hyvo¨nen, E.:
          <article-title>An ontology and data infrastructure for publishing and using biographical linked data</article-title>
          .
          <source>In: Proceedings of the Workshop on Humanities in the Semantic Web</source>
          (
          <article-title>WHiSe II)</article-title>
          .
          <source>CEUR Workshop Proceedings (October</source>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25. Ma¨kela¨, E.:
          <article-title>Combining a REST lexical analysis web service with SPARQL for mashup semantic annotation from text</article-title>
          .
          <source>In: Proceedings of the ESWC 2014 demonstration track</source>
          , Springer-Verlag (May
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26. Ma¨kela¨, E.,
          <string-name>
            <surname>Lindquist</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          , Hyvo¨ nen, E.:
          <article-title>CORE - a contextual reader based on linked data</article-title>
          .
          <source>In: Proceedings of Digital Humanities</source>
          <year>2016</year>
          , long papers. pp.
          <fpage>267</fpage>
          -
          <lpage>269</lpage>
          (
          <year>July 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Mendes</surname>
            ,
            <given-names>P.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jakob</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <article-title>Garc´ıa-</article-title>
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>DBpedia Spotlight: shedding light on the web of documents</article-title>
          .
          <source>In: Proceedings of the 7th international conference on semantic systems</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          . ACM (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Nagypal</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Deswarte</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oosthoek</surname>
          </string-name>
          , J.:
          <article-title>Applying the semantic web: The VICODI experience in creating visual contextualization for history</article-title>
          .
          <source>Lit Linguist Computing</source>
          <volume>20</volume>
          (
          <issue>3</issue>
          ),
          <fpage>327</fpage>
          -
          <lpage>349</lpage>
          (
          <year>2005</year>
          ), http://doi.org/10.1093/llc/fqi037
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Ockeloen</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fokkens</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>ter Braake</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vossen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Boer</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schreiber</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , Legeˆne, S.: BiographyNet:
          <article-title>Managing provenance at multiple levels and from different perspectives</article-title>
          .
          <source>In: Proceedings of the 3rd International Conference on Linked Science (LISC'13)</source>
          . pp.
          <fpage>59</fpage>
          -
          <lpage>71</lpage>
          . CEUR-WS.org (
          <year>2013</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1116</volume>
          /paper7.pdf
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Raimond</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdallah</surname>
            ,
            <given-names>S.:</given-names>
          </string-name>
          <article-title>The event ontology (</article-title>
          <year>2007</year>
          ), http://motools.sourceforge.net/event/event.html
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Scherp</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saathoff</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Franz</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Event-Model-F</surname>
          </string-name>
          (
          <year>2010</year>
          ), http://www.uni-koblenz-landau.de/koblenz/fb4/AGStaab/Research/ontologies/events
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          32.
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Almeida</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bona</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brochhausen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ceusters</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Courtot</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dipert</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goldfain</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grenon</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hastings</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hogan</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jacuzzo</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johansson</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mungall</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Natale</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neuhaus</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Overton</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Petosa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rovetto</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ruttenberg</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ressler</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rudniki</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          , Seppa¨la¨,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Schulz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Zheng</surname>
          </string-name>
          , J.:
          <source>Basic formal ontology 2</source>
          .0
          <article-title>- specification and user's guide (</article-title>
          <year>2015</year>
          ), https://github.com/BFO-ontology/BFO/raw/master/docs/bfo2- reference/BFO2-Reference.pdf, June 26
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          33.
          <string-name>
            <surname>Tuominen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Hyvo¨ nen, E.,
          <string-name>
            <surname>Leskinen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Bio</surname>
            <given-names>CRM</given-names>
          </string-name>
          :
          <article-title>A data model for representing biographical data for prosopographical research</article-title>
          .
          <source>In: Biographical Data in a Digital World (BD2017) (November</source>
          <year>2017</year>
          ), https://doi.org/10.5281/zenodo.1040712
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>