<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extraction of Career Profiles from Wikipedia</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Firas Dib</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Simon Lindberg</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pierre Nugues</string-name>
          <email>Pierre.Nugues@cs.lth.se</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Lund University LTH, Department of Computer Science</institution>
          ,
          <addr-line>S-221 00 Lund</addr-line>
          ,
          <country country="SE">Sweden</country>
        </aff>
      </contrib-group>
      <fpage>33</fpage>
      <lpage>38</lpage>
      <abstract>
        <p>In this paper, we describe a system that gathers the work experience of a person from her or his Wikipedia page. We first extract an ontology of profession names from the Wikidata graph. We then parse the Wikipedia pages using a dependency parser and we connect persons to professions through the analysis of parts of speech and dependency relations we extract from text. Setting aside the dates, we computed recall and precision scores on a very limited and preliminary test set for which we could reach a recall of 74% and a precision of 95%, showing our approach is promising. Biographies form a category of their own in literature as they typically mix free-form text - a life narrative - with a set of well-defined numerical and nominal properties, such as dates of birth and death, country of origin, titles and decorations, etc. that merely resort to databases. Textual biographies can therefore be associated to structured databases that describe such properties in the form of tables or graphs. Texts and databases are both useful and complementary for humanities research. While text often contains more details on people's life, databases enable researchers to formulate questions like:</p>
      </abstract>
      <kwd-group>
        <kwd>Knowledge extraction</kwd>
        <kwd>Wikidata ontology</kwd>
        <kwd>Dependency parsing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Are there welders who became prime ministers?
and immediately have answers.</p>
      <p>Although there are now scores of digital biographies,
Wikipedia has become the major reference of the internet.
It is free and easy to download; it covers more people than
other online resources; it is open to popular culture; and
it is multilingual. This makes it unique even if Wikipedia
often reuses text and data from older printed biographies
and contains mistakes. In addition to its scope and size,
there are many open computer tools that have been
designed for Wikipedia that make the development of new
programs dedicated to this resource faster.</p>
      <p>We created a system that takes a corpus of Wikipedia pages
describing people as input and that outputs a career profile
for each respective individual. To carry this out, we used
available tools to parse and extract information from text
and then analyze the data.</p>
      <p>A practical use of our system could be to expand Wikidata,
the data repository companion to Wikipedia. As of today,
Wikidata often associates people with the most notable
occupation of their life. The system we describe makes it
possible to build more comprehensive semantic knowledge
bases of career timelines as it extracts all the occupations,
possibly secondary, mentioned in the text.</p>
      <p>In our experiments, we used the Swedish version of
Wikipedia and we are strictly dependent on the Wikipedia
format. Nonetheless, we only used the text itself so this
source could easily be replaced with another one in another
language and another format. In addition, beyond
Wikidata, the techniques we have developed could be applied to
expand any database.</p>
      <p>2.</p>
    </sec>
    <sec id="sec-2">
      <title>Previous Work</title>
      <p>
        The analysis of career profiles from biographies is a specific
case of information extraction that produces tabular data
from raw text. Information extraction has a long history in
natural language processing, starting from the message
understanding conferences (MUC)
        <xref ref-type="bibr" rid="ref5">(Grishman and Sundheim,
1996)</xref>
        , and has been carried out with a variety of techniques
along the time: rule-based, statistical, or hybrid, with a
current focus on machine learning
        <xref ref-type="bibr" rid="ref7">(Mausam et al., 2012)</xref>
        . See
Hobbs et al. (1997) for the description of an early and
oftcited system and Roche and Schabes (1997) for a review.
There are a few papers describing the extraction of
timelines from Wikipedia. Timely YAGO
        <xref ref-type="bibr" rid="ref14">(Wang et al., 2010)</xref>
        is an example of them that is limited to the analysis of
infoboxes, summaries of facts in the form of tabular data
inside the articles, and lists in articles. Exner and Nugues
(2011) is another example that uses semantic role
labeling and the LODE model
        <xref ref-type="bibr" rid="ref11">(Shaw, 2010)</xref>
        to extract events.
Wu and Weld (2010) is a third example that combines
Wikipedia infoboxes and document text to collect data to
train relation classifiers.
      </p>
      <p>Contrary to these works, the system we describe is
dedicated to the extraction of careers through the analysis of the
dependency graphs of the sentences. To collect the
vocabulary associated with occupations, the system creates a
career ontology that it automatically retrieves from the
Wikidata repository. In addition to being automatic, this process
can easily be extended to create multilingual vocabularies.
3.</p>
    </sec>
    <sec id="sec-3">
      <title>Term Extraction</title>
      <p>3.1.</p>
      <sec id="sec-3-1">
        <title>Wikidata: A Semantic Repository</title>
        <p>We used Wikidata as main source of structured knowledge
on human beings and their occupations. Wikidata is a free
data repository from the wikimedia foundation. Wikidata
started as a means to identify named entities across all their
Wikipedia language versions with a unique number.
Go¨ran Persson, for instance, a former Prime Minister of
Sweden, has the identifier Q53747 in Wikidata that links
this entity to the 44 different language versions of his
biography in Wikipedia, while Jacques Delors, a former
president of the European Commission, has the identifier
Q153425 that provides links to the 35 language versions
of Delors’ biography. Figure 1 shows the 10 first links for
these two persons with their language codes, for instance en
for English, de for German, or el for Greek and their name’s
transcription in the corresponding script as in Greek: Zak
Ntelìr, for Jacques Delors.</p>
        <p>The entities reflected by Q-numbers are linked to concepts
or other entities by a set of properties that describes the
entity, Px, where x is a number. Property P31, corresponding
to instance of, applies to Go¨ran Persson with the value
human; P569, date of birth, with the value 20 January 1949;
P26, spouse, Anitra Steen; P106, occupation, politician,
etc.</p>
        <p>P31(Q53747) = human
P569(Q53747) = 20 January 1949
P26(Q53747) = Anitra Steen
P106(Q53747) = politician</p>
        <sec id="sec-3-1-1">
          <title>The values human, Anitra Steen, and politician having</title>
          <p>themselves unique Q-numbers, respectively Q5, Q444325,
and Q82955.</p>
          <p>
            The P39 property, position held, tracks the career of a
person and consists of multiple values. Wikidata lists five
positions held by Go¨ran Persson: Leader of the Opposition,
Minister for Finance, Skolminister (Minister for Schools),
Prime Minister of Sweden, and Member of the Riksdag,
possibly with time values or boundaries (Fig. 2).
Wikidata stores all this information as a graph in the RDF
format. It is similar to earlier projects such as
DBpedia
            <xref ref-type="bibr" rid="ref1">(Auer et al., 2007)</xref>
            , Yago
            <xref ref-type="bibr" rid="ref12">(Suchanek et al., 2007)</xref>
            , or
Freebase
            <xref ref-type="bibr" rid="ref2">(Bollacker et al., 2008)</xref>
            . A key difference
between these earlier works and Wikidata is that Wikidata
is language-agnostic and an integral part of the Wikipedia
structure.
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Extracting Occupations</title>
        <p>The properties such as P106, occupation, are organized as
hierarchies of more specific properties. In the case of
occupation, Figure 3 shows an excerpt of such a hierarchy,
where Wikidata gathers all types of jobs, professions, and
careers.
We processed the Wikidata graph and the concept
hierarchies to create a baseline list of professions. We
considered the Instance of (P31) and Subclass of (P279)
properties that we took as guiding relations to extract the people
careers. We created a list of terms using all the
descendants of the Occupation node that we chose as the root
node since Profession, Job, and Labour are all an
Instance of Occupation. We created this list in a
preprocessing stage, independently and prior to the actual article
parsing.</p>
        <p>4.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>NLP Pipeline</title>
      <p>Although Wikidata covers lots of biographical details, it is
far from being exhaustive and much of the information on
the career timelines still lays in the text. Stefan Lo¨fven,
another Prime Minister of Sweden, provides an example of
this, where his Wikipedia page in English states that:
Lo¨fve´n began his career in 1979 as a welder at
Ha¨gglunds in O¨ rnsko¨ldsvik.
while Wikidata only lists him as a politician1. We
assembled a pipeline of natural language processing components
to analyze the text and extract such information.</p>
      <p>1Both the Wikidata item and the Wikipedia page were
retrieved on May 28, 2015.</p>
      <p>
        We downloaded the Swedish version of Wikipedia and we
first processed the articles to remove the wiki markup. This
markup code enriches the text of Wikipedia articles, for
instance to create the links or to identify the section titles.
We then applied a part-of-speech tagger and a dependency
parser to the text. We split the Wikipedia archive in chunks
allowing for a multithreaded execution in order to speed up
the process:
1. The first step of the pipeline was to parse and remove
the Wikipedia markup. This markup is functionally
similar to HTML or XML, but has a different format
that requires a different parser. We used the Sweble
tool
        <xref ref-type="bibr" rid="ref3">(Dohrn and Riehle, 2013)</xref>
        to carry it out.
2. We then applied a tagger to the text in Swedish and
annotate the words with their parts of speech. We used
Stagger
        <xref ref-type="bibr" rid="ref9">(O¨ stling, 2013)</xref>
        that also includes a named
entity recognition (NER). We used these named entities
further down in the pipeline to extract the persons from
the sentence.
3. Finally, we ran the Maltparser dependency parser
        <xref ref-type="bibr" rid="ref8">(Nivre et al., 2006)</xref>
        on the POS tagged sentences to
have a syntactic representation of them.
5.
      </p>
    </sec>
    <sec id="sec-5">
      <title>Career Parsing</title>
      <p>The career parsing module analyzes the text, sentence by
sentence, to find out the persons, what they are working at,
during what time frame, and tries to connect these elements
together through the dependency graph of the sentence.</p>
      <sec id="sec-5-1">
        <title>5.1. Finding Persons</title>
        <p>The first step of the career parser identifies the mentions of
human beings in each sentence. We applied the following
rules to decide if a word referred to a person:
1. The word matches a regular expression based on the</p>
        <p>Wikipedia page title: The person the page is about;
2. The word is a singular pronoun in Swedish: han “he”,
hon “she”, hans “his”, or hennes “her”;
3. The word is tagged as a person by Stagger’s named
entity recognizer.</p>
        <p>We stored all the persons we found as well as the sentences
they occurred in.
The second step finds the job names mentioned in the
sentences. We used the list of professions we collected from
Wikidata in Sect. 3.2. to check the presence of
corresponding words and extract them. However, this initial profession
list is far from being exhaustive and we applied additional
rules to complete it. To decide if a given a word in a
sentence was a profession, we checked if it was:
1. A job name in the list, without any modification;
2. The compounding of two stems, where the last one is a
profession in the list. We split the word in a prefix and
a suffix and we applied a greedy search on the suffix,
where both the prefix and suffix had to be in a
dictionary of Swedish words. The prefix check was done
to eliminate false positives such as kretsar “circuits”
that could be interpreted as tsar “Czar” preceded by a
meaningless prefix kre.
3. The compounding of two stems separated by a linking
morpheme (fogemorpheme). In Swedish, and other
Germanic languages, it is common to either add an s
between the two stems or change the last vowel of the
first stem. We used two simple morphology rules to
extract them:
(a) If the last letter of the prefix ends with an s, we
remove it and we check if this prefix is a valid word
in the dictionary as with utbildningsminister,
(utbildning + minister), “Minister for Schools”.
(b) If the last letter of the prefix ends in a vowel, we
replace it by another vowel and we see if this
prefix makes up a word as with: fo¨rskolela¨rare
(fo¨rskola + la¨rare) “preschool teacher”.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.3. Finding Verbs</title>
        <p>As noted by Tesnie`re (1966), verbs in European languages
are central elements to describe processes between actors
and circumstances. We started from this observation and
we extracted the verbs hinting at a professional activity
from the sentences. As vocabulary, we used the
following set of Swedish verbs: vara “be”, bli “become”, arbeta
“work”, jobba “work”, and praktisera “practice”.
We then considered that these verbs were potential linking
nodes to relate a person to a job in sentence.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.4. Finding a Path</title>
        <p>The path finding step links people to jobs. From the
previous steps, the career parser has gathered for each sentence
respective lists of persons, jobs, and verbs. We create a path
between these words by traversing the dependency graph of
a sentence until we find a common ancestor.</p>
        <p>Figure 4 shows the dependency graph of the sentence:
Hon var tidigare kommun- och regionminister
2001-2005.
“Previously, she served as minister for
municipalities and regions (2001-2005)”,
where we link a person mentioned by the feminine singular
pronoun hon, highlighted in green in the figure to a
profession, regionminister, in turquoise, through the verb var, in
purple, and where we extract the path:
hon ! var
och
regionsminister
where the career parser connects a person, Go¨ran Persson,
to two occupations, politiker “politician” and statsminister
“Prime Minister”, through the verbs a¨r “is” and var “was”.
To deal with the case where multiple persons are referenced
in a sentence alongside a job, we introduced two additional
constraints:
1. The path from job to person must include one of the
professional activity verbs;
2. This path must be the shortest one. We search all the
paths between all the persons and all the jobs and we
keep the shortest path for each respective profession.
Figure 6 shows an example of this in a sentence with
two persons and one activity.</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.5. Finding Dates</title>
        <p>Once we have linked an occupation to a person, we extract
the dates from the sentence. We implemented a simple
procedure, where we looked at the words preceding and
following the word representing the job.</p>
        <p>We first try to match the adjacent words to a date
expression. If these words correspond to dates, we use them to
annotate the occupation with time stamps; if the adjacent
words are prepositions or conjunctions, we skip them and
we repeat the matching attempt.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Results</title>
      <p>We processed the complete collection of Swedish
Wikipedia articles referring to a person in Wikidata. We
extracted a total of 267,786 jobs from 170,300 articles.
Figure 7 shows the seven professions we obtained for Barack</p>
      <sec id="sec-6-1">
        <title>Obama:</title>
      </sec>
      <sec id="sec-6-2">
        <title>President,</title>
        <p>A¨mbete “officer”,</p>
      </sec>
      <sec id="sec-6-3">
        <title>Senator,</title>
        <p>Handledare “instructor, supervisor”,
Konsult “consultant”,
Sommaransta¨lld “Summer employee”, and</p>
      </sec>
      <sec id="sec-6-4">
        <title>Journalist,</title>
        <p>Politiker, “politician”,
Talesperson, “spokesperson”, and</p>
        <p>Sjo¨officer, “naval officer”.</p>
        <p>while Figure 8 shows the three ones for Filippa Reinfeldt:</p>
        <p>The third profession of Filippa Reinfeldt is wrong and
corresponds to that of her father.</p>
        <p>We assessed the accuracy of the system using a small and
preliminary test set of 10 random Wikipedia articles about
people that were about one or two paragraphs long
(Table 2). Since the articles were short, they were often to the
point and did not contain any complicated language. This
made the recall easier than if we would have tested against
larger and more complex articles.</p>
        <p>Although a more thorough testing would be necessary to
validate the system, it shows the promising nature of our
approach.</p>
        <p>Recall
74.1%</p>
        <p>Precision
95.2%</p>
        <p>F-score
83.3%
Coreference. While looking for persons in the sentence,
we also check for pronouns. We then assume that the
pronouns are referring to the person of interest. A
coreference solver would make this step more
accurate.</p>
        <p>Swedish only. Our system only supports the Swedish
language. It would however be relatively simple to adapt
it to English as well.</p>
        <p>8.</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgements</title>
      <p>This research was supported by Vetenskapsra˚det and the</p>
      <sec id="sec-7-1">
        <title>Det digitaliserade samha¨llet program.</title>
        <p>9.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>So¨ren Auer, Christian Bizer</article-title>
          , Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and
          <string-name>
            <given-names>Zachary</given-names>
            <surname>Ives</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>DBpedia: A nucleus for a web of open data</article-title>
          .
          <source>In The Semantic Web, Proceedings of 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference, ISWC 2007 + ASWC</source>
          <year>2007</year>
          , pages
          <fpage>722</fpage>
          -
          <lpage>735</lpage>
          , Busan, Korea,
          <source>November 11-15</source>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Kurt</given-names>
            <surname>Bollacker</surname>
          </string-name>
          , Colin Evans, Praveen Paritosh, Tim Sturge, and
          <string-name>
            <given-names>Jamie</given-names>
            <surname>Taylor</surname>
          </string-name>
          .
          <year>2008</year>
          .
          <article-title>Freebase: A collaboratively created graph database for structuring human knowledge</article-title>
          .
          <source>In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD '08</source>
          , pages
          <fpage>1247</fpage>
          -
          <lpage>1250</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Hannes</given-names>
            <surname>Dohrn</surname>
          </string-name>
          and
          <string-name>
            <given-names>Dirk</given-names>
            <surname>Riehle</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Design and implementation of wiki content transformations and refactorings</article-title>
          .
          <source>In Proceedings of the 9th International Symposium on Open Collaboration, WikiSym '13</source>
          , pages
          <issue>2</issue>
          :
          <fpage>1</fpage>
          -
          <lpage>2</lpage>
          :
          <fpage>10</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>Peter</given-names>
            <surname>Exner</surname>
          </string-name>
          and
          <string-name>
            <given-names>Pierre</given-names>
            <surname>Nugues</surname>
          </string-name>
          .
          <year>2011</year>
          .
          <article-title>Using semantic role labeling to extract events from Wikipedia</article-title>
          .
          <source>In Proceedings of the Workshop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE</source>
          <year>2011</year>
          ). Workshop in conjunction with the 10th
          <source>International Semantic Web Conference 2011 (ISWC</source>
          <year>2011</year>
          ), Bonn, October
          <volume>23</volume>
          -24.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <given-names>Ralph</given-names>
            <surname>Grishman</surname>
          </string-name>
          and
          <string-name>
            <given-names>Beth</given-names>
            <surname>Sundheim</surname>
          </string-name>
          .
          <year>1996</year>
          .
          <article-title>Message understanding conference - 6: A brief history</article-title>
          .
          <source>In Proceedings of the 16th International Conference on Computational Linguistics (COLING)</source>
          , volume
          <volume>1</volume>
          , page 466-
          <fpage>471</fpage>
          , Copenhagen.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Jerry R. Hobbs</surname>
            , Douglas E. Appelt, John Bear, David Israel, Megumi Kameyama,
            <given-names>Mark</given-names>
          </string-name>
          <string-name>
            <surname>Stickel</surname>
            , and
            <given-names>Mabry</given-names>
          </string-name>
          <string-name>
            <surname>Tyson</surname>
          </string-name>
          .
          <year>1997</year>
          .
          <article-title>FASTUS: a cascaded finite-state transducer for extracting information from natural-language text</article-title>
          .
          <source>In Emmanuel Roche and Yves Schabes</source>
          , editors,
          <source>Finite-State Language Processing</source>
          , chapter
          <volume>13</volume>
          , pages
          <fpage>383</fpage>
          -
          <lpage>406</lpage>
          . MIT Press, Cambridge, Massachusetts.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Mausam</surname>
            ,
            <given-names>Michael</given-names>
          </string-name>
          <string-name>
            <surname>Schmitz</surname>
            , Robert Bart, Stephen Soderland, and
            <given-names>Oren</given-names>
          </string-name>
          <string-name>
            <surname>Etzioni</surname>
          </string-name>
          .
          <year>2012</year>
          .
          <article-title>Open language learning for information extraction</article-title>
          .
          <source>In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL '12</source>
          , pages
          <fpage>523</fpage>
          -
          <lpage>534</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Joakim</given-names>
            <surname>Nivre</surname>
          </string-name>
          , Johan Hall, and
          <string-name>
            <given-names>Jens</given-names>
            <surname>Nilsson</surname>
          </string-name>
          .
          <year>2006</year>
          .
          <article-title>Maltparser: A data-driven parser-generator for dependency parsing</article-title>
          .
          <source>In Proceedings of the fifth international conference on Language Resources and Evaluation (LREC2006).</source>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Robert</surname>
            <given-names>O</given-names>
          </string-name>
          ¨
          <fpage>stling</fpage>
          .
          <year>2013</year>
          .
          <article-title>Stagger: an open-source part of speech tagger for Swedish</article-title>
          .
          <source>Northern European Journal of Language Technology</source>
          ,
          <volume>3</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Emmanuel</given-names>
            <surname>Roche</surname>
          </string-name>
          and Yves Schabes, editors.
          <year>1997</year>
          .
          <article-title>FiniteState Language Processing</article-title>
          . MIT Press, Cambridge, Massachusetts.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Ryan</given-names>
            <surname>Benjamin Shaw</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Events and Periods as Concepts for Organizing Historical Knowledge</article-title>
          .
          <source>Ph.D. thesis</source>
          , University of California, Berkeley.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Fabian M Suchanek</surname>
            ,
            <given-names>Gjergji</given-names>
          </string-name>
          <string-name>
            <surname>Kasneci</surname>
            , and
            <given-names>Gerhard</given-names>
          </string-name>
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          .
          <year>2007</year>
          .
          <article-title>Yago: a core of semantic knowledge</article-title>
          .
          <source>In Proceedings of the 16th international conference on World Wide Web</source>
          , pages
          <fpage>697</fpage>
          -
          <lpage>706</lpage>
          , Banff. ACM.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          Lucien Tesnie`re.
          <year>1966</year>
          . E´le´ments de syntaxe structurale.
          <source>Klincksieck</source>
          , Paris, 2nd edition.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Yafang</given-names>
            <surname>Wang</surname>
          </string-name>
          , Mingjie Zhu, Lizhen Qu,
          <string-name>
            <given-names>Marc</given-names>
            <surname>Spaniol</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Gerhard</given-names>
            <surname>Weikum</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <string-name>
            <surname>Timely</surname>
            <given-names>YAGO</given-names>
          </string-name>
          :
          <article-title>Harvesting, querying, and visualizing temporal knowledge from wikipedia</article-title>
          .
          <source>In Proceedings of the 13th International Conference on Extending Database Technology, EDBT '10</source>
          , pages
          <fpage>697</fpage>
          -
          <lpage>700</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>Fei</given-names>
            <surname>Wu and Daniel</surname>
          </string-name>
          <string-name>
            <given-names>S.</given-names>
            <surname>Weld</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>Open information extraction using wikipedia</article-title>
          .
          <source>In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10</source>
          , pages
          <fpage>118</fpage>
          -
          <lpage>127</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>