<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>NLP &amp; DBpedia: An Upward Knowledge Acquisition Spiral</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Sebastian Hellmann</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Agata Filipowska</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Caroline Barriere</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pablo N. Mendes</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dimitris Kontokostas</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre de Recherche Informatique de Montreal</institution>
          ,
          <addr-line>Montreal</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Instytut Informatyki Gospodarczej Sp. z o.o.</institution>
          ,
          <addr-line>ul. Rubiez 12G/6, 61-612 Poznan</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Kno.e.sis Center, Wright State University</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Poznan University of Economics, Faculty of Informatics and Electronic Economy, Department of Information Systems</institution>
          ,
          <addr-line>Al. Niepodleglosci 10, 61-875 Poznan</addr-line>
          ,
          <country country="PL">Poland</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>University of Leipzig, Institute of Computer Science, AKSW Group</institution>
          ,
          <addr-line>Augustusplatz 10, D-04009 Leipzig</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Recently, the DBpedia community has experienced an immense increase in activity and we believe, that the time has come to explore the connection between DBpedia &amp; Natural Language Processing (NLP) in a yet unprecedented depth. DBpedia has a long-standing tradition to provide useful data as well as a commitment to reliable Semantic Web technologies and living best practices. As the extraction of the Wikipedia's infoboxes by DBpedia matures, we can shift our focus to new challenges such as extracting information from an unstructured article text as well as becoming a testing ground for multilingual NLP methods. DBpedia has the potential to create an upward knowledge acquisition spiral as it provides a small amount of general knowledge allowing to process text, derive more knowledge, validate this knowledge and improve text processing methods. The goal of this workshop was to present existing research, systems and resources, but also to allow discussion about di erent points of convergence and divergence of the NLP and DBpedia community with a special focus on challenges that lie ahead. We would like to take part in the debate on how to use DBpedia for NLP and NLP for DBpedia.</p>
      </abstract>
      <kwd-group>
        <kwd>DBpedia</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>RDF</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Communities interested in Natural Language Processing (NLP) and in the
Semantic Web, in particular DBpedia, come together to explore di erent ways of
collaborating, and helping each other, towards a common goal of understanding
and representing information.</p>
      <p>
        Resources such as DBpedia are a step towards a solution to the knowledge
acquisition bottleneck, so often mentioned in earlier days of NLP [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. A
prerequisite of text processing and understanding is the availability of knowledge
about words, concepts and ways of expressing information. But then, to acquire
such knowledge, we are required to automatically process text or immerse in
costly and error-prone manual knowledge engineering.
      </p>
      <p>Where formerly, there was a chicken and egg problem with a serious
bootstrapping issue, we now have structured data in DBpedia, which is readily
available to turn the bottleneck into an upward knowledge acquisition spiral { a small
amount of general knowledge allowing to process text, create more knowledge,
validate this knowledge and improve text processing for more acquisition (and
so on).</p>
      <p>The recent years have seen a major change, mostly through crowd-sourcing
for the construction of the largest encyclopaedic resource, Wikipedia. Although
rst, mainly made of unstructured data (paragraphs), the addition of infoboxes,
and the expansion of interest towards the Semantic Web, have led to DBpedia
{ one of the largest openly shared structured resource available today.</p>
      <p>
        However, any resource not curated nor scrutinized by experts will be prone
to noise, and that becomes a new and di erent challenge for NLP. Also, any
resource, even as large as DBpedia, is not complete. So far, mainly the
infoboxes, which are already semi-structured, are used to build the RDF repository.
But even then, Aprosio et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] (this volume) mention that more than 50% of
Wikipedia articles do not include an infobox. So if the article text is analysed,
the spiral can turn further, using DBpedia as input for the NLP process and
then create more RDF triples to add and integrate into DBpedia [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>This workshop's aim is right in the knowledge acquisition spiral, bringing
together researchers in both areas to see how NLP can bene t DBpedia and how
DBpedia can bene t NLP. The contributions in the workshop allow to highlight
multiple facets of this duality. In the remainder of this article, we discuss the
contributions to the NLP&amp;DBpedia workshop. Our main interest, however, are
the challenges that the readers can expect to stay unresolved, that is the many
interesting underlying issues brought forward by these articles. Another goal of
this workshop was to present existing research, systems and resources to allow
discussion about di erent points of convergence and divergence of the NLP and
DBpedia community. It is also interesting to illustrate when both communities
actually tackle very similar problems, with di erent approaches.</p>
    </sec>
    <sec id="sec-2">
      <title>Knowledge acquisition and structuring</title>
      <p>As we look at NLP and DBpedia, we see that NLP requires knowledge about
words, not only about concepts. Obviously the notion of labels exists in DBpedia,
but there is more to language than labels. Should this lexical information be
represented the same ways as conceptual information is?</p>
      <p>The separation between lexical, conceptual, terminological, encyclopaedic,
and other kinds of knowledge has been a debate for years. Can a single schema
allow all types of knowledge? Lexical approaches usually start from words, going
from a word to all its senses, and sometimes terminological approaches will start
from concepts, and de ning all the words that illustrate such concept. If DBpedia
is more concept-based, we can then wonder how lexical information would be</p>
      <sec id="sec-2-1">
        <title>6 http://lemurproject.org/clueweb09/</title>
        <p>attached to it, or a more general question of how lexical knowledge has its place
within the Semantic Web?</p>
        <p>
          [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] (this volume) present a lemon lexicon for DBpedia and discuss di erent
issues of lexicalization of conceptual structures.
        </p>
        <p>
          The BabelNet[
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] resource, resulting from a merge of WordNet [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] (a
widelyused lexical resource in NLP) and Wikipedia, is an example of a mixed-level
representation in which lexical, conceptual and encyclopaedic knowledge is
combined. BabelNet is used in the work of [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] (this volume) for the task of QALD
(Question Answering over Linked Data) as we will see in the next section. Also
[
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] (this volume) talk of developing their own representation, SAR-Graphs
(Semantically Associated Relations Graphs) to express not only lexical knowledge,
but sentence-based knowledge, that is useful for verbalizing simple predicates but
also combined predicates (child of child, for example). These three contributions
stimulate a debate on the granularity of the representation of any language
resource. Such debate is present in corpus studies, where experts study the value of
not only terms, but also phrases (phraseology) in the understanding of language
use [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ].
4
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>NLP tasks and applications</title>
      <p>Although di erent tasks are mentioned in our workshop's contributions, three of
them are more prominent, that of NER (Named Entity Recognition), Relation
Extraction, and Question Answering over Linked Data (QALD).
4.1</p>
      <sec id="sec-3-1">
        <title>Named Entity Recognition</title>
        <p>
          Named Entity Recognition is de ned as the task of assigning a class to entities
found in a text, such as person, location, organization, date, etc. NER is a
well-recognized task in the NLP community since the beginning of the Message
Understanding Conferences (MUC) in 1987 (see [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] for a good overview of
information extraction and the early MUC conferences). Although not called as
such at the time, early work on information extraction looked at text to nd
Who did What When How discovering entities such as places, people and dates.
Extracted entities were not necessarily typed, or classi ed, but as information
extraction templates were used, such types were implicitly given by the roles the
entities lled (Agent, Place, Date).
        </p>
        <p>
          Later on, researchers, such as Sekine ([
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]) de ned a hierarchical schema
of classes for the NER task. Although, the more ne-grained the classes are,
however, the more di cult it is to obtain (or even measure) classi cation
results. Obviously, integration and comparison of these hierarchies can have high
complexity, if no reference hierarchy is agreed upon. One such reference
hierarchy is the recently created NERD ontology [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ], however, containing only 84
types7 which is coarse grained when compared to the over 500 DBpedia Ontology
classes8, which are used in [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] (this volume).
        </p>
        <p>
          As mentioned in [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] (this volume) Named Entity Disambiguation is a further
step towards identifying not only that an entity is a Person, but who this person
actually is by establishing a link to a more speci c reference id or URI in a
knowledge base. New names are given to the NED or NERD task, that of Entity
linking and "wiki ers" [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] (this volume) and the list of emerging tools, which
belong to this class of wiki ers is quite huge and growing steadily: Zemanta,
OpenCalais, Ontos, Evri, Extractiv, Alchemy API and many more9.
        </p>
        <p>
          Wikipedia (and therefore DBpedia) is limited to encyclopaedic knowledge,
but often terminological knowledge (how di erent terms describe di erent
domain speci c concepts) as well as lexical knowledge (common words) are available
for interlinking with text, thus resembling Word-Sense Disambiguation (WSD),
i.e. taking any word in a text and being able to connect the appropriate URI.
In [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] (this volume), both tasks (NED and WSD) are tackled using BabelNet.
4.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Relation extraction</title>
        <p>The task of relation extraction is sometimes seen as a step following that of
NER. After entities are extracted, it would be interesting to see how they are
related. But sometimes a more "template-like" strategy, as was suggested in early
information extraction is done. For example, a system would look for "merger"
relations between companies, to nd out which companies merged. In such case,
the relation is known in advance, and we look in text for both the relation and
the participants in such relation.</p>
        <p>
          Di erent types of relations have been investigated over the years, and as NLP
and DBpedia come closer, relations found in DBpedia tend to be used. [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] (this
volume) focus on ten di erent relations found in DBpedia. They identify such
relations in text through developed lexical extraction rules. The work of [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] (this
volume) focuses on seven di erent properties found in DBpedia. By properties,
they mean relations for which the subject is most likely a named entity, but the
object could be a literal, such as the property populationTotal. The line is fuzzy
between properties and relations (for example, both contributions mentioned
above use the birthDate as a relation to extract in text), and could bring an
interesting discussion and debate about this topic. The work of [
          <xref ref-type="bibr" rid="ref27">27</xref>
          ] (this volume)
does not target any speci c relation and is mostly about the development of
a representational schema (as mentioned before) for the English expression of
relations.
        </p>
        <p>
          The explicit expression of relations in text is a topic of interest in the NLP
community for a while. Di erent methods, either statistical [
          <xref ref-type="bibr" rid="ref25">25</xref>
          ] or pattern-based
        </p>
        <sec id="sec-3-2-1">
          <title>7 accessed Oct. 10th, 2013 http://nerd.eurecom.fr/ontology</title>
          <p>
            8 An up to date version can be downloaded fromhttp://mappings.dbpedia.org/
server/ontology/dbpedia.owl
9 http://en.wikipedia.org/wiki/Knowledge_extraction#Tools contains an
up-todate overview
are developed and experimented on [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ]. This is an interesting place for NLP and
the Semantic Web to meet as both communities are interested in nding links
between concepts and extract facts.
4.3
          </p>
        </sec>
      </sec>
      <sec id="sec-3-3">
        <title>Question Answering over Linked Data</title>
        <p>
          The tasks of Information Retrieval and Question Answering, within the NLP
community, provided some of the early attempts towards a more systematized
approach to making the eld of NLP grow. Those tasks encouraged the
development of challenges and competitions with common data (TREC, [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]) which
we discuss in the next section. The more recent task of Question Answering
over Linked Data10 is a very interesting task, certainly promoting a
communication and shared interest between the NLP and the Semantic Web community,
and also providing some early attempts within the Semantic Web community at
sharing data and evaluation standards.
        </p>
        <p>
          Three contributions look into QALD. The work of [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] (this volume), addresses
the task of QALD, with a particular strategy which involves NED and word sense
disambiguation, as we mentioned above. In [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] (this volume), the QALD task is
not just tackled, but they go further into the study of inconsistency detection
when gathering knowledge to answer questions. They look into English, German,
French and Italian chapters of DBpedia, and try to detect inconsistencies and
supporting evidence among the di erent answers. In [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] (this volume) the task
of QALD is not performed in itself, but it is mentioned as an extrinsic evaluation
of the coverage of the lemon lexicon, saying that the verbalizations found in the
lexicon cover many of the questions.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Resources</title>
      <p>As most workshop contributions combine some techniques from NLP with the
Semantic Web, they talk about di erent resources that would be useful to
the community. We don't want to reinvent the wheel. Obviously, even if
alternative Semantic Web resources, such as Yago (http://www.mpi-inf.mpg.de/
yago-naga/yago/) and Freebase (http://www.freebase.com) exist, this
workshop focuses on DBpedia, which therefore is the Semantic Web resource most
referred to in the di erent contributions.</p>
      <p>
        On the NLP side, many frameworks and typical resources exist as well.
Wordnet (http://wordnet.princeton.edu/) for example, has been a resource much
used in the community for English. More recently, Babelnet (http://babelnet.
org), mentioned earlier, has been developed to merge Wikipedia and Wordnet.
Also GATE, an open source development framework (http://gate.ac.uk), is used
in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] (this volume).
      </p>
      <p>We can think that the primary resource for NLP is text, but which text?
There has been work in NLP on di erent types of texts, from news articles to
10 The rst challenge started in 2011, and information can be found at http://
greententacle.techfak.uni-bielefeld.de/~cunger/qald/
scienti c articles, to blogs, to web data. In the present day, textual content is
abundant, and the appropriateness of which text should be analysed for which
purpose is a pertinent question. In fact, if we see NLP for DBpedia, at the
service of expanding DBpedia, then the chosen text should be informative, factual,
accurate. As we saw above, mining Wikipedia for more information is an
interesting direction, it is not the only one. We also saw (with NELL) that a large
crawled Web corpus is a possibility, as it brings large coverage, but it can also
bring noise.</p>
      <p>
        Di erent ways of ltering noise exists, either by trying to evaluate the source
of information (trust), or by looking at how consistent or inconsistent di erent
information is, looking at redundancy and con icts. In [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] (this volume), the
general problem of inconsistent information is tackled.
      </p>
      <p>
        If we reverse our point of view and see DBpedia at the service of NLP, then
the text on which NLP techniques are used is quite arbitrary and depends on
further purposes and applications. For example, in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] (this volume), both news
articles and tweets are explored, which are two very di erent types of texts.
      </p>
      <p>
        The question of language is valid whether we are looking at "NLP for
DBpedia" or "DBpedia for NLP". In [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] (this volume), French text is analysed,
and in [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] (this volume), four di erent language chapters of DBpedia are used.
This is a minority of contributions exploring other languages than English. As
always, work on English is more prominent than that on other language, and it
brings awareness that it would be interesting for both communities to work on
di erent languages.
5.1
      </p>
      <sec id="sec-4-1">
        <title>Gold and silver standards</title>
        <p>
          The topic of evaluation is both an important one, and a much debated one. In
NLP, there has been a tendency in the past 15 years to perform experiments
for which there are well de ned gold standards and datasets. There has been an
increase in the number of competitions and challenges in many sub- elds of NLP,
such as automatic summarization [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], word-sense disambiguation [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], textual
entailment [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], etc.
        </p>
        <p>
          In the Semantic Web community, there is less of such rigid evaluation, as
the eld is younger than NLP, and is still looking at pushing the eld with
di erent ideas and concepts without imposing rigid evaluations. Certainly, one
of the purposes of this workshop was to start discussion towards bringing more
of gold standards and evaluation datasets into the community. Although there
are some competitions in other areas, such as the OAEI (Ontology Alignment
Evaluation Initiative11) which has been happening for a few years now, as well
as the QALD (see above) and the plethora of benchmarks for triplestores such
as the DBPSB (DBepdia SPARQL Benchmark [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]). In the eld of NER/NED,
there are not many datasets or gold standards and only few challenges. The work
of [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] (this volume) paves the way towards the standardization of NER and NED
benchmarking in an implemented benchmarking system.
11 http://oaei.ontologymatching.org/
        </p>
        <p>
          As a rst important step to develop such a gold standard, it is also good to
review and question existing work. The work of [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ] (this volume) is an extensive
comparison of NED benchmarks and characterizes them to see, if they could be
biased for particular types of algorithms, or types of test data. The contribution
therefore opens the debate as to how we should develop such benchmarks and
provides a solid foundation to built upon.
        </p>
        <p>
          When gold standards are hard (costly, time-consuming) to develop, it can
be interesting to develop silver standards that are the results of well-known
methods, or the combined results of di erent methods. Such standards do not
replace gold standards, but they at least give an indication of the direction of
progress for particular algorithms. One possibility when two communities come
together is to take the results of one to become the "silver standard" of the other.
[
          <xref ref-type="bibr" rid="ref18">18</xref>
          ] (this volume) describes such a silver standard and discusses its bene ts as
well as its limitations.
        </p>
        <p>
          In some work, such as [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] (this volume) and [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] (this volume), DBpedia's
network of relations is used as a gold standard in relation extraction. Also
Wikipedia/DBpedia entities have become the most predominant link targets
in NED. [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] reports of 7 out of 10 tools that attach Wikipedia/DBpedia URLs
as annotations (3 out of 10 for the DBpedia Ontology). Although this is an
interesting way to proceed, we can debate whether we are using gold or silver
standards and how to unify benchmarks for comparison.
6
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Summary</title>
      <p>We conclude by highlighting a few issues brought forward by the contributions in
this workshop. First, the selected papers discuss many problems that have been
recognized within the NLP community for a long time, but have only recently
been introduced to Semantic Web researchers. The main challenges here concern:
{ consensus upon annotation guidelines,
{ development of extraction rules and agreed upon hierarchies that may be
used to unify semantic enrichment and benchmarks,
{ identi cation of well-de ned tasks and problem classes,
{ transferability of NLP tasks, resources and tools to other research
communities (e.g. library and life sciences) as well as other languages and application
areas,
{ building practical resources and infrastructures, which do not target one
single research question, but can be exploited in a more universal manner
by NLP tools,
{ unlock higher layers of semantic annotation to enable state-of-the art
OWLbased reasoning on a combination of noisy NLP data and LOD and DBpedia
based knowledge structures.</p>
      <p>Second, and perhaps more importantly, new possibilities emerge from the
combination of the communities, and we hope to further push such possibilities
to have more NLP for DBpedia and more DBpedia for NLP, continuing the
knowledge spiral, and ghting together to open the knowledge acquisition
bottleneck. We hope that the readers of this volume will nd all papers interesting.
We invite you to join our community and attend future workshop editions.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments.</title>
      <p>We especially thank all contributors to DBpedia and the DBpedia
Internationalisation committee12. This work was supported by grants from the European
Union's 7th Framework Programme provided for the projects LOD2 (GA no.
257943) and GeoKnow (GA no. 318159).</p>
      <sec id="sec-6-1">
        <title>Programme Commitee</title>
        <p>We would like to thank all reviewers that have helped us and especially the
authors with their comments and feedback.
12 http://wiki.dbpedia.org/Internationalization</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Aprosio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Giuliano</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L. Alberto</given-names>
            <surname>Lavelli</surname>
          </string-name>
          .
          <article-title>Extending the Coverage of DBpedia Properties using Distant Supervision over Wikipedia</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>A.</given-names>
            <surname>Auger</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Barriere</surname>
          </string-name>
          . Probing Semantic Relations:
          <article-title>Exploration and identi - cation in specialized texts</article-title>
          .
          <source>John Benjamins, benjamins edition</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>E.</given-names>
            <surname>Cabrio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Cojan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Villata</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Gandon</surname>
          </string-name>
          .
          <article-title>Argumentation-based Inconsistencies Detection for Question-Answering over DBpedia</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>A.</given-names>
            <surname>Carlson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Betteridge</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Kisiel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Settles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. R. H.</given-names>
            <surname>Jr.</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T. M.</given-names>
            <surname>Mitchell</surname>
          </string-name>
          .
          <article-title>Toward an architecture for never-ending language learning</article-title>
          . In M. Fox and D. Poole, editors,
          <source>AAAI. AAAI Press</source>
          ,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>D.</given-names>
            <surname>Cristea</surname>
          </string-name>
          .
          <article-title>Textual entailment</article-title>
          .
          <source>Computational Linguistics</source>
          , (June):
          <volume>1140</volume>
          {
          <fpage>1143</fpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>M.</given-names>
            <surname>Dojchinovski</surname>
          </string-name>
          and
          <string-name>
            <given-names>T.</given-names>
            <surname>Kliegr</surname>
          </string-name>
          .
          <article-title>Datasets and GATE Evaluation Framework for Benchmarking Wikipedia-Based NER Systems</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>A.</given-names>
            <surname>Dutta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Meilicke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Niepert</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Ponzetto</surname>
          </string-name>
          . Integrating Open and Closed Information Extraction:
          <article-title>Challenges and First Steps</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>K.</given-names>
            <surname>Elbedweihy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Wrigley</surname>
          </string-name>
          , and
          <string-name>
            <given-names>F.</given-names>
            <surname>Ciravegna</surname>
          </string-name>
          .
          <article-title>Using BabelNet in Bridging the Gap Between Natural Language Queries and Linked Data Concepts</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. C. Fellbaum, editor.
          <source>WordNet: an electronic lexical database</source>
          . MIT Press,
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>W. A.</given-names>
            <surname>Gale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. W.</given-names>
            <surname>Church</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Yarowsky</surname>
          </string-name>
          .
          <article-title>A method for disambiguating word senses in a large corpus</article-title>
          .
          <source>Computers and the Humanities</source>
          ,
          <volume>26</volume>
          (
          <issue>5-6</issue>
          ):
          <volume>415</volume>
          {
          <fpage>439</fpage>
          ,
          <year>1992</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>R.</given-names>
            <surname>Grishman</surname>
          </string-name>
          .
          <source>Information Extraction: Techniques and Challenges. New York, i(4):</source>
          <volume>10</volume>
          {
          <fpage>27</fpage>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>M.</given-names>
            <surname>Heder</surname>
          </string-name>
          and
          <string-name>
            <given-names>P. N.</given-names>
            <surname>Mendes</surname>
          </string-name>
          .
          <article-title>Round-trip semantics with sztakipedia and dbpedia spotlight</article-title>
          . In A.
          <string-name>
            <surname>Mille</surname>
            ,
            <given-names>F. L.</given-names>
          </string-name>
          <string-name>
            <surname>Gandon</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Misselis</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Rabinovich</surname>
          </string-name>
          , and S. Staab, editors,
          <source>WWW (Companion Volume)</source>
          , pages
          <fpage>357</fpage>
          {
          <fpage>360</fpage>
          . ACM,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>M. Morsey</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Lehmann</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Auer</surname>
          </string-name>
          , and A.
          <string-name>
            <surname>-C. Ngonga Ngomo. DBpedia SPARQL</surname>
          </string-name>
          <article-title>Benchmark { Performance Assessment with Real Queries on Real Data</article-title>
          .
          <source>In ISWC</source>
          <year>2011</year>
          ,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>R.</given-names>
            <surname>Navigli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Jurgens</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Vannella</surname>
          </string-name>
          . SemEval-2013
          <source>Task</source>
          <volume>12</volume>
          :
          <article-title>Multilingual Word Sense Disambiguation</article-title>
          .
          <source>In Proceedings of the 7th International Workshop on Semantic Evaluation SemEval 2013 in conjunction with the Second Joint Conference on Lexical and Computational Semantics SEM</source>
          <year>2013</year>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>R.</given-names>
            <surname>Navigli</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Ponzetto</surname>
          </string-name>
          . Babelnet:
          <article-title>The automatic construction, evaluation and application of a wide-coverage multilingual semantic network</article-title>
          .
          <source>Artif</source>
          . Intell.,
          <volume>193</volume>
          :
          <fpage>217</fpage>
          {
          <fpage>250</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>K.</given-names>
            <surname>Nebhi</surname>
          </string-name>
          .
          <article-title>A Rule-Based Relation Extraction System using DBpedia and Syntactic Parsing</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17. T. Okumura,
          <string-name>
            <given-names>Manabu</given-names>
            <surname>Fukusima</surname>
          </string-name>
          and
          <string-name>
            <given-names>H.</given-names>
            <surname>Nanba</surname>
          </string-name>
          .
          <article-title>Text Summarization Challenge 2 - Text summarization evaluation at NTCIR Workshop 3</article-title>
          .
          <source>In Proceedings of the HLT-NAACL 03 Text Summarization Workshop</source>
          , pages
          <volume>49</volume>
          {
          <fpage>56</fpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>H.</given-names>
            <surname>Paulheim. DBpediaNYD</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A Silver</given-names>
            <surname>Standard</surname>
          </string-name>
          <article-title>Benchmark Dataset for Semantic Relatedness in DBpedia</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>H.</given-names>
            <surname>Paulheim</surname>
          </string-name>
          and
          <string-name>
            <given-names>S. P.</given-names>
            <surname>Ponzetto</surname>
          </string-name>
          .
          <article-title>Extending DBpedia with Wikipedia List Pages</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          - 25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. G. Rizzo,
          <string-name>
            <given-names>R.</given-names>
            <surname>Troncy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hellmann</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M.</given-names>
            <surname>Bruemmer</surname>
          </string-name>
          .
          <article-title>NERD meets NIF: Lifting NLP extraction results to the linked data cloud</article-title>
          .
          <source>In LDOW</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>S.</given-names>
            <surname>Sekine</surname>
          </string-name>
          and
          <string-name>
            <given-names>C.</given-names>
            <surname>Nobata</surname>
          </string-name>
          .
          <article-title>De nition, dictionaries and tagger for extended named entity hierarchy</article-title>
          . In A. Zampolli and M. T. Lino, editors,
          <source>Proceedings of the Language Resources and Evaluation Conference LREC</source>
          , pages
          <year>1977</year>
          {
          <year>1980</year>
          .
          <string-name>
            <given-names>European</given-names>
            <surname>Language Resources Association</surname>
          </string-name>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <given-names>K. Sparck</given-names>
            <surname>Jones</surname>
          </string-name>
          .
          <source>Further re ections on TREC. Information Processing &amp; Management</source>
          ,
          <volume>36</volume>
          (
          <issue>1</issue>
          ):
          <volume>37</volume>
          {
          <fpage>85</fpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>N.</given-names>
            <surname>Steinmetz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Knuth</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Sack</surname>
          </string-name>
          .
          <article-title>Statistical Analyses of Named Entity Disambiguation Benchmarks</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <given-names>M.</given-names>
            <surname>Stubbs</surname>
          </string-name>
          .
          <article-title>An example of frequent English phraseology: Distribution, structures and functions</article-title>
          . In R. Facchinetti, editor,
          <source>Corpus linguistics 25 years on, number 62</source>
          , pages
          <fpage>89</fpage>
          {
          <fpage>105</fpage>
          385. Rodopi,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>P. D. Turney</surname>
            and
            <given-names>M. L.</given-names>
          </string-name>
          <string-name>
            <surname>Littman</surname>
          </string-name>
          .
          <article-title>Corpus-based Learning of Analogies and Semantic Relations</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>60</volume>
          (
          <issue>1-3</issue>
          ):1{
          <issue>3</issue>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>C. Unger</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Mccrae</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Walter</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Winter</surname>
            , and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Cimiano</surname>
          </string-name>
          .
          <article-title>A lemon lexicon for DBpedia</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <given-names>H.</given-names>
            <surname>Uszkoreit</surname>
          </string-name>
          and
          <string-name>
            <given-names>F.</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <article-title>From Strings to Things SAR-Graphs: A New Type of Resource for Connecting Knowledge and Language</article-title>
          .
          <source>In Proceedings of 1st International Workshop on NLP and DBpedia, October</source>
          <volume>21</volume>
          -25, Sydney, Australia, volume
          <volume>1064</volume>
          <source>of NLP &amp; DBpedia</source>
          <year>2013</year>
          , Sydney, Australia,
          <year>October 2013</year>
          . CEUR Workshop Proceedings.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>