<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Extending an Information Retrieval System through Time Event Extraction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Pierpaolo Basile</string-name>
          <email>fpierpaolo.basile@uniba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Annalina Caputo</string-name>
          <email>annalina.caputo@uniba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giovanni Semeraro</string-name>
          <email>giovanni.semeraro@uniba.it</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lucia Siciliani</string-name>
          <email>siciliani.lu@gmail.comg</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science - University of Bari Aldo Moro Via E. Orabona</institution>
          ,
          <addr-line>4 - 70125 Bari</addr-line>
          ,
          <country country="IT">ITALY</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we propose an innovative Information Retrieval system able to manage temporal information. The system allows temporal constraints in a classical keyword-based search. Information about temporal events is automatically extracted from text at indexing time and stored in an ad-hoc data structure exploited by the retrieval module for searching relevant documents. Our system can search textual information that refers to specific period of times. We perform an exploratory case study indexing all Italian Wikipedia articles.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Identifying specific pieces of information related to a particular time period is a
key task for searching past events. Although this task seems to be marginal for
Web users [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], many search domains, like enterprise search, or lately developed
information access tasks, such as Question Answering [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] and Entity Search,
would benefit from techniques able to handle temporal information.
The capability of extracting and representing temporal events mentioned in a text
can enable the retrieval of documents relevant for a given topic pertaining to a
specific time. Nonetheless, the notion of temporal in the retrieval context has
often being associated with the dynamic dimension of a piece of information, i.e.
how it changes over time, in order to promote freshness in results. Such kind of
approaches focus on when the document was published (timestamp) rather than
the temporal event mentioned in its content (focus time). While traditional search
engines take into account temporal information related to a document as a whole,
our search engine aims to extract and index single events occurring in the texts,
and to enable the retrieval of topics related to specific temporal events mentioned
in the documents. In particular, we are interested in retrieving documents that
are relevant for the user query, and also match some temporal constraints. For
example, the user could be interested in a particular topic —strumenti musicali
(musical instrument)— related to a specific time period —inventati tra il 1300 ed
il 1500 (invented between 1300 and 1500)—.
      </p>
      <p>However, looking for happenings in a specific time span requires further, and
more advanced, techniques able to treat temporal information. Therefore, our goal
is to merge features of both Information Retrieval (IRS) and Temporal Extraction
Systems (TES). While an IRS allows us to handle and access the information
included in texts, TES locate temporal expressions. We define this kind of system
“Time-Aware IR” (TAIR).</p>
      <p>
        In the past, several attempts have been made to exploit temporal information in
IR systems [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], with an up-to-date literature review and categorization provided
in [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Most of these approaches exploit time information related to the document
in order to improve the ranking (recent documents are more relevant) [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], cluster
documents using temporal attributes [
        <xref ref-type="bibr" rid="ref1 ref3">1,3</xref>
        ], or exploit temporal information for
effectively present documents to the user [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. However, just a handful of work have
focused on temporal queries, that is the capability of querying a collection with
both free text and temporal expression [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Alonso et al. pointed out as this kind
of tasks needs the combination of results from both the traditional keyword-based
and the temporal retrieval that can give rise to two different result sets.
Vandenbussche and Teisse`dre [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ] dealt with temporal search in the context of both the
Web of Content and the Web of Data, but differently from our system, they
relied on an ontology of time for temporal queries [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. Kanhabua and Nørva˚g [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]
defined semantic- and temporal-based features for a learning to rank approach
by extracting named entities and temporal events from the text. Similarly to our
approach, Arikan et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] considered the query as composed by a keyword and
a temporal part. Then, the two queries were addressed by computing two
different language model-based weights. Exploiting a similar model, Berberich et
al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] developed a framework for dealing with uncertainty in temporal queries.
However, both approaches drawn the probability of the temporal query out of the
whole document, thus neglecting the pertinence of temporal events at a sentence
level. In order to overcome such a limitation, Matthews et al. [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] introduced
two different types of indexes, at a document and a sentence level, with the latter
associated with content date.
      </p>
      <p>
        Preliminary to indexing and retrieval, the Information Extraction phase aims to
extract temporal information, and its associated events, from text. In this area
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], several approaches aim at building structured knowledge sources of
temporal events. In [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] the authors describe an extension of the YAGO knowledge
base, in which entities, facts, and events are anchored in both time and space.
Other work exploit Wikipedia to extract temporal events, such as those reported
in [
        <xref ref-type="bibr" rid="ref10 ref14 ref25">10, 14, 25</xref>
        ]. Temporal extraction systems can locate temporal expressions and
normalize them making this information available for further processing.
Currently, there are different tools that can make this kind of analysis on documents,
like SUTime [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] or HeidelTime [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and other systems which took part in
TempEval evaluation campaigns. Temporal extraction is not the main focus of this
paper, then we remand the interested reader to the TempEval description task
papers [
        <xref ref-type="bibr" rid="ref22 ref24">22,24</xref>
        ] for a wider overview of the latest state-of-the-art temporal extraction
systems.
      </p>
      <p>The paper is organized as follows: Section 2 provides details about the model
behind our TAIR system, while Section 3 describes the implementation of our
model. Section 4 reports some use cases of the TAIR system which show the
potential of our approach, while Section 5 closes the paper.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Time-Aware IR Model</title>
      <p>
        A TAIR model should be able to tackle some problems that emerge from temporal
search [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], that is: 1) the extraction and normalization of temporal references,
2) the representation of the temporal expressions associated to documents, and 3)
the ranking under the constraint of keyword- and temporal-queries.
Our TAIR model consists of three main components responsible to deal with
these issues, as sketched in Figure 1:
Text processing It automatically extracts time expressions from text. The
extracted expressions are normalized in a standard format and sent to the
indexing component;
Indexing This component is dedicated to index both textual and temporal
information. During the indexing, text fragments are linked to time expressions.
The idea behind this approach is that the context of a temporal expression is
relevant;
Search It analyzes the user query composed by both keywords and temporal
constraints, and performs the search over the index in order to retrieve
relevant information.
2.1
      </p>
      <sec id="sec-2-1">
        <title>Text Processing Component</title>
        <p>
          Given a document as input, the text processing component provides as output
the normalized temporal expressions extracted from the text, along with
information about positions in which the temporal expressions are found. For this
purpose we adopt a standard annotation language for temporal expressions called
TimeML [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ]. We are interested in expressions tagged with the TIMEX3 tag that
is used to mark up explicit temporal expressions, such as times, dates and
durations. In TIMEX3 the value of the temporal expression is normalized according
to 2002 TIDES guideline, an extension of the ISO-8601 standard, and is stored
in an attribute called value. An example of TIMEX3 annotation for the sentence
“before the 23th May 1980” is reported below:
&lt;TimeML&gt;
before the
&lt;TIMEX3 tid="t3" type="DATE" value="1980-05-23"&gt;
23th May 1980
&lt;/TIMEX3&gt;
&lt;/TimeML&gt;
Where tid is a unique identifier, type can assume one of the types between:
DATE, TIME, DURATION, and SET, while the value attribute contains the
temporal information that varies accordingly to the type.
        </p>
        <p>ISO-8601 normalizes temporal expressions in several formats. For example, “May
1980” is normalized as “1980-05”, while “23th May 1980” as “1980-05-23”. We
choose to normalize all dates using the pattern yyyy-mm-dd. All temporal
expressions not compliant to the pattern, such as “1980”, must be normalized retaining
the lexicographic order between dates. Our solution consists in normalizing all
temporal expressions in the form of yyyy or yyyy-mm to the last day of the
previous year or month, respectively. In our previous example, the expression “1980”
is normalized as 19791231. Similarly, the expression “1980-05” is normalized
as “1980-04-30”. Moreover, the text processing component applies several
normalization rules to correctly identify seasons, for example the TimeML tag for
Spring “yyyy-SP” is normalized as “yyyy-03-20”.</p>
        <p>Using the correct normalization, the order between periods is respected. In
conclusion the text processing component extracts temporal information and
correctly normalized them to make different time periods comparable.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>The Indexing Component</title>
        <p>After the text processing step, we need to store and index data. In our model we
propose to store both documents and temporal expressions in three separated data
indexes, as reported in Figure 1.</p>
        <p>The first index (docrep) stores the text of each document (without processing)
with an id, a numeric value that unequivocally identifies the document. This
index is used to store the document content only for the presentation purpose. The
second index (doc) is a traditional inverted index in which the text of each
document is indexed and used for keyword-based search. Finally, the last index (time)
stores temporal expressions found in each document. For each temporal
expression, we store the following information:
– The document id;
– The normalized value of the time expression according to the normalization
procedure described in Section 2.1;
– The start and end offset of the expression in the document, useful for
highlighting;
– The context of the expression: the context is defined by taking all the words
that can be found within n characters before and after the time expression.
The context is indexed and used by the search component during the retrieval
step. The idea is to keep trace of the context where the time expression
occurred. The context is tokenized and indexed and exploited in conjunction
with the keyword-based search, as we explained in Section 2.3.</p>
        <p>It is important to note that a document could have many temporal expressions,
for each of these an entry in the time index is created. For example, given the</p>
        <p>Wikipedia page in Figure 2, we store its whole content as reported in Table 1a,
while we tokenize and index the page as shown in Table 1b. The most interesting
part of the indexing step is the storage of temporal expressions. As depicted in
Table 1c, for each temporal expression we store the normalized time value, in
this case “13961231”, and the start and end offset of the expression in the text.
Finally, we tokenize and index the context in which the expression occurs. In
Table 1c, in italics is reported the left context, while the right context is reported
in bold. Examples are reported according to the Italian version of Wikipedia, but
the indexing step is language independent.
2.3</p>
      </sec>
      <sec id="sec-2-3">
        <title>The Search Component</title>
        <p>The search component retrieves relevant documents according to the user query
q containing temporal constraints. For this reason we need to make temporal
expressions in the query compliant with the expressions stored in the index. The
query is processed by the Text Component in order to extract and normalize the
time expressions.</p>
        <p>The query q is represented by two parts: qk contains keywords, while qt only the
normalized time expressions. qk is used to retrieve from the doc index a first
results set RSdoc. Thus, both qk and qt are used to query the time index producing
the results set RStime. The search in time index is limited to those documents
belonging to RSdoc. In RStime, text fragments have to match the time constraints
expressed in qt, while the matching with the keyword-based query qk is optional.
The optional matching with qk has the effect of promoting those contexts that
satisfy both the temporal constraints and the query topics, while not completely
removing poorly matching results. The motivation behind this approach is twofold:
through RSdoc we retrieve those documents relevant for the query topic, while
RStime contains the text fragments that match the time query qt and are related
to the query topic.</p>
        <p>For example given the query q =“clavicembalo [1300 TO 1400]”, we identify
the two fields: qk =“clavicembalo” and qt = [12991231 TO 13991231]. It is</p>
        <sec id="sec-2-3-1">
          <title>F ield V alue</title>
          <p>ID 42
Content Con il termine
clavicembalo (altrimenti detto
gravicembalo, arpicordo,
cimbalo, cembalo) si indica una
famiglia di strumenti
musicali a corde [...]
(a) docrep index.</p>
        </sec>
        <sec id="sec-2-3-2">
          <title>F ield V alue</title>
          <p>ID 42
Content f‘Con’, ‘il’, ‘termine’,
‘clavicembalo’,
‘altrimenti’, ‘detto’,
‘gravicembalo’, ‘arpicordo’,
‘cimbalo’, ‘cembalo’, ‘si’,
‘indica’, ‘una’, ‘famiglia’,
‘di’, ‘strumenti’,
‘musicali’, ‘a’, ‘corde’ [...]
g
(b) doc index.</p>
        </sec>
        <sec id="sec-2-3-3">
          <title>F ield V alue</title>
          <p>ID 42
Time 13961231
Start Offset 350
End Offset 354
Context f‘Il’, ‘termine’, ‘stesso’, ‘che’, ‘compare’, ‘per’, ‘la’,
‘prima’, ‘volta’, ‘in’, ‘un’, ‘documento’, ‘del’, ‘deriva’,
‘dal’, ‘latino’, ‘clavis’, ‘chiave’ [...] g</p>
          <p>(c) time index.</p>
          <p>Table 1: The three indices used by the system.
important to underline that in this example we adopted a particular syntax to
identify range queries, more details about the system implementation are reported
in Section 3.</p>
          <p>
            The retrieval step produces two results sets: RSdoc and RStime. Considering the
query q in the previous example: RSdoc contains the doc 42 with a relevance
score sdoc. While the results set RStime contains the temporal expression
reported in Table 1c with a score stime. The last step is to combine the two results
sets. The idea is to promote text fragments in RStime that comes from
documents that belong to RSdoc. We simply boost the score of each result in RStime
multiplying its score by the score assigned to its origin document in RSdoc. In
our example the temporal expression occurring in RStime obtains a final score
computed as: sdoc stime. We have chosen to boost score rather than linearly
combine them, in this way we avoid the use of combination parameters.
Finally, we sort the re-ranked RStime and provide it to the user as final result of
the search. It is important to underline that our system does not produce a list of
document as a classical search engine does, but we provide all the text passages
that are both relevant for the query and compliant to temporal constraints.
We implemented our TAIR model in a freely available system1 as an open-source
software under the GNU license V.3. The system is developed in JAVA and
extends the indexing and search open-source API Apache Lucene2.
The text processing component is based on the HeidelTime tool3 [
            <xref ref-type="bibr" rid="ref21">21</xref>
            ] to extract
temporal information. We adopt this tool for two reasons: 1) it obtained good
performance in the TempEval-3 task, and 2) it is able to analyze text written in
several languages including the Italian. HeidelTime is a rule based system that
can be extended to support other languages or specific domains.
          </p>
          <p>Our system provides all the expected functionalities: text analysis, indexing and
search. The query language supports all operators provided by the Lucene query
syntax4. Moreover the temporal query qt can be formulated using natural time
expressions, for example “12 May 2014” or “yesterday”. The search component
tries to automatically translate the user query in the proper time expressions.
However, the user can directly formulate qt using normalized time expressions
and query operators. Table 2 shows some time operators.</p>
          <p>Currently the system does not provide a GUI for searching and visualizing the
results, but it is designed as an API. As future works we plan to extend the API
with REST Web functionalities.
4</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Use case</title>
      <p>We decided to set up a case study to show the potentialities of the proposed IR
framework. The case study involves the indexing of a large collection of
docu1 https://github.com/pippokill/TAIR
2 http://lucene.apache.org/
3 https://code.google.com/p/heideltime/
4 http://lucene.apache.org/core/4_8_1/queryparser/org/apache/
lucene/queryparser/classic/package-summary.html
ments and a set of example queries exploiting specific scenarios in which
temporal expressions play a key role. Moreover, another goal is to provide performance
information about the system in terms of indexing and query time, and index
space.</p>
      <p>We propose an exploratory use case indexing all Italian Wikipedia articles. Our
choice is based on the fact that Wikipedia is freely available and contains millions
of documents with many temporal events. We need to set some parameters: we
index only documents with at least 4,000 characters, remove special pages (e.g.
category pages), we set the context size in temporal index to 256 characters.
We perform the experiment on a virtual machine with four virtual cores and 32GB
of RAM. Table 3 reports some statistics related to the indexing step. The indexing
time is very high due to the complexity of the temporal extraction algorithm and
the huge number of documents. We speed up the temporal event extraction
implementing a multi threads architecture, in particular in this evaluation we enable
four threads for the extraction.</p>
      <p>One of the most appropriate scenarios consists in finding events that happened in
a specific date. For example, one query could be interested in listing all events
happened on 29 April 1981. In this case the time query is “19810429” while the
keyword query is empty. The first three results are shown in Table 4.</p>
      <p>We report in bold the temporal expressions that match the query. It is important
to note that in the first result the year “1981” appears distant from both the month
and the day, but the Text Processing component is able to correctly recognize and
normalize the date.</p>
      <p>Another interesting scenario is to find events related to a specific topic in a
particular time period. For example, Table 5 reports the first three results for the
query: “terremoti tra il 1600 ed il 1700” (earthquakes between 1600 and 1700).
This query is split in its keyword qk =“terremoti” (earthquakes) and temporal
component qt = [15991231 TO 16991231].</p>
      <p>As reported in Table 6, the first two results regard events whose time interval
encompasses the time expressed in the query, since they took place in 1984, while
the third result shows an event that completely fulfil the time requirements
expressed in the temporal query.</p>
      <p>Result Rank Wikipedia page
1 Paul Breitner
2
3</p>
      <p>Time Context
nel 1981, richiamato da Jupp Derwall,
nel frattempo divenuto nuovo
commissario tecnico della Germania Ovest, e
con il quale aveva comunque avuto
accese discussioni a distanza. Il “nuovo
debutto” avviene ad Amburgo il 29
aprile contro l’Austria.
...E tu vivrai nel terrore! Warbeck e Catriona McColl, presente
L’aldila` nei contenuti speciali del DVD edito
dalla NoShame. Accoglienza. Il film
usc`ı in Italia il 29 aprile 1981 e incasso`
in totale 747.615.662 lire. Distribuito
per i mercati esteri dalla VIP
International, ottenne un ottimo successo
RCS Media Group L’operazione venne perfezionata il 29
aprile 1981. Quel giorno una societa`
dell’Ambrosiano (quindi di Calvi), la
“Centrale Finanziaria S.p.A.” effettu o`
l’acquisto del 40% di azioni Rizzoli</p>
      <p>Table 4: Results for the query “19810429”
Result Rank Wikipedia page Time Context
1 Terremoto della Cal- Il terremoto dell’8 giugno 1638 fu un
abria dell’8 giugno disastroso terremoto che colp`ı la
Cal1638 abria, in particolare il Crotonese e parte
del territorio gia` colpito nei giorni 27 e
28 marzo del 1638
2 Eruzione dell’Etna del 1669 10 marzo - M = 4.8 Nicolosi
1669 Terremoto con effetti distruttivi nel
catanese in particolare a Nicolosi in
seguito all’eruzione dell’Etna conosciuta
come Eruzione dell’Etna del 1669. Il 25
febbraio e l’8 e 10 marzo del 1669 una
serie di violenti terremoti.
3 Terremoto del Val di l’evento catastrofico di maggiori
diNoto del 1693 mensioni che abbia colpito la Sicilia
orientale in tempi storici.Il terremoto
del 9 Gennaio 1693
Table 5: Results for the query “earthquakes between 1600 and 1700”
We proposed a “Time-Aware” IR system able to extract, index, and retrieve
temporal information. The system expands a classical keyword-based search through
temporal constraints. Temporal expressions, automatically extracted from
documents, are indexed through a structure that enables both keyword- and
timematching. As a result, TAIR retrieves a list of text fragments that match the
temporal constraints, and are relevant for the query topic. We proposed a preliminary
case study indexing all the Italian Wikipedia and described some retrieval
scenarios which would benefit from the proposed IR model.</p>
      <p>As future work we plan to improve both recognition and normalization of time
expressions, extending some particular TimeML specifications that in this
preliminary work were not taken into account during the normalization process.
Moreover, we will perform a deep “in-vitro” evaluation on a standard document
collection.</p>
    </sec>
    <sec id="sec-4">
      <title>Acknowledgements</title>
      <p>This work fulfils the research objectives of the projects PON 01 00850
ASKHealth (Advanced System for the interpretation and sharing of knowledge in
health care) and PON 02 00563 3470993 project “VINCENTE - A Virtual
collective INtelligenCe ENvironment to develop sustainable Technology
Entrepreneurship ecosystems” funded by the Italian Ministry of University and Research
(MIUR).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gertz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Clustering of Search Results Using Temporal Attributes</article-title>
          .
          <source>In: Proceedings of the 29th annual international ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          . pp.
          <fpage>597</fpage>
          -
          <lpage>598</lpage>
          . ACM (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gertz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baeza-Yates</surname>
          </string-name>
          , R.:
          <article-title>On the Value of Temporal Information in Information Retrieval</article-title>
          .
          <source>SIGIR Forum</source>
          <volume>41</volume>
          (
          <issue>2</issue>
          ),
          <fpage>35</fpage>
          -
          <lpage>41</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gertz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :
          <article-title>Clustering and Exploring Search Results Using Timeline Constructions</article-title>
          .
          <source>In: Proceedings of the 18th ACM Conference on Information and Knowledge Management</source>
          . pp.
          <fpage>97</fpage>
          -
          <lpage>106</lpage>
          . CIKM '09,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , Stro¨tgen, J.,
          <string-name>
            <surname>Baeza-Yates</surname>
            ,
            <given-names>R.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gertz</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Temporal Information Retrieval: Challenges and Opportunities</article-title>
          .
          <source>In: Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW</source>
          <year>2011</year>
          ). vol.
          <volume>11</volume>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>8</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Arikan</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bedathur</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berberich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Time Will Tell: Leveraging Temporal Expressions in IR</article-title>
          . In: Baeza-Yates,
          <string-name>
            <given-names>R.A.</given-names>
            ,
            <surname>Boldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Ribeiro-Neto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.A.</given-names>
            ,
            <surname>Cambazoglu</surname>
          </string-name>
          , B.B. (eds.)
          <source>Proceedings of the 2ND International Conference on Web Search and Web Data Mining, WSDM</source>
          <year>2009</year>
          , Barcelona, Spain, February 9-
          <issue>11</issue>
          ,
          <year>2009</year>
          . ACM (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Berberich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bedathur</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>A Language Modeling Approach for Temporal Information Needs</article-title>
          .
          <source>In: Proceedings of the 32Nd European Conference on Advances in Information Retrieval</source>
          . pp.
          <fpage>13</fpage>
          -
          <lpage>25</lpage>
          . ECIR'
          <year>2010</year>
          , Springer-Verlag (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Campos</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dias</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jorge</surname>
            ,
            <given-names>A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jatowt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Survey of Temporal Information Retrieval and Related Applications</article-title>
          .
          <source>ACM Computing Surveys</source>
          <volume>47</volume>
          (
          <issue>2</issue>
          ),
          <volume>15</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
          :
          <fpage>41</fpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <issue>8</issue>
          .
          <string-name>
            <surname>Chang</surname>
            ,
            <given-names>A.X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manning</surname>
          </string-name>
          , C.D.:
          <article-title>SUTime: A library for recognizing and normalizing time expressions</article-title>
          .
          <source>In: LREC</source>
          . pp.
          <fpage>3735</fpage>
          -
          <lpage>3740</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Elsas</surname>
            ,
            <given-names>J.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dumais</surname>
          </string-name>
          , S.T.:
          <article-title>Leveraging Temporal Dynamics of Document Content in Relevance Ranking</article-title>
          .
          <source>In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          . WSDM '10,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Hienert</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luciano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Extraction of Historical Events from Wikipedia</article-title>
          .
          <source>In: Proceedings of the First International Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data</source>
          . pp.
          <fpage>25</fpage>
          -
          <lpage>36</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Hobbs</surname>
            ,
            <given-names>J.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>An Ontology of Time for the Semantic Web</article-title>
          .
          <source>ACM Transactions on Asian Language Information Processing (TALIP) - Special Issue on Temporal Information Processing</source>
          <volume>3</volume>
          (
          <issue>1</issue>
          ),
          <fpage>66</fpage>
          -
          <lpage>85</lpage>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Hoffart</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suchanek</surname>
            ,
            <given-names>F.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Berberich</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia</article-title>
          .
          <source>Artificial Intelligence</source>
          <volume>194</volume>
          ,
          <fpage>28</fpage>
          -
          <lpage>61</lpage>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Kanhabua</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          , Nørva˚g, K.:
          <article-title>Learning to Rank Search Results for Timesensitive Queries</article-title>
          .
          <source>In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management</source>
          . pp.
          <fpage>2463</fpage>
          -
          <lpage>2466</lpage>
          . CIKM '12,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Kuzey</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weikum</surname>
          </string-name>
          , G.:
          <article-title>Extraction of temporal facts and events from Wikipedia</article-title>
          .
          <source>In: Proceedings of the 2nd Temporal Web Analytics Workshop</source>
          . pp.
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          . ACM (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Ling</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Weld</surname>
            ,
            <given-names>D.S.</given-names>
          </string-name>
          :
          <article-title>Temporal Information Extraction</article-title>
          .
          <source>In: Proceedings of the 24th Conference on Artificial Intelligence (AAAI</source>
          <year>2010</year>
          ). Atlanta, GA. (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Matthews</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tolchinsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blanco</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Atserias</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mika</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
          </string-name>
          , H.:
          <article-title>Searching through time in the New York Times</article-title>
          .
          <source>In: Proceedings of the Fourth Workshop on Human-Computer Interaction and Information Retrieval (HCIR 10)</source>
          . pp.
          <fpage>41</fpage>
          -
          <lpage>44</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Matthews</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tolchinsky</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blanco</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Atserias</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mika</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zaragoza</surname>
          </string-name>
          , H.:
          <article-title>Searching through time in the New York Times</article-title>
          .
          <source>In: Proceedings of the 4th Workshop on Human-Computer Interaction and Information Retrieval</source>
          ,
          <source>HCIR Challenge</source>
          <year>2010</year>
          . pp.
          <fpage>41</fpage>
          -
          <lpage>44</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nunes</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ribeiro</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>David</surname>
          </string-name>
          , G.:
          <article-title>Use of temporal expressions in web search</article-title>
          .
          <source>In: Proceedings of the IR Research, 30th European Conference on Advances in Information Retrieval</source>
          , pp.
          <fpage>580</fpage>
          -
          <lpage>584</lpage>
          . ECIR'
          <volume>08</volume>
          , Springer-Verlag (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Pustejovsky</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Castano</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ingria</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sauri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gaizauskas</surname>
            ,
            <given-names>R.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Setzer</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <source>TimeML: Robust Specification of Event and Temporal Expressions in Text. New Directions in Question Answering</source>
          <volume>3</volume>
          ,
          <fpage>28</fpage>
          -
          <lpage>34</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20. Saur´ı, R.,
          <string-name>
            <surname>Knippen</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verhagen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pustejovsky</surname>
          </string-name>
          , J.:
          <article-title>Evita: A Robust Event Recognizer for QA Systems</article-title>
          .
          <source>In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>700</fpage>
          -
          <lpage>707</lpage>
          . ACL (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21. Stro¨ tgen, J.,
          <string-name>
            <surname>Zell</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gertz</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>HeidelTime: Tuning English and Developing Spanish Resources for TempEval-3</article-title>
          .
          <source>In: 2nd Joint Conference on Lexical and Computational Semantics (*SEM)</source>
          , Volume
          <volume>2</volume>
          :
          <source>Proceedings of the 7th International Workshop on Semantic Evaluation</source>
          . pp.
          <fpage>15</fpage>
          -
          <lpage>19</lpage>
          . ACL (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>UzZaman</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Llorens</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Derczynski</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Allen</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verhagen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pustejovsky</surname>
          </string-name>
          , J.:
          <article-title>Semeval-2013 task 1: Tempeval-3: Evaluating time expressions, events, and temporal relations</article-title>
          .
          <source>In: 2nd Joint Conference on Lexical and Computational Semantics (*SEM)</source>
          , Volume
          <volume>2</volume>
          :
          <source>Proceedings of the 7th International Workshop on Semantic Evaluation</source>
          . pp.
          <fpage>1</fpage>
          -
          <lpage>9</lpage>
          . ACL (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Vandenbussche</surname>
          </string-name>
          , P.Y., Teisse`dre, C.:
          <article-title>Events Retrieval Using Enhanced Semantic Web Knowledge</article-title>
          . In: Workshop DeRIVE 2011 (
          <article-title>Detection, Representation, and Exploitation of Events in the Semantic Web) in cunjunction with 10th</article-title>
          <source>International Semantic Web Conference 2011 (ISWC</source>
          <year>2011</year>
          )
          <article-title>(</article-title>
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Verhagen</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sauri</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Caselli</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pustejovsky</surname>
          </string-name>
          , J.: SemEval-2010
          <source>Task 13: TempEval-2. In: Proceedings of the 5th International Workshop on Semantic Evaluation</source>
          . pp.
          <fpage>57</fpage>
          -
          <lpage>62</lpage>
          . ACL (
          <year>July 2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Whiting</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Jose, J.,
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          :
          <article-title>Wikipedia As a Time Machine</article-title>
          .
          <source>In: Proceedings of the Companion Publication of the 23rd International Conference on World Wide Web Companion</source>
          . pp.
          <fpage>857</fpage>
          -
          <lpage>862</lpage>
          . International World Wide Web Conferences Steering Committee (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>