<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>SINAI at CLEF eHealth 2018 Task 3. Using cTAKES to remove noise from expanding queries with Google</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Manuel Carlos D az-Galiano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pilar Lopez-Ubeda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Maria-Teresa Martin-Valdivia</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>L. Alfonso Uren~a-Lopez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Advanced Studies Center in ICT (CEATIC) Universidad de Jaen</institution>
          ,
          <addr-line>Campus Las Lagunillas, 23071, Jaen</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In this paper we present our participation as SINAI research group from the Universidad de Jaen at Task 3 \Consumer Health Search" speci cally in sub-task 1 \Ad-hoc Search". The main objective of the task is to provide relevant information to people seeking health advice on the web. We apply the query expansion technique using the most famous search engine at the moment: Google. We search additional information related to the query using the search engine. We identify the medical concepts in Google results using cTAKES. This recognizer provides UMLS concepts from a given text. In this way, we avoid introducing noise with words that are not related to the user query. Our system improves NDCG@10 measurement by 48% over the previous year. We also signi cantly reduces the response time of the Information Retrieval System (IRS) by 90% compared to previous years.</p>
      </abstract>
      <kwd-group>
        <kwd>Retrieval Information</kwd>
        <kwd>Google expander</kwd>
        <kwd>Named entity recognition</kwd>
        <kwd>cTAKES</kwd>
        <kwd>UMLS</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>CLEF 2018 [15] consists of an independent peer-reviewed conference on a broad
range of issues in the elds of multilingual and multimodal information access
evaluation, and a set of labs and workshops designed to test di erent aspects of
mono and cross-language Information retrieval system.</p>
      <p>
        This year we have participated in the Information Retrieval (IR) task and
we will continue exploring the same problems and issues identi ed in 2014 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ],
2015 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], 2016 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and 2017 CLEF eHealth information retrieval challenges [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
However, the 2018 task [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] uses a new web corpus and a new set of queries
compared to previous years.
      </p>
      <p>This task deals the problem of retrieving relevant information in the biomedicine
domain. People are constantly looking for information by querying the most
important search engines. With these searches, the user tries to obtain more
information about diseases, illnesses, treatments, etc.</p>
      <p>
        Our research group SINAI has a large experience participating in several
tasks of other editions of CLEF eHealth. In previous years, we have participated
in this competition making use the Google search engine[
        <xref ref-type="bibr" rid="ref10 ref11 ref2">2,11,10</xref>
        ]. We consider
that Google provides extra knowledge as it is the most widely used search engine
in the world.
      </p>
      <p>For our approach, we use the clinical Text Analysis and Knowledge
Extraction System (cTAKES)1. cTAKES is an open-source Natural Language
Processing (NLP) system for information extraction from electronic medical record
clinical free-text [13] including types of clinical named entities mapped to
various biomedical terminologies/ontologies such as the Uni ed Medical Language
System (UMLS).</p>
      <p>This paper is organized as follows: In the next section, we introduce the
resources provided by the organizers. Our approach is described in Section 3. In
Section 4 we include the results obtained and nally, we expose the conclusions
and future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Resources</title>
      <p>2.1</p>
      <sec id="sec-2-1">
        <title>Dataset</title>
        <p>For task 3 of CLEF 2018 the corpus of documents consists of web passages
acquired from CommonCrawl2. For the creation, an initial list of websites was
selected, this list was constructed by sending the queries to Microsoft Bing APIs
and discarding the unreliable websites.</p>
        <p>The collection consists of 1,903 domains including some of the most famous
such as facebook.com, answer.com and health.com. Each web page is in the
original format as crawled from CommonCrawl, thus it may be html, xhtml, xml,
etc. It is important to have a global vision for this type of task, as each website
can provide relevant information from every point of view.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Indri index</title>
        <p>The organization also provided several indexes created due to the large size of
the collection because there are groups that do not have the possibility of dealing
with a corpus of this size. This makes it easier to compare the systems between
the participants.</p>
        <p>After uncompressed, each document of the collection was extended with the
traditional TREC format following the Figure 1.</p>
        <p>The collection was indexed as shown in the Figure 2.</p>
        <sec id="sec-2-2-1">
          <title>1 http://ctakes.apache.org/ (last visited: May 31, 2018)</title>
          <p>
            2 http://commoncrawl.org/2018/03/february-2018-crawl-archive-now-available/ (last
visited: May 31, 2018)
&lt;doc&gt; &lt;docno&gt; DOC ID &lt;/docno&gt; ORIGINAL CONTENT &lt;doc&gt;
The query set for 2018 consists of 50 queries issued by the general public to
the HON3 (Health On the Net) and TRIP4 search services. The queries and the
process to obtain them are described in [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]. Queries are formatted one per line
in the tab-separated query le, with the rst string being the query id, and the
second string being the query text. The queries for the English language follow
the structure of Figure 3.
&lt;query&gt;
&lt;id&gt; 195001 &lt;/id&gt;
&lt;en&gt; affective treatments for chronic lyme disease &lt;/en&gt;
&lt;/query&gt;
          </p>
        </sec>
        <sec id="sec-2-2-2">
          <title>3 https://hon.ch/en/ (last visited: May 31, 2018) 4 https://www.tripdatabase.com/ (last visited: May 31, 2018)</title>
          <p>We found out that entering everything provided by Google, it is included a
lot of noise to the nal query and for this reason, we decided to use a biomedical
entity recognizer.</p>
          <p>The Figure 4 detail the procedure followed. The IRS (explained in section
2.2) uses an information retrieval model described in [14].
The new document collection contains web pages from di erent websites. We
consider a good idea to simulated a typical web search and so, we have tried to
integrate the knowledge from the most popular web search engine: Google.</p>
          <p>We have performed a search for each query using the Google API for the
extraction of titles and snippets. The query was cleared of punctuation marks.
A total of 10 results have been taken into account and added to the query the
titles and snippets of those results.</p>
          <p>The average word count of the expanded query with 10 titles and 10 snippets
is 330 words, out of which 29% are stop-words.
3.2</p>
          <p>cTAKES: recogniser of medical entities
To select the terms used to expand the query, we use cTAKES, a tool for the
recognition of medical entities. This system identi es UMLS concepts in the
query expanded by Google. We create a new query only with terms detected
with cTAKES.</p>
          <p>To build the nal query, the concepts returned by cTAKES will be assigned
a weight according to the number of occurrences obtained in Google, and nally,
we add those words existing in the original query and that have not been detected
with cTAKES with a xed weight W , thus informing the system that these words
are also relevant to the query. In this case, the stop-words of original query are
not included.</p>
          <p>To determine the value of W , several experiments have been performed using
the 2017 relevant judgments and queries. In these experiments we observe that
the most optimal value is equal to the number of documents selected in Google.
In our case W = 10.</p>
          <p>Figure 5 shows the result of expanding the query following the process in
Figure 3. We have used the Indri query language5, where #combine allows us to
use multiple words as a unique term and the #weight operator assign varying
weights to each expressions.
&lt;query&gt;
&lt;type&gt;indri&lt;/type&gt;
&lt;number&gt;195001&lt;/number&gt;
&lt;text&gt;#weight(
1 #combine(bacteria) 15 #combine(treatment)
4 #combine(treatments) 1 #combine(microscopy)
1 #combine(direct microscopy) 1 #combine(test)
1 #combine(tips) 1 #combine(prevention)
3 #combine(therapy) 3 #combine(oxygen therapy)
3 #combine(hyperbaric oxygen therapy) 1 #combine(sick)
1 #combine(diagnosis) 1 #combine(illness)
20 #combine(disease) 20 #combine(lyme disease)
1 #combine(infection) 1 #combine(fibromyalgia)
1 #combine(conditions) 5 #combine(antibiotics)
6 #combine(antibiotic) 2 #combine(rise)
3 #combine(oxygen) 10 #combine(affective)
10 #combine(chronic) 10 #combine(lyme)
)
&lt;/text&gt;
&lt;/query&gt;
5 https://www.lemurproject.org/lemur/IndriQueryLanguage.php</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results</title>
      <p>This year, we have a new corpus for this task, therefore, we do not have the
results obtained for our system. But in the Table 4 we can see the results of
the experiments carried out in this year using the 2017 collection and the 2017
relevant judgments.</p>
      <p>Run
Baseline
Google
Google + cTAKES
Compared to the experiment of the last year (Google) where we didn't use
cTAKES to lter, this new experiment (Google + cTAKES ) improves NDCG@10
value by almost 48%, whereas the BPref and RBP values only decreases almost
11% and 22% respectively. We are analyzing the obtain results to understand
why these values are lower.</p>
      <p>In addition, the time required by the IRS to process the query is signi cantly
reduced, we improve in 90% of the total time invested. A query with Google
expansion without cTAKES lter contains 390 words on average, of which 23% are
stop-words. However, a ltered query with cTAKES contains about 30 weighted
terms.
5</p>
    </sec>
    <sec id="sec-4">
      <title>Conclusion</title>
      <p>In this paper, we have presented a method to expand queries with the most
popular search engine at the moment, adding more information to the user's
query. We have observed that this method introduces a lot of noise into the
nal query, so we have tried to lter and keep the words most related to the
biomedical domain.</p>
      <p>For this reason, we use the medical entity recognizer cTAKES to lter Google
results and we obtain a nal query focused on the biomedical domain, and we
also use the weighted words, giving more importance to some identi ed terms.</p>
      <p>
        In the future, we will continue this work to improve our systems by adding
knowledge to them. We will study di erent medical ontologies for the expansion
of queries [
        <xref ref-type="bibr" rid="ref1 ref3">3,1</xref>
        ]. Through these ontologies we will be able to extract knowledge to
get closer to the user needs. Besides, we will use disambiguation algorithms to
select the most appropriate biomedical terms for the query, using UMLS graph
similar to the work presented in [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This work has been partially supported by a grant from Fondo Europeo de
Desarrollo Regional (FEDER) and REDES project (TIN2015-65136-C2-1-R) from
the Spanish Government.
13. Savova, G.K., Masanz, J.J., Ogren, P.V., Zheng, J., Sohn, S., Kipper-Schuler, K.C.,
Chute, C.G.: Mayo clinical text analysis and knowledge extraction system (ctakes):
architecture, component evaluation and applications. Journal of the American
Medical Informatics Association 17(5), 507{513 (2010)
14. Strohman, T., Metzler, D., Turtle, H., Croft, W.B.: Indri: A language model-based
search engine for complex queries. In: Proceedings of the International Conference
on Intelligent Analysis. vol. 2, pp. 2{6. Citeseer (2005)
15. Suominen, H., Kelly, L., Goeuriot, L., Kanoulas, E., Azzopardi, L., Spijker, R., Li,
D., Neveol, A., Ramadier, L., Robert, A., Palotti, J., Jimmy, Zuccon, G.: Overview
of the clef ehealth evaluation lab 2018. In: CLEF 2018 - 8th Conference and Labs
of the Evaluation Forum, Lecture Notes in Computer Science (LNCS). Springer
(2018)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verma</surname>
            ,
            <given-names>R.:</given-names>
          </string-name>
          <article-title>A query-based medical information summarization system using ontology knowledge</article-title>
          .
          <source>In: Computer-Based Medical Systems</source>
          ,
          <year>2006</year>
          .
          <source>CBMS</source>
          <year>2006</year>
          .
          <article-title>19th IEEE International Symposium on</article-title>
          . pp.
          <volume>37</volume>
          {
          <fpage>42</fpage>
          .
          <string-name>
            <surname>IEEE</surname>
          </string-name>
          (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>D</given-names>
            <surname>az-Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.C.</given-names>
            ,
            <surname>Mart</surname>
          </string-name>
          n-Valdivia,
          <string-name>
            <given-names>M.T.</given-names>
            ,
            <surname>Jimenez-Zafra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.M.</given-names>
            ,
            <surname>Andreu</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          ,
          <article-title>Uren~a-</article-title>
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          :
          <source>SINAI at CLEF eHealth 2017 Task</source>
          <volume>3</volume>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>D</given-names>
            <surname>az-Galiano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.C.</given-names>
            ,
            <surname>Mart</surname>
          </string-name>
          n-Valdivia,
          <string-name>
            <surname>M.T.</surname>
          </string-name>
          ,
          <article-title>Uren~a-</article-title>
          <string-name>
            <surname>Lopez</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Query expansion with a medical ontology to improve a multimodal information retrieval system</article-title>
          .
          <source>Computers in biology and medicine 39(4)</source>
          ,
          <volume>396</volume>
          {
          <fpage>403</fpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanbury</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hegarty</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hodmon</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kriewel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lupu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Markonis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pecina</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schneller</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Meta-analysis of the second phase of empirical and user-centered evaluations (</article-title>
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hanlen</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grouin</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Overview of the clef ehealth evaluation lab 2015</article-title>
          . In:
          <article-title>International Conference of the Cross-Language Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>429</volume>
          {
          <fpage>443</fpage>
          . Springer (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kanoulas</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spijker</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
          </string-name>
          , G.:
          <article-title>Clef 2017 ehealth evaluation lab overview</article-title>
          . In:
          <article-title>International Conference of the Cross-Language Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>291</volume>
          {
          <fpage>303</fpage>
          . Springer (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Jimmy</surname>
            , Zuccon,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Overview of the clef 2018 consumer health search task</article-title>
          . In:
          <article-title>CLEF 2018 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Neveol</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Palotti</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Overview of the clef ehealth evaluation lab 2016</article-title>
          . In:
          <article-title>International Conference of the CrossLanguage Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>255</volume>
          {
          <fpage>266</fpage>
          . Springer (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kelly</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Goeuriot</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schreck</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leroy</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mowery</surname>
            ,
            <given-names>D.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velupillai</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chapman</surname>
            ,
            <given-names>W.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Martinez</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zuccon</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          , et al.:
          <article-title>Overview of the share/clef ehealth evaluation lab 2014</article-title>
          . In:
          <article-title>International Conference of the Cross-Language Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>172</volume>
          {
          <fpage>191</fpage>
          . Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mart</surname>
            nez-Santiago,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montejo-Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Sinai at clef adhoc robust track 2007: applying google search engine for robust cross-lingual retrieval</article-title>
          .
          <source>In: Workshop of the Cross-Language Evaluation Forum for European Languages</source>
          . pp.
          <volume>137</volume>
          {
          <fpage>142</fpage>
          . Springer (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Martinez-Santiago</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Montejo-Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garc</surname>
            a-Cumbreras,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Urena-Lopez</surname>
            ,
            <given-names>L.A.</given-names>
          </string-name>
          :
          <article-title>Sinai at clef 2006 ad hoc robust multilingual track: query expansion using the google search engine</article-title>
          . In:
          <article-title>Workshop of the Cross-Language Evaluation Forum for European Languages</article-title>
          . pp.
          <volume>119</volume>
          {
          <fpage>126</fpage>
          . Springer (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Montejo-Raez</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            nez-Camara,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mart</surname>
            n-Valdivia,
            <given-names>M.T.</given-names>
          </string-name>
          , Uren~aLopez,
          <string-name>
            <surname>L.A.</surname>
          </string-name>
          :
          <article-title>Ranked wordnet graph for sentiment polarity classi cation in twitter</article-title>
          .
          <source>Computer Speech &amp; Language</source>
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <volume>93</volume>
          {
          <fpage>107</fpage>
          (
          <year>2014</year>
          ), http://www.sciencedirect.com/science/article/pii/S0885230813000284
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>