<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ECNU at 2018 eHealth Task 2: Technologically Assisted Reviews in Empirical Medicine</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Huaying Wu</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tingting Wang</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jiayi Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Su Chen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Qinmin Hu</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Liang He</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science &amp; Technology, East China Normal University</institution>
          ,
          <addr-line>Shanghai, 200062</addr-line>
          ,
          <country country="CN">China</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, Ryerson University</institution>
          ,
          <addr-line>Toronto</addr-line>
          ,
          <country country="CA">Canada</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>The 2018 CLEF eHealth Task 2 has two sub-tasks in order to write a systematic review of evidence-based medicine. Researchers are required to retrieve relevant documents given by medical database for each query (sub-task 1) and re-rank the documents with the results of the Boolean search as the starting point(sub-task 2). We adopt BM25 with query expansion to acquire basic relationship and utilize a customized Paragraph2Vector to represent queries / documents trained by the training set of Boolean search. To compute the relevant score of given query-document pair, cosine similarity and logistic regression are taken in our experiments. Finally, we nd that the combination has a better performance.</p>
      </abstract>
      <kwd-group>
        <kwd>Query expansion</kwd>
        <kwd>Paragraph2Vector</kwd>
        <kwd>Health Information Retrieval</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The ECNUica participates in CLEF 2018 eHealth Task 2 : TAR in Empirical
Medicine, which proposes to do a sorting problem based on query-documents
similarity in Systematic Reviews. There are multiple stages contained in
Systematic Reviews: Boolean Search in each query, Screening queries title and
Abstract, and Document Checking. The task focus on the rst and second stages
of the process.</p>
      <p>In the Boolean Search stage, Participants need to do a basic binary
classication for each document based on every query. Boolean query with relevant
information constituted, which submits to a medical database containing details
of medical studies built by experts, need to be classi ed into relevant or
irrelevant. The database returns a set of potential relevant studies. In the following
steps, Participants decide which ones are indeed relevant by screening titles,
abstracts and full documents.</p>
      <p>
        There are two sub-tasks for this task. One is to nd documents in high
relevance for each given query. The other one is to re-rank the documents retrieved
in the rst step given by experts. According to our work in past two years, we try
to manage the data with learning-to-rank [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], word2vec [
        <xref ref-type="bibr" rid="ref1 ref7">1, 7</xref>
        ], relevance based
relation between a query and documents [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], which perform well in other runs
[
        <xref ref-type="bibr" rid="ref4 ref5 ref7">4, 5, 7</xref>
        ].
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Methods</title>
      <p>In sub-task 1, we choose BM25 algorithm to acquire a baseline of Boolean search.
Furthermore, query expansion based on MeSH and pseudo relevance feedback
(PRF) is taken to get a better result. In sub-task 2, we employ Paragraph2Vector
to represent query and documents for similarity calculation.
2.1</p>
      <sec id="sec-2-1">
        <title>Boolean search</title>
        <p>Query Expansion In this stage, we do query expansion to improve retrieval
precision. For better performance of experiments, we compare the expansion
with PRF, MeSH and RPF + MeSH.</p>
        <p>{ The PRF returns top-10 relative features for each query.
{ The MeSH database is applied to extract medical terms from titles.</p>
        <p>We choose DescriptorName part from raw data as keywords of document,
which describes theme of document with a series of words, and the words form
MeSH as expansion. We do not use any part of protocol in both tasks. Thus,
the query we use in both tasks contains: title and objective from original query,
expansion from DescriptorName or MeSH. The results show that both PRF and
MeSH can improve performance.</p>
        <p>Model Training In the model selection stage, we compare the result of BM25,
DRF BM25 and PL2. For each algorithm, experiment is based on method
(BM25, DEF BM25, PL2) only, method with PRF, method with MeSH, method
with both PRF and MeSH. One-hot is used to represent every query for relative
score calculation.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Ranking</title>
        <p>
          Paragraph2Vector T. Mikolov proposed paragraph vector [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ], which presented
an unsupervised algorithm that learns xed-length pieces of texts. With this
method, we use Paragraph2Vector Model to represent all selected documents
from words to xed-length vector.
        </p>
        <p>
          Under this framework, we should know how to learn vector representation
of words rst [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ]. The objective of the word vector model is to maximize the
average log probability.
        </p>
        <p>After training word vectors, we use softmax function as activate function to
learn the softmax weights and paragraph vectors on documents.
Logistic Regression With all CLEF 2017 eHealth training and testing queries
and CLEF 2018 eHealth training queries as training dataset, we train a logistic
regression model as a classi er. For each document given query, calculation about
the relationship with the LR classi er is taken care. The text is the input of model
while return a score of relevance.
3
3.1</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiments</title>
      <sec id="sec-3-1">
        <title>Dataset</title>
        <p>For sub-task 1, we are provided with a test set consisting of 20 topics of
Diagnostic Test Accuracy (DTA) reviews as follows.</p>
        <p>{ Topic-ID.
{ The title of the review, written by Cochrane experts.
{ A part of the protocol.
{ The entire PubMED database</p>
        <p>For sub-task 2, we are provided with di erent data in the same reviews as
follows.</p>
        <p>{ The Boolean query manually constructed by Cochrane experts
{ The set of PubMED Document Identi ers (PID's) returned by running the
query in MEDLINE.</p>
        <p>For training, we choose the CLEF eHealth 2017 queries and documents with
all training part and testing part, CLEF eHealth 2018 queries and documents
with training part.
3.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Runs</title>
      </sec>
      <sec id="sec-3-3">
        <title>In Sub-task 1:</title>
        <p>We submit three runs for each sub-task whose descriptions are as follows.
{ ECNU TASK1 RUN1 BM25: The result retreived on entire PubMed dataset
by terrier platform with BM25 model and pseudo relevance feedback.
{ ECNU TASK1 RUN2 LR: Rerank all documents by a Logistic Regression
classi er and Paragraph Vector.</p>
        <p>{ ECNU TASK1 RUN3 COMBINE: A combination of previous two runs.
In Sub-task 2:
{ ECNU TASK2 RUN1 TFIDF: Rerank the pids by vector space model. Each
document is represented as a vocabulary-size vector. Each dimension is the
tf-idf score of a certain word. We use cosine similarity to rerank the
document.
{ ECNU TASK2 RUN2 LR: a Logistic Regression classi er is used to rerank
documents based on Paragraph Vector.
{ ECNU TASK2 RUN3 COMBINE: A combination of previous two runs.
Task2 Combine recall@100% 0.992</p>
        <p>num rels 3964</p>
        <p>Table 1. Evaluations in Task1 and Task2
Summary of Runs The run-3 of each sub-task shows better performance in
training. We list some results in Table 1, which shows that BM25+PRF perform
best compared to other methods. Both MeSH and PRF are employed for query
expansion. But during experiments, performance declines when we take them
simultaneously.</p>
        <p>Experiments choose ap, recall@100, rels found, num rels as evaluation
metrics, where ap presents average precision in documents, recall@100 shows the
recall score at top-100 documents, num rels reveals the number of total recalled
documents and rels found in sub-task1 shows the number of documents we nd
in the experiments.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and Future Work</title>
      <p>In the CLEF eHealth 2018 Task 2 TAR, ECNUica team take advantages of the
Paragraph2Vector model. Combining with Statistical method, logistic regression
with TF-IDF shows better performance, compared to LR method only or
TFIDF only. Although the representation of queries and documents can be taken to
compute similarities by cosine distance, there are many aspects of our method
which need improvement. In the future work, we will focus on more features
from text and better out methods.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgement</title>
      <p>We thank reviewers for their review comments on this paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mikolov</surname>
            <given-names>T</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sutskever</surname>
            <given-names>I</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            <given-names>K</given-names>
          </string-name>
          , et al.
          <article-title>Distributed Representations of Words and Phrases and their Compositionality[J]</article-title>
          .
          <source>Advances in Neural Information Processing Systems</source>
          ,
          <year>2013</year>
          ,
          <volume>26</volume>
          :
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Le</surname>
            <given-names>Q V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mikolov</surname>
            <given-names>T</given-names>
          </string-name>
          . Distributed Representations of Sentences and Documents[J].
          <year>2014</year>
          ,
          <article-title>4:II-1188.</article-title>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Lavrenko</surname>
            <given-names>V</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft W B. Relevance</surname>
          </string-name>
          based language models[C]// International ACM SIGIR Conference.
          <year>2001</year>
          :
          <fpage>120</fpage>
          -
          <lpage>127</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Yang</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            <given-names>Q.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Haacke</surname>
            <given-names>E.M.:</given-names>
          </string-name>
          <article-title>ECNU at 2015 eHealth Task 2: User-centred Health Information Retrieval</article-title>
          .
          <source>Proceedings of the ShARe/CLEF eHealth Evaluation Lab</source>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Yang</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            <given-names>Q.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>ECNU at 2015 CDS Track: Two Re-ranking Methods in Medical Information Retrieval</article-title>
          .
          <source>Proceedings of the 2015 Text Retrieval Conference</source>
          . (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Yang</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            <given-names>Q.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            <given-names>H.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            <given-names>Y.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Luo</surname>
            <given-names>G.</given-names>
          </string-name>
          <year>h</year>
          .:
          <source>ECNU at 2016 eHealth Task</source>
          <volume>3</volume>
          :
          <string-name>
            <surname>Patient-centred Information</surname>
            <given-names>Retrieval.</given-names>
          </string-name>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Chen</surname>
          </string-name>
          j.y.,
          <string-name>
            <surname>Chen</surname>
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Song</surname>
            <given-names>Y.</given-names>
          </string-name>
          , ,
          <string-name>
            <surname>Liu</surname>
            <given-names>H.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            <given-names>Y.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hu</surname>
            <given-names>Q.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>He</surname>
            <given-names>L.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Yang</surname>
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>ECNU at 2017 eHealth Task 2: Technologically Assisted Reviews in Empirical Medicine</article-title>
          .
          <article-title>(</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Suominen</surname>
          </string-name>
          ,
          <article-title>Hanna and Kelly, Liadh and Goeuriot, Lorraine and Kanoulas, Evangelos and Azzopardi, Leif and Spijker, Rene and Li, Dan and Nvol, Aurlie and Ramadier, Lionel and Robert, Aude and Zuccon, Guido and Palotti, Joao. Overview of the CLEF eHealth Evaluation Lab 2018</article-title>
          .
          <source>CLEF 2018 - 8th Conference and Labs of the Evaluation Forum, Lecture Notes in Computer Science (LNCS)</source>
          , Springer, September,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Kanoulas</surname>
          </string-name>
          ,
          <article-title>Evangelos and Spijker, Rene and Li, Dan and Azzopardi, Leif</article-title>
          .
          <article-title>CLEF 2018 Technology Assisted Reviews in Empirical Medicine Overview</article-title>
          .
          <article-title>CLEF 2018 Evaluation Labs</article-title>
          and Workshop: Online Working Notes, CEUR-WS,
          <year>September</year>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>