<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>The CL-SciSumm Shared Task 2018: Results and Key Insights</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kokil Jaidka</string-name>
          <email>jaidka@sas.upenn.edu</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Michihiro Yasunaga</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Muthu Kumar Chandrasekaran</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dragomir Radev</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Min-Yen Kan</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Computer Science, Yale University</institution>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Computing, National University of Singapore</institution>
          ,
          <country country="SG">Singapore</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Smart Systems Institute, National University of Singapore</institution>
          ,
          <country country="SG">Singapore</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>University of Pennsylvania</institution>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>This overview describes the o cial results of the CL-SciSumm Shared Task 2018 { the rst medium-scale shared task on scienti c document summarization in the computational linguistics (CL) domain. This year, the dataset comprised 60 annotated sets of citing and reference papers from the open access research papers in the CL domain. The Shared Task was organized as a part of the 41st Annual Conference of the Special Interest Group in Information Retrieval (SIGIR), held in Ann Arbor, USA in July 2018. We compare the participating systems in terms of two evaluation metrics. The annotated dataset and evaluation scripts can be accessed and used by the community from: https://github.com/WING-NUS/scisumm-corpus.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        CL-SciSumm explores summarization of scienti c research in the domain of
computational linguistics. The Shared Task dataset comprises the set of citation
sentences (i.e., \citances") that reference a speci c paper as a (community-created)
summary of a topic or paper [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. Citances for a reference paper are considered a
synopses of its key points and also its key contributions and importance within
an academic community [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. The advantage of using citances is that they are
embedded with meta-commentary and o er a contextual, interpretative layer
to the cited text. Citances o er a view of the cited paper which could
complement the reader's context, possibly as a scholar [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] or a writer of a literature
review [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>The CL-SciSumm Shared Task is aimed at bringing together the
summarization community to address challenges in scienti c communication
summarization. It encourages the incorporation of new kinds of information in automatic
scienti c paper summarization, such as the facets of research information
being summarized in the research paper, and the use of new resources, such as
the mini-summaries written in other papers by other scholars, and concept
taxonomies developed for computational linguistics. Over time, we anticipate that
the Shared Task will spur the creation of other new resources, tools, methods
and evaluation frameworks.</p>
      <p>
        CL-SciSumm task was rst conducted at TAC 2014 as part of the larger
BioMedSumm Task5. It was organized in 2016 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and 2017 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] as a part of the
Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural
Language Processing for Digital Libraries (BIRNDL) workshop [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] at the Joint
Conference on Digital Libraries (JCDL6) in 2016, and the annual ACM
Conference on Research and Development in Information Retrieval (SIGIR7) in 2017
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. This paper provides the results for the CL-SciSumm 2018 Task being held
as part of the BIRNDL 2018 workshop at SIGIR 2018 in Ann Arbor, Michigan.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Task</title>
      <p>CL-SciSumm de nes two serially dependent tasks that participants could
attempt, given a canonical training and testing set of papers.</p>
      <p>Given: A topic consisting of a Reference Paper (RP) and ten or more Citing
Papers (CPs) that all contain citations to the RP. In each CP, the text spans
(i.e., citances) have been identi ed that pertain to a particular citation to the
RP. Additionally, the dataset provides three types of summaries for each RP:
{ the abstract, written by the authors of the research paper.
{ the community summary, collated from the reference spans of its citances.
{ a human-written summary, written by the annotators of the CL-SciSumm
annotation e ort.</p>
      <p>Task 1A: For each citance, identify the spans of text (cited text spans) in the
RP that most accurately re ect the citance. These are of the granularity of a
sentence fragment, a full sentence, or several consecutive sentences (no more than 5).
Task 1B: For each cited text span, identify what facet of the paper it belongs
to, from a prede ned set of facets.</p>
      <p>Task 2: Finally, generate a structured summary of the RP from the cited text
spans of the RP. The length of the summary should not exceed 250 words. This
was an optional bonus task.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Development</title>
      <p>
        The CL-SciSumm 2018 corpus comprises a training set that is randomly sampled
research papers (Reference papers, RPs) from the ACL Anthology corpus and the
citing papers (CPs) for those RPs which had at least ten citations. The prepared
5 http://www.nist.gov/tac/2014
6 http://www.jcdl2016.org/
7 http://sigir.org/sigir2017/
dataset then comprised annotated citing sentences for a research paper, mapped
to the sentences in the RP which they referenced. Summaries of the RP were also
included. The CL-SciSumm 2018 corpus included a re ned version of the
CLSciSumm 2017 corpus of 40 RPs as a training set, in order to encourage teams
from the previous edition to participate. For details of the general procedure
followed to construct and annotate the CL-SciSumm corpus, the changes made
to the procedure in CL-SciSumm-2016 and the re nement of the training set in
2017, please see [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>The test set was an additional corpus of 20 RPs which was picked out of
the ACL Anthology Network corpus (AAN), which automatically identi es and
connects the citing papers and citances for each of thousands of highly-cited RPs.
Therefore, we expect that that characteristics of the test set could be somewhat
di erent from the training set. For this year's test corpus, every RP and its citing
papers were annotated three times by three independent annotators, and three
sets of human summaries were also created.
3.1</p>
      <p>Annotation
The annotation scheme was unchanged from what was followed in previous
editions of the task and the original BiomedSumm task developed by Cohen et.
al8: Given each RP and its associated CPs, the annotation group was instructed
to nd citations to the RP in each CP. Speci cally, the citation text, citation
marker, reference text, and discourse facet were identi ed for each citation of
the RP found in the CP.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Overview of Approaches</title>
      <p>Ten systems participated in Task 1 and a subset of three also participated in
Task 2. The following paragraphs discuss the approaches followed by the
participating systems, in lexicographic order by team name.</p>
      <p>
        System 2: The team from the Beijing University of Posts and
Telecommunications' Center for Intelligence Science and Technology [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] developed models
based on their 2017 system. For Task 1A, they adopted Word Movers Distance
(WMD) and improve LDA model to calculate sentence similarity for citation
linkage. For Task 1B they presented both rule-based systems, and supervised
machine learning algorithms such as: Decision Trees and K-nearest Neighbor.
For Task 2, in order to improve the performance of summarization, they also
added WMD sentence similarity to construct new kernel matrix used in
Determinantal Point Processes (DPPs).
      </p>
      <p>
        System 4: The team from Thomson Reuters, Center for Cognitive
Computing [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] participated in Task 1A and B. For Task 1A, they treated the
citation linkage prediction as a binary classi cation problem and utilized various
      </p>
      <sec id="sec-4-1">
        <title>8 http://www.nist.gov/tac/2014</title>
        <p>similarity-based features, positional features and frequency-based features. For
Task 1B, they treated the discourse facet prediction as a multi-label classi cation
task using the same set of features.</p>
        <p>
          System 6: The National University of Defense Technology team [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]
participated in Task 1A and B. For Task 1A, they used a random forest model
using multiple features. Additionally, they integrated random forest model with
BM25 and VSM model and applied a voting strategy to select the most related
text spans. Lastly, they explored the language model with word embeddings and
integrated it into the voting system to improve the performance. For task 1B,
they used a multi-features random forest classi er.
        </p>
        <p>
          System 7: The Nanjing University of Science and Technology team (NJUST)
[
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] participated in all of the tasks (Tasks 1A, 1B and 2). For Task 1A, they used
a weighted voting-based ensemble of classi ers (linear support vector machine
(SVM), SVM using a radial basis function kernel, Decision Tree and Logistic
Regression) to identify the reference span. For Task 1B, they used a dictionary
for each discourse facet, a supervised topic model, and XGBOOST. For Task 2,
they grouped sentences into three clusters (motivation, approach and conclusion)
and then extracted sentences from each cluster to combine into a summary.
        </p>
        <p>
          System 8: The International Institute of Information Technology team [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]
participated in Task 1A and B. They treated Task 1A as a text-matching
problem, where they constructed a matching matrix whose entries represent the
similarities between words, and used convolutional neural networks (CNN) on top
to capture rich matching patterns. For Task 1B, they used SVM with tf-idf and
naive bayes features.
        </p>
        <p>
          System 9: The Klick Labs team [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] participated in Task 1A and B. For Task
1A, they explored word embedding-based similarity measures to identify
reference spans. They also studied several variations such as reference span cuto
optimization, normalized embeddings, and average embeddings. They treated
Task 2B as a multi-class classi cation problem, where they constructed the
feature vector for each sentence as the average of word embeddings of the terms in
the sentence.
        </p>
        <p>
          System 10: The University of Houston team [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ] adopted sentence
similarity methods using Siamese Deep learning Networks and Positional Language
Model approach for Task 1A. They tackled Task 1B using a rule-based method
augmented by WordNet expansion, similarly to last year.
        </p>
        <p>
          System 11: The LaSTUS/TALN+INCO team [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] participated in all of the
tasks (Tasks 1A, 1B and 2). For Task 1A, B, they proposed models that use
Jaccard similarity, BabelNet synset embeddings cosine similarity, or convolutional
neural network over word embeddings. For Task 2, they generated a summary
by selecting the sentences from the RP that are most relevant to the CPs using
various features. They used CNN to learn the relation between a sentence and
a scoring value indicating its relevance.
        </p>
        <p>
          System 12: The NLP-NITMZ team [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] participated in all of the tasks (Tasks
1A, 1B and 2). For task 1A and 1B they extracted each citing papers (CP)
text span that contains citations to the reference paper (RP). They used cosine
similarity and Jaccard Similarity to measure the sentence similarity between
CPs and RP, and picked the reference spans most similar to the citing sentence
(Task 1A). For Task 1B, they applied rule based methods to extract the facets.
For Task 2, they built a summary generation system using the OpenNMT tool.
        </p>
        <p>
          System 20: Team Magma [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] treated Task 1A as a binary classi cation
problem and explored several classi ers with di erent feature sets. They found
that Logistic regression with content-based features derived on topic and word
similarities, in the ACL reference corpus, performed the best.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Evaluation</title>
      <p>An automatic evaluation script was used to measure system performance for
Task 1A, in terms of the sentence ID overlaps between the sentences identi ed
in system output, versus the gold standard created by human annotators. The
raw number of overlapping sentences were used to calculate the precision, recall
and F1 score for each system.</p>
      <p>
        We followed the approach in most SemEval tasks in reporting the overall
system performance as its micro-averaged performance over all topics in the blind
test set. Additionally, we calculated lexical overlaps in terms of the ROUGE-2
and ROUGE-SU4 scores [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] between the system output and the human
annotated gold standard reference spans. It should be noted that this year, the
average performance on every task was obtained by calculating the average
performance on each of three independent sets of annotations for Task 1, and the
performance on the human summary was also an average of performances on
three human summaries.
      </p>
      <p>
        ROUGE scoring was used for Tasks 1a and Task 2. Recall-Oriented
Understudy for Gisting Evaluation (ROUGE) is a set of metrics used to
automatically evaluate summarization systems [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] by measuring the overlap
between computer-generated summaries and multiple human written reference
summaries. ROUGE{2 measures the bigram overlap between the candidate
computergenerated summary and the reference summaries. More generally, ROUGE{N
measures the n-gram overlap. ROUGE-SU uses skip-bigram plus unigram
overlaps. Similar to CL-SciSumm 2017, CL-SciSumm 2018 also uses ROUGE-2 and
ROUGE-SU4 for its evaluation.
      </p>
      <p>Task 1B was evaluated as a proportion of the correctly classi ed discourse
facets by the system, contingent on the expected response of Task 1A. As it is a
multi-label classi cation, this task was also scored based on the precision, recall
and F1 scores.</p>
      <p>Task 2 was optional, and also evaluated using the ROUGE{2 and ROUGE{
SU4 scores between the system output and three types of gold standard
summaries of the research paper: the reference paper's abstract, a community
summary, and a human summary.</p>
      <p>The evaluation scripts have been provided at the CL-SciSumm Github
repository9 where the participants may run their own evaluation and report the results.
6</p>
    </sec>
    <sec id="sec-6">
      <title>Results</title>
      <p>This section compares the participating systems in terms of their performance.
Three of the ten systems that did Task 1 also did the bonus Task 2. The
results are provided in Table 1 and Table 2. The detailed implementation of the
individual runs are described in the system papers included in this proceedings
volume.</p>
      <p>
        For Task 1A, on using sentence overlap (F1 score) as the metric, the best
performance was by four runs from NUDT [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Their performance was closely
followed by three runs from CIST [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. The third best system was UPF-TALN
[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. When ROUGE-based F1 is used as a metric, the best performance is by
Klick Labs [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] followed by NUDT [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] and then NLP-NITMZ [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        The best performance in Task 1B was by several runs submitted by CIST
[
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] followed by NJUST [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. Klick Labs [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] was the second runner-up.
      </p>
      <p>
        For Task 2, TALN-UPF had the best performance against the abstract and
human summaries, and the second-best performance against community
summaries [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. NLP-NITMZ had the best performance against the community
summaries [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] and were the second runners-up in the evaluation against human
summaries. CIST [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] summaries had the second best performance against human
and abstract summaries and nished as second runners-up against community
summaries.
7
      </p>
    </sec>
    <sec id="sec-7">
      <title>Conclusion and Future Work</title>
      <p>
        Ten teams participated in this year's shared task, on a corpus that was 33%
larger than the 2017 corpus. In follow-up work, we plan to release a detailed
comparison of the annotations as well as a micro-level error analysis to identify
possible gaps in document or annotation quality. We will also aim to expand the
part of the corpus with multiple annotations, in the coming few months.
Furthermore, we expect to release other resources complementary to the CL-scienti c
summarization task, such as semantic concepts from the ACL Anthology
Network [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
      <p>We believe that the large improvements in Task 1A this year are a sign of
forthcoming breakthroughs in information retrieval and summarization
methods, and we hope that the community will not give up on the challenging task
of generating scienti c summaries for computational linguistics. Based on the
experience of running this task for four years, we believe that lexical methods
would work well with the structural and semantic characteristics that are unique
to scienti c documents, and perhaps will be complemented with domain-speci c</p>
      <sec id="sec-7-1">
        <title>9 github.com/WING-NUS/scisumm-corpus</title>
        <p>
          word embeddings in a deep learning framework. The Shared Task has
demonstrated potential as a transfer learning task [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] and is also expected to allow the
generalization of its methods to other areas of scienti c summarization.
        </p>
        <p>Acknowledgement. We would like to thank Microsoft Research Asia,
for their generous funding. We would also like to thank Vasudeva Varma
and colleagues at IIIT-Hyderabad, India and University of Hyderabad
for their e orts in convening and organizing our annotation workshops.
We are grateful to our hardworking and talented annotators:
Akansha Gehlot, Ankita Patel, Fathima Vardha, Swastika Bhattacharya and
Sweta Kumari, without whom the Cl-SciSumm corpus { and ultimately
the Shared Task itself { would not have been possible. We would also like
to thank Rahul Jha and Dragomir Radev for sharing their software to
prepare the XML versions of papers and constructing the test corpus for
2018. Finally, we acknowledge the continued advice of Hoa Dang, Lucy
Vanderwende and Anita de Waard from the pilot stage of this task. We
are grateful to Kevin B. Cohen and colleagues for their support, and
for sharing their annotation schema, export scripts and the Knowtator
package implementation on the Protege software { all of which have been
indispensable for this shared task.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1. Abura'ed,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Bravo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chiruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Saggion1</surname>
          </string-name>
          , H.:
          <article-title>LaSTUS/TALN+INCO @ CL-SciSumm 2018 - Using Regression and Convolutions for Cross-document Semantic Linking and Summarization of Scholarly Literature</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Agrawal</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mittal</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <string-name>
            <surname>IIIT-H @</surname>
          </string-name>
          CLScisumm
          <article-title>-18</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Alonso</surname>
            ,
            <given-names>H.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Makki</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <string-name>
            <surname>CL-SciSumm Shared Task - Team Magma</surname>
          </string-name>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Baruah</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kolla</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Klick Labs at CL-SciSumm 2018</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Conroy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Vector space and language models for scienti c document summarization</article-title>
          .
          <source>In: NAACL-HLT</source>
          . pp.
          <volume>186</volume>
          {
          <fpage>191</fpage>
          . Association of Computational Linguistics, Newark, NJ, USA (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Davoodi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Madan</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gu</surname>
          </string-name>
          , J.: CLSciSumm Shared Task:
          <article-title>On the Contribution of Similarity measure and Natural Language Processing Features for Citing Problem</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Debnath</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Achom</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pakray</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>NLP-NITMZ @ CLScisumm-18</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>The cl-scisumm shared task 2018: Results and key insights (</article-title>
          <year>2017</year>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-1888/ editorial.pdf
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rustagi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>Insights from clscisumm 2016: the faceted scienti c document summarization shared task</article-title>
          .
          <source>International Journal on Digital</source>
          Libraries pp.
          <volume>1</volume>
          {
          <issue>9</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khoo</surname>
            ,
            <given-names>C.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Na</surname>
            ,
            <given-names>J.C.</given-names>
          </string-name>
          :
          <article-title>Deconstructing human literature reviews{a framework for multi-document summarization</article-title>
          .
          <source>In: Proc. of ENLG</source>
          . pp.
          <volume>125</volume>
          {
          <issue>135</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>K.S.:</given-names>
          </string-name>
          <article-title>Automatic summarising: The state of the art</article-title>
          .
          <source>Information Processing and Management</source>
          <volume>43</volume>
          (
          <issue>6</issue>
          ),
          <volume>1449</volume>
          {
          <fpage>1481</fpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>C.Y.</given-names>
          </string-name>
          :
          <article-title>Rouge: A package for automatic evaluation of summaries</article-title>
          .
          <source>Text summarization branches out: Proceedings of the ACL-04 workshop 8</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ma</surname>
          </string-name>
          , S.,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <surname>C.</surname>
          </string-name>
          : NJUST@CLSciSumm-
          <fpage>17</fpage>
          .
          <source>In: Proc. of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2017)</source>
          . Tokyo, Japan (
          <year>August 2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Zhang, H.,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <surname>C.</surname>
          </string-name>
          : NJUST @ CLSciSumm-
          <fpage>18</fpage>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Editorial for the 2nd joint workshop on bibliometric-enhanced information retrieval and natural language processing for digital libraries (BIRNDL) at SIGIR 2017</article-title>
          .
          <source>In: Proceedings of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</source>
          <year>2017</year>
          )
          <article-title>co-located with the 40th</article-title>
          <source>International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR</source>
          <year>2017</year>
          ), Tokyo, Japan,
          <year>August 11</year>
          ,
          <year>2017</year>
          . pp.
          <volume>1</volume>
          {
          <issue>6</issue>
          (
          <issue>2017</issue>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-1888/editorial.pdf
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frommholz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabanac</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Editorial for the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL) at JCDL 2016</article-title>
          .
          <source>In: Proc. of the Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2016)</source>
          . pp.
          <volume>1</volume>
          {
          <issue>5</issue>
          . Newark, NJ, USA (
          <year>June 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Moraes</surname>
            ,
            <given-names>L.F.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Das</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karimi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Verma</surname>
          </string-name>
          , R.: University of Houston @ CLSciSumm
          <year>2018</year>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometricenhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nakov</surname>
            ,
            <given-names>P.I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schwartz</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hearst</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          : Citances:
          <article-title>Citation sentences for semantic analysis of bioscience text</article-title>
          .
          <source>In: Proceedings of the SIGIR'04 workshop on Search and Discovery in Bioinformatics</source>
          . pp.
          <volume>81</volume>
          {
          <issue>88</issue>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Qazvinian</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Scienti c paper summarization using citation summary networks</article-title>
          .
          <source>In: Proceedings of the 22nd International Conference on Computational Linguistics-Volume</source>
          <volume>1</volume>
          . pp.
          <volume>689</volume>
          {
          <fpage>696</fpage>
          .
          <string-name>
            <surname>ACL</surname>
          </string-name>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Muthukrishnan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qazvinian</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>The acl anthology network corpus</article-title>
          .
          <source>In: Proceedings of the 2009 Workshop on Text and Citation Analysis for Scholarly Digital Libraries</source>
          . pp.
          <volume>54</volume>
          {
          <fpage>61</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhou</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tang</surname>
          </string-name>
          , J.:
          <source>NUDT @ CLSciSumm-18. In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>