<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>LaSTUS-TALN+INCO @ CL-SciSumm 2019</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Luis Chiruzzo</string-name>
          <email>luischir@fing.edu.uy</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>ggion</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad de la Republica</institution>
          ,
          <addr-line>Facultad de Ingenier a, INCO, Montevideo</addr-line>
          ,
          <country country="UY">Uruguay</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universitat Pompeu Fabra</institution>
          ,
          <addr-line>DTIC, LaSTUS-TALN, C/Tanger 122, Barcelona (08018)</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>In this paper we present several systems developed to participate in the 4th Computational Linguistics Scienti c Document Summarization Shared challenge which addresses the problem of summarizing a scienti c paper using information from its citation network (i.e., the papers that cite the given paper). Given a cluster of scienti c documents where one is a reference paper (RP) and the remaining documents are papers citing the reference, two tasks are proposed: (i) to identify which sentences in the reference paper are being cited and why they are cited, and (ii) to produce a citation-based summary of the reference paper using the information in the cluster. Our systems are based on both supervised (LSTM and convolutional neural networks) and unsupervised techniques using word embedding representations and features computed from the linguistic and semantic analysis of the documents.</p>
      </abstract>
      <kwd-group>
        <kwd>Citation-based Summarization Scienti c Document Analysis Convolutional Neural Networks Text-similarity Measures</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Although scienti c summarization has always been an important research topic
in the area of natural language processing (NLP) [
        <xref ref-type="bibr" rid="ref13 ref19 ref24 ref25">13, 19, 24, 25</xref>
        ] in recent years
new summarization approaches have emerged which take advantage of the
citations that a scienti c article has received in order to extract and summarize its
main contributions [
        <xref ref-type="bibr" rid="ref1 ref20 ref21">20, 21, 1</xref>
        ].
      </p>
      <p>
        The interest in the area has motivated the development of a series of
evaluation exercises in scienti c summarization in the Computational Linguistics (CL)
domain known as the Computational Linguistics Scienti c Document
Summarization Shared Task which started in 2014 as a pilot [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and which is now a well
developed challenge in its fourth year [
        <xref ref-type="bibr" rid="ref7 ref8">7, 8</xref>
        ].
      </p>
      <p>In this challenge, given a cluster of n documents where one is a reference
paper (RP) and the n 1 remaining documents are papers (i.e., citing papers
(CPs)) citing the reference paper, participants of the challenge have to develop
automatic procedures to simulate the following tasks: given a cluster of n
documents where one is a reference paper and the n 1 remaining documents are
papers containing citations to it:</p>
      <p>The challange has the following tasks:
{ Task 1A: For each citance in the citing papers (i.e., text spans containing
a citation), identify the cited spans of text in the reference paper that most
accurately re ect the citance.
{ Task 1B: For each cited text span, identify which discourse facet it belongs
to, among: Aim, Hypothesis, Implication, Results, or Method.
{ Task 2: Finally, an optional task consists on generating a structured
summary of the reference paper with up to 250 words from the cited text spans.</p>
      <p>
        In this paper we report the systems developed at LaSTUS-TALN+INCO
to participate in CL-SciSumm 2019 [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. We include a supervised system based
on recurrent neural networks and an unsupervised system based on sentence
similarity for Task 1A, one supervised approach for Task 1B, and one supervised
approach for Task 2. Except for the recurrent neural network method, the rest of
the systems for Tasks 1A and 1B follow similar approaches to the ones reported
in [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], achieving good performance in previous editions of the task. The
approach for Task 2 follows the method described in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] which, according to
o cial results [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], was the winning approach in CL-SciSumm 2018.
2
      </p>
    </sec>
    <sec id="sec-2">
      <title>Task 1</title>
      <p>
        We tried a supervised and an unsupervised approach for Task 1A. We separated
the CL SciSumm 2018 corpus of documents in 75% for training and 25% for
development evaluation. We also used the 978 documents from ScisummNet
2019 automatically annotated following [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ] for pre-training our neural network
models.
2.1
      </p>
      <sec id="sec-2-1">
        <title>Supervised approach</title>
        <p>Our supervised approach consists in a neural network architecture for nding
out which sentences from the reference document are most the likely candidates
for being referenced by a given citation.</p>
        <p>Network architecture The neural networks have the following structure:
{ Input layer - Two sentences: the citation text and a sentence from the
reference document.
{ Embeddings layer - We tried with two collections of embeddings: Google</p>
        <p>
          News3 300 dimensions vectors and BabelNet[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ][
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] 300 dimensions vectors.
{ LSTM layers - One, two or three stacked bidirectional LSTM layers.
{ Dense layer - One fully connected layer.
{ Output layer - One unit indicating the probability that the sentence from
the reference document corresponds to the citation.
3 https://code.google.com/archive/p/word2vec/
        </p>
        <p>We carried di erent experiments using word embeddings or BabelNet synset
embeddings, the tokens in the input layer were words or synsets depending on
the experiment. The LSTM layers combine up to three layers and a dense layer
with sizes 150, 300, or 450. In all of our experiments we aimed to optimize against
our development set, which contains 25% of the CL-SciSumm 2018 training set.
Pre-training and Training We separated the training of the models in two
stages: pre-training and training. The 978 clusters of documents from the Yale
corpus were used to do a pre-training of the LSTM models. During pre-training,
we trained the models using 70% of the Yale corpus optimizing against the
remaining 30% using early stopping.</p>
        <p>After this pre-training phase was over, we trained the resulting model
using our CL-SciSumm 2018 training partition. We found out that, in general,
pre-training with the Yale corpus and then training with CL-SciSumm 2019
achieved better results than only training with CL-SciSumm, even if the Yale
data was automatically annotated. For the training stage, we used early stopping
optimizing against 20% of our training corpus.
2.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Unsupervised approach</title>
        <p>
          As in previous editions [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ][
          <xref ref-type="bibr" rid="ref4">4</xref>
          ][
          <xref ref-type="bibr" rid="ref3">3</xref>
          ], we used an unsupervised approach consisting in
comparing all the sentences in a reference document with a citation and returning
the most similar one according to certain metric. In this case, we transformed
all sentences and citations into BabelNet synsets and we took the centroid of
the synsets as a way of creating a sentence embedding. Then we used cosine
similarity two nd out which of the candidate sentences were more suitable.
2.3
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Voting System</title>
        <p>We submitted a voting system which considers sentences picked by two or more
of the previous mentioned systems for Task 1.
2.4</p>
      </sec>
      <sec id="sec-2-4">
        <title>Development results</title>
        <p>
          In this section, we describe our extractive text summarization approach based on
convolutional neural networks which extends on our previous work on trainable
summarization [
          <xref ref-type="bibr" rid="ref23 ref4">23, 4</xref>
          ]. The network generates a summary by selecting the most
relevant sentences from the RP using linguistic and semantic features from RP
and CPs. The aim of our CNN is to learn the relation between a sentence and a
scoring value indicating its relevance.
3.1
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Context Features</title>
        <p>In order to extract the linguistic information from both sources (RP and CPs),
we developed a complex feature extraction method to characterize each sentence
in the RP and its relation with the corresponding CPs.</p>
        <p>We extracted a set of numeric features some of which are based on comparing
a sentence to its (document or cluster) context:
{ Sentence Abstract Similarity Scores: the similarity of a sentence vector to
the author abstract vectors (three features).
{ Sentence Centroid Similarity Scores: the similarity of a sentence vector to
the article centroid (three features).
{ First Sentence Similarity Scores: the similarity of a sentence vector to the
vector of the rst sentence, that is, the title of the RP (three features).
{ Position Score: a score representing the position of the sentence in the article.</p>
        <p>
          Sentences at the beginning of the article have high scores and sentence at
the end of the article have low scores.
{ Position in Section Score: a score representing the position of the sentence
in the section of the article. Sentences in rst section get higher scores,
sentences in last section get low scores.
{ Position in a Speci c Section Score: a score representing the position of the
sentence in a particular section. Sentences at the beginning of the section
get higher scores and sentences at the end of the section get lower scores.
{ TextRank Normalized Scores: a sentence vector is computed to obtain a
normalized score using the TextRank algorithm [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] (three features).
{ Term Frequency Score: we sum up the tf*idf values of all words in the
sentence. Then, the obtained value is normalized using the set of scores from
the whole article.
{ Citation Marker Score: the ratio of the number of citation markers in the
sentence to the total number of citation markers in the article.
{ Rhetorical Class Probability Scores: probability of a sentence being in one
of ve possible rhetorical categories calculated by the Dr. Inventor
framework [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ].
{ Citing Paper Maximum Similarity Scores: each RP sentence vector is
compared to each citation vector in each CP to get the maximum possible cosine
similarity (three features).
{ Citing Paper Minimum Similarity Scores: each RP sentence vector is
compared to each citation vector in each CP to get the minimum possible cosine
similarity (three features).
{ Citing Paper Average Similarity Scores: each RP sentence vector is compared
to each citation vector and the average cosine value obtained (three features).
3.2
        </p>
      </sec>
      <sec id="sec-2-6">
        <title>Scoring Values</title>
        <p>
          As commented above, our CNN learns the relation between features and a score,
that is, a regression task by devising various scoring functions to represent the
likelihood of a sentence belonging to a summary (for abstract, community and
human). The nomenclature followed to symbolize a scoring function is SCSum,
where SC is the speci c scoring function (which is indicated bellow) and Sum
is any summary type: abstract (Abs), community (Com) or human (Hum). The
scoring functions are de ned bellow:
{ Cosine Distance: we calculated the maximum cosine similarity between each
sentence vector in the RP with each vector in the gold standard summaries.
This method produced three scoring functions (SUMMA (SUSum), ACL
(ACLSum), and Google (GoSum)) for each summary type.
{ ROUGE-2 Similarity: we also calculated similarities based on the overlap
of bigrams between sentences in the RP and gold standard summaries. In
this regard, each sentence in the RP is compared with each gold standard
summary using ROUGE-2 [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ]. The precision value from this comparison is
taken for the scoring function and is symbolized as R2Sum.
{ Scoring Functions Average: Moreover, we computed the average between all
scoring functions (SUMMA, ACL, Google and ROUGE-2) for each summary
type. In addition, we also calculated a simpli ed average with vectors do
not based on word-frequencies (ACL, Google and ROUGE-2). These scoring
functions are indicated as AvSum and SAvSum, respectively.
        </p>
        <p>Finally, these computation produced eighteen di erent functions to learn:
SUMMA (SU ), ACL (ACL) and Google (Go) vectors, ROUGE-2 (R2), Average
(Av) and Simpli ed Average (SAv) times abstract (Abs), community (Com),
human (Hum) summaries.</p>
      </sec>
      <sec id="sec-2-7">
        <title>Convolution Model</title>
        <p>
          Regarding the neural network hyperparameters, the CNN was de ned with the
Adadelta updater [
          <xref ref-type="bibr" rid="ref26">26</xref>
          ] and the gradients were computed using back-propagation
as Kim [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] and Nguyen [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ]. Also we used the sigmoid activation function, a
dropout rate of 0.5, l2 constraint of 3. For the convolutions, we applied 3 lter
window sizes (3, 4 and 5) to context features and 4 lter window sizes (2, 3, 4 and
5) to word embeddings. For each window were applied 150 lters for convolution.
Finally, for learning the regression task we applied a Mean Squared Error (MSE)
as loss function.
4
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Challenge Submissions</title>
      <sec id="sec-3-1">
        <title>For task 1, we sent the following four submissions:</title>
        <p>
          { run1: LSTM trained with Babelnet vectors with three layers of size 150.
{ run2: BabelNet centroids cosine similarity.
{ run3: LSTM trained with Google News vectors with two layers of size 150.
{ run4: Voting scheme based on [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>For task 2, the submissions we sent are the following:</title>
        <p>{ Similarity with the abstract from all similarity scores except SUMMA.
{ Similarity with the abstract from all scores.
{ Rouge based score similarity with the abstract.
{ ACL cosine similarity based score with the abstract.</p>
        <p>
          Finally, based on [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] we presented the results of a classi er that addresses
Task 1B of identifying the discourse facet for each identi ed cited sentence.
5
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Results</title>
      <p>
        The performance of our systems for task 1 over the test set is shown in table 2.
We can see that the LSTM approached underperformed compared to their results
over the development corpus, one possible cause for this is that the systems could
have over t to the training and development data. Out of the methods we tried,
the system that performs best for task 1 is still the voting scheme based on [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
The performance of our systems for task 2 over the test set is shown in table 3.
6
      </p>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>We have described the systems developed to participate in Tasks 1a, 1b and 2
in the CL-SciSumm 2019 summarization challenge. For Task 1a { which aimed
at identifying cited sentences {, we implemented supervised and unsupervised
methods. Our supervised systems are based on LSTM neural networks, while the
Run
run4 Voting scheme
run2 BabelNet centroids
run3 Google News LSTM
run1 BabelNet LSTM
unsupervised techniques take advantage of BabelNet synset embedding
representations. We also included a system that uses a voting scheme based on several
supervised and unsupervised approaches with many di erent system con
gurations.</p>
      <p>Regarding Task 2 { summarization proper {, we have developed a neural
network based on convolutions to learn a speci c scoring function. The CNN
model was fed by a combination of word embedding with sentence relevance and
citation features extracted from each document cluster (RP and CPs).</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>This work is (partly) supported by the Spanish Ministry of Economy and
Competitiveness under the Maria de Maeztu Units of Excellence Programme
(MDM2015-0502).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Abu-Jbara</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ezra</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Purpose and polarity of citation: Towards nlp-based bibliometrics</article-title>
          .
          <source>In: HLT-NAACL</source>
          . pp.
          <volume>596</volume>
          {
          <issue>606</issue>
          (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2. AbuRa'ed,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Bravo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chiruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Saggion</surname>
          </string-name>
          , H.:
          <article-title>Lastus/taln+ inco@ clscisumm 2018-using regression and convolutions for cross-document semantic linking and summarization of scholarly literature</article-title>
          .
          <source>In: Proceedings of the 3nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL2018)</source>
          . Ann Arbor,
          <source>Michigan (July</source>
          <year>2018</year>
          ) (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3. AbuRa'ed,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chiruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Saggion</surname>
          </string-name>
          , H.:
          <article-title>What sentence are you referring to and why? identifying cited sentences in scienti c literature</article-title>
          .
          <source>In: RANLP 2017. International Conference Recent Advances in Natural Language Processing; 2017 Sep</source>
          <volume>2</volume>
          -8; Varna, Bulgaria.[
          <string-name>
            <surname>Stroudsburg</surname>
          </string-name>
          (PA)]: ACL;
          <year>2017</year>
          . p.
          <fpage>9</fpage>
          -
          <lpage>17</lpage>
          . ACL (
          <article-title>Association for Computational Linguistics) (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. AbuRa'ed,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Chiruzzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Saggion</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Accuosto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Bravo</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          : Lastus/taln @ clscisumm-
          <fpage>17</fpage>
          :
          <article-title>Cross-document sentence matching and scienti c text summarization systems</article-title>
          . In:
          <article-title>Proceedings of the Computational Linguistics Scienti c Summarization Shared Task (CL-SciSumm 2017) organized as a part of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</article-title>
          <year>2017</year>
          )
          <article-title>and co-located with the 40th</article-title>
          <source>International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR</source>
          <year>2017</year>
          ), Tokyo, Japan,
          <year>August 11</year>
          ,
          <year>2017</year>
          . pp.
          <volume>55</volume>
          {
          <issue>66</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Camacho-Collados</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pilehvar</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Navigli</surname>
          </string-name>
          , R.:
          <article-title>Nasari: Integrating explicit knowledge and corpus statistics for a multilingual representation of concepts and entities</article-title>
          .
          <source>Arti cial Intelligence</source>
          <volume>240</volume>
          ,
          <fpage>36</fpage>
          {
          <fpage>64</fpage>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Freitag</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>Overview and Results: CL-SciSumm SharedTask 2019</article-title>
          .
          <source>Proceedings of the 4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</source>
          <year>2019</year>
          ) @
          <source>SIGIR</source>
          <year>2019</year>
          (
          <year>2019</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jain</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>The cl-scisumm shared task 2017: results and key insights</article-title>
          . In:
          <article-title>Proceedings of the Computational Linguistics Scienti c Summarization Shared Task (CL-SciSumm 2017), organized as a part of the 2nd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</article-title>
          <year>2017</year>
          )
          <article-title>(</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rustagi</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>Insights from clscisumm 2016: the faceted scienti c document summarization shared task</article-title>
          .
          <source>International Journal on Digital</source>
          Libraries pp.
          <volume>1</volume>
          {
          <issue>9</issue>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elizalde</surname>
            ,
            <given-names>B.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jha</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
            ,
            <given-names>M.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Khanna</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Molla-Aliod</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ronzano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saggion</surname>
          </string-name>
          , H.:
          <article-title>The computational linguistics summarization pilot task</article-title>
          .
          <source>In: Proceedings of TAC</source>
          <year>2014</year>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Jaidka</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yasunaga</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chandrasekaran</surname>
            ,
            <given-names>M.K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.Y.:
          <article-title>The CLSciSumm Shared Task 2018: Results and Key Insights</article-title>
          .
          <source>Proceedings of the 3rd Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</source>
          <year>2018</year>
          )
          <article-title>co-located with the 41st</article-title>
          <source>International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR</source>
          <year>2018</year>
          ) (
          <year>July 2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Convolutional neural networks for sentence classi cation</article-title>
          .
          <source>arXiv preprint arXiv:1408.5882</source>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Lin</surname>
            ,
            <given-names>C.Y.</given-names>
          </string-name>
          :
          <article-title>ROUGE: A package for automatic evaluation of summaries</article-title>
          .
          <source>In: Text summarization branches out: Proceedings of the ACL-04 workshop</source>
          . vol.
          <volume>8</volume>
          .
          <string-name>
            <surname>Barcelona</surname>
          </string-name>
          ,
          <string-name>
            <surname>Spain</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Luhn</surname>
            ,
            <given-names>H.P.:</given-names>
          </string-name>
          <article-title>The automatic creation of literature abstracts</article-title>
          .
          <source>IBM J. Res. Dev</source>
          .
          <volume>2</volume>
          (
          <issue>2</issue>
          ),
          <volume>159</volume>
          {165 (Apr
          <year>1958</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ma</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          , Zhang, H.,
          <string-name>
            <surname>Xu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <surname>C.</surname>
          </string-name>
          : Njust@
          <fpage>clscisumm</fpage>
          -
          <lpage>18</lpage>
          . In: BIRNDL@ SIGIR. pp.
          <volume>114</volume>
          {
          <issue>129</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Mihalcea</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tarau</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Textrank:
          <article-title>Bringing order into text</article-title>
          .
          <source>In: Proceedings of the 2004 conference on empirical methods in natural language processing</source>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Navigli</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ponzetto</surname>
            ,
            <given-names>S.P.:</given-names>
          </string-name>
          <article-title>BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network</article-title>
          .
          <source>Artif. Intell</source>
          .
          <volume>193</volume>
          ,
          <issue>217</issue>
          {250 (Dec
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>T.H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grishman</surname>
          </string-name>
          , R.:
          <article-title>Relation extraction: Perspective from convolutional neural networks</article-title>
          .
          <source>In: Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing</source>
          . pp.
          <volume>39</volume>
          {
          <issue>48</issue>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Nomoto</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Resolving citation links with neural networks</article-title>
          .
          <source>Frontiers in Research Metrics and Analytics</source>
          <volume>3</volume>
          ,
          <issue>31</issue>
          (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Paice</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jones</surname>
            ,
            <given-names>P.A.</given-names>
          </string-name>
          :
          <article-title>The identi cation of important concepts in highly structured technical papers</article-title>
          .
          <source>In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          . pp.
          <volume>69</volume>
          {
          <fpage>78</fpage>
          . SIGIR '93,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>1993</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Qazvinian</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Scienti c paper summarization using citation summary networks</article-title>
          .
          <source>In: Proceedings of the 22Nd International Conference on Computational Linguistics - Volume 1</source>
          . pp.
          <volume>689</volume>
          {
          <fpage>696</fpage>
          . COLING '
          <volume>08</volume>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Qazvinian</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          :
          <article-title>Identifying non-explicit citing sentences for citationbased summarization</article-title>
          .
          <source>In: ACL</source>
          <year>2010</year>
          ,
          <article-title>Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics</article-title>
          ,
          <source>July 11-16</source>
          ,
          <year>2010</year>
          , Uppsala, Sweden. pp.
          <volume>555</volume>
          {
          <issue>564</issue>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Ronzano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Saggion</surname>
          </string-name>
          , H.: Dr. Inventor Framework:
          <article-title>Extracting structured information from scienti c publications</article-title>
          .
          <source>In: International Conference on Discovery Science</source>
          . pp.
          <volume>209</volume>
          {
          <fpage>220</fpage>
          . Springer (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Saggion</surname>
            , H., AbuRa'ed,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ronzano</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Trainable citation-enhanced summarization of scienti c articles</article-title>
          .
          <source>In: Proceedings of the Joint Workshop on Bibliometricenhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL) co-located with the Joint Conference on Digital Libraries</source>
          <year>2016</year>
          (
          <article-title>JCDL 2016), Newark</article-title>
          , NJ, USA, June 23,
          <year>2016</year>
          . pp.
          <volume>175</volume>
          {
          <issue>186</issue>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>Saggion</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lapalme</surname>
          </string-name>
          , G.:
          <article-title>Concept identi cation and presentation in the context of technical text summarization</article-title>
          .
          <source>In: Proceedings of the 2000 NAACL-ANLP Workshop on Automatic Summarization</source>
          . pp.
          <volume>1</volume>
          {
          <fpage>10</fpage>
          . Association for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Saggion</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lapalme</surname>
          </string-name>
          , G.:
          <article-title>Generating indicative-informative summaries with sumum</article-title>
          .
          <source>Comput. Linguist</source>
          .
          <volume>28</volume>
          (
          <issue>4</issue>
          ),
          <volume>497</volume>
          {526 (Dec
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Zeiler</surname>
          </string-name>
          , M.D.:
          <article-title>Adadelta: an adaptive learning rate method</article-title>
          .
          <source>arXiv preprint arXiv:1212.5701</source>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>