<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Editorial for the Second Workshop on Mining Scienti c Papers: Computational Linguistics and Bibliometrics (CLBib2017)</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>Iana Atanassova Centre Tesniere - CRIT, University of Bourgogne Franche-Comte, Besancon, France Marc Bertin ELICO Laboratory, University of Lyon</institution>
          ,
          <addr-line>Lyon</addr-line>
          ,
          <country>France Philipp Mayr GESIS</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Leibniz-Institute for the Social Sciences</institution>
          ,
          <addr-line>Cologne</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>Scope and Motivation The CLBib workshops aim to bring together researchers in bibliometrics and computational linguistics in order to study the ways bibliometrics can bene t from large-scale text analytics and sense mining of scienti c papers, thus exploring the interdisciplinarity of Bibliometrics and Natural Language Processing. Working with full text allows us to go beyond metadata used in bibliometrics. Full text o ers a new eld of investigation, where the major problems arise around the organization and structure of text, the extraction of information and its representation on the level of metadata. Furthermore, the study of contexts around in-text citations o ers new perspectives related to the semantic dimension of citations. The analyses of citation contexts and the semantic</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>categorization of publications will allow us to rethink co-citation networks, bibliographic coupling and other
bibliometric techniques.</p>
      <p>The rst edition of this workshop1, co-located with the International Society of Scientometrics and Informetrics
Conference (ISSI) in 2015, attracted more than 70 participants and six full paper contributions, showing a large
interest in these topics in the community. From a technical point of view, during the rst edition of the workshop,
the e orts to provide articles in machine-readable formats and the rise of Open Access publishing have resulted
in a number of standardized formats for scienti c papers, full-text datasets for research experiments and corpora
and focus on number of open source tools for versatile text processing.</p>
      <p>The goal of this second edition of the CLBib workshop, co-located with the ISSI conference 2017, is to continue
to encourage the collaboration between these two domains and to answer questions like: How can we enhance
author network analysis and Bibliometrics using data obtained by text analytics? What insights can NLP provide
on the structure of scienti c writing, on citation networks, and on in-text citation analysis? Natural Language
Processing and Bibliometrics meet again in this second workshop in a context where Open Access is at the heart
of exchanges between scientists and publishers and raises many economic and ethical issues, but also new research
problems through the access to articles in full text. Indeed, the possibility of enriching metadata currently used
in bibliometrics with information from the text is an essential step towards building the tools of tomorrow.</p>
      <p>As the CLBib 2017 workshop was held in China, at Wuhan University, the discussions raised important
questions not only around the processing of scienti c papers but also on the need to take into account the
multilingual aspect of the scienti c production. Even if today English is essential on the international stage,
national level publications can also be rich in information and relevant for bibliometric studies. The linguistic
aspect, which is more and more present at the ISSI conference, must be taken into consideration and highlights
the importance of this workshop series and the growing interest in the community of bibliometricians but also
in other communities for Natural Language Processing.
3</p>
    </sec>
    <sec id="sec-2">
      <title>Overview of the papers</title>
      <p>
        1See the proceedings of the rst edition of the workshop: http://ceur-ws.org/Vol-1384/, [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        2https://easychair.org/cfp/CLBib2017
Gu Dongxiao and Shi Jin [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Among the methods that are used are social network analysis, keyword and
coword analysis and clustering. Considering the hypothesis that authors that work on similar topics and keywords
could potentially be contributors, this paper provides a method for author similarity analysis.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Outlook</title>
      <p>The interest for this interdisciplinary research has been growing during the last years (see e.g. the workshops of
BIRNDL - "Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries"
and WoSP - "Workshop on Mining Scienti c Publications") and the series of CLBib workshops up to now have
shown that both elds of Natural Language Processing and Bibliometrics can bene t from addressing the problem
of the full text processing of papers.</p>
      <p>As a result of this workshop series, a new Research Topic "Mining Scienti c Papers: NLP-enhanced
Bibliometrics"3 has been launched as part of the "Frontiers in Research Metrics and Analytics" journal published in
Open Access. We intend to continue the e ort to bring both communities together and foster the development
of semantic technologies dedicated to Bibliometrics and Scientometrics.
4.0.1</p>
      <p>Acknowledgements
Part of this research has been funded by the FEDER (Fonds europeen de developpement regional) and selected
by the French-Swiss programme Interreg V: Webso+ project4.
3https://www.frontiersin.org/research-topics/7043/mining-scientific-papers-nlp-enhanced-bibliometrics
4http://tesniere.univ-fcomte.fr/projet-webso/
[9] Wang, J., Ma, S., Zhang, C.: Citationas: A summary generation tool based on clustering of retrieved
citation content. In: Atanassova, I., Bertin, M., Mayr, P. (eds.) 2nd Workshop on Mining Scienti c Papers:
Computational Linguistics and Bibliometrics collocated with 16th International Conference on Scientometrics
and Informetrics (ISSI 2017). CEUR-WS.org (2017)</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Atanassova</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bertin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Editorial for the rst workshop on mining scienti c papers: Computational linguistics and bibliometrics</article-title>
          .
          <source>In: Proceedings of the First Workshop on Mining Scienti c Papers: Computational Linguistics and Bibliometrics co-located with 15th International Society of Scientometrics and Informetrics Conference (ISSI</source>
          <year>2015</year>
          ), Istanbul, Turkey, June 29,
          <year>2015</year>
          . pp.
          <volume>1</volume>
          {
          <issue>4</issue>
          (
          <issue>2015</issue>
          ), http://ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1384</volume>
          /editorial.pdf
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bertin</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Atanassova</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lariviere</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gingras</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>The invariant distribution of references in scienti c articles</article-title>
          .
          <source>Journal of the Association for Information Science and Technology (JASIST) 67(1)</source>
          ,
          <volume>164</volume>
          {
          <fpage>177</fpage>
          (
          <year>2016</year>
          ), http://dx.doi.org/10.1002/asi.23367
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Citespace ii: Detecting and visualizing emerging trends and transient patterns in scienti c literature</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          <volume>57</volume>
          (
          <issue>3</issue>
          ),
          <volume>359</volume>
          {
          <fpage>377</fpage>
          (
          <year>2006</year>
          ), http://dx.doi.org/10.1002/asi.20317
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bichindaritz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Temporal evolution, research themes, and emerging trends in case-based reasoning literature</article-title>
          . In: Atanassova,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bertin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          , P. (eds.) 2nd Workshop on Mining Scienti c Papers:
          <article-title>Computational Linguistics and Bibliometrics collocated with 16th International Conference on Scientometrics and Informetrics (ISSI 2017)</article-title>
          .
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>He</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Understanding the changing roles of scienti c publications via citation embeddings</article-title>
          . In: Atanassova,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bertin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          , P. (eds.) 2nd Workshop on Mining Scienti c Papers:
          <article-title>Computational Linguistics and Bibliometrics collocated with 16th International Conference on Scientometrics and Informetrics (ISSI 2017)</article-title>
          .
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Combining bibliometrics and information retrieval: preface</article-title>
          .
          <source>Scientometrics</source>
          <volume>102</volume>
          (
          <issue>3</issue>
          ),
          <volume>2191</volume>
          {2192 (Mar
          <year>2015</year>
          ), https://doi.org/10.1007/s11192-015-1529-2
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <surname>Peng</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gu</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Mining the potential collaborative relationships based on the author keyword coupling analysis and social network analysis</article-title>
          .
          <source>In: Atanassova</source>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bertin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Mayr</surname>
          </string-name>
          , P. (eds.) 2nd Workshop on Mining Scienti c Papers:
          <article-title>Computational Linguistics and Bibliometrics collocated with 16th International Conference on Scientometrics and Informetrics (ISSI 2017)</article-title>
          .
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>Shotton</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Cito, the citation typing ontology</article-title>
          .
          <source>Journal of Biomedical Semantics</source>
          <volume>1</volume>
          (
          <issue>1</issue>
          ),
          <source>S6 (Jun</source>
          <year>2010</year>
          ), https://doi.org/10.1186/2041-1480-1-S1-S6
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>