<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Editorial for the 5th Bibliometric-enhanced Information Retrieval Workshop at ECIR 2017</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Philipp Mayr</string-name>
          <email>philipp.mayr@gesis.org</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ingo Frommholz</string-name>
          <email>ingo.frommholz@beds.ac.uk</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Guillaume Cabanac</string-name>
          <email>guillaume.cabanac@univ-tlse3.fr</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>GESIS - Leibniz-Institute for the Social Sciences</institution>
          ,
          <addr-line>Cologne</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Institute for Research in Applicable Computing, University of Bedfordshire</institution>
          ,
          <addr-line>Luton</addr-line>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Toulouse, Computer Science Department</institution>
          ,
          <addr-line>IRIT UMR 5505</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2017</year>
      </pub-date>
      <abstract>
        <p>Overview of the papers This year 16 papers were submitted to the workshop, 11 of which were finally accepted for presentation and inclusion in the proceedings: 6 regular papers and 5 posters. The workshop featured one keynote talk, three full paper sessions and one poster session. The following section briefly describes the keynote and sessions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>
        Following the successful workshops at ECIR 20144, 20155 and 20166 [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ],
respectively, this workshop was the fifth in a series of events that brought together
experts of communities which often have been perceived as different ones:
bibliometrics / scientometrics / informetrics on the one hand and information
retrieval on the other. Our motivation as organizers of the workshop started from
the observation that main discourses in both fields are different, that
communities are only partly overlapping and from the belief that a knowledge transfer
would be profitable for both sides [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. This fifth full-day Bibliometric-enhanced
Information Retrieval (BIR) workshop7 at ECIR 2017 aimed to foster a
common ground for the incorporation of bibliometric-enhanced services into
scholarly search engine interfaces. In particular we addressed specific communities,
as well as studies on large, cross-domain collections like Web of Science, Scopus
or Mendeley. This fifth BIR workshop addressed explicitly both scholarly and
industrial researchers.
2.1
      </p>
      <sec id="sec-1-1">
        <title>Keynote</title>
        <p>
          The invited paper “Real-World Recommender Systems for Academia: The Pain
and Gain in Building, Operating, and Researching them” [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] by Joeran Beel
(Trinity College Dublin, Ireland) gives a very insightful overview of the
practical experiences in building scholarly document recommender systems for Digital
Libraries. The authors Beel and Dinesh report about their research with three
different recommender systems which have been implemented and operated in
the last six years. They present empirical results of various studies, discuss
challenges like running A/B testing with real-world scholarly recommender systems
and perform research against competitive benchmarks.
2.2
        </p>
      </sec>
      <sec id="sec-1-2">
        <title>Session 1: Full papers</title>
        <p>
          In the paper “Manuscript Matcher: A Content and Bibliometrics-based Scholarly
Journal Recommendation System” [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], Jason Rollins, Meredith McCusker, Joel
Carlson and Jon Stroll present a scholarly journal recommendation system called
Manuscript Matcher which is developed and run by Clarivate (formerly
Thomson Reuters). The use case of the tool is uploading manuscript title, abstract
and references to Manuscript Matcher and getting back bibliometric-informed
recommendations of journals (“best fit” publications). The authors present user
feedback of the recommendation system and future directions.
        </p>
        <p>
          In their paper “Use of Locality Sensitive Hashing (LSH) Algorithm to Match
Web of Science and SCOPUS” [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], Mehmet Ali Abdulhayoglu and Bart Thijs
report on an attempt to match the records of two flagship bibliographic databases.
They considered various metadata (e.g., publication title, venue name, bylines)
whilst disregarding identifiers such as DOIs, as these are not always available
or assigned. Their efficient approach based on LSH found a 70% intersection
between these in about an hour. This research contributes to the understanding
of the coverage of leading bibliographic databases.
2.3
        </p>
      </sec>
      <sec id="sec-1-3">
        <title>Session 2: Full papers</title>
        <p>
          The paper “Academic Search in Response to Major Scientific Events” [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] by Li
and de Rijke describes search behaviour of academic and web users in occurrence
of major scientific events (the Nobel Prize announcements of Chemistry, Physics
and Medicine in 2014). The authors compare the query patterns in the query log
of the academic search engine ScienceDirect with the data provided by Google
Trends. Google Trend is used as a proxy to observe users on the web. They found
unique trends for the academic searchers, which are different from users of a web
search engine.
        </p>
        <p>
          The paper “Exploring Choice Overload in Related-Article Recommendations
in Digital Libraries” [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] by Beierle, Aizawa and Beel studies choice overload in
scholarly document recommendation in the social sciences search engine sowiport.
The authors used click-through rate of different amounts of recommendations
as a measure of recommendation effectiveness. Their preliminary results show
lower click-through rates for higher numbers of recommendations. According to
the experiments, users in the social sciences seem to feel quickly overloaded by
increasing choice.
2.4
        </p>
      </sec>
      <sec id="sec-1-4">
        <title>Session 3: Full papers</title>
        <p>
          The article “Computing Interdisciplinarity of Scholarly Objects using an
AuthorCitation-Text Model” [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] by Seo, Jung, Kim and Myaeng discusses the
computation of the degree of interdisciplinarity of a scholarly object (e.g., an article).
To this end, three different sources are used: the author network, the citation
network and the actual text. Furthermore, an alternative to measure
interdisciplinarity is discussed. Experiments show that the combination of the three
aspects author, citations and text of articles can accurately predict the
discipline distributions.
        </p>
        <p>
          In their paper “Detecting Automatically Generated Sentences with
Grammatical Structure Similarity” [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], Nguyen Minh Tien and Cyril Labbé tackle the
issue of spotting machine generated texts at the sentence level. They introduce a
grammatical structure similarity and benchmark it to detect passages stemming
from known generators: 80% positive detection rate and less than 1% false
detection rate. Editorial workflows could integrate this effective approach to detect
questionable manuscripts that editorial staff should check before sending to peer
review.
2.5
        </p>
      </sec>
      <sec id="sec-1-5">
        <title>Poster session</title>
        <p>
          Langer and Beel discuss the use of Lucene in the Docear research paper
recommender in their article “Apache Lucene as Content-Based-Filtering
Recommender System: 3 Lessons Learned” [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]. They compare Lucene’s relevance score
to the click-through rate of a document, finding that Lucene’s scores indeed can
be used to determine relevance. The authors also observed that returning ten
recommendations out of the top 50 results might be sensible. Furthermore, Lucene
is suitable to approximate the recommendation effectiveness.
        </p>
        <p>
          In their paper “Extending Scientific Literature Search by Including the
Author’s Writing Style” [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], Andi Rexha, Mark Kröll, Hermann Ziak, and Roman
Kern consider authors’ writing style as a potential feature for paper retrieval and
recommendation. They report the results of a pilot study questioning the extent
to which individuals identify similarities in authorship. This is a challenging task,
even for humans.
        </p>
        <p>
          In his paper “Drakkar: a graph based All-Nearest Neighbour search algorithm
for bibliographic coupling” [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], Bart Thijs discusses the creation of bibliographic
coupling graphs based on citations. The proposed algorithm utilizes a bipartite
graph constituted by the citing publications and the cited references as well as
directed citations.
        </p>
        <p>
          Siebert, Dinesh and Feyer discuss how scientific recommender systems can be
improved by incorporating scientometric measures. In their paper “Extending a
Research-Paper Recommendation System with Scientometric Measures” [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] the
authors evaluate different reranking approaches in the context of the Mr. DLib
research paper recommender system. Readership data is used as an
approximation for citation.
        </p>
        <p>
          In their paper “Semantic embedding for information retrieval”, Wang and
Koopman [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] combine bibliometric measures with word embeddings. Word
embedding results of well-known systems such as Word2Vec/Doc2Vec and GloVe
are compared to the Ariadne approach, showing that Ariadne exhibits a
competitive performance in a document embedding for information retrieval task.
3
        </p>
      </sec>
    </sec>
    <sec id="sec-2">
      <title>Outlook</title>
      <p>With this continuing workshop series we have built up a sequence of explorations,
visions, results documented in scholarly discourse, and created a sustainable
bridge between bibliometrics and IR.</p>
      <p>This year, the authors of accepted papers at the 5th BIR workshop were
invited to submit extended versions to a Special Issue on “Bibliometric-enhanced
IR” of the Scientometrics 8 journal to be published in 2018.</p>
      <p>As a next iteration we will organize a Joint Workshop on
Bibliometricenhanced Information Retrieval and Natural Language Processing for Digital
Libraries (BIRNDL 2017)9 at the 40th International ACM SIGIR Conference on
Research and Development in Information Retrieval (SIGIR 2017). The BIRNDL
workshop will be co-organized together with the natural language processing
group community and includes a shared task (the CL-SciSumm Shared Task10).
The shared task tackles automatic paper summarization in the Computational
Linguistics (CL) domain.
8 http://www.springer.com/journal/11192
9 http://wing.comp.nus.edu.sg/birndl-sigir2017/
10 http://wing.comp.nus.edu.sg/cl-scisumm2017/</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frommholz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabanac</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <source>Report on the 3rd International Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2016</year>
          ).
          <source>SIGIR Forum</source>
          <volume>50</volume>
          (
          <issue>1</issue>
          ) (
          <year>2016</year>
          )
          <fpage>28</fpage>
          -
          <lpage>34</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Scientometrics and Information Retrieval: weak-links revitalized</article-title>
          .
          <source>Scientometrics</source>
          <volume>102</volume>
          (
          <issue>3</issue>
          ) (
          <year>2015</year>
          )
          <fpage>2193</fpage>
          -
          <lpage>2199</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Beel</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinesh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Real-World Recommender Systems for Academia: The Pain and Gain in Developing, Operating, and Researching them</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ), CEURWS.org (
          <year>2017</year>
          )
          <fpage>6</fpage>
          -
          <lpage>17</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Rollins</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCusker</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carlson</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stroll</surname>
          </string-name>
          , J.:
          <article-title>Manuscript Matcher: A Content and Bibliometrics-based Scholarly Journal Recommendation System</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>18</fpage>
          -
          <lpage>29</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Abdulhayoglu</surname>
            ,
            <given-names>M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thijs</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Use of Locality Sensitive Hashing (LSH) algorithm to match Web of Science and SCOPUS</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>30</fpage>
          -
          <lpage>40</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>de Rijke</surname>
          </string-name>
          , M.: Academic Search in Response to Major Scientific Events.
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>41</fpage>
          -
          <lpage>50</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Beierle</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Aizawa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beel</surname>
          </string-name>
          , J.:
          <article-title>Exploring Choice Overload in Related-Article Recommendations in Digital Libraries</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometricenhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>51</fpage>
          -
          <lpage>61</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Seo</surname>
            ,
            <given-names>M.G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jung</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>K.m.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Myaeng</surname>
            ,
            <given-names>S.H.</given-names>
          </string-name>
          :
          <article-title>Computing Interdisciplinarity of Scholarly Objects using an Author-Citation-Text Model</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ), CEURWS.org (
          <year>2017</year>
          )
          <fpage>62</fpage>
          -
          <lpage>72</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Nguyen</surname>
            ,
            <given-names>M.T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Labbé</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Detecting Automatically Generated Sentences with Grammatical Structure Similarity</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometricenhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>73</fpage>
          -
          <lpage>84</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Langer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Beel</surname>
          </string-name>
          , J.:
          <article-title>Apache Lucene as Content-Based-Filtering Recommender System: 3 Lessons Learned</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>85</fpage>
          -
          <lpage>92</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Rexha</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kröll</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ziak</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kern</surname>
          </string-name>
          , R.:
          <article-title>Extending Scientific Literature Search by Including the Author's Writing Style</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>93</fpage>
          -
          <lpage>100</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Thijs</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Drakkar: a graph based All-Nearest Neighbour Search Algorithm for Bibliographic Coupling</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>101</fpage>
          -
          <lpage>111</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Siebert</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dinesh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Feyer</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Extending a Research Paper Recommendation System with Bibliometric Measures</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometricenhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>112</fpage>
          -
          <lpage>121</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Koopman</surname>
          </string-name>
          , R.:
          <article-title>Semantic Embedding for Information Retrieval</article-title>
          .
          <source>In: Proc. of the 5th Workshop on Bibliometric-enhanced Information Retrieval (BIR</source>
          <year>2017</year>
          ),
          <article-title>CEUR-WS.org (</article-title>
          <year>2017</year>
          )
          <fpage>122</fpage>
          -
          <lpage>132</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>