<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Bibliometrics, Information Retrieval and Natural Language Processing: Natural Synergies to Support Digital Library Research</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Dietmar Wolfram</string-name>
          <email>dwolfram@uwm.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Information Studies, University of Wisconsin-Milwaukee P.</institution>
          <addr-line>O. Box 413, Milwaukee, WI U.S.A. 53201</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2016</year>
      </pub-date>
      <fpage>6</fpage>
      <lpage>13</lpage>
      <abstract>
        <p>Historically, researchers have not fully capitalized on the potential synergies that exist between bibliometrics and information retrieval (IR). Knowledge of regularities in information production and use, as well as citation relationships in bibliographic databases that are studied in bibliometrics, can benefit IR system design and evaluation. Similarly, techniques developed for IR and database technology have made the investigation of large-scale bibliometric phenomena feasible. Both fields of study have also benefitted directly from developments in natural language processing (NLP), which have provided new tools and techniques to explore research problems in bibliometrics and IR. Digital libraries, with their full text, multimedia content, along with searching and browsing capabilities, represent ideal environments in which to investigate the mutually beneficial relationships that can be forged among bibliometrics, IR and NLP. This brief presentation highlights the symbiotic relationship that exists among bibliometrics, IR and NLP.</p>
      </abstract>
      <kwd-group>
        <kwd>bibliometrics</kwd>
        <kwd>information retrieval</kwd>
        <kwd>digital libraries</kwd>
        <kwd>natural language processing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Both information retrieval (IR) and bibliometrics have long histories as
distinct areas of investigation in information science. IR has focused on
the storage, representation and retrieval of documents (text or other
media) from the system and user perspectives. Bibliometrics and its
allied areas (informetrics, scientometrics, webmetrics, or simply
“metrics”) have focused on discovering and understanding regularities that
exist in the way information is produced and used--but this simple
definition belies the breadth of research undertaken. Units of analysis may
be words, metadata fields, publications, authors, journals, research
groups, institutions, sub-fields, disciplines or geographic regions.
Applications of the study of these regularities and underlying processes
extend to equally diverse areas such as science policy, subject indexing
and IR system design, including digital libraries (DLs).</p>
      <p>
        The documentary contents of bibliographic IR systems and their
associated indexes provide much of the data that metrics researchers rely
on today to conduct their research. Conversely, many of the processes
within IR are directly informed by metrics research. Surprisingly, there
has been little overlap in the research agendas of these two areas of
study over the history of information science. This is changing with the
growing recognition of the common interests and the tools and
techniques used by IR and metrics researchers that can help to advance the
research agendas of each field [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. The recent international BIR
workshops [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">2,3,4</xref>
        ] have brought together researchers with combined interests
in bibliometrics and IR. The BIR research presentations demonstrate
the potential synergies that exist between these two areas. With large,
full text databases now commonplace, both IR and bibliometrics have
also benefitted from developments in natural language processing
(NLP) and computational linguistics, where text-based techniques have
improved document retrieval and the ability to explore relationships
among entities of interest in metrics research.
      </p>
      <p>In a sense, digital libraries represent an ideal environment in which
to study the intersection of IR, bibliometrics and NLP, whether the DLs
consist of repositories of formal publications or heterogeneous
collections of multimedia documents. With content representation and search
functionality found in more traditional IR environments, hyperlinks that
mimic relationships similar to those found in citation analysis, and full
text contents that lend themselves to NLP analysis. This brief overview
outlines how developments in IR and bibliometrics, along with
language-based methods, have helped to advance research in both areas.
These intersections are most evident in bibliographic databases, but
also extend to more heterogeneous digital libraries.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Information Retrieval Research</title>
      <p>
        IR research has focused on the design of more efficient and effective
systems to match documents to queries to meet the information needs
of users. Early IR research was more system-centered, but
usercentered approaches are now employed alongside system-centered
approaches in the design and evaluation of IR systems. Bibliometrics
provides useful methods for the analysis of both system content and
system usage, to better understand system processes and user dynamics. In
essence, IR processes represent forms of information production and
use [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], where observed patterns follow classic power law distributions
or possibly unimodal distributions (e.g., lognormal or Poisson-like).
Furthermore, metric studies of scientific communication can provide
frameworks to assist in the design of IR systems.
      </p>
      <p>
        Aspects of IR system content that lend themselves to bibliometric
modeling include index term frequency distributions, indexing
exhaustivity or term assignment, term co-occurrence frequency
distributions, database and index growth, and more recently, aspects of
webbased IR such as frequency distributions of inlinks/outlinks and
document persistence [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Applications of bibliometric modeling to IR
systems have included the development of simulations to model IR system
processes to better understand system interactions. These models also
can be used to observe retrieval efficiency under different situations to
identify preferred file structures or for space planning [
        <xref ref-type="bibr" rid="ref7 ref8">7,8</xref>
        ].
      </p>
      <p>
        User interactions with IR systems as recorded by transaction logs
may be similarly modeled. Characteristics that can be modeled include
search terms used per query, frequency distributions of query terms,
query term co-occurrence, query frequency distributions, search session
length based on queries or other actions, user search and browsing
regularities, and site/document visitation frequency. Search and
browsing patterns may be modeled using network analysis methods, similar
to those applied in citation analysis, to identify issues with interface use
[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], or clustering techniques may be applied to identify larger scale
search session patterns [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Beyond mathematical modeling of specific bibliometric
characteristics of IR systems, higher level aspects of science models also can
inform the search and retrieval process of scientific literature. Mutschke
and Mayr [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], for instance, recognize this important observation and
demonstrate that retrieval performance can benefit by ranking results
based on our understanding of models of science.
      </p>
      <p>
        Citation-based connections between bibliographic records serve as a
readily exploitable data source for expanding searching and browsing
options for users. These ideas have been implemented in experimental
systems over the past several decades such as I3R [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and BIRS [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ],
with the latter also supporting query expansion through visualizations
of relationships among searchable entities. The importance of citation
linkages is also recognized by commercial database vendors such as
EBSCO and ProQuest, which now provide active hyperlinks to
references included in full text articles and to citing articles, where
available.
      </p>
      <p>For much of the history of IR, the research focus was on the
representation and retrieval of document surrogates with limited natural
language. Representation and retrieval were based on metadata fields,
keywords or controlled vocabularies. Models to support query and
document matching were based on bag-of-words approaches that treated
terms independently of one another. Context and semantics played no
role in determining the aboutness of documents. At the same time,
computational challenges increased as the size of databases grew. IR
models such as the vector space model became more computationally
expensive due to the high dimensionality associated with representing
documents and processing document-query matching.</p>
      <p>
        IR research has been more responsive to taking advantage of
advanced NLP techniques to provide a more natural search environment
for users and to extend retrieval evaluation beyond simple term
independence models. Dimensionality reduction techniques that lower the
computational overhead associated with IR processing have become
commonplace since early approaches based on latent semantic analysis
were introduced over 25 years ago. Successors based on language
modeling [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], including topic modeling [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], have provided novel ways to
tackle information search and retrieval. The applications have more
recently included the recommendation of scientific articles [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] and the
identification of search terms [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]. Language-based techniques have
also been used to identify additional index terms for documents based
on language surrounding citations appearing in citing documents [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
3
The relationship between IR and bibliometrics research can be
considered in some ways symbiotic [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. As noted above, methods used in
metrics research have informed IR research. Conversely, tools and
techniques developed to support IR research have been adapted to
support metrics research. An exemplar is the development and application
of PageRank [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ] used by Google to rank webpages. It was directly
influenced by citation analysis methods used in bibliometrics research.
The developers recognized the parallels between citations and webpage
inlinks. The effectiveness of PageRank is evident in Google’s
longstanding success as a search engine. More recently,
PageRankbased approaches have been reapplied to citation analysis research
problems [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ]. Other similar network-based approaches, such as
Eigenfactor (URL: http://www.eigenfactor.org), provide
additional ways to rank journals or support methods to rank or recommend
articles [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ]. Furthermore, the h-index, initially developed to measure
author impact, has been demonstrated to have application for the ranking
of webpages, similar to PageRank, but with less computational
overhead [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>
        Citation analysis has been a core area of bibliometrics research for
over 50 years. Citation relationships in the form of direct citations,
cocitations or bibliographic coupling create a network of linkages among
documents, authors, and publication venues that allows one to visualize
the intellectual structure of a field [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ], or all of science [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. Similarly,
co-authorship-based studies provide another type of linkage that allows
researchers to gain deeper insights into the dynamics of research
communities [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. One limitation inherent in citation-based studies is that
relationships among entities of interest only exist if the linkages exist.
The lack of direct citation, co-citation, bibliographic coupling or
coauthorship should not preclude the possibility of relationships.
      </p>
      <p>
        This situation can be addressed using approaches that integrate the
vocabulary used by the entities of interest, usually employing keyword
or controlled vocabulary co-occurrence, which takes bibliometric
studies more into the realm of IR research. Still, there are concerns with
keyword-based approaches in bibliometrics research that mirror the
same bag-of-words issues found in IR research. The assumption of
independence of vocabulary in bibliographic entities, whether as
keywords or subject terms, creates the same issues observed earlier in IR
studies. The ability to work with full text and natural language can
improve these analyses. More recent research has moved beyond simple
co-occurrence to identify relationships among entities of interest. Of
particular note has been the application of topic modeling techniques,
such as Latent Dirichlet Analysis (LDA) [
        <xref ref-type="bibr" rid="ref27">27</xref>
        ], to bibliographic records
to identify relationships among bibliometric entities of interest, for
instance, authors [
        <xref ref-type="bibr" rid="ref28 ref29">28,29</xref>
        ], that may not be reflected through citation or
collaboration. Tang et al. [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] have developed a searchable system
initially called ArnetMiner (now AMiner, URL: http://aminer.org)
that profiles researchers based on publication content and supports
expertise search.
4
      </p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>
        The application of bibliometric methods for IR research and vice
versa has evolved over the past forty years, from bibliometric modeling of
IR system processes to the exploitation of citation relationships to
provide extended browsing capabilities to identify potentially relevant
documents based on citation linkages. More recently, the adoption of
language-based methods from NLP and computational linguistics has
benefitted both IR and bibliometrics research. Applications of
language-based approaches are still relatively new in bibliometric
contexts. Link-based analysis (citations, co-authorship, hyperlinks) in
combination with textual analysis can capitalize on the strengths of
both approaches [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ]. The analysis of full text collections--whether
bibliographic databases or heterogeneous, multimedia digital
libraries-offers many opportunities for further study. The applications of
citation-based methods and emerging language-based methods are evident
in the range of presentations given at the BIRNDL workshop, which
include the use of citation methods, text mining and topic modeling to
enhance retrieval in full text or digital library environments, scholarly
communication and our understanding of disciplinary boundaries.
5
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Scientometrics and Information Retrieval: Weak-links Revitalized</article-title>
          .
          <source>Scientometrics</source>
          <volume>102</volume>
          ,
          <fpage>2193</fpage>
          -
          <lpage>2199</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Larsen</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schaer</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mutschke</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Bibliometric-enhanced Information retrieval</article-title>
          .
          <source>In Advances in Information Retrieval</source>
          (pp.
          <fpage>798</fpage>
          -
          <lpage>801</lpage>
          ). Springer International Publishing (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frommholz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mutschke</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Bibliometric-enhanced Information Retrieval: 2nd International BIR Workshop</article-title>
          . In Advances in Information Retrieval (pp.
          <fpage>845</fpage>
          -
          <lpage>848</lpage>
          ). Springer International Publishing (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frommholz</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cabanac</surname>
          </string-name>
          , G.:
          <article-title>Editorial for the 3rd</article-title>
          <source>Bibliometric-Enhanced Information Retrieval Workshop at ECIR</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Egghe</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Power Laws in the Information Production Process: Lotkaian Informetrics</article-title>
          .
          <source>Elsevier</source>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : Applied Informetrics for Information Retrieval Research. Libraries
          <string-name>
            <surname>Unlimited</surname>
          </string-name>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Applying Informetric Characteristics of Databases to IR System File Design</article-title>
          ,
          <string-name>
            <surname>Part</surname>
            <given-names>I</given-names>
          </string-name>
          :
          <article-title>Informetric Models</article-title>
          .
          <source>Information Processing and Management</source>
          ,
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <fpage>121</fpage>
          -
          <lpage>133</lpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>Applying Informetric Characteristics of Databases to IR System Design</article-title>
          ,
          <string-name>
            <surname>Part</surname>
            <given-names>II</given-names>
          </string-name>
          :
          <article-title>Simulation Comparisons</article-title>
          .
          <source>Information Processing and Management</source>
          ,
          <volume>28</volume>
          (
          <issue>1</issue>
          ),
          <fpage>135</fpage>
          -
          <lpage>151</lpage>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. Han,
          <string-name>
            <given-names>H.J.</given-names>
            ,
            <surname>Joo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Wolfram</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>Using Transaction Logs to Better Understand User Search Session Patterns in an Image-Based Digital Library</article-title>
          .
          <source>Journal of the Korean Biblia Society for Library and Information Science</source>
          .
          <volume>25</volume>
          (
          <issue>1</issue>
          ),
          <fpage>19</fpage>
          -
          <lpage>37</lpage>
          (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Zhang, J.:
          <article-title>Identifying Web Search Session Patterns Using Cluster Analysis: A Comparison of Three Search Environments</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>60</volume>
          (
          <issue>5</issue>
          ),
          <fpage>896</fpage>
          -
          <lpage>910</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Mutschke</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mayr</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Science Models for Search: A Study on Combining Scholarly Information Retrieval and Scientometrics</article-title>
          . Scientometrics,
          <volume>102</volume>
          ,
          <fpage>2323</fpage>
          -
          <lpage>2345</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Croft</surname>
            ,
            <given-names>W.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>R.H.: I R</given-names>
          </string-name>
          :
          <article-title>A New Approach to the Design of Document Retrieval Systems</article-title>
          .
          <source>Journal of the American Society for Information Science</source>
          ,
          <volume>38</volume>
          (
          <issue>6</issue>
          ),
          <fpage>389</fpage>
          -
          <lpage>404</lpage>
          (
          <year>1987</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Ding</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chowdhury</surname>
            ,
            <given-names>G. G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foo</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qian</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <article-title>Bibliometric information retrieval system (BIRS): A web search interface utilizing bibliometric research results</article-title>
          .
          <source>Journal of the American Society for Information Science</source>
          ,
          <volume>51</volume>
          (
          <issue>13</issue>
          ),
          <fpage>1190</fpage>
          -
          <lpage>1204</lpage>
          (
          <year>2000</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Ponte</surname>
            ,
            <given-names>J.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W.B.:
          <article-title>A Language Modeling Approach to Information Retrieval</article-title>
          .
          <source>In Proceedings of the 21st annual international ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          (pp.
          <fpage>275</fpage>
          -
          <lpage>281</lpage>
          ). ACM (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Wei</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Croft</surname>
          </string-name>
          , W. B.:
          <article-title>LDA-based Document Models for Ad-Hoc Retrieval</article-title>
          .
          <source>In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</source>
          (pp.
          <fpage>178</fpage>
          -
          <lpage>185</lpage>
          ). ACM (
          <year>2006</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.M.:</given-names>
          </string-name>
          <article-title>Collaborative Topic Modeling For Recommending Scientific Articles</article-title>
          .
          <source>In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          (pp.
          <fpage>448</fpage>
          -
          <lpage>456</lpage>
          ). ACM (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Koopman</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Scharnhorst</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Englebienne</surname>
          </string-name>
          , G.:
          <article-title>Ariadne's Thread: Interactive Navigation in a World of Networked Information</article-title>
          .
          <source>In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems</source>
          (pp.
          <fpage>1833</fpage>
          -
          <lpage>1838</lpage>
          ). ACM (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Ritchie</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Robertson</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teufel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Comparing Citation Contexts for Information Retrieval</article-title>
          .
          <source>In Proceedings of the 17th ACM conference on Information and knowledge management</source>
          (pp.
          <fpage>213</fpage>
          -
          <lpage>222</lpage>
          ). ACM (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>The Symbiotic Relationship between Information Retrieval and Informetrics</article-title>
          . Scientometrics,
          <volume>102</volume>
          ,
          <fpage>2201</fpage>
          -
          <lpage>2214</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motwani</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winograd</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>The PageRank Citation Ranking: Bringing Order to the Web</article-title>
          . http://ilpubs.stanford.edu:
          <volume>8090</volume>
          /422/1/1999-
          <fpage>66</fpage>
          .pdf (
          <year>1999</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Waltman</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yan</surname>
          </string-name>
          , E.:
          <article-title>PageRank-related Methods for Analyzing Citation Networks</article-title>
          . In Y. Ding,
          <string-name>
            <given-names>R.</given-names>
            <surname>Rousseau</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Wolfram</surname>
          </string-name>
          (Eds.).
          <source>Measuring Scholarly Impact</source>
          . (pp.
          <fpage>83</fpage>
          -
          <lpage>100</lpage>
          ). Springer (
          <year>2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>West</surname>
            ,
            <given-names>J. D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wesley-Smith</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bergstrom</surname>
          </string-name>
          , C. T.:
          <article-title>A Recommendation System Based on Hierarchical Clustering of an Article-level Citation Network</article-title>
          .
          <source>IEEE Transactions on Big Data (Forthcoming)</source>
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Bar-Ilan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Levene</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>The hw-rank: An h-index Variant for Ranking Web Pages</article-title>
          , Scientometrics,
          <volume>102</volume>
          ,
          <fpage>2247</fpage>
          -
          <lpage>2253</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <surname>White</surname>
            ,
            <given-names>H.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>McCain</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          :
          <article-title>Visualizing a Discipline: An Author Co-Citation Analysis of Information Science,</article-title>
          <year>1972</year>
          -
          <fpage>1995</fpage>
          .
          <source>Journal of the American Society for Information Science</source>
          ,
          <volume>49</volume>
          ,
          <fpage>327</fpage>
          -
          <lpage>355</lpage>
          (
          <year>1998</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Boyack</surname>
            ,
            <given-names>K.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klavans</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Börner</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          : Scientometrics,
          <volume>64</volume>
          ,
          <fpage>351</fpage>
          -
          <lpage>374</lpage>
          (
          <year>2005</year>
          )
          <article-title>Mapping the Backbone of Science</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Glänzel</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schubert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Analysing Scientific Networks through Co-Authorship</article-title>
          .
          <source>In Handbook of Quantitative Science and Technology Research</source>
          (pp.
          <fpage>257</fpage>
          -
          <lpage>276</lpage>
          ). Springer Netherlands (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          , &amp;
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>M.J.</given-names>
          </string-name>
          :
          <article-title>Latent Dirichlet allocation</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>3</volume>
          ,
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          (
          <year>2003</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Rosen-Zvi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Griffiths</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Steyvers</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smyth</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>The Author-topic Model for Authors and Documents</article-title>
          .
          <source>In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence</source>
          (pp.
          <fpage>487</fpage>
          -
          <lpage>494</lpage>
          ). AUAI Press (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wolfram</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          : Measuring Author Research Relatedness:
          <article-title>A Comparison of WordBased, Topic-Based and Author Co-Citation Approaches</article-title>
          .
          <source>Journal of the American Society for Information Science and Technology</source>
          ,
          <volume>63</volume>
          ,
          <fpage>1973</fpage>
          -
          <lpage>1986</lpage>
          (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang, J.,
          <string-name>
            <surname>Yao</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , Zhang,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Su</surname>
          </string-name>
          ,
          <string-name>
            <surname>Z.</surname>
          </string-name>
          :
          <article-title>ArnetMiner: Extraction and Mining of Academic Social Networks</article-title>
          .
          <source>In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining</source>
          (pp.
          <fpage>990</fpage>
          -
          <lpage>998</lpage>
          ). ACM (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Zitt</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Meso-level Retrieval: IR-bibliometrics Interplay And Hybrid Citation-Words Methods In Scientific Fields Delineation</article-title>
          , Scientometrics,
          <volume>102</volume>
          ,
          <fpage>2223</fpage>
          -
          <lpage>2245</lpage>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>