<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Browsing citation clusters for academic literature search: A simulation study with systematic reviews</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Juan Pablo Bascur[</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Suzan Verberne</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nees Jan van Eck</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ludo Waltman</string-name>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centre for Science and Technology Studies, Leiden University</institution>
          ,
          <addr-line>Leiden</addr-line>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Leiden Institute for Advanced Computer Science, Leiden University</institution>
          ,
          <country country="NL">The Netherlands</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <fpage>53</fpage>
      <lpage>65</lpage>
      <abstract>
        <p>Our aim is to test if citation clusters can be useful in academic literature search for systematic reviews. We performed an initial offline evaluation using simulated user behaviour on a browsing tool for academic literature search over a set of 17 systematic reviews. To perform the evaluation, we clustered papers in a citation network obtained from the Web of Science database. The clustering solution was a system of seven hierarchical levels of clusters that allowed the simulated user to navigate from larger to smaller clusters. We simulated five user models with different emphasis on precision and recall. We found that citation clusters are more helpful for tasks focused on recall than for precision-oriented tasks. Our future research includes evaluation on a larger set, and a comparison to query search, followed by a study with real users.</p>
      </abstract>
      <kwd-group>
        <kwd>Academic literature search</kwd>
        <kwd>Evaluation</kwd>
        <kwd>Simulation of interaction</kwd>
        <kwd>Macroscope</kwd>
        <kwd>Scatter/Gather</kwd>
        <kwd>Citation network</kwd>
        <kwd>Document clustering</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>Knowledgeworkers need special information retrieval (IR) tools because their IR
tasks and practices differ from the general public and from each other [1]. Several
special IR tools for academic knowledge workers have been proposed, some of which
visualize the search space of literature [2, 3]. These tools are sometimes called
macroscopes, that is, tools for visualizing big or complex data [4]. Macroscopes
facilitate document search through browsing because the visual content and context
provides additional information. This information is particularly helpful when Boolean
queries are inadequate for an IR task, e.g. if the user does not know the relevant terms
to search for. It is difficult to perform offline evaluations of IR macroscopes because
there are no standards for the simulation of the stopping point in a browsing task. An
analysis of this difficulty of stopping point simulation for document retrieval can be
found in the work of Maxwell et al. [5].</p>
      <p>
        To assist knowledge workers, we have prototyped an IR macroscope for academic
search literature based on citation clusters. We refer to this tool as SciMacro (Science
Macroscope). SciMacro clusters papers based on their citations and summarizes the
content of each cluster. This is different from previous works that cluster papers based
on their textual content, like Iris [6], or use citations to find related papers, like
PaperPoles [
        <xref ref-type="bibr" rid="ref8">7</xref>
        ]. The user can obtain smaller clusters from the papers of a given cluster
for a more detailed visualization. As an example, let us consider a fictive user that has
a set of documents from a multidisciplinary journal and that wants to know which ones
are related to the visualization of big data. The user provides his initial set of documents
to SciMacro, which are then clustered into 3 clusters. The descriptors of the clusters are
Mathematics, Statistics and Physics. Then, the user selects the Statistics cluster and gets
3 smaller clusters from the next clustering level labelled Visual, Modeling and
Bayesian. Because these new clusters were created for documents from the Statistics
cluster, the user knows that they are also related to statistics. Finally, the user selects
the Visual cluster, and obtains the smaller clusters Analytic, GIS, and Big. Now the user
sees the Big cluster related to visualization and statistics, enabling him to screen the
documents in that cluster.
      </p>
      <p>In this paper, we evaluate the performance of SciMacro at retrieving relevant
scientific literature for systematic reviews. Our work is of particular relevance because
the potential of IR macroscopes that allow for cluster-based browsing of the complete
document set of all scientific literature has not been studied before.</p>
      <p>The task goals in our study were to find the relevant literature for 17 systematic
reviews (SRs). SRs are review articles that report the relevant literature found by the
authors [8, 9]. We used a public test set of these reports to simulate the IR tasks [10].
We obtained the results of the SciMacro IR tasks from a simulation of five user models
with different emphasis on precision and recall.</p>
      <p>We address the following research questions:
1. What is the potential of SciMacro for finding the relevant literature for SRs?
2. How do the user’s preferences affect the performance of SciMacro?
The main contribution of this paper is that it presents the first evaluation of SR search
through citation clusters.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Related work</title>
      <p>
        The idea of using citations for academic IR browsing is not new: A number of prior
works [11, 3] have proposed IR tools that visualize papers based on their citation
networks, while others [
        <xref ref-type="bibr" rid="ref8">6, 7</xref>
        ] have proposed IR tools that visualize clusters of papers.
The tool Citation Gecko [2] visualizes a citation network that expands from given
papers, while the work by Haunschild and Marx [12] uses a citation network to find the
seminal paper on a topic. On the other hand, text processing can also be used for IR
browsing: A number of prior works [13, 14] have proposed IR tools that cluster the
semantics of the papers, while others [15, 16] have proposed IR tools that suggest terms
for complex Boolean queries.
      </p>
      <p>Our tool, SciMacro, belongs to the academic IR browsing tools that visualize clusters
of papers. One possibility is to cluster papers based on textual similarity (see for
instance the tool Iris [6]). SciMacro does not use textual similarity, but instead uses
citations. Citations represent primarily the intellectual relation between papers, while
text may represent (broader) topical clustering.</p>
      <p>Following citations is a common strategy for authors of SRs [17]. Some IR tools
have been proposed for SRs that classify the relevance of papers combining their
citations and text [18]. Other tools reduce the workload of the authors by ranking their
search results [19, 20]. These tools are different from SciMacro because they are not
based on browsing. For a more complete overview of IR techniques for SRs, we refer
to [21].</p>
      <p>For non-academic IR browsing, a prominent model is the Scatter/Gather browsing
model [22, 23], which has inspired the development of SciMacro. In this browsing
model, a cluster of documents is split into smaller clusters, each with their own label.
The clusters can also be combined to give the user control over the clustering solution.
Scatter/Gather has been previously used for clustering web services [24] and web
search results [25]. SciMacro is the first to use the Scatter/Gather model for academic
literature search.</p>
      <p>During the SIGIR 2010 Workshop on the Simulation of Interaction it was argued
that the simulation of different search types (browsing, directed and drifting) requires
different user models [26]. We follow up on this work, but instead of simulating query
search we simulate the browsing behaviour in citation clusters.</p>
      <p>Beyond simulations, Mahdi et al. [27] proposed a framework for evaluating
browsing tasks with real users. We also want to highlight the work of Leuski [28], who
worked with real users to evaluate a web search tool in which clusters of search results
are presented. His work emphasized that clustering the search results gives the user a
sense of control over the feedback process, which is a highly valued feature in
professional search [1]. In line with this, we designed SciMacro in such a way that the
user also has control over the feedback process.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>Methods</title>
      <sec id="sec-3-1">
        <title>Search model using queries</title>
        <p>
          We model the search process of the authors of SRs as an IR task. We start from the
following idea: When an author of a SR decides to read the full text of a document,
based on the abstract and/or title, we consider this document to be relevant. Therefore,
we argue that an IR tool should find all documents that the user considers relevant
enough to read for a SR. With this consideration in mind, we decided to use the SRs
published by the Cochrane Library database [
          <xref ref-type="bibr" rid="ref5">29</xref>
          ], which requires authors to report the
documents of which they read the full text, regardless of whether they included these
documents in the SR or not. We will refer to this set of documents as the relevant
documents of a particular SR.
3.2
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>Search model using SciMacro</title>
        <p>This search model retrieves the documents of a given cluster (see Section 3.4 for an
explanation of the clustering). To select a cluster, we defined the following simulation
protocol, similar to a greedy algorithm (see Figure 1 for an example):
1. Select the cluster from level 1 with the highest relevance score (the score is explained
below).
2. If the selected cluster has subclusters, obtain the highest score of the subclusters.</p>
        <p>Otherwise, retrieve the selected cluster.
3. If the highest score of the subclusters is higher than the score of the selected cluster,
select the subcluster with the highest score and go back to step 2. Otherwise, retrieve
the selected cluster.</p>
        <p>The goal of this protocol is to simulate the behaviour of a real user. In our simulation
the user has perfect knowledge of the relevance scores of the clusters; this is a common
simplification in simulation for evaluation [10]. In a real situation the user has to deduce
this knowledge from the cluster labels.1</p>
        <p>To evaluate this IR task, we need to know the number of relevant retrieved
documents, non-relevant retrieved documents and relevant non-retrieved documents
from the simulation:
● The total number of retrieved documents equals the number of documents in the
retrieved cluster.
● The number of relevant retrieved documents equals the number of relevant
documents that are in the retrieved cluster.
● The number of non-relevant retrieved documents equals the total number of retrieved
documents minus the number of relevant retrieved documents.
● The relevant non-retrieved documents are the documents of the SR that are not in
the retrieved cluster.</p>
        <p>With these values we can also calculate the weighted F-score of each cluster:
(1)</p>
        <p>We created five user models that differ in which F-score they prefer: F0.25, F0.5, F1,
F2 or F4. The different F-scores reflect the different needs of real users: for example, a
real user that wants a short overview of a topic will emphasize precision over recall.
Lower subscript F-scores emphasize precision, and higher subscript F-scores
emphasize recall. F1 gives an equal weight to precision and recall. For each user model,
the relevance score in the cluster selection protocol is given by the weighted F-score.
The goal is to maximize this F-score.</p>
        <p>1 Generating informative cluster labels is outside the scope of this paper.</p>
        <p>We obtained the SRs and their relevant documents from the dataset published by
Scells et al. [10]. It contains the PubMed ids of 177 randomly selected SRs published
by the Cochrane Library plus their relevant documents, excluding relevant documents
that lack PubMed ids. We selected the SRs from this collection that had 10 or more
relevant documents.</p>
        <p>The citation network was created based on the in-house Web of Science database at
the Centre for Science and Technology Studies (CWTS) at Leiden University, the
Netherlands, which includes papers published since 1980 (as well as a small number of
papers published in earlier years). We excluded the documents in this database without
PubMed id. We transformed citation links into undirected links (to comply with the
requirements of our clustering algorithm). The SRs in the Cochrane Library database
include the relevant documents in their reference list. Therefore our database also
contains these citation links. These links are advantageous for the clustering algorithm,
but they would not exist in a real IR task, so we excluded from the citation network all
documents published in the same year as the SR and in later years. We selected as our
focal year the year with the largest number of SRs. This was the year 2014 with 17 SRs.
Therefore our citation network only included documents published before 2014. In the
end, the citation network contained over 13.2 million documents and 280.4 million
citation links. The SR set contained the 17 SRs published in 2014 and for each SR it
contained the relevant documents present in the citation network.
3.4</p>
      </sec>
      <sec id="sec-3-3">
        <title>Clustering</title>
        <p>We clustered the citation network with the Leiden algorithm [30] based on the
methodology developed by Waltman and van Eck [31]. However, they built the
clustering hierarchy in a bottom-up manner while we took a top-down approach. Also,
they merged small clusters, which we did not do. The use of citation links to cluster
documents is common practice in the field of bibliometrics. Textual information, for
instance from the titles and abstracts of documents, is also often used to cluster
documents. In this paper, we choose to use citation links. The use of textbased
cooccurrence links could be explored in future research. We refer to Waltman et al. [32]
for a comparison of different approaches for clustering documents. The clustering
algorithm maximizes the following quality function:
(!, … , " ) = ∑# ∑$ *# , $+*#$ − +
(2)
In this quality function, i and j are documents, xi is the cluster of document i, and r is
the resolution parameter. aij equals 1 if there is a citation link between documents i and
j, otherwise aij equals 0. δ equals 1 if i and j are in the same cluster, otherwise δ equals
0. The value of r is given for each level of the clustering hierarchy (see below). The
quality function ensures that related documents tend to be assigned to the same cluster.
The higher the value of the resolution parameter, the larger the number of clusters and
the smaller the number of documents per cluster.</p>
        <p>We created a hierarchical clustering consisting of 7 levels. Starting from the highest
level, at each level we applied our clustering algorithm to the citation network of the
documents of each cluster obtained at the prior level, except at the highest level, where
we applied the clustering algorithm to all documents in our citation network. At each
lower level we multiplied the value of the resolution parameter by 10 to obtain smaller
clusters.</p>
        <p>At the highest level, we used the value 10-7 for the resolution parameter. This is
similar to the value of 8*10-8 used by Waltman and van Eck [31] in their clustering
solution with the lowest granularity. Using the value 10-7, we obtained a clustering
solution in which 40% of the documents belonged to the largest cluster and 88% of the
documents belonged to the 10 largest clusters. The size of the 15 largest clusters is
shown in Figure 2.</p>
        <p>In the end, we obtained a nested system of clusters and subclusters that was used for
the evaluation. We removed clusters with fewer than 5 documents to avoid precision
artefacts.
Fscores and precision are 10 to 20 times higher for SciMacro than for the Boolean query
searches, while the recall is 20% to 60% lower, depending on the user model. Therefore,
we expect that SciMacro will perform well in precision-focussed tasks when compared
with query-based search methods.</p>
        <p>Fig. 3. Performance of SciMacro by F-score. X-axis: F-score variant. Y-axis: F-score value.
Quadrants: the quadrants have no specific meaning, they serve to better visualize the 17 SRs.</p>
        <p>Lines: SRs.</p>
        <sec id="sec-3-3-1">
          <title>Boolean query SciMacro</title>
          <p>F0.25
0.181
F0.5
0.012
0.180
F1
0.018
0.222
F2
0.298
F3
0.053
F4
0.422</p>
        </sec>
        <sec id="sec-3-3-2">
          <title>Boolean query SciMacro F0.25 SciMacro F4</title>
        </sec>
        <sec id="sec-3-3-3">
          <title>SciMacro F0.25</title>
        </sec>
        <sec id="sec-3-3-4">
          <title>SciMacro F4</title>
        </sec>
        <sec id="sec-3-3-5">
          <title>Precision 0.010 0.191 ± 0.086 0.102 ± 0.073</title>
          <p>The answers to our research questions can be summarized as follows:
What is the potential of SciMacro for finding the relevant literature for SRs? The
preliminary results presented in this paper do not allow us to draw strong conclusions.
We will need further simulations on a larger benchmark set, a more direct comparison
to query search, and ultimately a follow-up user study to answer this question.
However, our informed guess, based on the results reported in prior work, is that
SciMacro will perform well in precision-focussed tasks when compared with
querybased search methods.</p>
          <p>How do the user’s preferences affect the performance of SciMacro? Our simulation
has shown that SciMacro performs better at tasks focused on recall than precision.</p>
          <p>At the moment, we are working on evaluating SciMacro using our current simulation
setup and comparing it to other academic IR methods. Also, we are currently expanding
our evaluation set with more SR tasks to present more rigorous results on the potential
of SciMacro.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.</p>
        </sec>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <volume>54</volume>
          ,
          <fpage>1042</fpage>
          -
          <lpage>1057</lpage>
          (
          <year>2018</year>
          ). https://doi.org/10.1016/j.ipm.
          <year>2018</year>
          .
          <volume>07</volume>
          .003 Citation Gecko, https://github.com/CitationGecko/gecko-react
          <string-name>
            <surname>Choo</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kim</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kannan</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stolper</surname>
            ,
            <given-names>C.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stasko</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Drake</surname>
            ,
            <given-names>B.L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
          </string-name>
          , H.:
          <article-title>VisIRR: Visual analytics for information retrieval and recommendation with large-scale document data</article-title>
          .
          <source>In: 2014 IEEE Conference on Visual Analytics Science and Technology (VAST)</source>
          . pp.
          <fpage>243</fpage>
          -
          <lpage>244</lpage>
          . IEEE, Paris, France (
          <year>2014</year>
          ) Börner,
          <string-name>
            <surname>K.</surname>
          </string-name>
          :
          <article-title>Plug-and-play macroscopes</article-title>
          .
          <source>Commun. ACM</source>
          .
          <volume>54</volume>
          ,
          <issue>60</issue>
          (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          https://doi.org/10.1145/1897852.1897871 Maxwell,
          <string-name>
            <surname>D.M.</surname>
          </string-name>
          :
          <article-title>Modelling Search and</article-title>
          Stopping in Interactive Information Retrieval, (
          <year>2019</year>
          )
          <article-title>iris</article-title>
          .ai, https://iris.ai/ He,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Ping</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            ,
            <surname>Lou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>PaperPoles: Facilitating adaptive visual exploration of scientific publications by citation links</article-title>
          .
          <source>J. Assoc. Inf. Sci. Technol</source>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <volume>70</volume>
          ,
          <fpage>843</fpage>
          -
          <lpage>857</lpage>
          (
          <year>2019</year>
          ). https://doi.org/10.1002/asi.24171 Giang,
          <string-name>
            <given-names>H.T.N.</given-names>
            ,
            <surname>Ahmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.M.</given-names>
            ,
            <surname>Fala</surname>
          </string-name>
          , R.Y.,
          <string-name>
            <surname>Khattab</surname>
            ,
            <given-names>M.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Othman</surname>
            ,
            <given-names>M.H.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Abdelrahman</surname>
            ,
            <given-names>S.A.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Thao</surname>
            ,
            <given-names>L.P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gabl</surname>
            ,
            <given-names>A.E.A.E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Elrashedy</surname>
            ,
            <given-names>S.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>P.N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hirayama</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Salem</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huy</surname>
          </string-name>
          , N.T.:
          <article-title>Methodological steps used by authors of systematic reviews and meta-analyses of clinical trials: a cross-sectional study</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <source>BMC Med. Res. Methodol</source>
          .
          <volume>19</volume>
          , (
          <year>2019</year>
          ). https://doi.org/10.1186/s12874- 0190780-2 Peterson,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Pearce</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.F.</given-names>
            ,
            <surname>Ferguson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.A.</given-names>
            ,
            <surname>Langford</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.A.</surname>
          </string-name>
          :
          <article-title>Understanding scoping reviews: Definition, purpose, and process</article-title>
          .
          <source>J. Am. Assoc. Nurse Pract.</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <volume>29</volume>
          ,
          <fpage>12</fpage>
          -
          <lpage>16</lpage>
          (
          <year>2017</year>
          ). https://doi.org/10.1002/
          <fpage>2327</fpage>
          -
          <lpage>6924</lpage>
          .12380 Scells,
          <string-name>
            <given-names>H.</given-names>
            ,
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Koopman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Deacon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Azzopardi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Geva</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.:</surname>
          </string-name>
          <article-title>A Test Collection for Evaluating Retrieval of Studies for Inclusion in Systematic Reviews</article-title>
          .
          <source>In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '17</source>
          . pp.
          <fpage>1237</fpage>
          -
          <lpage>1240</lpage>
          . ACM Press, Shinjuku, Tokyo, Japan (
          <year>2017</year>
          )
          <article-title>Nakazawa</article-title>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Itoh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Saito</surname>
          </string-name>
          ,
          <string-name>
            <surname>T.</surname>
          </string-name>
          :
          <article-title>Analytics and visualization of citation network applying topic-based clustering</article-title>
          .
          <source>J. Vis</source>
          .
          <volume>21</volume>
          ,
          <fpage>681</fpage>
          -
          <lpage>693</lpage>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          https://doi.org/10.1007/s12650-018-0483-5 Haunschild,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Marx</surname>
          </string-name>
          ,
          <string-name>
            <surname>W.</surname>
          </string-name>
          :
          <article-title>Discovering seminal works with marker papers</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>(2019) Mirylenka</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Passerini</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>ScienScan - An Efficient Visualization and Browsing Tool for Academic Search</article-title>
          . In: Salinesi,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Norrie</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.C.</surname>
          </string-name>
          , and Pastor, Ó. (eds.)
          <source>Advanced Information Systems Engineering</source>
          . pp.
          <fpage>667</fpage>
          -
          <lpage>671</lpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg (
          <year>2013</year>
          )
          <article-title>Open Knowledge Maps: A Visual Interface to the World's Scientific Knowledge</article-title>
          ., https://openknowledgemaps.org
          <string-name>
            <surname>Smith</surname>
            ,
            <given-names>P.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krawczak</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shute</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chignell</surname>
            ,
            <given-names>M.H.</given-names>
          </string-name>
          :
          <article-title>Bibliographic information retrieval systems: increasing cognitive compatibility</article-title>
          .
          <source>Inf. Serv. Use.</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <volume>7</volume>
          ,
          <fpage>95</fpage>
          -
          <lpage>102</lpage>
          (
          <year>1987</year>
          ). https://doi.org/10.3233/ISU-1987
          <string-name>
            <surname>-</surname>
            74-502 Zhu,
            <given-names>Y.</given-names>
          </string-name>
          :
          <source>Graph-based Interactive Bibliographic Information Retrieval Systems</source>
          , (
          <year>2017</year>
          ) Horsley,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Dingwall</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            ,
            <surname>Sampson</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          :
          <article-title>Checking reference lists to find additional studies for systematic reviews</article-title>
          .
          <source>Cochrane Database Syst. Rev</source>
          . (
          <year>2011</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          https://doi.org/10.1002/14651858.MR000026.pub2 Portenoy,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>West</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.D.</surname>
          </string-name>
          :
          <article-title>Supervised Learning for Automated Literature Review</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <source>In: Proceedings of the 4th Joint Workshop on Bibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL</source>
          <year>2019</year>
          ). p.
          <volume>9</volume>
          (
          <issue>2019</issue>
          ) Martinez,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Cavedon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Baldwin</surname>
          </string-name>
          ,
          <string-name>
            <surname>T.</surname>
          </string-name>
          :
          <article-title>Facilitating Biomedical Systematic Reviews Using Ranked Text Retrieval and Classification</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Australasian</given-names>
            <surname>Document Computing Symposium (ADCS) (2008) Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Pohl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Scholer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Cavedon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Zobel</surname>
          </string-name>
          , J.:
          <article-title>Boolean versus ranked querying for biomedical systematic reviews</article-title>
          .
          <source>BMC Med</source>
          . Inform. Decis. Mak.
          <volume>10</volume>
          , (
          <year>2010</year>
          ). https://doi.org/10.1186/
          <fpage>1472</fpage>
          -6947-10-58
          <string-name>
            <given-names>O</given-names>
            <surname>'Mara-Eves</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Thomas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>McNaught</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Miwa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Ananiadou</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.:</surname>
          </string-name>
          <article-title>Using text mining for study identification in systematic reviews: a systematic review of current approaches</article-title>
          .
          <source>Syst. Rev. 4</source>
          , (
          <year>2015</year>
          ). https://doi.org/10.1186/2046-40534-5
          <string-name>
            <surname>Cutting</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Karger</surname>
            ,
            <given-names>D.R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pedersen</surname>
            ,
            <given-names>J.O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tukey</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          :
          <article-title>Scatter/Gather: A Cluster-based Approach to Browsing Large Document Collections</article-title>
          .
          <source>Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval.</source>
          (
          <year>1992</year>
          ) Pirolli,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Schank</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Hearst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Diehl</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Scatter/gather browsing communicates the topic structure of a very large text collection</article-title>
          .
          <source>In: Proceedings of the SIGCHI conference on Human factors in computing systems common ground - CHI '96</source>
          . pp.
          <fpage>213</fpage>
          -
          <lpage>220</lpage>
          . ACM Press, Vancouver, British Columbia,
          <string-name>
            <surname>Canada</surname>
          </string-name>
          (
          <year>1996</year>
          ) Farsandaj,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <surname>C.</surname>
          </string-name>
          :
          <article-title>Scatter/Gather browsing of web service QoS data</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Future</given-names>
            <surname>Gener</surname>
          </string-name>
          .
          <source>Comput. Syst</source>
          .
          <volume>28</volume>
          ,
          <fpage>1145</fpage>
          -
          <lpage>1154</lpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          https://doi.org/10.1016/j.future.
          <year>2011</year>
          .
          <volume>08</volume>
          .020 Gong,
          <string-name>
            <given-names>X.</given-names>
            ,
            <surname>Ke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Khare</surname>
          </string-name>
          , R.:
          <article-title>Studying scatter/gather browsing for web search</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>Proc. Am. Soc. Inf. Sci. Technol</source>
          .
          <volume>49</volume>
          ,
          <issue>1</issue>
          -
          <fpage>4</fpage>
          (
          <year>2012</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          https://doi.org/10.1002/meet.14504901328 Azzopardi,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Järvelin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            ,
            <surname>Kamps</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ,
            <surname>Smucker</surname>
          </string-name>
          , M.D.
          <source>: Report on the SIGIR 2010 workshop on the simulation of interaction. ACM SIGIR Forum</source>
          .
          <volume>44</volume>
          ,
          <issue>35</issue>
          (
          <year>2011</year>
          ). https://doi.org/10.1145/1924475.1924484 Mahdi,
          <string-name>
            <given-names>M.N.</given-names>
            ,
            <surname>Ahmad</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.R.</given-names>
            ,
            <surname>Ismail</surname>
          </string-name>
          , R.:
          <article-title>Evaluating Search Results in Exploratory Search</article-title>
          .
          <source>Int. J. Eng. Technol</source>
          .
          <volume>7</volume>
          ,
          <issue>276</issue>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          https://doi.org/10.14419/ijet.v7i4.
          <volume>35</volume>
          .22746 Leuski,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>Evaluating Document Clustering for Interactive Information Retrieval</article-title>
          . .
          <source>K</source>
          .
          <volume>13</volume>
          (
          <issue>2001</issue>
          )
          <article-title>Chalmers, I.: The Cochrane collaboration: preparing, maintaining, and disseminating systematic reviews of the effects of health care</article-title>
          .
          <source>Annals of the New York Academy of Sciences</source>
          <volume>703</volume>
          (
          <issue>1</issue>
          )
          <fpage>156</fpage>
          -
          <lpage>165</lpage>
          (
          <year>1993</year>
          )
          <string-name>
            <surname>Traag</surname>
            ,
            <given-names>V.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Waltman</surname>
          </string-name>
          , L.,
          <string-name>
            <surname>van Eck</surname>
            ,
            <given-names>N.J.</given-names>
          </string-name>
          :
          <article-title>From Louvain to Leiden: guaranteeing well-connected communities</article-title>
          .
          <source>Sci. Rep</source>
          .
          <volume>9</volume>
          , (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          https://doi.org/10.1038/s41598-019-41695-z
          <string-name>
            <surname>Waltman</surname>
          </string-name>
          , L.,
          <string-name>
            <surname>van Eck</surname>
            ,
            <given-names>N.J.:</given-names>
          </string-name>
          <article-title>A New Methodology for Constructing a PublicationLevel Classification System of Science</article-title>
          .
          <source>J. Am. Soc. Inf. Sci. Technol</source>
          .
          <volume>63</volume>
          ,
          <fpage>2378</fpage>
          -
          <lpage>2392</lpage>
          (
          <year>2012</year>
          ). https://doi.org/10.1002/asi.22748 Waltman,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Boyack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.W.</given-names>
            ,
            <surname>Colavizza</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            , &amp;
            <surname>Van Eck</surname>
          </string-name>
          ,
          <string-name>
            <surname>N.J.</surname>
          </string-name>
          <article-title>(in press): A principled methodology for comparing relatedness measures for clustering publications</article-title>
          .
          <source>Quantitative Science Studies</source>
          . arXiv:
          <year>1901</year>
          .
          <volume>06815</volume>
          (
          <issue>2019</issue>
          ) Scells,
          <string-name>
            <given-names>H.</given-names>
            , &amp;
            <surname>Zuccon</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Generating better queries for systematic reviews</article-title>
          .
          <source>In The 41st International ACM SIGIR Conference on Research &amp; Development in Information Retrieval</source>
          (pp.
          <fpage>475</fpage>
          -
          <lpage>484</lpage>
          ). (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>