<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Importance Assessment in Scholarly Networks</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Saurav Manchanda</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>George Karypis</string-name>
          <email>karypisg@umn.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of Minnesota</institution>
          ,
          <addr-line>Twin Cities</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present approaches to estimate content-aware bibliometrics to quantitatively measure the scholarly impact of a publication. Traditional measures to assess quality-related aspects such as citation counts and h-index, do not take into account the content of the publications, which limits their ability to provide rigorous quality-related metrics and can significantly skew the results. Our proposed metric, denoted by Content Informed Index (CII), uses the content of the paper as a source of distant-supervision, to weight the edges of a citation network. These content-aware weights quantify the information in the citation i.e., these weights quantify the extent to which the cited-node informs the citing-node. The weights convert the original unweighted citation network to a weighted one. Consequently, this weighted network can be used to derive impact metrics for the various entities involved, like the publications, authors etc. We evaluate the weights estimated by our approach on three manually annotated datasets, where the annotations quantify the extent of information in the citation. Particularly, we evaluate how well the ranking imposed by our approach associates with the ranking imposed by the manual annotations. The proposed approach achieves up to 103% improvement in performance as compared to second best performing approach.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>Scientific, engineering, and technological (SET) innovations
have been the drivers behind many of the significant
positive advances in our modern economy, society, and life.
To measure various impact-related aspects of these
innovations various quantitative metrics have been developed and
deployed. These metrics play an important role as they are
used to influence how resources are allocated, assess the
performance of personnel, identify intellectual property
(IP)related takeover targets, value a company’s intangible assets
(IP is such an asset), and identify strategic and/or emerging
competitors.</p>
      <p>
        Citation networks of peered-reviewed scholarly
publications (e.g., journal/conference articles and patents) have
widely been used and studied in order to derive such metrics
for the various entities involved (e.g., articles, researchers,
institutions, companies, journals, conferences, countries,
etc.
        <xref ref-type="bibr" rid="ref2">(Aguinis et al. 2012)</xref>
        ). However, most of these
traditional metrics, such as citation counts and h-index treat all
citations and publications equally, and do not take into
account the content of the publications and the context in
which a prior scholarly work was cited. Another related
line of work, such as PageRank
        <xref ref-type="bibr" rid="ref58">(Page et al. 1999)</xref>
        and
HITS
        <xref ref-type="bibr" rid="ref35">(Kleinberg 1999)</xref>
        takes the node centrality into
consideration (as a proxy for publication influence), but still
operate in an content-agnostic manner. These content-agnostic
metrics fail to reliably measure the scholarly impact of an
article as they do not differentiate between the possible
reasons a scholarly work is being cited. Being content-agnostic,
these metrics can be easily manipulated by the presence of
malicious entities, such as publication venues indulging in
self-citations, which leads to high impact factor, or a group
of scholars citing each others’ work. For example, Journal
Citation Reports (JCR)1 routinely suppresses many journals
that indulge in citation stacking, a practice where the
reviewers and journal editors pressure authors to cite papers that
either they wrote or that are published in “their” journal. Thus,
there is a need to establish content-aware metrics to
accurately and quantitatively measure various innovation-related
aspects such as their significance, novelty, impact, and
market value. Such metrics are essential for ensuring that
SETdriven innovations will play an ever more significant role in
the future.
      </p>
      <p>
        In this paper, we propose machine-learning-driven
approaches, that automatically estimate the weights of the
edges in a citation network, such that edges with higher
weights correspond to higher-impact citations. There has
been considerable effort in the past to identify important
citations
        <xref ref-type="bibr" rid="ref30 ref34 ref5 ref70">(Valenzuela, Ha, and Etzioni 2015; Jurgens et al.
2018; Cohan et al. 2019)</xref>
        . These approaches treat this task
as a supervised text-classification problem, and thus, require
the availability of training data with ground truth
annotations. However, generating such labeled data is difficult and
time consuming, especially when the meaning of the labels
is user-defined. In contrast, our approaches are distant
supervised, that require no manual annotation. The proposed
approaches leverage the readily available content of the
papers as a source of distant-supervision. Specifically, we
for1http://help.incites.clarivate.com/incitesLiveJCR/JCRGroup/
titleSuppressions.html
mulate the problem as how well the linear combination of
the representations of the cited publication explains the
representation of the citing publication. The weights in this
linear-combination quantify the extent to which the
citedpublication informs the citing-publication. We evaluate the
weights estimated by our approach on three manually
annotated datasets, where the annotations quantify the extent
of information in the citation. Particularly, we evaluate how
well the ranking imposed by our approach associates with
the ranking imposed by the manual annotations. The
proposed approach achieves up to 103% improvement in
performance as compared to second best performing approach.
      </p>
      <p>While our discussion and evaluation focuses on
identifying informing citations, our approach is not restricted
to this domain, and can be used to derive impact metrics
for the various involved entities. For example, the
contentaware weights estimated by the proposed approach convert
the original unweighted citation network to a weighted one.
Consequently, this weighted network can be used to derive
impact metrics for the various involved entities, like the
publications, authors etc. For example, to find the impact of
a publication, the sum of weights outgoing from its
corresponding node can be used to quantify the impact of the
publication, instead of using vanilla citation count.</p>
      <p>The reminder of the paper is organized as follows. Section
2 presents the related literature review. The paper discusses
the proposed method in Section 3 followed by the
experiments in Section 4. Section 5 discusses the results. Finally,
Section 6 corresponds to the conclusions.</p>
      <p>2</p>
    </sec>
    <sec id="sec-2">
      <title>Related Work</title>
      <p>The research areas relevant to the work present in this paper
belong to citation indexing, citation recommendation, link
prediction approaches, distant-supervised credit attribution
approaches and citation-intent classification approaches. We
briefly discuss these areas below:</p>
      <sec id="sec-2-1">
        <title>Citation Indexing</title>
        <p>
          A citation index indexes the links between publications that
authors make when they cite other publications. Citation
indexes aim to improve the dissemination and retrieval of
scientific literature. CiteSeer
          <xref ref-type="bibr" rid="ref12 ref37">(Giles, Bollacker, and Lawrence
1998; Li et al. 2006)</xref>
          is a first automated citation indexing
system that works by downloading publications from the
Web and converting them to text. It then parses the papers to
extract the citations and the context in which the citations are
made in the body of the paper, storing this information in a
database. Other examples of popular citation indices include
Google Scholar2, Web of Science3 by Clarivate Analytics,
Scopus4 by Elsevier and Semantic Scholar5. Some examples
of subject-specific citation indices include INSPIRE-HEP6
which covers high energy physics, PubMed7, which covers
2https://scholar.google.com/
3http://www.webofknowledge.com/
4https://www.scopus.com/
5https://www.semanticscholar.org/
6https://inspirehep.net/
7https://pubmed.ncbi.nlm.nih.gov/
life sciences and biomedical topics, and Astrophysics Data
System8 which covers astronomy and physics.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>Citation recommendation</title>
        <p>
          Citation recommendation describes the task of
recommending citations for a given text. It is an essential task, as
all claims written by the authors need to be backed up
in order to ensure reliability and truthfulness. The
approaches developed for citation recommendation can be
grouped into 4 groups as follows
          <xref ref-type="bibr" rid="ref10 ref46">(Fa¨rber and Jatowt 2020)</xref>
          :
hand-crafted feature based approaches, topic-modelling
based approaches, machine-translation based approaches,
and neural-network based approaches. Hand-crafted feature
based approaches are based on features are are manually
engineered by the developers. For example, text similarity
between the citation context and the candidate papers can
be used as one of the text-based features. Examples of
papers that propose hand-crafted feature based approaches
include
          <xref ref-type="bibr" rid="ref10 ref17 ref23 ref39 ref40 ref46 ref60 ref65">(Fa¨rber and Jatowt 2020; He et al. 2011; LIU, YAN,
and YAN 2016; Livne et al. 2014; Rokach et al. 1978)</xref>
          .
Topic modeling based approaches represent the candidate
papers’ text and the citation contexts by means of abstract
topics, and thereby exploiting the latent semantic structure
of texts. Examples of topic modeling based approaches
include
          <xref ref-type="bibr" rid="ref24 ref31">(He et al. 2010; Kataria, Mitra, and Bhatia 2010)</xref>
          .
The machine-translation based approaches apply the idea
of translating the citation context into the cited document
to find the candidate-papers worth citing. Examples in this
category include
          <xref ref-type="bibr" rid="ref22 ref26">(He et al. 2012; Huang et al. 2012)</xref>
          .
Finally, the popular examples of neural-network based
models include
          <xref ref-type="bibr" rid="ref20 ref36 ref44 ref66 ref66 ref68 ref72 ref72 ref9 ref9">(Ebesu and Fang 2017; Han et al. 2018; Huang
et al. 2015; Kobayashi, Shimbo, and Matsumoto 2018; Tang,
Wan, and Zhang 2014; Yin and Li 2017)</xref>
          .
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>Link-prediction</title>
        <p>
          A link is a connection between two nodes in a network.
As such, link-prediction is the problem of predicting the
existence of a link between two nodes in a network. A
good link-prediction model predicts the likelihood of a link
between two nodes, so it can not only be used to
predict new links, but to also curate the graph by filtering
less-likely links that are already present. Thus, the
linkprediction can be a useful tool to find likely citations in
a citation network. The citation recommendation task
described previously can be thought of as a special case of
linkprediction. Following the taxonomy described in
          <xref ref-type="bibr" rid="ref17 ref50">(Mart´ınez,
Berzal, and Cubero 2016)</xref>
          , link-prediction approaches can be
broadly categorized into three categories: similarity-based
approaches, probabilistic and statistical approaches and
algorithmic approaches. The similarity based approaches
assume that nodes tend to form links with other similar nodes,
and that two nodes are similar if they are connected to
similar nodes or are near in the network according to a given
similarity function. Examples of popular similarity functions
include number of common neighbors
          <xref ref-type="bibr" rid="ref38">(Liben-Nowell and
Kleinberg 2007)</xref>
          , Adamic-Adar index
          <xref ref-type="bibr" rid="ref1 ref62">(Adamic and Adar
8http://ads.harvard.edu/
2003)</xref>
          , etc. The probabilistic and statistical approaches
assume that the network has a known structure. These
approaches estimates the model parameters of the network
structure using statistical methods, and use these
parameters to calculate the likelihood of the presence of a link
between two nodes. Examples of probabilistic and
statistical approaches include
          <xref ref-type="bibr" rid="ref18 ref28 ref38 ref71">(Guimera` and Sales-Pardo 2009;
Huang 2010; Wang, Satuluri, and Parthasarathy 2007)</xref>
          .
Algorithmic approaches directly uses the link-prediction as
supervision to build the model. For example, link-prediction
task can be formulated as a binary classification task where
the positive instances are the pair of nodes which are
connected in the network, and negative instances are the
unconnected nodes. Examples include
          <xref ref-type="bibr" rid="ref51 ref60">(Menon and Elkan 2011;
Bliss et al. 2014)</xref>
          . Unsupervised or self-supervised node
embedding (such as DeepWalk
          <xref ref-type="bibr" rid="ref60">(Perozzi, Al-Rfou, and Skiena
2014)</xref>
          , node2vec
          <xref ref-type="bibr" rid="ref17">(Grover and Leskovec 2016)</xref>
          ), followed by
training a binary classifier and Graph Neural network
approaches such as GraphSage
          <xref ref-type="bibr" rid="ref19 ref66 ref72 ref9">(Hamilton, Ying, and Leskovec
2017)</xref>
          belong to this category.
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>Distant-supervised credit-attribution</title>
        <p>
          Various distant-supervised approaches have been developed
for credit-attribution, but the prior have primarily focused on
text documents. A document may be associated with
multiple labels but all the labels do not apply with equal
specificity to the individual parts of the documents. Credit
attribution problem refers to identifying the specificity of
labels to different parts of the document. Various
probabilistic and neural-network based approaches have been
developed to address the credit-attribution problem, such as
Labeled Latent Dirichlet Allocation (LLDA) (Ramage et al.
2009), Partially Labeled Dirichlet Allocation (PLDA)
          <xref ref-type="bibr" rid="ref51 ref64">(Ramage, Manning, and Dumais 2011)</xref>
          , Multi-Label Topic
Model (MLTM)
          <xref ref-type="bibr" rid="ref66 ref72 ref9">(Soleimani and Miller 2017)</xref>
          , Segmentation
with Refinement (SEG-REFINE)
          <xref ref-type="bibr" rid="ref44">(Manchanda and Karypis
2018)</xref>
          , and Credit Attribution with Attention (CAWA)
          <xref ref-type="bibr" rid="ref10 ref46 ref49">(Manchanda and Karypis 2020)</xref>
          .
        </p>
        <p>
          Another line of work uses distant-supervised
creditattribution for query-understanding in product search.
Examples include, (i) using the reformulation logs as a source
of distant-supervision to estimate a weight for each term in
the query that indicates the importance of the term towards
expressing the query’s product intent
          <xref ref-type="bibr" rid="ref47 ref48 ref5">(Manchanda, Sharma,
and Karypis 2019a,b)</xref>
          ; and (ii) annotating individual terms
in a query with the corresponding intended product
characteristics, using the characteristics of the engaged products
as a source of distant-supervision
          <xref ref-type="bibr" rid="ref10 ref46 ref49">(Manchanda, Sharma, and
Karypis 2020)</xref>
          .
        </p>
      </sec>
      <sec id="sec-2-5">
        <title>Citation-intent classification</title>
        <p>
          There is a large body of work studying the intent of
citations and devising categorization systems. In general, these
approaches treat citation-intent classification as a text
classification problem, and require the availability of training
data with ground truth annotations. Representative examples
include rule based approaches
          <xref ref-type="bibr" rid="ref1 ref11 ref62">(Pham and Hoffmann 2003;
Garzone and Mercer 2000)</xref>
          as well as machine-learning
Paper 2
        </p>
        <p>Paper 3</p>
        <p>Paper 4
Unit Normalization</p>
        <p>Paper 1
(Citing paper)
---[Paper
4]---[Paper
3]---[Paper
2]Representation of the historical concepts</p>
        <p>
          Minimize the explanation loss
driven approaches
          <xref ref-type="bibr" rid="ref30 ref34 ref5 ref70">(Valenzuela, Ha, and Etzioni 2015;
Jurgens et al. 2018; Cohan et al. 2019)</xref>
          . Generating labeled data
for for these supervised approaches is difficult and time
consuming, especially when the meaning of the labels is
userdefined. In contrast, our approaches are distant supervised,
that require no manual annotation.
        </p>
        <p>3</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Content-Informed Index (CII)</title>
      <p>In the absence of labels that define the impact, we assume
that the extent to which a cited paper informs the citing
paper is an indication of the citation’s impact. Specifically, we
assume that each paper Pi can be represented as a set of
concepts Ci. Further, we assume that each paper Pi is build on
top of a set of historical concepts Hi, and its novelty Ni is
the new set of concepts it proposes. The contribution of a
cited paper Pj towards the citing paper Pi is the set of
concepts Cji = Cj \ Hi. In other terms, the set of concepts Ci
is given by:</p>
      <p>Ci = Ni [ Hi = Ni [ [[PicitesPj Cji]:
The task at hand is to quantify the extent to which Cji
contributes towards Hi. To achieve this task, we look into the
following directions:
• How do we supervise the exercise? We minimize the
novelty of paper Pi, by trying to explain the concepts in
paper Pi (denoted by Ci) using the historical concepts,
i.e., the concepts of the papers it cites (Cj ). We call the
loss associated with this minimization as the explanation
loss. This gives rise to the following optimization
problem:
minimize X</p>
      <p>Ni = minimize X Ci</p>
      <p>
        Hi:
i
i
To proceed in this direction, we need to answer two
questions, (i) How to represent the the set of concepts
associated with the paper Pi?, and (ii) How do we represent the
set of historical concepts Hi? As we show next, we use
the textual content of the papers to estimate the
representations of Ci and Hi. Thus, we formulate our problem as a
distant-supervised problem, and the content of the papers
acts as a source of distant-supervision.
• How to represent the set of concepts associated with
a paper? For simplicity, we represent the set of
concepts associated with a paper (Ci) as a pretrained
vector representation (embedding) of its abstract, such as
Word2Vec
        <xref ref-type="bibr" rid="ref52">(Mikolov et al. 2013)</xref>
        , GloVe
        <xref ref-type="bibr" rid="ref59">(Pennington,
Socher, and Manning 2014)</xref>
        , BERT
        <xref ref-type="bibr" rid="ref7">(Devlin et al. 2018)</xref>
        ,
ELMo
        <xref ref-type="bibr" rid="ref61">(Peters et al. 2018)</xref>
        , etc. In this paper, we use the
pretrained representations pretrained on scientific
documents provided by ScispaCy
        <xref ref-type="bibr" rid="ref54">(Neumann et al. 2019)</xref>
        . The
representation of Ci is denoted by r(Ci).
• How do we represent the set of historical concepts Hi?
As the set of historical concepts Hi is a union of the
borrowed concepts from the cited papers (Cj ), we simply
represent the set of historical concepts as a weighted
linear combination of the representation of the concepts of
the cited papers, i.e.,
r(Hi)
      </p>
      <p>=
subject to</p>
      <p>X
Pi cites Pj</p>
      <p>X
Pi cites Pj
w~ji
w~jir(Cj )
w~j2i = 1
0; 8(i; j):</p>
      <p>P
w~j2i =</p>
      <sec id="sec-3-1">
        <title>We have the constrained norm condition (</title>
        <p>Pi cites Pj
1) to make the representation of r(Hi) agnostic to the
number of cited-papers (a paper can cite multiple papers
to reference the same borrowed concepts).</p>
        <p>The weights w~ji can be thought of as normalized
similarity measure between the concepts of the cited paper,
and the citation context. Thus, to estimate w~ji, we first
estimate unnormalized w~ji, denoted by wji, and then
normalize wji so as to have unit norm. The unnormalized
weight wji is precisely the extent to which Cj contributes
towards Hi (and hence Ci), i.e., the weight that we wish to
estimate in this paper. We estimate wji as a multilayer
perceptron, that takes as input the representations of the cited
paper and the citation context. We use the representation
associated with the corresponding concepts as the
representations of the cited papers (r(Cj )). Similar to r(Cj ),
we use the ScispaCy vector representation for the citation
context as the representation of the context, and denote it
by r(j ! i).</p>
        <p>The above discussion leads to the following formulation:
minimize</p>
        <p>f
subject to</p>
        <p>X jjr(Ci)</p>
        <p>i
w~ji = r</p>
        <p>X</p>
        <p>Pi cites Pj
wji</p>
        <p>P
Pi cites Pj
wj2i
w~jir(Cj )jj2
; 8(i; j);
wji = f (r(Cj ); r(Cji)); 8(i; j);
wji
wji
0; 8(i; j);
b; 8(i; j):
(1)</p>
        <p>The max-bound constraint (wji b) is introduced to limit
the projection space of the weights wji. This is because,
without this constraint, for a given citing paper Pi, if the
set of weights wji minimize Equation (1), then so will any
scalar multiplication of the weights wji. This can potentially
lead to the estimated weights being incomparable across
different citing papers. Having a max bound on the
estimated weights helps avoid this scenario. To take care of
the constraints, the function f ( ) can be implemented as a
L2 regularized multilayer perceptron, with a single output
node, and a non-negative mapping at the output node. Not
that we do not explicitly set the max-bound b, but it is
implicitly set by the L2 regularization of the weights of the
function f . The L2 regularization parameter is treated as
a hyperparameter. Figure 1 shows an overview of
ContentInformed Index (CII).</p>
        <p>4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental methodology</title>
      <sec id="sec-4-1">
        <title>Evaluation methodology and metrics</title>
        <p>
          We need to evaluate how well the weights estimated by our
proposed approach quantifies the extent to which a cited
paper informs the citing paper. To this extent, we leverage
various manually annotated datasets (explained later in
Section 4), where the annotations quantify the extent of
information in the citation. The task inherently becomes an
ordinal association, and we need to evaluate how well the
ranking imposed by our proposed method associates with
the ranking imposed by the manual annotations. As a
measure of rank correlation, we use the non-parametric Somers’
Delta
          <xref ref-type="bibr" rid="ref67">(Somers 1962)</xref>
          (denoted by ). Values of range
from 1(100% negative association, or perfect inversion)
to +1(100% positive association, or perfect agreement). A
value of zero indicates the absence of association. Formally,
given a dependent variable (i.e., the predicted weights by our
model) and an independent variable (i.e., the manually
annotated ground truth), is the difference between the number
of concordant and discordant pairs, divided by the number
of pairs with independent variable values in the pair being
unequal.
        </p>
      </sec>
      <sec id="sec-4-2">
        <title>Relation of to other metrics: When the independent</title>
        <p>
          variable has only two distinct classes (binary variable), the
area under the receiver operating characteristic curve (AUC
ROC) statistic is equivalent to
          <xref ref-type="bibr" rid="ref56">(Newson 2002)</xref>
          . Thus,
can also be visualized as a generalization of AUC ROC
towards ordinal classification with multiple classes. Further,
as the dependent variable (the weights estimated by our
proposed approach) is real valued, having two tied values on the
independent variable is very difficult. Thus, for our case,
is equivalent to Goodman and Kruskal’s Gamma
          <xref ref-type="bibr" rid="ref13 ref14 ref15 ref16">(Goodman
and Kruskal 1959, 1963, 1972, 1979)</xref>
          , and just a scaled
variant of Kendall’s coefficient
          <xref ref-type="bibr" rid="ref32">(Kendall 1938)</xref>
          , with are other
popular measures of ordinal association.
        </p>
      </sec>
      <sec id="sec-4-3">
        <title>Baselines</title>
        <p>
          We choose representative baselines from diverse categories
as discussed below:
Link-prediction approaches: The citation weights that
we estimate in this paper can also looked from the
linkprediction perspective, i.e., assigning a score to every
citation (link) in the citation graph, the score portraying the
likelihood of the existence of a link. Thus, the citations that are
noisy, i.e., the edges that do not make sense with the respect
to underlying link-prediction model get smaller weights. We
compare against two link-prediction methods, one based on
classic network embedding approach, and other belonging
to recent Graph Neural Network (GNN) based approaches.
• DeepWalk
          <xref ref-type="bibr" rid="ref60">(Perozzi, Al-Rfou, and Skiena 2014)</xref>
          :
DeepWalk is a popular method to learn node embeddings.
DeepWalk borrows ideas from language modeling and
incorporates them with network concepts. Its main proposition
is that linked nodes tend to be similar and they should
have similar embeddings as well. Once we have node
embeddings as the output of DeepWalk, we train a
binary classifier, with the positive instances as the pairs of
nodes which are connected in the network, and negative
instances are the unconnected nodes (generated using
negative sampling). We provide results using two different
classifiers: Logistic Regression (denoted by DeepWalk+LR)
and Multilayer Perceptron (denoted by DeepWalk+MLP).
Note that Deepwalk is a transductive model, and only
considers the network topology, i.e., DeepWalk does not use
the content of the papers to estimate the model.
• GraphSage
          <xref ref-type="bibr" rid="ref19 ref66 ref72 ref9">(Hamilton, Ying, and Leskovec 2017)</xref>
          :
GraphSAGE is a Graph Concolutional Network (GCN) based
framework for inductive representation learning on large
graphs. GraphSage is trained with the link-prediction loss,
so we do not use a second step (as in DeepWalk) to train
separate classifier. Note that, GraphSage is an inductive
model, so also considers the content of the papers in
addition to topology of the network to estimate the model.
Text-similarity based baselines: We can think of the
function f as a similarity measure between the cited
paper and the citation context. Thus, we consider the following
similarity measures as our baselines: We use the same
pretrained representations as we used as an input to CII, and
cosine similarity as the similarity measure, which is a
popular similarity measure for text data.
• Similarity-Abstract-Context: Similarity between the cited
abstract and the citation context.
• Similarity-Context-Abstract: Similarity between the
citing abstract and the citation context.
• Similarity-Abstract-Abstract: Similarity between the
cited abstract and citing abstract.
        </p>
        <p>To calculate each of the above similarity measures, we use
the same pretrained representations as we used as an input to
CII, and cosine similarity as the similarity measure, which is
a popular similarity measure for text data. The baselines
belonging to this category can also be thought of as
similaritybased link prediction approaches.</p>
        <p>
          In addition, we also consider another simple baseline,
referred to as Reference Frequency, where we assume that
more frequently the cited paper is referenced in the citing
paper, the higher the chances of the cited paper informing the
citing paper. This assumption has also been used as a feature
in prior supervised approaches
          <xref ref-type="bibr" rid="ref34 ref70">(Valenzuela, Ha, and Etzioni
2015)</xref>
          . The absolute frequency of referencing a cited-paper
may provide a good signal regarding the information
borrowed from the cited paper, when comparing with other
papers being cited by the same citing paper. However, as the
citation-behavior differs between papers, the absolute
frequency may not be comparable across different citing
papers. Thus, we also provide results after doing
normalization of the absolute frequency of the citation references for
each citing paper. We provide results for mean, max, and
min normalization. Specifically, given a citation and the
corresponding citing paper, the information weight for a
citation is calculated by dividing the number of references of
that citation, by the mean, max, and min of references of all
the citations in that citing paper, respectively.
        </p>
      </sec>
      <sec id="sec-4-4">
        <title>Datasets</title>
      </sec>
      <sec id="sec-4-5">
        <title>The Semantic Scholar Open Research Corpus (S2ORC):</title>
        <p>
          The S2ORC
          <xref ref-type="bibr" rid="ref41">(Lo et al. 2020)</xref>
          dataset is a citation graph of
81:1 million academic publications and 380:5 million
citation edges. We only consider the publications for which
fulltext is available and abstract contains at least 50 words. This
leaves us with a total of 5; 653; 297 papers, and 30; 533; 111
edges (citations).
        </p>
        <p>
          ACL-
          <xref ref-type="bibr" rid="ref27">2015: The ACL-2015</xref>
          <xref ref-type="bibr" rid="ref34 ref70">(Valenzuela, Ha, and Etzioni
2015)</xref>
          dataset contains 465 citations gathered from the ACL
anthology9, represented as tuples of (cited paper, citing
paper), with ordinal labels ranging from 0 to 3, in increasing
order of importance. The citations were annotated by one
expert, followed by annotation by another expert on a subset of
the dataset, to verify the inter-annotator agreement. We only
use the citations for which we have the inter-annotator
agreement, and the citations are present in the S2ORC dataset we
described before. The selected dataset contains 300 citations
among 316 unique publications. The total number of unique
citing publications are 283 and the total number of unique
cited publications are 38.
        </p>
        <p>
          ACL-ARC: The ACL-ARC
          <xref ref-type="bibr" rid="ref30">(Jurgens et al. 2018)</xref>
          is a
dataset of citation intents based on a sample of papers from
the ACL Anthology Reference Corpus
          <xref ref-type="bibr" rid="ref3">(Bird et al. 2008)</xref>
          and includes 1,941 citation instances from 186 papers and
is annotated by domain experts. The dataset provides ACL
IDs for the papers in the ACL corpus, but does not provide
an identifier to the papers outside the ACL corpus,
making it difficult to map many citations to the S2ORC
corpus. However, it provided the titles of those papers, and
we used these titles to map these papers to the papers in
the S2ORC dataset, if we found matching titles. The
annotations in ACL-ARC are provided at individual
citationcontext level, leading to multiple annotations for some of the
(cited paper, citing paper) pair. If this is the case, we chose
the highest-informing annotation for such (cited paper,
citing paper) pairs. The selected dataset contains 460 citations
among 547 unique publications. The total number of unique
        </p>
        <sec id="sec-4-5-1">
          <title>9https://www.aclweb.org/anthology/</title>
          <p>citing publications are 145 and the total number of unique
cited publications are 413.</p>
          <p>
            SciCite
            <xref ref-type="bibr" rid="ref5">(Cohan et al. 2019)</xref>
            SciCite is a dataset of
citation intents based on a sample of papers from the Semantic
Scholar corpus10, consisting of papers in general computer
science and medicine domains. Citation intent was labeled
using crowdsourcing. The annotators were asked to identify
the intent of a citation, and were directed to select among
three citation intent options: Method, Result/Comparison
and Background. This resulted in a total 9; 159
crowdsourced instances. We use the citations that are present in
the S2ORC dataset we described before. Similar to
ACLARC, the annotations are provided at individual
citationcontext level, leading to multiple annotations for some of the
(cited paper, citing paper) pair. For such cases, we chose the
highest-informing annotation for the (cited paper, citing
paper) pairs. The selected dataset contains 352 citations among
704 unique publications. There is no repeated citing or cited
publication in this dataset, thus, the total number of unique
citing publications as well as unique citing publications are
352 each.
          </p>
        </sec>
      </sec>
      <sec id="sec-4-6">
        <title>Parameter selection</title>
        <p>
          We treat one of the evaluation datasets (ACL-ARC) as the
validation set, and chose the hyperparameters of our
approaches and baselines with respect to best performance
on this dataset. For DeepWalk, we use the implementation
provided here11, with the default parameters, except the
dimensionality of the estimated representations, which is set
to 200 (for the sake of fairness, as the used 200
dimensional text representations for CII). For the models that
require learning, i.e., the logistic regression part of Deepwalk,
MLP part of Deepwalk, GraphSage, and CII, we used the
ADAM
          <xref ref-type="bibr" rid="ref34">(Kingma and Ba 2015)</xref>
          optimizer, with initial
learning rate of 0:0001, and further use step learning rate
scheduler, by exponentially decaying the learning rate by a factor
of 0:2 every epoch. We use L2 regularization of 0:0001. The
function f in CII was implemented as a multilayer
perceptron, with three hidden layers, with 256, 64, and 8 neurons,
10https://www.semanticscholar.org/
11https://github.com/xgfs/deepwalk-c
ACL-2015
        </p>
        <p>ACL-ARC
respectively. We use the same network architecture for the
MLP that we train on top of DeepWalk representations. We
train the logistic regression and MLP parts of Deepwalk,
GraphSage, and CII for a maximum of 50 epochs, and do
early-stopping if the validation performance does not
improve for 5 epochs. For GraphSage, we use the
implementation provided by DGL12. We used mini-batch size of 1024
for training the models.</p>
        <p>5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results and discussion</title>
      <sec id="sec-5-1">
        <title>Quantitative analysis</title>
        <p>Table 1 shows the performance of the various approaches on
the Somers’ Delta ( ) for each of the datasets ACL-2015,
ACL-ARC and SciCite. For ACL-2015 and SciCite, the
proposed approach CII outperforms the competing approaches;
while for the ACL-ARC dataset, CII performs at par with
the best performing approach. The improvement of CII over
the second best performing approach is 22% and 103%, on
the ACL-2015 and SciCite datasets, respectively.</p>
        <p>Interestingly, the simplest baseline, Reference-frequency
and its normalized forms are the second best performing
approaches. While Reference-frequency performs at par with
the CII on the ACL-ARC dataset, it does not perform as
good on the other two datasets. This can be attributed to
the fact that the number of unique citing papers in
ACLARC dataset are relatively small. Thus, many citations in
ACL-ARC are shared by the same citing paper, which is not
the case with the other two datasets. Thus, as mentioned in
Section 4, absolute frequency of referencing a cited-paper
may provide a good signal regarding the information
borrowed from the cited paper, when comparing with other
papers being cited by the same citing paper. Further, even the
normalized forms of the Reference-frequency lead to only
marginal increase in performance for the ACL-2015 and
SciCite datasets. Thus, the simple normalizations (such as
mean, max and min normalization used in this paper), are
not sufficient to address the difference in citation-behavior
that occurs between different papers.</p>
        <p>12https://github.com/dmlc/dgl/blob/master/examples/pytorch/
graphsage</p>
        <p>Furthermore, we observe that simple similarity based
approaches, such as cosine-similarity between pairs of various
entities (each combination of citing abstract, citing abstract,
and citation-context) performs close to random scoring (
value of close to zero). This validates that the simple
similarity measures, like cosine similarity are not sufficient to
manifest the the information that a cited-paper lends to the
citing-paper; thus, showing the necessity of more expressive
approaches, like CII.</p>
        <p>In addition, the other learning-based
link-predictionbased approaches perform considerably worse than the
simple baseline reference-frequency. While on ACL-2015 and
SciCite datasets, they perform close to random scoring, the
performance on ACL-ARC dataset is better than the random
baseline.</p>
      </sec>
      <sec id="sec-5-2">
        <title>Qualitative analysis</title>
        <p>In order to understand the patterns that the proposed
approach CII learns, we look into the data instances with the
highest and lowest predicted weights. As the function f
takes as input both the abstract of the cited paper and the
citation context, the learnt patterns can be a complex function
of the cited paper abstract and the citation context. Thus, for
simplicity, we limit the discussion in this section to
understand the linguistic patterns in the citation context, and how
these patterns associate with the weights predicted for them.</p>
        <p>In this direction, we select 10; 000 citation-contexts
corresponding to citations with highest predicted weights, and
plot the word clouds for these contexts. We repeat the same
exercise for the citation-contexts with the lowest predicted
weights. Figures 2 and 3 shows the wordclouds for the
highest weighted citations and lowest weighted citations,
respectively. These figures show some clear discriminatory
patters between the highest-weighted and lowest-weighted
citations, that relate well with the information carried by a
citation. For example, the words such as ‘used’ and ‘using’ are
very frequent in the citation contexts of the highest weighted
citations. This is expected, as such verbs provide a strong
signal that the cited work was indeed employed by the citing
paper, and hence the cited paper informed the citing work.
Another interesting pattern in the highest weighted citations
is the presence of words like ‘fig’, ‘figure’ and ‘table’. Such
words are usually present when the authors present or
describe important concepts, such as methods and results. As
such, citations in these important sections indicates that the
cited work is used or extended in the citing paper, which
signals importance.</p>
        <p>On the other hand, the wordcloud for the least weighted
citations (Figure 3) is dominated by weasle words such as
‘may’, ‘many’, ‘however’, etc. The words such as ‘many’
commonly occur in the related work section of the paper,
where the paper presents some examples of other related
works to emphasize the problem that the citing paper is
solving. The words like ‘may’, ‘however’, ‘but’ etc are
commonly used to describe some limitation of the cited work.
Such citations are expected to be incidental, carrying less
information, as compared to other citations.
In this paper, we presented approaches to estimate
contentaware bibliometrics to accurately quantitatively measure the
scholarly impact of a publication. Our distant-supervised
approaches use the content of the publications to weight
the edges of a citation network, where the weights quantify
the extent to which the cited-publication informs the
citingpublication. Experiments on the three manually annotated
datasets show the advantage of using the proposed method
on the competing approaches. Our work makes a step
towards developing content-aware bibliometrics, and envision
that the proposed method will serve as a motivation to
develop other rigorous quality-related metrics.</p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Adamic</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Adar</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>Friends and neighbors on the web</article-title>
          .
          <source>Social networks</source>
          <volume>25</volume>
          (
          <issue>3</issue>
          ):
          <fpage>211</fpage>
          -
          <lpage>230</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Aguinis</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ;
          <article-title>Sua´rez-Gonza´lez, I.;</article-title>
          <string-name>
            <surname>Lannelongue</surname>
            , G.; and Joo,
            <given-names>H.</given-names>
          </string-name>
          <year>2012</year>
          .
          <article-title>Scholarly impact revisited</article-title>
          .
          <source>Academy of Management Perspectives</source>
          <volume>26</volume>
          (
          <issue>2</issue>
          ):
          <fpage>105</fpage>
          -
          <lpage>132</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Bird</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Dale,
          <string-name>
            <surname>R.</surname>
          </string-name>
          ; Dorr,
          <string-name>
            <given-names>B. J.</given-names>
            ;
            <surname>Gibson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ;
            <surname>Joseph</surname>
          </string-name>
          , M. T.;
          <string-name>
            <surname>Kan</surname>
          </string-name>
          , M.-Y.;
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Powley</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Radev</surname>
            ,
            <given-names>D. R.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Tan</surname>
            ,
            <given-names>Y. F.</given-names>
          </string-name>
          <year>2008</year>
          .
          <article-title>The acl anthology reference corpus: A reference dataset for bibliographic research in computational linguistics</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          2014.
          <article-title>An evolutionary algorithm approach to link prediction in dynamic social networks</article-title>
          .
          <source>Journal of Computational Science</source>
          <volume>5</volume>
          (
          <issue>5</issue>
          ):
          <fpage>750</fpage>
          -
          <lpage>764</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Cohan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ammar</surname>
            , W.; van Zuylen,
            <given-names>M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Cady</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <article-title>Structural Scaffolds for Citation Intent Classification in Scientific Publications</article-title>
          .
          <source>In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies</source>
          , Volume
          <volume>1</volume>
          (Long and Short Papers),
          <fpage>3586</fpage>
          -
          <lpage>3596</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Devlin</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Chang, M.-W.;
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Toutanova</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <article-title>Bert: Pre-training of deep bidirectional transformers for language understanding</article-title>
          . arXiv preprint arXiv:
          <year>1810</year>
          .04805 .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <surname>Ebesu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ; and Fang,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <year>2017</year>
          .
          <article-title>Neural citation network for context-aware citation recommendation</article-title>
          .
          <source>In Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval</source>
          ,
          <fpage>1093</fpage>
          -
          <lpage>1096</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <article-title>Fa¨rber, M.;</article-title>
          and
          <string-name>
            <surname>Jatowt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>Citation Recommendation: Approaches and Datasets</article-title>
          . arXiv preprint arXiv:
          <year>2002</year>
          .06961 .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <surname>Garzone</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and Mercer,
          <string-name>
            <surname>R. E.</surname>
          </string-name>
          <year>2000</year>
          .
          <article-title>Towards an automated citation classifier</article-title>
          .
          <source>In Conference of the canadian society for computational studies of intelligence</source>
          ,
          <fpage>337</fpage>
          -
          <lpage>346</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Bollacker</surname>
            , K. D.; and Lawrence,
            <given-names>S.</given-names>
          </string-name>
          <year>1998</year>
          .
          <article-title>CiteSeer: An automatic citation indexing system</article-title>
          .
          <source>In Proceedings of the third ACM conference on Digital libraries</source>
          ,
          <fpage>89</fpage>
          -
          <lpage>98</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kruskal</surname>
            ,
            <given-names>W. H.</given-names>
          </string-name>
          <year>1959</year>
          .
          <article-title>Measures of association for cross classifications</article-title>
          . II:
          <article-title>Further discussion and references</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          <volume>54</volume>
          (
          <issue>285</issue>
          ):
          <fpage>123</fpage>
          -
          <lpage>163</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kruskal</surname>
            ,
            <given-names>W. H.</given-names>
          </string-name>
          <year>1963</year>
          .
          <article-title>Measures of association for cross classifications III: Approximate sampling theory</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          <volume>58</volume>
          (
          <issue>302</issue>
          ):
          <fpage>310</fpage>
          -
          <lpage>364</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kruskal</surname>
            ,
            <given-names>W. H.</given-names>
          </string-name>
          <year>1972</year>
          .
          <article-title>Measures of association for cross classifications, IV: Simplification of asymptotic variances</article-title>
          .
          <source>Journal of the American Statistical Association</source>
          <volume>67</volume>
          (
          <issue>338</issue>
          ):
          <fpage>415</fpage>
          -
          <lpage>421</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <surname>Goodman</surname>
            ,
            <given-names>L. A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kruskal</surname>
            ,
            <given-names>W. H.</given-names>
          </string-name>
          <year>1979</year>
          .
          <article-title>Measures of association for cross classifications</article-title>
          .
          <source>In Measures of association for cross classifications</source>
          ,
          <fpage>2</fpage>
          -
          <lpage>34</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Grover</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Leskovec</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2016</year>
          . node2vec:
          <article-title>Scalable feature learning for networks</article-title>
          .
          <source>In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          ,
          <fpage>855</fpage>
          -
          <lpage>864</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Guimera</surname>
            <given-names>`</given-names>
          </string-name>
          , R.; and
          <string-name>
            <surname>Sales-Pardo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Missing and spurious interactions and the reconstruction of complex networks</article-title>
          .
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>106</volume>
          (
          <issue>52</issue>
          ):
          <fpage>22073</fpage>
          -
          <lpage>22078</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <surname>Hamilton</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ying</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Leskovec</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Inductive representation learning on large graphs</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          ,
          <volume>1024</volume>
          -
          <fpage>1034</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          .; Song,
          <string-name>
            <given-names>Y.</given-names>
            ;
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W. X.</given-names>
            ;
            <surname>Shi</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          ; and Zhang,
          <string-name>
            <surname>H.</surname>
          </string-name>
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <article-title>hyperdoc2vec: Distributed representations of hypertext documents</article-title>
          . arXiv preprint arXiv:
          <year>1805</year>
          .03793 .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <surname>He</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Nie</surname>
          </string-name>
          , J.-Y.;
          <string-name>
            <surname>Lu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>W. X.</given-names>
          </string-name>
          <year>2012</year>
          .
          <article-title>Positionaligned translation model for citation recommendation</article-title>
          .
          <source>In International symposium on string processing and information retrieval</source>
          ,
          <fpage>251</fpage>
          -
          <lpage>263</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kifer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Mitra,
          <string-name>
            <given-names>P.</given-names>
            ; and
            <surname>Giles</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. L.</surname>
          </string-name>
          <year>2011</year>
          .
          <article-title>Citation recommendation without author supervision</article-title>
          .
          <source>In Proceedings of the fourth ACM international conference on Web search and data mining</source>
          ,
          <fpage>755</fpage>
          -
          <lpage>764</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>He</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kifer</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ; Mitra,
          <string-name>
            <given-names>P.</given-names>
            ; and
            <surname>Giles</surname>
          </string-name>
          ,
          <string-name>
            <surname>L.</surname>
          </string-name>
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <article-title>Context-aware citation recommendation</article-title>
          .
          <source>In Proceedings of the 19th international conference on World wide web</source>
          ,
          <fpage>421</fpage>
          -
          <lpage>430</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ; Kataria,
          <string-name>
            <given-names>S.</given-names>
            ;
            <surname>Caragea</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ;
            <surname>Mitra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ;
            <surname>Giles</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. L.</given-names>
            ; and
            <surname>Rokach</surname>
          </string-name>
          ,
          <string-name>
            <surname>L.</surname>
          </string-name>
          <year>2012</year>
          .
          <article-title>Recommending citations: translating papers into references</article-title>
          .
          <source>In Proceedings of the 21st ACM international conference on Information and knowledge management</source>
          ,
          <year>1910</year>
          -
          <fpage>1914</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          2015.
          <article-title>A neural probabilistic model for context based citation recommendation</article-title>
          .
          <source>In Twenty-ninth AAAI conference on artificial intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          <year>2010</year>
          .
          <article-title>Link prediction based on graph topology: The predictive value of generalized clustering coefficient</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          <string-name>
            <surname>Available at SSRN</surname>
          </string-name>
          1634014 .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          <string-name>
            <surname>Jurgens</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kumar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Hoover,
          <string-name>
            <given-names>R.</given-names>
            ;
            <surname>McFarland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ; and
            <surname>Jurafsky</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          <year>2018</year>
          .
          <article-title>Measuring the evolution of a scientific field through citation frames</article-title>
          .
          <source>Transactions of the Association for Computational Linguistics</source>
          <volume>6</volume>
          :
          <fpage>391</fpage>
          -
          <lpage>406</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          <string-name>
            <surname>Kataria</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Mitra,
          <string-name>
            <given-names>P.</given-names>
            ; and
            <surname>Bhatia</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <year>2010</year>
          .
          <article-title>Utilizing Context in Generative Bayesian Models for Linked Corpus</article-title>
          . In Aaai, volume
          <volume>10</volume>
          ,
          <article-title>1</article-title>
          . Citeseer.
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          <string-name>
            <surname>Kendall</surname>
            ,
            <given-names>M. G.</given-names>
          </string-name>
          <year>1938</year>
          .
          <article-title>A new measure of rank correlation</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          <source>Biometrika</source>
          <volume>30</volume>
          (
          <issue>1</issue>
          /2):
          <fpage>81</fpage>
          -
          <lpage>93</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          <string-name>
            <surname>Kingma</surname>
            ,
            <given-names>D. P.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Ba</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Adam: A Method for Stochastic Optimization</article-title>
          URL http://arxiv.org/abs/1412.
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          <string-name>
            <surname>Kleinberg</surname>
            ,
            <given-names>J. M.</given-names>
          </string-name>
          <year>1999</year>
          .
          <article-title>Authoritative sources in a hyperlinked environment</article-title>
          .
          <source>Journal of the ACM (JACM) 46</source>
          (
          <issue>5</issue>
          ):
          <fpage>604</fpage>
          -
          <lpage>632</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          <string-name>
            <surname>Kobayashi</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Shimbo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and Matsumoto,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <year>2018</year>
          .
          <article-title>Citation recommendation using distributed representation of discourse facets in scientific articles</article-title>
          .
          <source>In Proceedings of the 18th ACM/IEEE on joint conference on digital libraries</source>
          ,
          <fpage>243</fpage>
          -
          <lpage>251</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Councill</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lee</surname>
          </string-name>
          , W.-C.; and
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C. L.</given-names>
          </string-name>
          <year>2006</year>
          .
          <article-title>CiteSeerx: an architecture and web service design for an academic document search engine</article-title>
          .
          <source>In Proceedings of the 15th international conference on World Wide Web</source>
          ,
          <fpage>883</fpage>
          -
          <lpage>884</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          <string-name>
            <surname>Liben-Nowell</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kleinberg</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>The linkprediction problem for social networks</article-title>
          .
          <source>Journal of the American society for information science and technology 58</source>
          <volume>(7)</volume>
          :
          <fpage>1019</fpage>
          -
          <lpage>1031</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          <string-name>
            <surname>LIU</surname>
            ,
            <given-names>Y.; YAN</given-names>
          </string-name>
          , R.; and YAN,
          <string-name>
            <surname>H.</surname>
          </string-name>
          <year>2016</year>
          .
          <article-title>Personalized Citation Recommendation Based on User's Preference and Language Model</article-title>
          .
          <source>Journal of Chinese Information Processing (2)</source>
          :
          <fpage>18</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          <string-name>
            <surname>Livne</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gokuladas</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Teevan</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Dumais, S. T.; and
          <string-name>
            <surname>Adar</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>CiteSight: supporting contextual citation recommendation using differential search</article-title>
          .
          <source>In Proceedings of the 37th international ACM SIGIR conference on Research &amp; development in information retrieval</source>
          ,
          <fpage>807</fpage>
          -
          <lpage>816</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          <string-name>
            <surname>Lo</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>L. L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kinney</surname>
          </string-name>
          , R.; and
          <string-name>
            <surname>Weld</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2020</year>
          .
          <article-title>S2ORC: The Semantic Scholar Open Research Corpus</article-title>
          .
          <source>In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics</source>
          ,
          <fpage>4969</fpage>
          -
          <lpage>4983</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref42">
        <mixed-citation>
          <article-title>Online: Association for Computational Linguistics</article-title>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <year>2020</year>
          .acl-main.447. URL https://www.aclweb.
        </mixed-citation>
      </ref>
      <ref id="ref43">
        <mixed-citation>
          org/anthology/2020.acl-main.
          <volume>447</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref44">
        <mixed-citation>
          <string-name>
            <surname>Manchanda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and Karypis,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <year>2018</year>
          .
          <article-title>Text segmentation on multilabel documents: A distant-supervised approach</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref45">
        <mixed-citation>
          <source>In 2018 IEEE International Conference on Data Mining (ICDM)</source>
          ,
          <fpage>1170</fpage>
          -
          <lpage>1175</lpage>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref46">
        <mixed-citation>
          <string-name>
            <surname>Manchanda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and Karypis,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>CAWA: An Attention-Network for Credit Attribution</article-title>
          .
          <source>In AAAI</source>
          ,
          <fpage>8472</fpage>
          -
          <lpage>8479</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref47">
        <mixed-citation>
          <string-name>
            <surname>Manchanda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Sharma,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ; and Karypis,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <year>2019a</year>
          .
          <article-title>Intent term selection and refinement in e-commerce queries</article-title>
          . arXiv preprint arXiv:
          <year>1908</year>
          .08564 .
        </mixed-citation>
      </ref>
      <ref id="ref48">
        <mixed-citation>
          <string-name>
            <surname>Manchanda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Sharma,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ; and Karypis,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <year>2019b</year>
          .
          <article-title>Intent term weighting in e-commerce queries</article-title>
          .
          <source>In Proceedings of the 28th ACM International Conference on Information and Knowledge Management</source>
          ,
          <fpage>2345</fpage>
          -
          <lpage>2348</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref49">
        <mixed-citation>
          <string-name>
            <surname>Manchanda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Sharma,
          <string-name>
            <surname>M.</surname>
          </string-name>
          ; and Karypis,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <year>2020</year>
          .
          <article-title>DistantSupervised Slot-Filling for E-Commerce Queries</article-title>
          . arXiv preprint arXiv:
          <year>2012</year>
          .08134 .
        </mixed-citation>
      </ref>
      <ref id="ref50">
        <mixed-citation>
          <string-name>
            <surname>Mart</surname>
          </string-name>
          ´ınez, V.;
          <string-name>
            <surname>Berzal</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Cubero</surname>
          </string-name>
          , J.-C.
          <year>2016</year>
          .
          <article-title>A survey of link prediction in complex networks</article-title>
          .
          <source>ACM computing surveys (CSUR) 49</source>
          (
          <issue>4</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>33</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref51">
        <mixed-citation>
          <string-name>
            <surname>Menon</surname>
            ,
            <given-names>A. K.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Elkan</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>Link prediction via matrix factorization</article-title>
          .
          <source>In Joint european conference on machine learning and knowledge discovery in databases</source>
          ,
          <volume>437</volume>
          -
          <fpage>452</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref52">
        <mixed-citation>
          <string-name>
            <surname>Mikolov</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Corrado</surname>
          </string-name>
          , G.; and
          <string-name>
            <surname>Dean</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2013</year>
          .
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref53">
        <mixed-citation>
          <source>arXiv preprint arXiv:1301</source>
          .
          <fpage>3781</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref54">
        <mixed-citation>
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>King</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Beltagy</surname>
            ,
            <given-names>I.;</given-names>
          </string-name>
          and
          <string-name>
            <surname>Ammar</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <year>2019</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref55">
        <mixed-citation>
          <article-title>ScispaCy: Fast and Robust Models for Biomedical Natural Language Processing</article-title>
          .
          <source>In Proceedings of the 18th BioNLP Workshop and Shared Task</source>
          ,
          <fpage>319</fpage>
          -
          <lpage>327</lpage>
          . Florence, Italy:
          <article-title>Association for Computational Linguistics</article-title>
          . doi:
          <volume>10</volume>
          .18653/v1/
          <fpage>W19</fpage>
          -5034. URL https://www.aclweb.org/anthology/W19- 5034.
        </mixed-citation>
      </ref>
      <ref id="ref56">
        <mixed-citation>
          <string-name>
            <surname>Newson</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <year>2002</year>
          .
          <article-title>Parameters behind “nonparametric” statistics: Kendall's tau, Somers' D and median differences</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref57">
        <mixed-citation>
          <source>The Stata Journal</source>
          <volume>2</volume>
          (
          <issue>1</issue>
          ):
          <fpage>45</fpage>
          -
          <lpage>64</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref58">
        <mixed-citation>
          <string-name>
            <surname>Page</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Brin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Motwani, R.; and Winograd,
          <string-name>
            <surname>T.</surname>
          </string-name>
          <year>1999</year>
          .
          <article-title>The PageRank citation ranking: Bringing order to the web</article-title>
          .
          <source>Technical report</source>
          , Stanford InfoLab.
        </mixed-citation>
      </ref>
      <ref id="ref59">
        <mixed-citation>
          <string-name>
            <surname>Pennington</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Socher, R.; and
          <string-name>
            <surname>Manning</surname>
            ,
            <given-names>C. D.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Glove: Global vectors for word representation</article-title>
          .
          <source>In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP)</source>
          ,
          <fpage>1532</fpage>
          -
          <lpage>1543</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref60">
        <mixed-citation>
          <string-name>
            <surname>Perozzi</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Al-Rfou</surname>
          </string-name>
          , R.; and
          <string-name>
            <surname>Skiena</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2014</year>
          .
          <article-title>Deepwalk: Online learning of social representations</article-title>
          .
          <source>In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          ,
          <fpage>701</fpage>
          -
          <lpage>710</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref61">
        <mixed-citation>
          <string-name>
            <surname>Peters</surname>
            ,
            <given-names>M. E.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Neumann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Iyyer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Gardner</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Clark</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Lee</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Zettlemoyer</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <year>2018</year>
          .
          <article-title>Deep contextualized word representations</article-title>
          .
          <source>arXiv preprint arXiv:1802</source>
          .05365 .
        </mixed-citation>
      </ref>
      <ref id="ref62">
        <mixed-citation>
          <string-name>
            <surname>Pham</surname>
            ,
            <given-names>S. B.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Hoffmann</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2003</year>
          .
          <article-title>A new approach for scientific citation classification using cue phrases</article-title>
          .
          <source>In Australasian Joint Conference on Artificial Intelligence</source>
          ,
          <fpage>759</fpage>
          -
          <lpage>771</lpage>
          . Springer.
        </mixed-citation>
      </ref>
      <ref id="ref63">
        <mixed-citation>
          2009.
          <article-title>Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora</article-title>
          .
          <source>In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1-</source>
          Volume
          <volume>1</volume>
          ,
          <fpage>248</fpage>
          -
          <lpage>256</lpage>
          . Association for Computational Linguistics.
        </mixed-citation>
      </ref>
      <ref id="ref64">
        <mixed-citation>
          <string-name>
            <surname>Ramage</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Manning</surname>
          </string-name>
          , C. D.; and
          <string-name>
            <surname>Dumais</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2011</year>
          .
          <article-title>Partially labeled topic models for interpretable text mining</article-title>
          .
          <source>In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          ,
          <fpage>457</fpage>
          -
          <lpage>465</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref65">
        <mixed-citation>
          <string-name>
            <surname>Rokach</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Mitra</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kataria</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <year>1978</year>
          .
          <article-title>A supervised learning method for context-aware citation recommendation in a large corpus. INVITED SPEAKER: Analyzing the Performance of Top-</article-title>
          K Retrieval Algorithms
          <year>1978</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref66">
        <mixed-citation>
          <string-name>
            <surname>Soleimani</surname>
          </string-name>
          , H.; and
          <string-name>
            <surname>Miller</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Semisupervised, multilabel, multi-instance learning for structured data</article-title>
          .
          <source>Neural computation 29</source>
          <volume>(4)</volume>
          :
          <fpage>1053</fpage>
          -
          <lpage>1102</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref67">
        <mixed-citation>
          <string-name>
            <surname>Somers</surname>
            ,
            <given-names>R. H.</given-names>
          </string-name>
          <year>1962</year>
          .
          <article-title>A new asymmetric measure of association for ordinal variables</article-title>
          .
          <source>American sociological review 799-811.</source>
        </mixed-citation>
      </ref>
      <ref id="ref68">
        <mixed-citation>
          <string-name>
            <surname>Tang</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wan</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ; and Zhang,
          <string-name>
            <surname>X.</surname>
          </string-name>
          <year>2014</year>
          .
          <article-title>Cross-language context-aware citation recommendation in scientific articles</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref69">
        <mixed-citation>
          <source>In Proceedings of the 37th international ACM SIGIR conference on Research &amp; development in information retrieval</source>
          ,
          <fpage>817</fpage>
          -
          <lpage>826</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref70">
        <mixed-citation>
          <string-name>
            <surname>Valenzuela</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ha</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Etzioni</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <year>2015</year>
          .
          <article-title>Identifying meaningful citations</article-title>
          .
          <source>In Workshops at the twenty-ninth AAAI conference on artificial intelligence.</source>
        </mixed-citation>
      </ref>
      <ref id="ref71">
        <mixed-citation>
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Satuluri</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Parthasarathy</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>Local probabilistic models for link prediction</article-title>
          .
          <source>In Seventh IEEE international conference on data mining (ICDM</source>
          <year>2007</year>
          ),
          <fpage>322</fpage>
          -
          <lpage>331</lpage>
          . IEEE.
        </mixed-citation>
      </ref>
      <ref id="ref72">
        <mixed-citation>
          <string-name>
            <surname>Yin</surname>
            ,
            <given-names>J.;</given-names>
          </string-name>
          and
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>Personalized citation recommendation via convolutional neural networks. In Asia-Pacific web (APWeb) and web-age information management (WAIM) joint conference on web and big data</article-title>
          ,
          <fpage>285</fpage>
          -
          <lpage>293</lpage>
          . Springer.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>