<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A preliminary study to compare deep learning with rule-based approaches for citation classification</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Julien</forename><surname>Perier-Camby</surname></persName>
							<email>julien.perier-camby@etu.univ-lyon.fr</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Laboratoire LIRIS</orgName>
								<orgName type="institution">Université Claude Bernard Lyon 1</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Marc</forename><surname>Bertin</surname></persName>
							<email>marc.bertin@univ-lyon1.fr</email>
							<affiliation key="aff1">
								<orgName type="laboratory">Laboratoire ELICO</orgName>
								<orgName type="institution">Université Claude Bernard Lyon 1</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Iana</forename><surname>Atanassova</surname></persName>
							<email>iana.atanassova@univ-fcomte.fr</email>
							<affiliation key="aff2">
								<orgName type="department">CRIT</orgName>
								<orgName type="institution">Université de Bourgogne Franche-Comté</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Frédéric</forename><surname>Armetta</surname></persName>
							<email>frederic.armetta@univ-lyon1.fr</email>
							<affiliation key="aff0">
								<orgName type="laboratory">Laboratoire LIRIS</orgName>
								<orgName type="institution">Université Claude Bernard Lyon 1</orgName>
								<address>
									<country key="FR">France</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">A preliminary study to compare deep learning with rule-based approaches for citation classification</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">EA8D2F56D286D8ED0DBFEB51C1D014BF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:03+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Biattentive Classification Network</term>
					<term>Citation Classification</term>
					<term>Citation Analysis</term>
					<term>Citation Contexts. BIR 2019 Workshop on Bibliometric-enhanced Information Retrieval</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Categorization of semantic relationships between scientific papers is a key to characterize the condition of a research field and to identify influential works. Recently, new approaches based on Deep Learning have demonstrated good capacities to tackle Natural Language Processing problems, such as text classification and information extraction. In this paper, we show how deep learning algorithms can automatically learn to classify citations, and could provide a relevant alternative when compared with methods based on pattern extractions from the recent state of the art. The paper discusses their appropriateness given the requirement of large datasets to train neural networks.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The categorization of semantic relationships is at the very heart of bibliometrics and Natural Languages Processing research. As described by Garfield more than 50 years ago <ref type="bibr" target="#b6">[7]</ref>, understanding how scholars use and frame citations is an essential prerequisite to characterize the state of a scientific field and to identify influential works. The research on citation acts has already proposed numerous empirical studies and models, in particular through the production of ontologies such as CiTO (see <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b4">5]</ref>) or studies on the analysis of sentiments applied to the context of citations <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b10">11]</ref>.</p><p>Most of the studies in this field rely on Rule-based Information Extraction in order to categorize and semantically annotate citation acts. The general idea of such approaches in Natural Language Processing is to propose a categorization of citation contexts through the identification of patterns or text structures <ref type="bibr" target="#b15">[16,</ref><ref type="bibr" target="#b8">9,</ref><ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b9">10,</ref><ref type="bibr" target="#b0">1]</ref>. Nevertheless, the declarative nature of rule-based approaches leads to drawbacks and tends to be replaced by machine learning alternatives <ref type="bibr" target="#b3">[4]</ref>.</p><p>To our knowledge, deep learning methods have not yet been applied to categorize citations in texts, i.e. to determine a class for each of the citation acts. The reasons for this are mainly because few datasets are publicly available, and because they tend to be small and unbalanced, making them difficult to use for the development of deep learning approaches. If we consider the progress enabled by deep learning in any domains, it is nevertheless interesting to show how deep learning approaches behave within this innovative context.</p><p>In this paper, we aim to compare the most efficient rule-based approach from the state of the art used for categorization <ref type="bibr" target="#b7">[8]</ref> to a famous deep learning approach well-known for its ability to identify sentence meanings <ref type="bibr" target="#b14">[15]</ref>. In section 2, we describe how rule-based approaches have been applied to categorize citations and the main principles of deep learning approaches. We discuss the advantages and drawbacks for both approaches, and underline the challenges in training a neural network with a dataset that is limited in size and unbalanced between labeled categories or classes. This section introduces the Biattentive Classification Network (BCN, <ref type="bibr" target="#b11">[12]</ref>) combined with Embeddings from Language Models (ELMo, <ref type="bibr" target="#b14">[15]</ref>) word representations that we experiment. Section 3 introduces the corpus and the protocol that we use for evaluation. The results are presented and discussed in section 4. The conclusion is presented in section 5.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Categorization of Semantic Relationships</head><p>In this section, we describe and discuss the general functioning of rule-based and deep learning approaches and their requirements. Rather that giving the detailed description of each of the approaches, we present their general properties for the sake of comparison.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Rule-based information extraction</head><p>In rule-based approaches, one has to define a set of discourse features which can be relevant to characterize the sentences semantics dedicated to different scopes. A state-of-the-art method for rule-based information extraction applied to citation framing has been proposed by <ref type="bibr" target="#b7">[8]</ref>, using pattern-based features, topicbased features and prototypical argument features. As a final step, a training phase is used to weight the relevance of each of the available patterns depending on the class to predict. This is usually done through shallow machine learning models (for instance, k-nearest neighbors <ref type="bibr" target="#b16">[17]</ref> or random forest <ref type="bibr" target="#b7">[8]</ref>). Such models require smaller sizes of training datasets to provide satisfying results, compared to deep neural networks.</p><p>It should be noted that rule-based methods suffer only slightly from unbalanced datasets as the features are hand-crafted, and therefore inferred on wider knowledge and not limited to the sample in the training dataset. If a citation class is under represented, the classifier could still capture part of the meaning, as the knowledge used for the capturing is provided by an expert. Thus, the lack of balance in the dataset, only slightly degrades the classifier learning.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Deep learning information extraction</head><p>Deep learning algorithms are artificial neural networks that learn to perform tasks by learning from samples. For the specific problem we address, the network takes as input some selected characteristics of the citation and learns to give as an output the appropriate prediction (citation class). The efficiency of such algorithms does not rely on any task-specific rules, but rather benefits from non linear functions dedicated to capture complex patterns during the learning phase in order to produce a model capable of categorizing new samples.</p><p>Deep learning algorithms are highly sensitive to the quality of the training data as they do not rely on any external knowledge. As for any machine learning algorithm, the training data should be as balanced as possible, i.e. the variables have to be independent and identically distributed, and the training dataset should be large enough for the system to learn. For the so addressed problem, we need a dataset that is large and balanced across the different citation classes. In fact, if a citation class is underrepresented in the dataset, its characteristics will need to be extracted from a smaller number of samples and the inference mechanism will provide sub-optimal results.</p><p>For the purpose of comparison, we have selected the BCN model (Biattentive Classification Network, <ref type="bibr" target="#b11">[12]</ref>) designed to handle sentence classification tasks. ELMo (Embeddings from Language Models, <ref type="bibr" target="#b14">[15]</ref>) is designed to extract word representations, and can be used to encode sentences to pass through classifiers. BCN complemented by ELMo is the current state of the art on fine-grained (five-class) sentiment classification (SST-5, Stanford Sentiment Treebank). It is one of the best available algorithms from the state of the art for inference in text understanding.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Method and experimental setup</head><p>The dataset that we use for the training and the evaluation of the BCN model is the one used in <ref type="bibr" target="#b7">[8]</ref>. This dataset has been fully annotated manually, which makes it particularly accurate to study the ability of an automatic classifier to imitate human performances. Table <ref type="table" target="#tab_0">1</ref> presents the six classes used for the labelling of citations.</p><p>In order to underline the citation act to classify, every in-text reference is replaced in turn by a marker ('[X]'). The so formatted sentences (one marker for each sentence) are passed through the neural network for inference. We used in this paper the BCN model implemented by the AllenNLP library <ref type="bibr" target="#b5">[6]</ref>, which is a high-level framework built on PyTorch <ref type="bibr" target="#b12">[13]</ref>.</p><p>The evaluation has been done using k-fold cross-validation, with k = 10, for the learning and testing of the network to provide statistically significant results. The original samples have been randomly partitioned into 10 equally sized subsamples. The learning has been performed on 9 subsamples and tested on the remaining one for each of the combinations. The reported results correspond to the average results over the 10 training sessions <ref type="foot" target="#foot_0">4</ref> . M icro − F 1 and M acro − F 1 scores are used to report the global efficiency of the network for each class, where M icro − F 1 stands for the weighted arithmetic average and M acro − F 1 stands for the non-weighted arithmetic average of the F 1 score for each class.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Results and discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.1">Global results</head><p>The selected deep learning and rule-based approaches performances are presented on table 2. Jurgens et al. <ref type="bibr" target="#b7">[8]</ref> only reports the M acro − F 1 metric as a base for comparison. Because of their rarity, accurate samples of significant size for such a study are difficult to acquire and this can be a major obstacle to clearly identifying the potential of deep learning approaches for citation categorization. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Results by class</head><p>Table <ref type="table" target="#tab_2">3</ref> presents the results of the F 1 score and the sample sizes for the different classes. One can note that some classes are more challenging to predict than others with the BCN model. The classes are highly imbalanced in the dataset. For this reason, we consider the M icro − average metric, which aggregates the contributions of all classes in an average value.</p><p>For each class, the reported efficiency clearly correlates with the size of the available data. While rule-based approaches become effective mainly thanks to expert knowledge, neural networks are completely dependant on samples for their learning. As a result, it is not surprising that the model performs poorly for small samples, as in the case of the class "FUTURE". Under-represented classes pull down the M acro − F 1 value, and sightly influence the M icro − F 1 value. On the other hand, the classifier performs well for larger samples, e.g. the classes "BACKGROUND" and "USES" with F 1 scores of 0.720 and 0.640. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusion</head><p>The paper describes the citation classification which is a central problem leading to many applications in bibliometrics. In this work, we are interested in studying deep learning abilities to capture the semantics of citations when compared with rule-based approaches. To do so, we compare two approaches from the recent state of the art.</p><p>We still can not define an upper bound for the application of deep learning approaches to citation classification because the experiment is based on a limited dataset compared to the datasets generally used in deep learning. New datasets need to be created to delineate more precisely the F 1 score that can be reached by such approaches. The results encourage the use of neural networks for the cases where large samples are available. In the cases when large samples are not available, it is clear that efforts invested into rule-based approaches prove reliable and can guarantee more accurate output.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Scheme used for the labelling of citations, extracted from<ref type="bibr" target="#b7">[8]</ref> </figDesc><table><row><cell>Class</cell><cell>Description</cell></row><row><cell>BACKGROUND</cell><cell>Provides relevant information for this domain.</cell></row><row><cell></cell><cell>e.g. "This is often referred to as incorporating deterministic</cell></row><row><cell></cell><cell>closure (Dörre, 1993)."</cell></row><row><cell>MOTIVATION</cell><cell>Illustrates need for data, goals, methods, etc.</cell></row><row><cell></cell><cell>e.g. "As shown in Meurers (1994), this is a well-motivated con-</cell></row><row><cell></cell><cell>vention [...]"</cell></row><row><cell>USES</cell><cell>Uses data, methods, etc.</cell></row><row><cell></cell><cell>e.g. "The head words can be automatically extracted [...] in the</cell></row><row><cell></cell><cell>manner described by Magerman (1994)."</cell></row><row><cell>EXTENSION</cell><cell>Extends data, methods, etc.</cell></row><row><cell></cell><cell>e.g. "[...] we improve a two-dimensional multimodal version of</cell></row><row><cell></cell><cell>LDA (Andrews et al, 2009) [...]</cell></row><row><cell cols="2">COMPARISON OR Expresses similarity/differences.</cell></row><row><cell>CONTRAST</cell><cell>e.g. "Other approaches use less deep linguistic resources (e.g.,</cell></row><row><cell></cell><cell>POS-tags Stymne (2008)) [...]"</cell></row><row><cell>FUTURE</cell><cell>Is a potential avenue for future work.</cell></row></table><note>e.g. "[...] but we plan to do so in the near future using the algorithm of Littlestone and Warmuth (1992)."</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Experimental results</figDesc><table><row><cell>Approach</cell><cell>M acro − F 1</cell><cell>M icro − F 1</cell></row><row><cell>BCN + ELMo (2018)</cell><cell>0.405</cell><cell>0.588</cell></row><row><cell>Jurgens et al. (2018)</cell><cell>0.530</cell><cell>-</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>F1 score reported by class for BCM and ELMo</figDesc><table><row><cell>Class</cell><cell>F1 score</cell><cell>Sample size</cell></row><row><cell>BACKGROUND</cell><cell>0.720</cell><cell>1000</cell></row><row><cell>MOTIVATION</cell><cell>0.306</cell><cell>185</cell></row><row><cell>USES</cell><cell>0.640</cell><cell>823</cell></row><row><cell>EXTENSION</cell><cell>0.103</cell><cell>152</cell></row><row><cell>COMPARISON OR CONTRAST</cell><cell>0.570</cell><cell>857</cell></row><row><cell>FUTURE</cell><cell>0.093</cell><cell>70</cell></row><row><cell>M icro − average</cell><cell>0.588</cell><cell>3087</cell></row><row><cell>M acro − average</cell><cell>0.405</cell><cell>3087</cell></row><row><cell>Random</cell><cell>0.138</cell><cell>3087</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_0">The source code of the approach presented here is available on GitHub : https://github.com/jperier/BIR2019_citationBCN</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Purpose and polarity of citation: Towards nlp-based bibliometrics</title>
		<author>
			<persName><forename type="first">A</forename><surname>Abu-Jbara</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ezra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Radev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 2013 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies (NAACL-HLT 2013)</title>
				<meeting><address><addrLine>Westin Peachtree Plaza Hotel Atlanta, Georgia, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2013-06">Jun 2013</date>
			<biblScope unit="page" from="596" to="606" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Mining the context of citations in scientific publications</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">R</forename><surname>Aljohani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Nawaz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Maturity and Innovation in Digital Libraries: 20 th International Conference on Asia-Pacific Digital Libraries</title>
				<meeting><address><addrLine>Hamilton, New Zealand</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">Nov. 19-22. 2018</date>
			<biblScope unit="page">316</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Towards automatic topical question generation</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Chali</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Hasan</surname></persName>
		</author>
		<ptr target="http://aclweb.org/anthology/C12-1030" />
	</analytic>
	<monogr>
		<title level="m">The COLING 2012 Organizing Committee</title>
				<imprint>
			<date type="published" when="2012">2012. 2012</date>
			<biblScope unit="page" from="475" to="492" />
		</imprint>
	</monogr>
	<note>Proceedings of COLING</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Rule-based information extraction is dead! long live rule-based information extraction systems!</title>
		<author>
			<persName><forename type="first">L</forename><surname>Chiticariu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">R</forename><surname>Reiss</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Empirical Methods in Natural Language Processing (EMNLP</title>
				<meeting>Empirical Methods in Natural Language Processing (EMNLP<address><addrLine>Grand Hyatt SeattleSeattle, Washington, USA</addrLine></address></meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2013-10">2013. Oct 2013</date>
			<biblScope unit="page" from="827" to="832" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Evaluating citation functions in cito: Cognitive issues</title>
		<author>
			<persName><forename type="first">P</forename><surname>Ciancarini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Di Iorio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">G</forename><surname>Nuzzolese</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Peroni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Vitali</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 11 th conference proceedings for Semantic Evaluation Challenge 2014 (ESWC2014) -The Semantic Web: Trends and Challenges</title>
				<editor>
			<persName><forename type="first">V</forename><surname>Presutti</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">C</forename><surname>Amato</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><surname>Gandon</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>D'aquin</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><surname>Staab</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">A</forename><surname>Tordai</surname></persName>
		</editor>
		<meeting><address><addrLine>Anissaras, Crete, Greece</addrLine></address></meeting>
		<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2014-05">May 2014</date>
			<biblScope unit="page" from="580" to="594" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Allennlp: A deep semantic natural language processing platform</title>
		<author>
			<persName><forename type="first">M</forename><surname>Gardner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Grus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Neumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Tafjord</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Dasigi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schmitz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Zettlemoyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename></persName>
		</author>
		<ptr target="http://aclweb.org/anthology/W18-2501" />
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="1" to="6" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Can citation indexing be automated ? In: Statistical association methods for mechanized documentation</title>
		<author>
			<persName><forename type="first">E</forename><surname>Garfield</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="s">symposium proceedings</title>
		<imprint>
			<biblScope unit="volume">269</biblScope>
			<biblScope unit="page" from="189" to="192" />
			<date type="published" when="1965">1965</date>
			<publisher>National Bureau of Standards</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Measuring the evolution of a scientific field through citation frames</title>
		<author>
			<persName><forename type="first">D</forename><surname>Jurgens</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Hoover</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Mcfarland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Jurafsky</surname></persName>
		</author>
		<ptr target="https://transacl.org/ojs/index.php/tacl/article/view/1266" />
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="page" from="391" to="406" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Automated classification of author&apos;s sentiments in citation using machine learning techniques: A preliminary study</title>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">C</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Thoma</surname></persName>
		</author>
		<idno type="DOI">10.1109/CIBCB.2015.7300319</idno>
		<ptr target="https://doi.org/10.1109/CIBCB.2015.7300319" />
	</analytic>
	<monogr>
		<title level="m">IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)</title>
				<meeting><address><addrLine>Niagara Falls, Canada</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2015-08">2015. Aug 2015</date>
			<biblScope unit="page" from="1" to="7" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Patterns in citation context: the case of the field of scientometrics</title>
		<author>
			<persName><forename type="first">W</forename><surname>Lamers</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">J</forename><surname>Van Eck</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Waltman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Hoos</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">23rd International Conference on Science and Technology Indicators (STI 2018)</title>
				<meeting><address><addrLine>Leiden, The Netherlands</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2018">September 12-14, 2018. 2018</date>
		</imprint>
		<respStmt>
			<orgName>Centre for Science and Technology Studies (CWTS)</orgName>
		</respStmt>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Improve sentiment analysis of citations with author modelling</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Ma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Nam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Weihe</surname></persName>
		</author>
		<idno type="DOI">10.18653/v1/W16-0420</idno>
		<ptr target="http://aclweb.org/anthology/W16-0420" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 7 th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</title>
				<meeting>the 7 th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis</meeting>
		<imprint>
			<publisher>Association for Computational Linguistics</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="122" to="127" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Learned in Translation: Contextualized Word Vectors</title>
		<author>
			<persName><forename type="first">B</forename><surname>Mccann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bradbury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Xiong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Socher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The Thirty-first Annual Conference on Neural Information Processing Systems (NIPS)</title>
				<imprint>
			<date type="published" when="2017-08">Aug 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Automatic differentiation in pytorch</title>
		<author>
			<persName><forename type="first">A</forename><surname>Paszke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chintala</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Chanan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Devito</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Lin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Desmaison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Antiga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Lerer</surname></persName>
		</author>
		<ptr target="https://openreview.net/pdf?id=BJJsrmfCZ" />
	</analytic>
	<monogr>
		<title level="m">The Thirty-first Annual Conference on Neural Information Processing Systems (NeurIPS). The Neural Information Processing Systems Foundation</title>
				<meeting><address><addrLine>Long Beach Convention Center, CA, USA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2017-12">Dec 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Fabio and cito: ontologies for describing bibliographic resources and citations</title>
		<author>
			<persName><forename type="first">S</forename><surname>Peroni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Shotton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Web Semantics: Science, Services and Agents on the World Wide Web</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="page" from="33" to="43" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Deep contextualized word representations</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Neumann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Iyyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Gardner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Clark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zettlemoyer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">The 16 th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT</title>
				<imprint>
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">A new approach for scientific citation classification using cue phrases</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Hoffmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Artificial Intelligence</title>
				<editor>
			<persName><forename type="first">T</forename><forename type="middle">T D</forename><surname>Gedeon</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">L</forename><forename type="middle">C C</forename><surname>Fung</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin Heidelberg; Berlin, Heidelberg</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2003">2003. 2003</date>
			<biblScope unit="page" from="759" to="771" />
		</imprint>
	</monogr>
	<note>AI</note>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Automatic classification of citation function</title>
		<author>
			<persName><forename type="first">S</forename><surname>Teufel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Siddharthan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Tidhar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Empirical Methods in Natural Language Processing (EMNLP</title>
				<meeting>Empirical Methods in Natural Language Processing (EMNLP</meeting>
		<imprint>
			<date type="published" when="2006">2006. 2006</date>
			<biblScope unit="page" from="103" to="110" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
