<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Improving Scientific Article Visibility by Neural Title Simplification</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Alexander</forename><surname>Shvets</surname></persName>
							<email>alexander.shvets@upf.edu</email>
							<affiliation key="aff0">
								<orgName type="institution">Universitat Pompeu Fabra</orgName>
								<address>
									<postCode>08018</postCode>
									<settlement>Barcelona</settlement>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Improving Scientific Article Visibility by Neural Title Simplification</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">1AE38894E6C805A4C57E39686DF9A1BC</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T20:03+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Scientific Text Summarization</term>
					<term>Machine Translation</term>
					<term>Recommender Systems</term>
					<term>Personalized Simplification</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The rapidly growing amount of data that scientific content providers should deliver to a user makes them create effective recommendation tools. A title of an article is often the only shown element to attract people's attention. We offer an approach to automatic generating titles with various levels of informativeness to benefit from different categories of users. Statistics from Re-searchGate used to bias train datasets and specially designed post-processing step applied to neural sequence-to-sequence models allow reaching the desired variety of simplified titles to gain a trade-off between the attractiveness and transparency of recommendation.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The amount of information scientific society produces on a daily basis results in the necessity of researchers to have proper guidance in a digital space. The function of the virtual assistance is performed by various scientometric systems, research paper recommender systems <ref type="bibr" target="#b0">(Haruna et al., 2017)</ref> and different kinds of search engines. <ref type="bibr" target="#b1">(Shvets et al., 2015)</ref> summarizes the most common types of systems for scientometric analysis. The recent trend in scientific paper delivery is purpose specific webresources, blogs, and e-journals often coupled with email subscriptions. They often provide personalized recommendations based on users' behavior and preferences.</p><p>The recommendation usually has a form of imprint often limited only by a title (as in the case with email subscriptions associated with limited space and lack of time to attract people's attention). Eventually, the success of recommendation depends on the informativeness of the title of an article subject to user's intentions and acknowledgment with a certain scientific field. This denotes the necessity of finding a way of varying the title of the same paper for different categories of users.</p><p>The focus of this paper is in developing models for creating a variety of simplified versions of the titles of scientific articles which would be condensed and informative enough and at the same time would correspond to the original topic of a paper to maintain users' loyalty. We aim at supporting two scenarios of personalized simplification: the first ensuring narrow focus on specific scientific concepts for goal-oriented experts and the second providing a general overview for researchers working on the edge of a topic willing to expand their horizons. The second case should not be treated as a generation of clickbaits (catchy short misleading headings) that are to be blocked with the use of efficient machine learning approaches <ref type="bibr" target="#b2">(Biyani et al., 2016)</ref>.</p><p>There is a variety of algorithms that could be used for title simplification which is a rapidly growing research area <ref type="bibr" target="#b3">(Bouayad-Agha et al., 2009;</ref><ref type="bibr" target="#b4">Saggion et al., 2015;</ref><ref type="bibr" target="#b6">Guo et al., 2018)</ref>. As long as the defined task is similar to text compression and abstractive summarization we made a choice towards encoder-decoder neural architectures <ref type="bibr">(Nallapati et.al, 2016</ref><ref type="bibr">, Nikolov et.al, 2018)</ref>.</p><p>The remainder of the paper is structured as follows. In Section 2, we propose a method for scientific title diversification and simplification. Section 3 is devoted to describing the datasets used for training. Section 4 denotes the experiment setup. Section 5 provides results of numerical experiments. Section 6 is devoted to human evaluation. In Section 7, finally, we discuss results and outline future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Method</head><p>Recent advances in natural machine translation (NMT) incite to solve the task in a supervised manner controlling the style of a title by conditioning training data. The method we propose comprises the following steps: a) selecting a subset from an abstract-to-title dataset to impose conditions that would force a model to generate hypotheses with desirable properties; b) training a sequence-to-sequence (seq2seq) model; c) applying a model to title-to-title generation; d) performing post-processing step to remove unnecessarily repeated tokens; e) filtering titles with improper structure. The remainder of this section describes each step in details.</p><p>To create titles of different styles for various categories of researchers several datasets should be used. The set of highly popular scientific titles may help to generate attractive headings for users with interests peripheral to the subject of a paper. The condition to have a multi-word noun phrase NPmw in a target text is to avoid producing overly shortened pointless titles. In case each training example contains a reference text Rt and a target text Tt that have similar NPmw (at least 2 common terms), a model might learn to preserve the most important concepts from original titles needed by experts. Figure <ref type="figure" target="#fig_0">1</ref> shows the training example with similar NPmw-s in Rt and Tt.</p><p>Input sequence (an abstract, lower case, tokenized, truncated)</p><p>Target sequence (a title, tokenized, lower case) effects of order of presentation on conditional reasoning the main goal of this research is to study whether or not the order of presentation of the premises in a logical argument form , such as a conditional reasoning task , could affect In particular, we used the realization in OpenNMT toolkit <ref type="bibr" target="#b11">(Klein et al., 2018)</ref> enabling pointer that allows copying tokens from the reference text. The trained model is to be applied to new unseen titles, which are, in opposite to abstracts (cut-off after 50 tokens in our experiments), not truncated. Since the task differs from general NMT task and summarization task by the absence of need in tracking alignment, traditional coverage mechanism <ref type="bibr" target="#b12">(Wu et al., 2016)</ref>, that discourage repetitions, is not included not to impose potentially harmful restrictions and not to overcomplicate the model. Instead, we introduce the postprocessing step PS as follows. Firstly, each repetition of a term is removed leaving the only occurrence closest to the beginning of a text. Secondly, all the auxiliary tokens without required terms in between or after them are eliminated. In the end, we iteratively remove the last token in a text if it is an adjective or auxiliary token and, in addition, capitalize the title.</p><p>The last step consists in filtering improper titles, i.e., generated sequences that have less than two NPmw-s similar to some NPmw-s of the source title. In those use cases when even potentially pointless output is required, this step should be skipped.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Datasets</head><p>We chose ResearchGate<ref type="foot" target="#foot_0">1</ref> platform as a source of data. It has a recommender system and therefore openly counts the number of times a page with a paper was visited to provide reasonable recommendations that motivates authors to be more visible.</p><p>We selected 150K imprints of articles on various topics using a wide list of general scientific words <ref type="bibr" target="#b13">(Osipov et al., 2014)</ref> as an entry point to the articles. Figure <ref type="figure" target="#fig_2">2</ref>   To detect noun phrases we used Spacy chunker (Honnibal and Montani, 2017) that we elaborated for detecting complex phrases, which map single concepts (e.g., "vertex energy of a graph" that is a lexical variation of the concept "graph energy").</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Experiment Setup</head><p>Selecting the first 75 characters of the reference text is generally used as a baseline in summarization tasks; cf., e.g., <ref type="bibr" target="#b15">(Rush et al., 2015)</ref>. We added subsequent cut-off after the last noun in a phrase. This improved baseline is referred to as MBase henceforth. Several seq2seq models (M1, M2, …) with the above-described architectures differed by a number of layers were applied to various datasets to bias the style of output text. They were then extended with post-processing PS (M1ps, M2ps, …) and filtering steps, which are novel for the best knowledge of the author; cf. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Results</head><p>The most of the basic models performed reasonably: produced titles were in general shorter than original, multiple-word noun phrases from reference title covered a significant part of the generated title (NPdiff-p = 0.68 on average). However, some models, especially M5, introduced many repetitions (for all checkpoints): the BLEU value reflected it being equal to 0.18 for M5 while the average value for the rest of models was equal to 0.35. Since BLEU depends on a number of same word occurrences, the increase of it by 24% on average due to PS attests usefulness of the step (cf. Table <ref type="table" target="#tab_1">2</ref>). Filtering step allowed dropping less informative titles so that one can take advantage even of poor models reducing a risk to present misleading picking-eye headings or generic topics to an user (cf. Figure <ref type="figure">3</ref> for examples of generated texts).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Fig. 3. Filtering step</head><p>The extension of basic models led to an increase of NPdiff-p by 9% and rouge-L-f by 11% on average. Table <ref type="table" target="#tab_2">3</ref> gives an idea of the variation of titles of different models in style and in compression rate. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Uninformative titles</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Final titles</head><p>It is worth noting that 1-layer models M1 and M2 trained on conditioned datasets reached higher values for the majority of measures in comparison to models M3 and M4 fed with generic data. This highlights rationality in pre-directing the training.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Human Evaluation</head><p>For human evaluation, we selected five papers of the NLP research group (TALN UPF) with titles longer than 93 characters (10-18 words). Their authors who own Ph.D. degrees were asked to rank output titles for these papers including original title by preference on clicking if they saw a title briefly in a daily email digest. To face different decision criteria assessors worked with papers of their authorship (for simulating expert behavior) and with papers of their colleagues (expanding horizons use case). If some titles in a set were the same or assessors did not have any preference between two similar titles they were allowed to rank them equally. The top models sorted by the average rank and examples of titles from one set are listed in Table <ref type="table" target="#tab_3">4</ref>. Noted final increase of NPdiff-p and rouge-L-f indicates that common subsequences became longer in relation to the length of titles meaning that offered post-processing step with filtering plays an important role in forming a fluent text. At the same time, the output should not have been just one of the original subsequences, therefore, we did not aim at reaching too high precision values. Pure state-of-the-art seq2seq models without post-processing step got low ranks on human evaluation. The models M1ps and M2ps have a higher average rank of 6. Their titles are well-formed and represent a combination of original multi-word expressions (cf. Table <ref type="table" target="#tab_2">3</ref> for relatively high scores of rouge-2-r), however, less corresponding to the topic that is partly reflected by comparatively lower values of rouge-L-p. The outputs of the models M3ps and M5ps were often preferred to original titles. Having 1.3 times shorter titles than M5ps, conditionally trained M6ps achieved almost the same average score. The baseline has the highest rank since it often better preserves the meaning although does not always form a complete phrase. The main drawback is that it usually only generalize a title to some extent (in case of well-turned subsequence) and miss details experts might need.</p><p>The close average ranks of models and rouge-L-f on the same level for all models denote an opportunity to overcome the general problem of lacking the variability in neural seq2seq generation. Different title styles give a possibility to reach a preferable trade-off between the conciseness of the title and its transparency.</p><p>For future work, we plan to gain value from methods of paraphrasing <ref type="bibr" target="#b18">(Cao et al., 2017)</ref>, advanced simplification (Zhang and Lapata, 2017; Štajner and Saggion, 2018) and surface realization for deep input representations <ref type="bibr" target="#b21">(Belz et al., 2018)</ref> to obtain diverse semantically close outputs differ from text reformulated with mostly the same words. Fake-paper detecting (Byrne and Labbé, 2017) and assessing the quality of scientific texts <ref type="bibr" target="#b23">(Shvets, 2015)</ref> will help to avoid training the models on misleading titles. Finally, pre-existing taxonomies (e.g., JEL codes in Economics, the ACM taxonomy in Computer Science, the Web of Science categories attached to journals), and meta information of papers such as authors' keywords or KeywordsPlus items inferred from the references cited <ref type="bibr" target="#b24">(Garfield and Sher, 1993)</ref> are to be used for preselecting the most relevant concepts to bias the training.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Training example with similar noun phrases in reference and target text</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>shows the correlation between the number of paper views Nv and the title lengths Lt (in characters) in the collection. The top-viewed articles along negative correlation formed the desired set of highly popular titles. The whole pool of imprints formed a generic dataset. Random split for training and validation (93/7) was carried out. The set of 1000 imprints with Nv = 1 and Lt &gt; 100 was used for testing the models.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Dynamics for titles in a professional social network ResearchGate (Nv &gt; 5, Lt &gt; 20).The texts were pre-processed on the fly applying language detection with langid.py 2 and sentence detection with tokenization from NLTK 3 . Cleaning of training and vali-</figDesc><graphic coords="3,124.68,492.36,345.96,116.40" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Table 1 for details. Distinctive details of basic and extended models</figDesc><table><row><cell cols="2">Model #layers</cell><cell>Dataset</cell></row><row><cell>M1 / M1ps</cell><cell>1</cell><cell>conditioned (Rt and Tt have at least 2 pairs of similar NPmw), 11K</cell></row><row><cell>M2 / M2ps</cell><cell>1</cell><cell>strongly conditioned (Rt and Tt have at least 1 pair of equal NPmw), 5.5K</cell></row><row><cell>M3 / M3ps</cell><cell>1</cell><cell>weakly conditioned (Rt and Tt have a common term), 66K</cell></row><row><cell>M4 / M4ps</cell><cell>1</cell><cell>top-views weakly conditioned (Rt and Tt have a common term), 18K</cell></row><row><cell>M5 / M5ps</cell><cell>2</cell><cell>weakly conditioned (Rt and Tt have a common term), 66K</cell></row><row><cell>M6 / M6ps</cell><cell>2</cell><cell>conditioned (Rt</cell></row></table><note>and Tt have at least 2 pairs of similar NPmw), 11KFor the final model assessment, we used measures BLEU<ref type="bibr" target="#b16">(Papineni et al., 2002)</ref>, ROUGE-1, ROUGE-2, ROUGE-L<ref type="bibr" target="#b17">(Lin, 2004)</ref>, and specially designed NPdiff-p, i.e., NPmw-based precision evaluated as rouge-L-p but considering only one occurrence of similar NPmw-s in a hypothesis.The intermediate models created at checkpoints during the training were assessed and the best by NPdiff-p were selected as resulting.</note></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Improvement of a title by post-processing step PS</figDesc><table><row><cell>Original title</cell><cell>A Study on Knowledge Management System for Knowledge Competitive-</cell></row><row><cell>(reference)</cell><cell>ness with One Stop Knowledge Service</cell></row><row><cell>Initial hypoth-esis before PS</cell><cell>knowledge management system for knowledge competitiveness with one stop knowledge service with one stop service with one stop service with one stop service with one stop service with one stop service with…</cell></row><row><cell cols="2">Resulting title Knowledge Management System for Competitiveness with One Stop Service</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>ROUGE measures for inspected models</figDesc><table><row><cell></cell><cell></cell><cell cols="5">Addiction and the New Black?</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell cols="2">picking-eye</cell><cell></cell><cell cols="3">The Romans Know? Spain: a Focus</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell cols="8">Active Learning for Biomedical Data Classification</cell></row><row><cell></cell><cell></cell><cell></cell><cell cols="8">Access to Specialist Medical Services: a Pilot Study</cell></row><row><cell></cell><cell cols="3">generic topics</cell><cell cols="3">Consumer Loyalty Financial Cooperation</cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>model</cell><cell>rouge -1-r</cell><cell>rouge -1-p</cell><cell cols="2">rouge -1-f</cell><cell>rouge -2-r</cell><cell>rouge -2-p</cell><cell>rouge -2-f</cell><cell>rouge -L-r</cell><cell>rouge -L-p</cell><cell>rouge -L-f</cell><cell>rouge-L-f (basic Mn)</cell></row><row><cell>MBase</cell><cell cols="7">0.60 1.00 0.74 0.54 1.00 0.69</cell><cell>0.60</cell><cell cols="2">1.00 0.66</cell><cell>0.64</cell></row><row><cell>M1ps+F</cell><cell cols="7">0.59 0.99 0.73 0.41 0.76 0.52</cell><cell>0.52</cell><cell cols="2">0.88 0.57</cell><cell>0.55</cell></row><row><cell>M2ps+F</cell><cell cols="7">0.58 0.98 0.72 0.42 0.78 0.53</cell><cell>0.53</cell><cell cols="2">0.89 0.58</cell><cell>0.53</cell></row><row><cell>M3ps+F</cell><cell cols="7">0.50 0.99 0.65 0.36 0.83 0.49</cell><cell>0.48</cell><cell cols="2">0.95 0.53</cell><cell>0.43</cell></row><row><cell>M4ps+F</cell><cell cols="7">0.52 1.00 0.67 0.34 0.75 0.46</cell><cell>0.47</cell><cell cols="2">0.89 0.52</cell><cell>0.43</cell></row><row><cell>M5ps+F</cell><cell cols="7">0.65 0.99 0.77 0.51 0.84 0.62</cell><cell>0.62</cell><cell cols="2">0.94 0.67</cell><cell>0.64</cell></row><row><cell>M6ps+F</cell><cell cols="7">0.50 1.00 0.65 0.38 0.89 0.52</cell><cell>0.48</cell><cell cols="2">0.96 0.52</cell><cell>0.48</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 .</head><label>4</label><figDesc>Top models according to the average rank given by assessors</figDesc><table><row><cell>Model</cell><cell>Final Title</cell><cell>RAVG</cell></row><row><cell>MBase</cell><cell cols="2">Multisensor: Development of Multimedia Content Integration Technologies 1.9</cell></row><row><cell>M3ps</cell><cell>Multimedia Content Integration Technologies for Journalism</cell><cell>3.7</cell></row><row><cell>M5ps</cell><cell>Development of Multimedia Content Integration for Journalism, Media and International Exporting and Decision Support</cell><cell>4.2</cell></row><row><cell>MOrig</cell><cell>Multisensor: Development of multimedia content integration technologies for journalism, media monitoring and international exporting decision support</cell><cell>4.3</cell></row><row><cell>M6ps</cell><cell>Multimedia Content Integration Technologies for Journalism, Media</cell><cell>4.4</cell></row><row><cell>M4ps</cell><cell>Multimedia Content Integration for Journalism</cell><cell>5.7</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>7 Discussion and Future Work</head><label></label><figDesc></figDesc><table /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://www.researchgate.net/ BIR</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2019" xml:id="foot_1">Workshop on Bibliometric-enhanced Information Retrieval</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2">https://github.com/saffsd/langid.py</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_3">https://www.nltk.org</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>The presented work was supported by the European Commission under the contract numbers H2020-700024-RIA, H2020-700475-IA, H2020-779962-RIA, H2020-786731-RIA, and H2020-825079-RIA and by the Russian Foundation for Basic Research under the contract number 18-37-00198. Many thanks to the four anonymous reviewers for their valuable comments, and to the five postdoctoral researchers for their high responsiveness in the evaluation and insightful feedback.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">A collaborative approach for research paper recommender system</title>
		<author>
			<persName><forename type="first">K</forename><surname>Haruna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Ismail</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Damiasih</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sutopo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Herawan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PloS one</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<biblScope unit="issue">10</biblScope>
			<biblScope unit="page">e0184516</biblScope>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Detection of current research directions based on full-text clustering</title>
		<author>
			<persName><forename type="first">A</forename><surname>Shvets</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Devyatkin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sochenkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Tikhomirov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Popov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Yarygin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Science and Information Conference (SAI)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2015">2015. 2015</date>
			<biblScope unit="page" from="483" to="488" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Detecting Clickbaits in News Streams Using Article Informality</title>
		<author>
			<persName><forename type="first">P</forename><surname>Biyani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Tsioutsiouliklis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Blackmer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Thirtieth AAAI Conference on Artificial Intelligence</title>
				<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note>8 Amazing Secrets for Getting More Clicks</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Simplification of patent claim sentences for their paraphrasing and summarization</title>
		<author>
			<persName><forename type="first">N</forename><surname>Bouayad-Agha</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Casamayor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Ferraro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wanner</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">22 nd FLAIRS Conference</title>
				<imprint>
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Making it simplext: Implementation and evaluation of a text simplification system for spanish</title>
		<author>
			<persName><forename type="first">H</forename><surname>Saggion</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Štajner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mille</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Rello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Drndarevic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Transactions on Accessible Computing (TACCESS)</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">14</biblScope>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Abstractive text summarization using sequence-to-sequence rnns and beyond</title>
		<author>
			<persName><forename type="first">R</forename><surname>Nallapati</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Gulcehre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Xiang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">CoNLL</title>
		<imprint>
			<biblScope unit="page">280</biblScope>
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Dynamic Multi-Level Multi-Task Learning for Sentence Simplification</title>
		<author>
			<persName><forename type="first">H</forename><surname>Guo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Pasunuru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bansal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th International Conference on Computational Linguistics</title>
				<meeting>the 27th International Conference on Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="462" to="476" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Data-driven Summarization of Scientific Articles</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">I</forename><surname>Nikolov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Pfeiffer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">H</forename><surname>Hahnloser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 7th International Workshop on Mining Scientific Publications</title>
				<meeting>of the 7th International Workshop on Mining Scientific Publications</meeting>
		<imprint>
			<publisher>LREC</publisher>
			<date type="published" when="2018">2018. 2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Effective approaches to attention-based neural machine translation</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Luong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Pham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2015 Conference on Empirical Methods in Natural Language Processing</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="1412" to="1421" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Incorporating copying mechanism in sequence-tosequence learning</title>
		<author>
			<persName><forename type="first">J</forename><surname>Gu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Lu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">O</forename><surname>Li</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 54th Annual Meeting of the Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="1631" to="1640" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Get to the point: Summarization with pointer-generator networks</title>
		<author>
			<persName><forename type="first">A</forename><surname>See</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">J</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics</title>
				<meeting>the 55th Annual Meeting of the Association for Computational Linguistics</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1073" to="1083" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">OpenNMT: Neural Machine Translation Toolkit</title>
		<author>
			<persName><forename type="first">G</forename><surname>Klein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Deng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Nguyen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Senellart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Rush</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 13th Conference of the Association for Machine Translation in the Americas</title>
				<meeting>the 13th Conference of the Association for Machine Translation in the Americas</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="177" to="184" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<monogr>
		<title level="m" type="main">Google&apos;s neural machine translation system: Bridging the gap between human and machine translation</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schuster</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><forename type="middle">V</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Norouzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Macherey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">.</forename><forename type="middle">.</forename><surname>Klingner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename></persName>
		</author>
		<idno type="arXiv">arXiv:1609.08144</idno>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Information retrieval for R&amp;D support</title>
		<author>
			<persName><forename type="first">G</forename><surname>Osipov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Smirnov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Tikhomirov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Sochenkov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Shelmanov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Shvets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Professional search in the modern world</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2014">2014</date>
			<biblScope unit="volume">8830</biblScope>
			<biblScope unit="page" from="45" to="69" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Honnibal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Montani</surname></persName>
		</author>
		<title level="m">spacy 2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Rush</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Chopra</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Weston</surname></persName>
		</author>
		<title level="m">Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2015 Conference on Empirical Methods in Natural Language Processing</meeting>
		<imprint>
			<date type="published" when="2015">2015</date>
			<biblScope unit="page" from="379" to="389" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">BLEU: a method for automatic evaluation of machine translation</title>
		<author>
			<persName><forename type="first">K</forename><surname>Papineni</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Roukos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ward</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">J</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 40th annual meeting on association for computational linguistics</title>
				<meeting>the 40th annual meeting on association for computational linguistics</meeting>
		<imprint>
			<date type="published" when="2002">2002</date>
			<biblScope unit="page" from="311" to="318" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Rouge: A package for automatic evaluation of summaries</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">Y</forename><surname>Lin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Text Summarization Branches Out</title>
				<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Joint copying and restricted generation for paraphrase</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Cao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Li</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Thirty-First AAAI Conference on Artificial Intelligence</title>
				<imprint>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Sentence Simplification with Deep Reinforcement Learning</title>
		<author>
			<persName><forename type="first">X</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lapata</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing</title>
				<meeting>the 2017 Conference on Empirical Methods in Natural Language Processing</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="584" to="594" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Data-Driven Text Simplification</title>
		<author>
			<persName><forename type="first">S</forename><surname>Štajner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Saggion</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th International Conference on Computational Linguistics: Tutorial Abstracts</title>
				<meeting>the 27th International Conference on Computational Linguistics: Tutorial Abstracts</meeting>
		<imprint>
			<date type="published" when="2018">2018</date>
			<biblScope unit="page" from="19" to="23" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Belz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Bohnet</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Pitler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wanner</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Mille</surname></persName>
		</author>
		<title level="m">The First Multilingual Surface Realisation Shared Task (SR&apos;18): Overview and Evaluation Results</title>
				<imprint>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Striking similarities between publications from China describing single gene knockdown experiments in human cancer cell lines</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Byrne</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Labbé</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientometrics</title>
		<imprint>
			<biblScope unit="volume">110</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="1471" to="1493" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">A Method of Automatic Detection of Pseudoscientific Publications</title>
		<author>
			<persName><forename type="first">A</forename><surname>Shvets</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Intelligent Systems&apos;</title>
				<imprint>
			<date type="published" when="2014">2014. 2015</date>
			<biblScope unit="volume">323</biblScope>
			<biblScope unit="page" from="533" to="539" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Key Words Plus [TM]-Algorithmic Derivative Indexing</title>
		<author>
			<persName><forename type="first">E</forename><surname>Garfield</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">H</forename><surname>Sher</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal-American Society For Information Science</title>
		<imprint>
			<biblScope unit="volume">44</biblScope>
			<biblScope unit="page" from="298" to="298" />
			<date type="published" when="1993">1993</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
