<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Data Credit Distribution through Lineage ⋆</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Dennis</forename><surname>Dosso</surname></persName>
							<email>dosso@dei.unipd.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Information Engineering</orgName>
								<orgName type="institution">University of Padua</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Gianmaria</forename><surname>Silvello</surname></persName>
							<email>silvello@dei.unipd.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Information Engineering</orgName>
								<orgName type="institution">University of Padua</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Data Credit Distribution through Lineage ⋆</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">0FE08D99D7A0623DDB3FA615F9E87E0C</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T16:13+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Data Citation</term>
					<term>Data Credit</term>
					<term>Data Provenance</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Data are a fundamental asset in the current world of research. Data citation is becoming more common and supported by research databases, but it still presents many research challenges. This paper describes Data Credit, a new measure of value for data derived from data citation, that enables us to annotate databases with real values representing their importance. Credit, computed through the citations, can be used alongside them to better understand the importance of data. We introduce the task of Data Credit Distribution, the process by which credit produced by a citation is and assigned to the data in a database responsible for producing the output information being cited. We describe how this process can be performed and, through experiments, we show that credit can serve, among other things, to highlight "hotspots" in the database.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>It is widely accepted that citations are the "currency" of the scientific world, a fundamental method to perform dissemination of knowledge and foster scientific development <ref type="bibr" target="#b21">[22]</ref>. Scientific databases, "populated and updated with a great deal of human effort" <ref type="bibr" target="#b3">[4]</ref>, are numerous and at the core of the scientific research <ref type="bibr" target="#b4">[5]</ref>. It is globally accepted that data must be cited and citable <ref type="bibr" target="#b17">[18,</ref><ref type="bibr" target="#b6">7,</ref><ref type="bibr" target="#b9">10]</ref>.</p><p>Data citations should be, among other things, counted alongside traditional citations and contribute to bibliometrics indicators to reward scientific database curators for their effort <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b19">20]</ref>. Data citation is often considered in the current literature as a driving force to "facilitate giving scholars credit" <ref type="bibr" target="#b18">[19]</ref>. One of its central aspects is how to attribute credit to data creators and curators <ref type="bibr" target="#b5">[6]</ref>. Many data creators and curators still do not receive any form of reward for their work; this fosters the growth of detrimental phenomenons like the "reward dilemma", the fear from researchers to share their data, losing their competitive advantage without proper recognition of their work <ref type="bibr" target="#b13">[14]</ref>.</p><p>How to handle and count the credit generated by data citations and how it contributes to traditional and new bibliometrics are long-standing research issues <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b1">2]</ref>. However, even when correctly applied, data citations and the related bibliometrics do not always accurately reward data. Indeed, a query often uses more data than the one present in its output result set. The data being used but not visualized do not receive a citation, nor do their contributors.</p><p>To overcome this limitation, in recent years, the idea of crediting data emerged in the academic discussion through the concept of data credit, a real positive value describing the importance of data in a given context. We argue that credit can be used to address some of the limitations highlighted above. Credit is not atomic like a citation. Once computed, it can be divided into portions and assigned to all the data used by a query. Credit can be used as an annotation set at different granularity levels within a database to describe their importance.</p><p>In this work, we discuss the problem of data credit distribution, the issuance of credit generated by some query Q on a relational database instance I to the data in I responsible for the generation of Q(I). In particular, we discuss how the distribution is possible in relational databases through lineage, a form of data provenance <ref type="bibr" target="#b8">[9]</ref>. While data citation and credit distribution are not limited to relational databases, they are a good test bed for this first approach. In Section 2, we report the related work; Section 3 presents the methods used and the experimental results carried on a real scientific database, GtoPdb; Section 4 contains the conclusions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Related Work</head><p>Kats in <ref type="bibr" target="#b16">[17]</ref> suggests the need for a modified citation system that includes the idea of transient and fractional credit. Credit is defined as a "quantity" representing the importance of a research entity (a paper, software or data) mentioned in a citation, but these ideas are proposed without any formalism.</p><p>Fang in <ref type="bibr" target="#b12">[13]</ref> presents a framework to distribute credit generated by a paper to its authors and to the papers in its reference list in a transitive way. Each cited paper's quantity of credit depends on its impact/role in the citing paper. This theoretical framework works for a graph composed of only papers, but it can be extended to another graph model that includes data.</p><p>Zeng et al. in <ref type="bibr" target="#b20">[21]</ref> proposed the first method designed to compute credit within a network of papers citing data. This is the first step towards an automatic credit computation procedure. However, it is limited to assigning credit to the whole dataset without considering variable data granularity. Therefore, this is not a way to assign credit to a single research entity within a dataset.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Methods and Experiments</head><p>Methods. Data Credit is a non-negative real value representing the importance of data in a specific context. It can be computed with different strategies and rationales. In this paper's context, we consider credit as the product of a data citation; therefore, it is a quantity representing the importance of the data being cited in the citing paper. Ideally, the higher the impact of the cited data in the citing paper, the bigger the credit.</p><p>The task of Data Credit Distribution (DCD) consists of dividing this credit into portions and assigning it to the recipients in a database responsible for generating the cited data. Formally: Definition 1. Data Credit Distribution at tuple level (DCD) <ref type="bibr" target="#b11">[12]</ref> Given a database instance I, a query Q over I and the value k ∈ R &gt;0 , DCD is defined as the computation of the function</p><formula xml:id="formula_0">f I,Q : T upleLoc × R &gt;0 → R ≥0 such that f I,Q (t, k) = h where 0 ≤ h ≤ k and t∈T upleLoc f I,Q (t, k) = k.</formula><p>f is the Distribution Strategy (DS), it aims to annotate each tuple (thus we speak of DCD at tuple level) in I with a portion of the credit. Its only requirement is that it has to be conservative: no credit is generated or lost during the distribution. A DS can be defined in many different ways, but what we may prefer is a function that distributes credit coherently with the role of the input tuples as defined by Q. That is, only tuples that had some role in generating Q(I) should receive credit.</p><p>To do so, we propose one definition of DS that exploits the concept of lineage <ref type="bibr" target="#b10">[11]</ref>. Given a tuple t ∈ Q(I), its lineage is the set of all and only the tuples that have a role, whatever it is, in the generation of t. Definition 2. Lineage-based Distribution Strategy <ref type="bibr" target="#b11">[12]</ref> Let I be a database instance, Q a query over I, o ∈ Q(I) an output tuple and k the credit associated to o. Let L be the lineage of o and t be a generic tuple in I. t receives a credit equal to:</p><formula xml:id="formula_1">f I,Q (t, k) = 0 if t / ∈ L k |L| if t ∈ L</formula><p>As we see, this DS equally rewards the tuples of the lineage of a tuple. To perform the whole distribution on Q(I), it is simply necessary to apply this DS to each tuple o ∈ Q(I).</p><p>Evaluation. We considered the IUPHAR/BPS Guide to Pharmacology (GtoPdb) <ref type="bibr" target="#b15">[16]</ref>, a famous and highly cited medical database containing information about drugs, targets, and ligands. GtoPdb is maintained and curated by a consortium of 512 scientists collaborating with in-house curators, distributed in committees <ref type="bibr" target="#b2">[3]</ref>.</p><p>GtoPdb is relational in nature, and its information is also organized into webpages describing specific diseases, receptors, ligands, and families of these elements.</p><p>To gather data citations, we considered papers published in the British Journal of Pharmacology (BJP) that cite <ref type="bibr" target="#b15">[16]</ref>. <ref type="bibr" target="#b15">[16]</ref> is a recent version of a series of papers that the GtoPdb consortium releases every two years to describe the database and its evolutions. It works as a data journal that can be cited in place of the whole database <ref type="bibr" target="#b7">[8]</ref>. The papers published in BJP that refer to specific webpages of GtoPdb report the URL of the referenced page. It is possible from these URLs to reverse-engineer the SQL queries that compute the data contained in the webpages. A webpage is composed by different parts, each part created with data extracted from the GtoPdb through SQL queries. We use these queries to perform DCD. We focused only on queries referring to the so-called target families <ref type="foot" target="#foot_0">1</ref> .</p><p>Without any loss of generality, we assumed that each tuple present in the output of these queries contains credit equal to 1, and we performed credit distribution through lineage using these queries that we inferred from the BJP papers. We used the ∼900 BJP papers citing <ref type="bibr" target="#b15">[16]</ref> as of October 2020, and we extracted from them more than 1200 SQL queries to families of receptors.</p><p>The results of the distribution on the family table of GtoPdb, that contain information about the target families, are shown in the heat-map of Figure <ref type="figure" target="#fig_0">1</ref>. Each cell in the map is a tuple, and the intensity of the color represents the assigned quantity of credit. Interestingly, few tuples receive almost all the credit, following a Pareto distribution. This shows how credit distribution can highlight "hotspots", elements in the database that receive high values of credit. These are tuples that are used frequently by queries. Interestingly, these may also be tuples that are used but not visualized in the final output. This means that credit allows to rewards parts of the database that are used but not visualized, overcoming a limitation of traditional citations.</p><p>To better see how credit differs from traditional citations, consider Figure <ref type="figure" target="#fig_1">2</ref>. We reported two radar plots, presenting the top 10 authors citation-wise and credit-wise (values normalized between 0 and 1, and the authors were substituted with numbers for privacy reasons). To compute the citations, we proceeded as follows: each time a query identifies data curated by an author, that author receives one citation and equally shares the credit assigned to that data with the other co-authors of that data. As we see from Figure <ref type="figure" target="#fig_1">2</ref>.a, the top 10 authors, citation-wise, do not have the highest values of credit. Similarly, in Figure <ref type="figure" target="#fig_1">2</ref>.b, the authors with the higher values of credit do not also have the highest citation count. This shows that credit can reward authors whose data have a high impact in the research community, i.e., those data generated the highest quantity of credit, even if they received fewer citations than other authors. That is, specific citations are "more valuable", credit-wise. Since we assumed that each output tuple carries credit 1, the queries that return outputs with more tuples also generate more credit. In more complex scenarios, where different and more sophisticated techniques may be used to decide how to generate quantities of credit, credit distribution can help to understand how data and their corresponding authors impact the scientific environment.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions</head><p>We showed how credit can highlight parts of the database that cover certain topics instead of others, as defined by queries. Credit and citations are correlated measures, but credit offers a new perspective to evaluate the impact of both data and curators. It can highlight parts of the database related to certain query topics, so-called "hotspots". It directly rewards the tuples, and corresponding authors, that contributed to the production of cited data, even those that are not in the output itself. Moreover, it proportionately rewards data and curators based on their impact in the context defined by the issued queries. This helps to reward authors that would otherwise remain unnoticed. In future works, credit can become the basis for new bibliometrics and applications based on its presence. For example, data pricing, that is the identification of the price of certain data in a database based on how much they are used by queries.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Heat-map of the distribution of credit to the family table. Each cell represents a tuple in the table.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Radar plots showing the top-10 authors of GtoPdb citation (a) and credit (b) with their normalized values of credit and citations.</figDesc><graphic coords="5,199.80,143.90,77.95,58.54" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">https://www.guidetopharmacology.org/targets.jsp</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>This work is partially supported by the ExaMode project, as part of the European Union Horizon 2020 program under Grant Agreement no. 825292.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Measuring the Value of Research Data: A Citation Analysis of Oceanographic Data Sets</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">W</forename><surname>Belter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLoS ONE</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page">e92590</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Data Citation as a Bibliometric Oxymoron</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">L</forename><surname>Borgman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Theories of Informetrics and Scholarly Communication</title>
				<editor>
			<persName><forename type="first">C</forename><forename type="middle">R</forename><surname>Sugimoto</surname></persName>
		</editor>
		<imprint>
			<publisher>De Gruyter Mouton</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="93" to="116" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">How to cite curated databases and how to make them citable</title>
		<author>
			<persName><forename type="first">P</forename><surname>Buneman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">18th International Conference on Scientific and Statistical Database Management</title>
				<imprint>
			<publisher>IEEE Computer Society</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="page" from="195" to="203" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Curated Databases</title>
		<author>
			<persName><forename type="first">P</forename><surname>Buneman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Cheney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">C</forename><surname>Tan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Vansummeren</surname></persName>
		</author>
		<idno type="DOI">10.1145/1376916.1376918</idno>
		<ptr target="https://doi.org/10.1145/1376916.1376918" />
	</analytic>
	<monogr>
		<title level="m">Proc. of the 27th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems</title>
				<meeting>of the 27th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems<address><addrLine>PODS</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">2008. 2008</date>
			<biblScope unit="page" from="1" to="12" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Why data citation is a computational problem</title>
		<author>
			<persName><forename type="first">P</forename><surname>Buneman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Davidson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Frew</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Commun. ACM</title>
		<imprint>
			<biblScope unit="volume">59</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="50" to="57" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Why data citation isn&apos;t working, and what to do about it</title>
		<author>
			<persName><forename type="first">P</forename><surname>Buneman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Christie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Davies</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Dimitrellou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">D</forename><surname>Harding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Pawson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Sharman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Wu</surname></persName>
		</author>
		<idno type="DOI">10.1093/databa/baaa022</idno>
		<ptr target="https://doi.org/10.1093/databa/baaa022" />
	</analytic>
	<monogr>
		<title level="j">Database J. Biol. Databases Curation</title>
		<imprint>
			<date type="published" when="2020">2020. 2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Making Data a First Class Scientific Output: Data Citation and Publication by NERC&apos;s Environmental Data Centres</title>
		<author>
			<persName><forename type="first">S</forename><surname>Callaghan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Donegan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pepler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Thorley</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Cunningham</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kirsch</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Ault</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Bell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Bowie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Leadbetter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">K</forename><surname>Lowry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Moncoiffé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Harrison</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Smith-Haddon</surname></persName>
		</author>
		<author>
			<persName><surname>Weatherby</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Wright</surname></persName>
		</author>
		<idno type="DOI">10.2218/ijdc.v7i1.218</idno>
		<ptr target="http://dx.doi.org/10.2218/ijdc.v7i1.218" />
	</analytic>
	<monogr>
		<title level="j">International Journal of Digital Curation</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="107" to="113" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Data Journals: A Survey</title>
		<author>
			<persName><forename type="first">L</forename><surname>Candela</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Castelli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Manghi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Tani</surname></persName>
		</author>
		<idno type="DOI">10.1002/asi.23358</idno>
		<ptr target="http://dx.doi.org/10.1002/asi.23358" />
	</analytic>
	<monogr>
		<title level="j">Journal of the Association for Information Science and Technology</title>
		<imprint>
			<biblScope unit="volume">66</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="1747" to="1762" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Provenance in databases: Why, how, and where</title>
		<author>
			<persName><forename type="first">J</forename><surname>Cheney</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Chiticariu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Tan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Foundations and Trends in Databases</title>
		<imprint>
			<biblScope unit="volume">1</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="379" to="474" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">CODATA-ICSTI Task Group on Data Citation Standards and Practices: Out of Cite, Out of Mind: The Current State of Practice</title>
		<idno type="DOI">10.2481/dsj.OSOM13-043</idno>
		<ptr target="https://doi.org/http://doi.org/10.2481/dsj.OSOM13-043" />
	</analytic>
	<monogr>
		<title level="j">Policy, and Technology for the Citation of Data</title>
		<imprint>
			<biblScope unit="volume">12</biblScope>
			<date type="published" when="2013-09">September 2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Tracing the lineage of view data in a warehousing ACM Trans</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Cui</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Widom</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Wiener</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Database Syst</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="179" to="227" />
			<date type="published" when="2000">2000</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Data credit distribution: A new method to estimate databases impact</title>
		<author>
			<persName><forename type="first">D</forename><surname>Dosso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Silvello</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Informetrics</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">101080</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">A discussion of citations from the perspective of the contribution of the cited paper to the citing paper</title>
		<author>
			<persName><forename type="first">H</forename><surname>Fang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">JASIST</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page" from="1513" to="1520" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Sharing research data</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">E</forename><surname>Fienberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">E</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Straf</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1985">1985</date>
			<publisher>National Academy Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Journal impact factor: a brief review</title>
		<author>
			<persName><forename type="first">E</forename><surname>Garfield</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Can. Med. Assoc</title>
		<imprint>
			<biblScope unit="page" from="979" to="980" />
			<date type="published" when="1999">1999</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Nc-Iuphar: The IUPHAR/BPS guide to PHARMACOLOGY in 2018: updates and expansion to encompass the new guide to IMMUNOPHARMACOLOGY</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">D</forename><surname>Harding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Sharman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Faccenda</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Southan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J</forename><surname>Pawson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ireland</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">J G</forename><surname>Gray</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Bruce</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">P H</forename><surname>Alexander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Anderton</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Bryant</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">P</forename><surname>Davenport</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Doerig</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Fabbro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Levi-Schaffer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Spedding</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">A</forename><surname>Davies</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nucleic Acids Research</title>
		<imprint>
			<biblScope unit="volume">46</biblScope>
			<biblScope unit="page" from="D1091" to="D1106" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Transitive credit as a means to address social and technological concerns stemming from citation and attribution of digital products</title>
		<author>
			<persName><forename type="first">D</forename><surname>Katz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Open Research Software</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">1</biblScope>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Citation and Peer Review of Data: Moving Towards Formal Data Publication</title>
		<author>
			<persName><forename type="first">B</forename><surname>Lawrence</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Matthews</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Pepler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Callaghan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Digital Curation</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="4" to="37" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">M</forename><surname>Martone</surname></persName>
		</author>
		<idno type="DOI">10.25490/a97f-egyk</idno>
		<ptr target="https://www.force11.org/datacitationprinciples" />
		<title level="m">Joint declaration of data citation principles</title>
				<meeting><address><addrLine>San Diego CA</addrLine></address></meeting>
		<imprint>
			<publisher>FORCE11</publisher>
			<date type="published" when="2014-09">2014. September 2020</date>
		</imprint>
	</monogr>
	<note>Data Citation Synthesis Group</note>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Research data explored: An extended analysis of citations and altmetrics</title>
		<author>
			<persName><forename type="first">I</forename><surname>Peters</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kraker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Lex</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Gumpenberger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gorraiz</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientometrics</title>
		<imprint>
			<biblScope unit="volume">107</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="723" to="744" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Assigning credit to scientific datasets using article citation networks</title>
		<author>
			<persName><forename type="first">T</forename><surname>Zeng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Wu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bratt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">E</forename><surname>Acuna</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Informetrics</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="issue">2</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Quantifying the scientific output of new researchers using the zp-index</title>
		<author>
			<persName><forename type="first">C</forename><surname>Zou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">B</forename><surname>Peterson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientometrics</title>
		<imprint>
			<biblScope unit="volume">106</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="901" to="916" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
