<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">A graphical view of distance between rankings: The Point and Area measures</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Giorgio</forename><forename type="middle">Maria</forename><surname>Di</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">Department of Information Engineering</orgName>
								<orgName type="institution">University of Padua</orgName>
							</affiliation>
						</author>
						<author role="corresp">
							<persName><forename type="first">Gianmaria</forename><surname>Silvello</surname></persName>
							<email>silvello@dei.unipd.it</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Information Engineering</orgName>
								<orgName type="institution">University of Padua</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">A graphical view of distance between rankings: The Point and Area measures</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">A9538EEBB54CB3A6C68EF66D238908FF</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T20:46+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In Information Retrieval (IR), measuring the distance between rankings is a way for comparing evaluation measures and assess system rankings. In this paper, we present a variation of the Spearman foot rule which allows us to define two measures that have nice analytical and geometrical properties that can be effectively used to compare different rankings and to evaluate IR experiments. A Web application that shows how these measures behave from the graphical point of view is available at: https://gmdn.shinyapps.io/ShinyPointArea/</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>Search engines effectiveness can be measured by analyzing their visible outcomes which are lists of documents ranked in descending order of relevance to a given topic. In this context, the correlation among rankings can be used to assess the search engines effectiveness; in fact, when one of the two rankings is the ideal one -i.e. the best obtainable result in a laboratory based evaluation -the correlation between two rankings becomes a measure of effectiveness. A high correlation between a ranking under evaluation and the ideal indicates a good behavior of the search engine being tested. Two standard de-facto measures of distance are the Spearman foot rule <ref type="bibr" target="#b5">[6]</ref> and the Kendall rank distance <ref type="bibr" target="#b4">[5]</ref>. Both measures are very easy to calculate and to interpret, but they lack some properties when it comes to evaluate search engines effectiveness. A review and a classification of ranking similarity measures (including extensions of the Spearman foot rule and the Kendall rank distance) has been presented by Webber et alii in <ref type="bibr" target="#b6">[7]</ref>. This classification is based on two main properties: conjointness and weightedness. Conjointness is the property of dealing with complete (conjoint) or partial (nonconjoint) rankings; weightedness is the property of being able (weighted) or not being able (unweighted) to weight misplacements at the top of the list more than at its bottom.</p><p>In this paper, we present a variation of the Spearman foot rule leading to the definition of two new measures: a point-wise (qualitative) measure and an area-wise (quantitative) measure which can be classified as non-conjoint and unweighted. <ref type="foot" target="#foot_0">1</ref> Point-wise and area-wise measures present analytical and geometrical properties that can be effectively used to compare different rankings and to evaluate IR experiments. Furthermore, the point-wise measure provides an intuitive and effective graphical interpretation that can be used for performing a qualitative analysis on rankings comparison. Whereas, the area-wise measure can be used for a quantitative analysis given that its normalized version offers a simple measure of correlation between rankings. Moreover, as a by-product of this approach, we also obtain an original reformulation of the Kendall rank distance computed at each rank.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Methodological Approach</head><p>Given a set of documents D = {d 1 , . . . , d i , . . . , d n }, we consider two rankings r α and r β as two permutations without repetitions of D. <ref type="foot" target="#foot_1">2</ref> We can use a method idx α (d i ) to extract the index of document d i within r α . In general, the problem is to find the index of a document of r α in the another ranking r β . To this purpose, we define the function F α,β (k) as:</p><formula xml:id="formula_0">F α,β (k) = idx β (r α (k)))<label>(1)</label></formula><p>This function translates the k-th document in r α and returns the index of that document in the ranking r β . For example, let</p><formula xml:id="formula_1">r a = [d 2 , d 1 , d 4 , d 3 ] and r b = [d 1 , d 4 , d 2 , d 3 ] be two instances of r α and r β . Then, for k = 1, F a,b (1) = idx b (r a (1)) = idx b (d 2 ) = 3.</formula><p>The definition of Spearman footrule can be rewritten using Eq. 1 as:</p><formula xml:id="formula_2">S α,β (i) = i k=1 |F α,β (k) − k| .<label>(2)</label></formula><p>which is the total element-wise misplacements between the two lists r α and r β .</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Point and Area Measures</head><p>The two measures we present in this paper derive from a variation of the Spearman foot rule. By removing the absolute value from the Spearman foot rule, we obtain the point-wise measure:</p><formula xml:id="formula_3">P α,β (i) = i k=1 (F α,β (k) − k) .<label>(3)</label></formula><p>As a result, we make a distinction between negative and positive misplacements: when F α,β (k) &gt; k, we obtain a positive error because the document at rank k should have been ranked higher in r β ; when F α,β (k) &lt; k, we obtain a negative value because we find a highly relevant document lower in the list, which means that we are recovering a misplacement occurred earlier. Ultimately, the pointwise goes to zero when the last element of the list is computed. When the pointwise measure returns a positive number at any rank i, it means that there is still some non-recovered misplacement in r b . On the other hand, every time P α,β (i) = 0 it means that the two rankings have retrieved the same elements at rank i. Compared to Spearman, the measure P α,β (i) gives us some additional information about the distance between a given ranking and the ideal one at rank i: we can tell how far we are from the ideal ranking. However, the point-wise measure does not tell how bad a ranking is before rank i.</p><p>The area-wise measure considers the area formed by the segments between two adjacent points P α,β (k −1) and P α,β (k) and the x-axis. As an approximation of the area to be measured, we use a linear interpolation between adjacent points and we define the A α,β (i) as the as a sum of all the trapezoids formed with the x-axis from rank 1 to i:</p><formula xml:id="formula_4">A α,β (i) = i k=1 P α,β (k − 1) + P α,β (k) 2 h (<label>4</label></formula><formula xml:id="formula_5">)</formula><p>where h is the height of each trapezoid, and P α,β (0) = 0 by assumption. The height h of each trapezoid can be a constant (for example h = 1), or it can be used as a tuning parameter for weighting errors differently according to the rank of the elements. The area-wise measure can be used effectively in two ways: to accumulate the information about misplacements until rank i, to compare two or more rankings given a reference ranking (for example, the ideal one). Moreover, we can also study when the same type of misplacements occur in one ranking or another (earlier or later in the ranking), for example when two adjacent relevance degrees are exchanged.</p><p>It may be convenient to normalise the value returned by the area-wise between 0 and 1. The normalisation can be done by dividing the area of a relevance list at rank i by the largest obtainable area given by the worst possible ranking, that is the ideal ranking taken in the inverse order. The normalisation of the area-wise measure is defined as:</p><formula xml:id="formula_6">nA α,β = A α,β A * α,β<label>(5)</label></formula><p>where A * α,β is the area of the worst possible ranking. The normalized area nA α,β defines the distance between β and α where nA α,β = 0 means that β and α are the same ranking and nA α,β = 1 means that they are inverse rankings one of the other. From this it is straightforward to derive the correlation coefficient as: A-corr α,β = 1 − nA α,β .</p><p>In this paper, we have introduced two new measures of distance among rankings: the point-wise and the area-wise measure. The point-wise measure was derived as a modification of the original Spearman foot rule, while the area-wise measure is built on top of the point-wise. The normalisation of the area-wise measure and the correlation coefficient derived from it -i.e. A-corr -is a measure of correlation between rankings. We have discussed some properties of these two measures in terms of qualitative analysis and quantitative analysis. For space reasons, we could not give a complete formalisation of the fact that both measures are metrics -i.e. the reflexivity, non-negativity, symmetry and triangle inequality properties hold. The area-wise measure already incorporates a parameter h that can be used to weight misplacements that happen at top ranks more than at low ranks; a possibility is to define h as the inverse of the index h(i) = 1/i. In this way, we would obtain a non-conjoint weighted distance measure. In addition, the parameter h can be used to model users search intents; for example, we could adjust the height h as a tuning parameter to represent a particular aspect of user behaviour such as the level of patience in the RBO measure <ref type="bibr" target="#b6">[7]</ref>. Furthermore, we want to investigate the use of the point-wise curve and the A-corr measure as a full-fledged effectiveness measures. The behavior of the point-wise curve resembles the Cumulative Relative Position (CRP) <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b3">4]</ref> one, but it presents several key differences such as the fact that the point-wise curve does not cross the x-axis whereas the CRP one does and that CRP is defined as an effort-oriented measure and not as a correlation one.</p></div>			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The point-wise and area-wise measures already contain a parameter for weighting the top and the bottom of a ranking list differently. For the purpose of this paper, we present the unweighted version of the measures.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">There are important research areas in IR which require distance measures on incomplete permutations (see Webber et alii<ref type="bibr" target="#b6">[7]</ref>, Fagin et alii<ref type="bibr" target="#b2">[3]</ref> for Web search results and Angelini et alii<ref type="bibr" target="#b1">[2]</ref> for search engine failure analysis), also known as the property of conjointness of a distance measure. For space reasons, we cannot address this issue in this paper and leave the solution of this problem to future works.</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Cumulated Relative Position: A Metric for Ranking Evaluation</title>
		<author>
			<persName><forename type="first">M</forename><surname>Angelini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Järvelin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Keskustalo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Pirkola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Santucci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Silvello</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 3rd Int. Conf. of the CLEF Initiative (CLEF 2012)</title>
				<meeting>of the 3rd Int. Conf. of the CLEF Initiative (CLEF 2012)<address><addrLine>Heidelberg, Germany</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="volume">7488</biblScope>
			<biblScope unit="page" from="112" to="123" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">VIRTUE: A Visual Tool for Information Retrieval Performance Evaluation and Failure Analysis</title>
		<author>
			<persName><forename type="first">M</forename><surname>Angelini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Santucci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Silvello</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Visual Languages &amp; Computing</title>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Comparing Top k Lists</title>
		<author>
			<persName><forename type="first">R</forename><surname>Fagin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Sivakumar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the 14th annual ACM-SIAM symposium on Discrete algorithms, SODA &apos;03</title>
				<meeting>of the 14th annual ACM-SIAM symposium on Discrete algorithms, SODA &apos;03</meeting>
		<imprint>
			<publisher>Society for Industrial and Applied Mathematics</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="28" to="36" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The Twist Measure for IR Evaluation: Taking User&apos;s Effort Into Account</title>
		<author>
			<persName><forename type="first">N</forename><surname>Ferro</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Silvello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Keskustalo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Pirkola</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Järvelin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the American Society for Information Science and Technology</title>
		<imprint>
			<publisher>JASIST</publisher>
		</imprint>
	</monogr>
	<note>in print</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A New Measure of Rank Correlation</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">G</forename><surname>Kendall</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biometrika</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="issue">1/2</biblScope>
			<biblScope unit="page" from="81" to="93" />
			<date type="published" when="1938">1938</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The Proof and Measurement of Association Between Two Things</title>
		<author>
			<persName><forename type="first">C</forename><surname>Spearman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">American Journal of Psychology</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="page" from="88" to="103" />
			<date type="published" when="1904">1904</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A Similarity Measure for Indefinite Rankings</title>
		<author>
			<persName><forename type="first">W</forename><surname>Webber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Moffat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zobel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">ACM Trans. Inf. Syst</title>
		<imprint>
			<biblScope unit="volume">28</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page">20</biblScope>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
