<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Ambiguous Place Names on the Web</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author role="corresp">
							<persName><forename type="first">Davide</forename><surname>Buscaldi</surname></persName>
							<email>dbuscaldi@dsic.upv.es</email>
							<affiliation key="aff0">
								<orgName type="department">Dpto. de Sistemas Informáticos y Computación (DSIC)</orgName>
								<orgName type="laboratory" key="lab1">Natural Language Engineering Lab</orgName>
								<orgName type="laboratory" key="lab2">ELiRF Research Group</orgName>
								<orgName type="institution">Universidad Politécnica de Valencia</orgName>
								<address>
									<country key="ES">Spain</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Ambiguous Place Names on the Web</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">8A5AB3EE1A94341C2DDA68837FEB77A6</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T09:13+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Geographical information is achieving an increasing importance in the World Wide Web. Everyday, the number of users looking for geographically constrained information is growing. Map-based services, such as Google or Yahoo Maps provide users with a graphical interface, visualizing results on maps. However, most of the geographical information contained in web documents is represented by means of toponyms, which in many cases are ambiguous. Therefore, it is important to properly disambiguate toponyms in order to improve the accuracy of web searches. The advent of the semantic web will allow to overcame this issue by labelling documents with geographical IDs. In this paper we discuss the problems of using toponyms in web documents instead of identifying places using tools such as Geonames RDF, focusing on the errors that affect a prototype geographical web search engine, Geooreka!, currently under development.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The interest of users for geographically constrained information in the Web has increased over the past years, boosted by the availability of services such as Google Maps 1 . Sanderson and Kohler <ref type="bibr" target="#b0">[1]</ref> showed that 18.6% of the queries submitted to the Excite search engine contained at least a geographic term, while Gan et al. <ref type="bibr" target="#b1">[2]</ref> estimated that 12.94% of queries submitted to the AOL search engine expressed a geographically constrained information need. Most of the geographical information contained in the Web and unstructured text is composed by toponyms, or place names. There are two main problems that derive from using toponyms to represent geographical information. The first one is the polysemy of toponyms, or toponym ambiguity: a toponym may be used to represent more than one place, such as "Puebla" which may be used to indicate the city at 19 o 3 N, 98 o 12 W, the state in which it is contained, a suburb of Mexicali in the state of Baja California, or three more small towns in Mexico. The second problem is that the mere inclusion of a toponym in a document does not always mean that the document is geographically relevant with respect to the region or</p><p>We would like to thank the TIN2009-13391-C04-03 research project for partially supporting this work. 1 http://maps.google.com area represented by the toponym. In the first case, the solution is constituted by the Toponym Disambiguation (TD) task, also named toponym grounding or resolution; in the second case, the solution is to carry out Geographic Scope Resolution, which is also affected by the problem of toponym ambiguity <ref type="bibr" target="#b2">[3]</ref>.</p><p>The Geonames ontology<ref type="foot" target="#foot_0">2</ref> provide users with RDF description of more than 6 million places. The use of this ontology would allow to include geospatial semantic information in the Web, eliminating the need of toponym disambiguation. Unfortunately, as noted by <ref type="bibr" target="#b3">[4]</ref>, in the Web "references to geographical locations remain unstructured and typically implicit in nature", determining a "lack of explicit spatial knowledge within the Web" which "makes it difficult to service user needs for location-specific information". In this paper, with the help of the Geooreka!<ref type="foot" target="#foot_1">3</ref> system [5], a prototype web search engine developed at the Universidad Politécnica of Valencia in Spain, we will the problems that users interested in geographically constrained information may found because of the ambiguity of toponyms in the web.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Geooreka!: a Geographical Web Search Engine</head><p>Geooreka! is a search engine developed on the basis of our experiences at Geo-CLEF<ref type="foot" target="#foot_2">4</ref>  <ref type="bibr" target="#b5">[6,</ref><ref type="bibr" target="#b6">7]</ref>, which suggested us that the use of term-based queries could not be the optimal method to express a geographically constrained information need. For instance, it is common for users to employ vernacular names that have vague spatial extent and which do not correspond to the official administrative place name terminology. Another issue is the use of vague geographical constraints that are difficult to automatically translate from the natural language to a precise query. For instance, the query "Cultivos de tabaco al este de Puebla" ("Tobacco plantations East of Puebla") presents a double problem because of the ambiguity of the place name and the fact that the geographical constraint "East of" is vague (for instance, it does not specify if the search should be constrained within Mexico or extend to other countries).</p><p>These issues are addressed in Geooreka! by allowing the user to specify his geographical information needs using a map-based interface. The user writes a natural language query in order to represent the query theme (e.g., "Cultivos de tabaco") and selects a rectangular map in a box (Figure <ref type="figure" target="#fig_0">1</ref>), representing the query geographical footprint. All toponyms in the box are retrieved using a PostGIS database, and then the Web is queried in order to check the maximum Mutual Information (MI) between the thematic part of the query and all the places retrieved. The complete architecture of the system can be observed in Figure <ref type="figure">2</ref>. Web counts and MI are used in order to determine which combinations theme-toponym are most relevant with respect to the information need expressed by the user (Selection of Relevant Queries). In order to speed-up the process, web counts are calculated using the static Google 1T Web database<ref type="foot" target="#foot_3">5</ref> , indexed using the jWeb1T interface <ref type="bibr" target="#b7">[8]</ref>, whereas Yahoo! Search is used to retrieve the results of the queries composed by the combination of a theme and a toponym.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1">Model of Theme-Place Relevance</head><p>The key issue in the selection of the relevant queries is to obtain a relevance model that is able to select pairs theme-toponym that are most promising to satisfy the user's information need. On the basis of the theory of probability, we assume that the two component parts of a query, theme T and a place G, are independent if their conditional probabilities are independent, i.e., p(T |G) = p(T ) and p(G|T ) = p(G), or, equivalently, their joint probability is the product of their probabilities:</p><formula xml:id="formula_0">p(T ∩ G) = p(G)p(T )<label>(1)</label></formula><p>If probabilities are calculated using page counts, that is, as the number of pages in which the term (or phrase) representing the theme or toponym appears, divided by F max = 2, 147, 436, 244 which is the maximum term frequency contained in the Google Web 1T database, then p(T ∩ G) is the expected probability of co-occurrence of T and G in the same web page. It is clear that this represents a rough estimation of the fact that T occurred in G, since the mere inclusion of G in a page where T is mentioned does not guarantee the semantic relation between G and T .</p><p>Considering this model for the independence of theme and place, we can measure the divergence of the expected probability p(T ∩ G) from the observed probability p(T ∩ G): the more the divergence, the more informative is the result Fig. <ref type="figure">2</ref>. Architecture of Geooreka! of the query. The Kullback-Leibler measure <ref type="bibr" target="#b8">[9]</ref> is commonly used in order to determine the divergence of two probability distributions.</p><formula xml:id="formula_1">D KL (p(T ∩ G)||p(T ∩ G)) = p(T ∩ G) log p(T ∩ G) p(T )p(G)<label>(2)</label></formula><p>This formula is exactly one of the formulations of the Mutual Information (MI) of T and G, usually denoted as (I(T ; G)).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Evaluation</head><p>Geooreka! has been evaluated over the GeoCLEF 2005 test set, in order to compare the results that could be obtained by specifying the geographic footprint by means of keywords and those that could be obtained using a map-based interface to define the geographic footprint of the query. With this setup, topic title only was used as input for the Geooreka! thematic part, while the area corresponding to the geographic scope of the topic was manually selected. Probabilities were calculated using the number of occurrences in the GeoCLEF collection. Occurrences for toponyms were calculated by taking into account only the geo index. The results were calculated over the 25 topics of GeoCLEF-2005, minus the queries in which the geographic footprint was composed of disjoint areas (for instance, "Europe" and "USA" or "California" and "Australia"), which could not be processed by Geooreka!. Mean Reciprocal Rank (MRR) was used as a measure of accuracy. The GIR system GeoWorSE, where queries are specified by text, was used as a baseline <ref type="bibr" target="#b9">[10]</ref>. Table <ref type="table" target="#tab_0">1</ref> displays the obtained results. The results show that the web-based results are sensibly worse than those obtained on the static collection. This is due primarily to two reasons: in the first place, because topics were tailored on the GeoCLEF collection. Therefore, some topics refer explicitly to events that are particularly relevant in the collection and are easier to retrieve. For instance, query GC-005 "Japanese Rice Imports" targets documents regarding the opening of the Japanese rice market for the first time to other countries; "Japan" and "Rice" in the document collection appear together only in such documents, therefore it is easier to retrieve the relevant documents when searching the GeoCLEF collection.</p><p>The second factor affecting the results for the Web-based system is ambiguity of toponyms, which does not allow to correctly estimate the probabilities for places. For instance, in the results obtained for topic GC-008 ("Milk Consumption in Europe"), the MI obtained for "Turkey" was abnormally high with respect to the expected value for this country. The reason is that in most documents, the name "turkey" was referring to the animal and not to the country. This kind of ambiguity represents one of the most important issue at the time of estimating the probability of occurrence of places. Ambiguity (or, better, the polysemy of toponyms) grows together with the size and the scope of the collection being searched. The GeoCLEF collection was also semantically tagged using WordNet and Geonames IDs to identify the places referenced by toponyms, while Web content is rarely tagged using precise IDs, therefore increasing the chance of error in the estimation of probabilities for places which share the same name.</p><p>There are three kind of toponym ambiguity that can be recognised (after the two main types identified by <ref type="bibr" target="#b10">[11]</ref>:</p><p>-Geo / Non-Geo ambiguity: in this case, a toponym is ambiguous with respect to another class of name (such as "Turkey" which may be the animal or the country); -Geo / Geo ambiguity of different class: for instance, "Puebla" the city or the state; -Same class Geo / Geo ambiguity.</p><p>The solution in all cases would be to use an ontology to precisely identify places in documents; the only difference is the amount of information that the ontology should include. For the first type of ambiguity, the only information needed is whether the name represents a place or not. In the second case, we would also need to know the class of the place. Finally, in the Geo / Geo ambiguity, we may differentiates places using their coordinates or by knowing the including entity, or both. The Geonames ontology contains all these information and represents the best option at the time of geographically tag place names.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusions</head><p>The results obtained with Geooreka! over a static, semantically-labelled (at least from a geographical viewpoint) collection compared to the results obtained in the Web showed that the imprecise identification of places is a problem for search engines destined to users who are interested in searching for geographically constrained information. The use of precise semantically tagging schemes for toponyms, such as Geonames RDF, would allow these search engines to produce more reliable results. Spreading the use of geographical tagging for the Semantic Web would also allow users to mine information using geographical constraints in a more effective way. In this sense, we would like to encourage the use of Geonamen in order to produce accurate geographically tagged Web content.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Main page of Geooreka!</figDesc><graphic coords="3,169.35,115.84,276.66,150.93" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="4,169.35,115.83,276.66,207.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>MRR obtained with Geooreka!, using GeoCLEF or the WWW as target collection, compared to the MRR obtained using the GeoWorSE system, Topic Only runs.</figDesc><table><row><cell></cell><cell></cell><cell cols="2">Geooreka! Geooreka!</cell></row><row><cell>topic</cell><cell cols="2">GeoWorSE (GeoCLEF collection)</cell><cell>(Web)</cell></row><row><cell>GC-002</cell><cell>0.250</cell><cell>1.000</cell><cell>0.083</cell></row><row><cell>GC-003</cell><cell>0.013</cell><cell>1.000</cell><cell>1.000</cell></row><row><cell>GC-005</cell><cell>1.000</cell><cell>1.000</cell><cell>0.000</cell></row><row><cell>GC-006</cell><cell>0.143</cell><cell>0.000</cell><cell>0.500</cell></row><row><cell>GC-007</cell><cell>1.000</cell><cell>1.000</cell><cell>0.125</cell></row><row><cell>GC-008</cell><cell>0.143</cell><cell>1.000</cell><cell>0.000</cell></row><row><cell>GC-009</cell><cell>1.000</cell><cell>1.000</cell><cell>0.067</cell></row><row><cell>GC-010</cell><cell>1.000</cell><cell>0.333</cell><cell>0.250</cell></row><row><cell>GC-012</cell><cell>0.500</cell><cell>1.000</cell><cell>0.000</cell></row><row><cell>GC-013</cell><cell>1.000</cell><cell>0.000</cell><cell>0.000</cell></row><row><cell>GC-014</cell><cell>1.000</cell><cell>0.500</cell><cell>0.091</cell></row><row><cell>GC-015</cell><cell>1.000</cell><cell>1.000</cell><cell>1.000</cell></row><row><cell>GC-016</cell><cell>0.000</cell><cell>0.000</cell><cell>1.000</cell></row><row><cell>GC-017</cell><cell>1.000</cell><cell>1.000</cell><cell>0.143</cell></row><row><cell>GC-018</cell><cell>1.000</cell><cell>0.333</cell><cell>0.500</cell></row><row><cell>GC-019</cell><cell>0.200</cell><cell>1.000</cell><cell>0.045</cell></row><row><cell>GC-020</cell><cell>0.500</cell><cell>1.000</cell><cell>0.090</cell></row><row><cell>GC-021</cell><cell>1.000</cell><cell>1.000</cell><cell>0.000</cell></row><row><cell>GC-022</cell><cell>0.333</cell><cell>1.000</cell><cell>0.076</cell></row><row><cell>GC-023</cell><cell>0.019</cell><cell>0.200</cell><cell>0.125</cell></row><row><cell>GC-024</cell><cell>0.250</cell><cell>1.000</cell><cell>1.000</cell></row><row><cell>GC-025</cell><cell>0.500</cell><cell>0.000</cell><cell>0.000</cell></row><row><cell>average</cell><cell>0.584</cell><cell>0.698</cell><cell>0.280</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_0">http://www.geonames.org/ontology/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_1">http://www.geooreka.eu</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_2">http://ir.shef.ac.uk/geoclef/</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_3">http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2006T13</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Analyzing geographic queries</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sanderson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kohler</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of Workshop on Geographic Information Retrieval (GIR04)</title>
				<meeting>Workshop on Geographic Information Retrieval (GIR04)</meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Analysis of geographic queries in a search engine log</title>
		<author>
			<persName><forename type="first">Q</forename><surname>Gan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Attenberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Markowetz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Suel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">LOCWEB &apos;08: Proceedings of the first international workshop on Location and the web</title>
				<meeting><address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="49" to="56" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Geographically Constrained Information Retrieval</title>
		<author>
			<persName><forename type="first">G</forename><surname>Andogah</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
		<respStmt>
			<orgName>University of Groningen</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">PhD thesis</note>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Location and the web (locweb</title>
		<author>
			<persName><forename type="first">S</forename><surname>Boll</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Jones</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Kansa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kishor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Naaman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Purves</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Scharl</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Wilde</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceeding of the 17th international conference on World Wide Web. WWW &apos;08</title>
				<meeting>eeding of the 17th international conference on World Wide Web. WWW &apos;08<address><addrLine>New York, NY, USA</addrLine></address></meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2008">2008. 2008</date>
			<biblScope unit="page" from="1261" to="1262" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Geooreka: Enhancing Web Searches with Geographical Information</title>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. Italian Symposium on Advanced Database Systems SEBD-2009</title>
				<meeting>Italian Symposium on Advanced Database Systems SEBD-2009<address><addrLine>Camogli, Italy</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="205" to="212" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Using the WordNet Ontology in the GeoCLEF Geographical Information Retrieval Task</title>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rosso</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Sanchis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Accessing Multilingual Information Repositories</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<editor>
			<persName><forename type="first">C</forename><surname>Peters</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">F</forename><forename type="middle">C</forename><surname>Gey</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Gonzalo</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">H</forename><surname>Mller</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">G</forename><forename type="middle">J</forename><surname>Jones</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>Kluck</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">B</forename><surname>Magnini</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">M</forename><surname>De Rijke</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Giampiccolo</surname></persName>
		</editor>
		<meeting><address><addrLine>Berlin</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="volume">4022</biblScope>
			<biblScope unit="page" from="939" to="946" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">On the relative importance of toponyms in geoclef</title>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Multilingual and Multimodal Information Retrieval, 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007</title>
				<meeting><address><addrLine>Budapest, Hungary</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2007">September 19-21, 2007. 2007</date>
			<biblScope unit="page" from="815" to="822" />
		</imprint>
	</monogr>
	<note>Revised Selected Papers</note>
</biblStruct>

<biblStruct xml:id="b7">
	<monogr>
		<title level="m" type="main">jWeb1T: a library for searching the Web 1T 5-gram corpus</title>
		<author>
			<persName><forename type="first">C</forename><surname>Giuliano</surname></persName>
		</author>
		<ptr target="http://tcc.itc.it/research/textec/tools-resources/jweb1t.html" />
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">On Information and Sufficiency</title>
		<author>
			<persName><forename type="first">S</forename><surname>Kullback</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Leibler</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Annals of Mathematical Statistics</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="79" to="86" />
			<date type="published" when="1951">1951</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Using GeoWordNet for Geographical Information Retrieval</title>
		<author>
			<persName><forename type="first">D</forename><surname>Buscaldi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rosso</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Evaluating Systems for Multilingual and Multimodal Information Access, 9th Workshop of the Cross-Language Evaluation Forum, CLEF 2008</title>
				<meeting><address><addrLine>Aarhus, Denmark</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2008">September 17-19, 2008. 2009</date>
			<biblScope unit="page" from="863" to="866" />
		</imprint>
	</monogr>
	<note>Revised Selected Papers</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Web-a-where: Geotagging web content</title>
		<author>
			<persName><forename type="first">E</forename><surname>Amitay</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Harel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Sivan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Soffer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval</title>
				<meeting>the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval<address><addrLine>Sheffield, UK</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="273" to="280" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
