<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Making Sense of Location-based Micro-posts Using Stream Reasoning</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Irene</forename><surname>Celino</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">CEFRIEL -ICT Institute</orgName>
								<orgName type="institution">Politecnico of Milano</orgName>
								<address>
									<settlement>Milano</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Daniele</forename><surname>Dell'aglio</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">CEFRIEL -ICT Institute</orgName>
								<orgName type="institution">Politecnico of Milano</orgName>
								<address>
									<settlement>Milano</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Emanuele</forename><surname>Della Valle</surname></persName>
							<affiliation key="aff0">
								<orgName type="department">CEFRIEL -ICT Institute</orgName>
								<orgName type="institution">Politecnico of Milano</orgName>
								<address>
									<settlement>Milano</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="department">Dip. di Elettronica e dell&apos;Informazione</orgName>
								<orgName type="institution">Politecnico di Milano</orgName>
								<address>
									<settlement>Milano</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Yi</forename><surname>Huang</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution" key="instit1">SIEMENS AG</orgName>
								<orgName type="institution" key="instit2">Corporate Technology</orgName>
								<address>
									<settlement>Muenchen</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Tony</forename><surname>Lee</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">Saltlux</orgName>
								<address>
									<settlement>Seoul</settlement>
									<country key="KR">Korea</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stanley</forename><surname>Park</surname></persName>
							<affiliation key="aff3">
								<orgName type="institution">Saltlux</orgName>
								<address>
									<settlement>Seoul</settlement>
									<country key="KR">Korea</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Volker</forename><surname>Tresp</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution" key="instit1">SIEMENS AG</orgName>
								<orgName type="institution" key="instit2">Corporate Technology</orgName>
								<address>
									<settlement>Muenchen</settlement>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Making Sense of Location-based Micro-posts Using Stream Reasoning</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">00D535FDC6AA5BDB03C343B9F8331429</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T00:25+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Consider an urban environment and think to its semi-public realms (e.g., shops, bars, visitors attractions, means of transportation). Who is the maven of a district? How fast and how broad can such maven influence the opinions of others? These are just few of the questions BOTTARI (our Location-based Social Media Analysis mobile app) is getting ready to answer. In this position paper, we recap our investigation on deductive and inductive stream reasoning for social media analysis, and we show how the results of this research form the underpinning of BOTTARI.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>In the last few years, we have been witnessing the increasing popularity and success of Location-based Services (LBS), especially of those with a Social Networking flavour. Twitter, Facebook Places, foursquare, Gowalla are only a few examples of applications; those services bring a wide range on useful information about tourist attractions, local businesses and points of interests (POIs) in the physical world.</p><p>Although these services are enormously popular, users still suffer from a number of shortcomings. The overwhelming information flow coming from those channels often confuses users; it is also very difficult to distinguish between a fair personal opinion and a malicious or opportunistic advice. This might be the reason why users primarily link to people they know personally since there is no clear way find out those who are trustable in an on-line social network.</p><p>In this paper, we present our collaborative effort to the design and development of the BOTTARI application, a Location-based Service for mobile users that exploit Social Media Analysis techniques to identify the "mavens" of a specific geographical area, who can be considered as experts of the POIs in this area. BOTTARI was conceived by Saltlux, a Korean Knowledge Communication Company. The application is still under development and it will be made available to Korean users in the Seoul area.</p><p>BOTTARI exploits hybrid Stream Reasoning both on heterogeneous social network data <ref type="bibr" target="#b0">[1]</ref> and geo-location data. The hybrid reasoning engine combines deductive and inductive techniques. Since the input data are huge and change in real-time, the reasoning engine works by processing streaming data. The hybrid reasoning engine is developed on top of the LarKC platform <ref type="bibr" target="#b1">[2]</ref>, a pluggable architecture to build applications with Semantic Web technologies.</p><p>The remainder of the paper is organised as follows. Section 2 explains the concept of stream reasoning and delineates the system architecture. Section 3 describes the BOTTARI app. Section 4 details some user questions in terms of queries to our stream reasoner. Finally, Section 5 concludes the paper.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">System Architecture</head><p>Continuous processing of information flows (i.e. data streams) has widely been investigated in the database community. <ref type="bibr" target="#b2">[3]</ref>. In contrast, continuous processing of data streams together with rich background knowledge requires semantic reasoners, but, so far, semantic technologies are still focusing on rather static data. We strongly believe that there is a need to close this gap between existing solutions for belief update and the actual need of supporting decision making based on data streams and rich background knowledge. We named this little explored, yet high-impact research area Stream Reasoning <ref type="bibr" target="#b3">[4]</ref>. The foundation for Stream Reasoning has been investigated by introducing technologies for wrapping and querying streams in the RDF data format (e.g., using C-SPARQL <ref type="bibr" target="#b4">[5]</ref>) and by supporting simple forms of reasoning <ref type="bibr" target="#b5">[6]</ref> or query rewriting <ref type="bibr" target="#b6">[7]</ref>.</p><p>We are developing the Stream Reasoning vision on top of LarKC <ref type="bibr" target="#b7">[8]</ref>. The LarKC platform is aimed to reason on massive heterogeneous information such as social media data. The platform consists of a framework to build workflows, i.e. sequences of connected components (plug-ins) able to consume and process data. Each plug-in exploits techniques and heuristics from diverse areas such as databases, machine learning and the Semantic Web. We built our Stream Reasoning system by embedding a deductive reasoner and an inductive reasoner within the LarKC architecture (see Figure <ref type="figure" target="#fig_0">1</ref>). First, BOT-TARI pre-processes the micro-posts by extracting information<ref type="foot" target="#foot_0">5</ref> whether a micropost expresses a positive or a negative feeling of its author about a certain POI. After BOTTARI data arrives to the stream reasoner as set of data streams, a selection plug-in extracts the relevant data in each input stream in form of windows. A second plug-in abstracts the window content from fine grain data streams into aggregated events and produces RDF streams. Then, a deductive reasoner plug-in is able to register C-SPARQL queries, whose results can be of immediate use (cf. Section 4) or can be processed by other two sub-workflows. Each sub-workflow is constituted by an abstracter and an inductive reasoner, which uses an extended version of SPARQL that supports probabilities <ref type="bibr" target="#b8">[9]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">The BOTTARI mobile app</head><p>The BOTTARI mobile app is a location-based service that exploits the social context to provide relevant contents to the user in a specific geographic location. The purpose of the BOTTARI service is to provide recommendations on local context information to users through an augmented reality interface. BOTTARI gives detailed information on local POIs, including trust or reputation information. In Figure <ref type="figure" target="#fig_1">2</ref> , we provide some sample screenshots on how the BOTTARI mobile application will look like once completed.</p><p>The input data for the BOTTARI service come from public social networks and location based services (Twitter, local blogs and Korean news), are converted in RDF streams and are then processed and analysed by the system described in Section 2. The RDF-ized data are modelled with respect to the ontology represented in Figure <ref type="figure" target="#fig_2">3</ref>, which is an extension to the SIOC vocabulary [?]. Our model takes into account the specific relations of Twitter (followers/following, reply/retweet); it adds the geographical perspective by modelling the POIs; it includes the "reputation" information by means of positive/negative reviews. The hybrid Stream Reasoning solutions we are developing is able to answer questions like: Who are the opinion makers (i.e., the users who are likely to influence the behaviour of their followers with regard to a certain POI)? How fast and how wide are opinions spreading? Who shall I follow to be informed about a given category of POIs in this neighbourhood?</p><p>In the rest of the section we show how to issue the three queries above using C-SPARQL and SPARQL with probabilities. Who are the opinion makers?</p><p>Lines 1 and 3 of the following listing tell the C-SPARQL engine to register the continuous query on the stream of micro-posts generated by BOTTARI considering a sliding window of 30 minutes that slides every 5 minutes. Line 2 tells the engine that it should generate an RDF stream as output reporting who are the opinion makers for a certain POI and if they are rating it positively or negatively. The basic triple pattern (BTP) at lines 5 and 6 matches micro-posts of the potential opinion makers with a POI. The variable opinion can match one of the properties talksAbout, talksAboutPositively, or talksAboutNegatively. The BTP at lines 7-8 looks up the followers of the opinion makers. The FILTER clause at line 9 checks whether the micro-posts of the followers, which talk about the same POI, occurs after those from the opinion makers. At line 10 the query filters out actions of type twd:talksAbout and concentrates on micro-posts clearly discussing a POI in a positive or negative way. Finally, at line 12 the clause HAVING promotes the true opinion makers which have at least ten followers who expressed the same opinion about the POI after them. How fast and wide opinions are getting spread?</p><p>Using the RDF stream computed by the previous query, the query in the following listing informs about how wide the micro-posts of an opinion maker are getting spread in half an hour. To do so, it considers the reply and re-tweet relationships among tweets (i.e., tweets linked by the discuss property in BOT-TARI data model). Being discuss a transitive property, the C-SPARQL engine uses the materialization technique presented in <ref type="bibr" target="#b5">[6]</ref> to incrementally compute the transitive closure of discuss. ?user a twd:opinionMaker ; 7.</p><p>twd:post ?opinionMakerTweet . 8.</p><p>{ ?aPositiveTweet a twd:Tweet ; 9.</p><p>twd:discuss ?opinionMakerTweet ; 10.</p><p>twd:talksAboutPositively ?poi . 11.</p><p>} UNION { 12.</p><p>?aNegativeTweet a twd:Tweet ; 13.</p><p>twd:discuss ?opinionMakerTweet ; 14.</p><p>twd:talksAboutNegatively ?poi . 15. } Lines 1, 3 and 4 tell the C-SPARQL engine to register the continuous query on the stream of micro-posts generated by BOTTARI and on the streaming results of the opinion makers query. In both cases, a sliding window of 30 minutes, which slides every 30 seconds, is considered. The BTP at lines 6-7 matches the microposts of the opinion makers. The BTP at lines 8-10 and the BTP at lines 12-14 look up other micro-posts that, respectively, positively and negatively discussed those of the opinion makers. Line 2 asks the engine to generate a variable binding reporting how many positive and negative micro-posts are discussing the microposts of the current opinion makers.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Who shall I follow?</head><p>Let us consider now a specific BOTTARI user named Giulia. In the following listing we show a query that asks for the mavens Giulia should follow to be informed about attractions for kids, even among people she does not know. The system uses the social network of Giulia and the last window in the stream (generated by the query in the first listing) to determine such predicted probability. The BGP at lines 4-6 matches the opinion makers that have been recently expressing positive opinions about attractions for kids. The triple patter at line 7 matches BOTTARI users that Giulia is following. Note that the following relationship may have not been asserted yet, the construct WITH PROB extends SPARQL by letting it query an inducted model. The variable ?prob assumes the value 1 for the user she follows already and assumes the estimated probabilities between 0.8 and 1 for users she may be recommended to follow (cf. line 8). The ORDER BY clause is used to return users sorted by decreasing probability. The query answer includes pairs of users and predicted likelihood (e.g. :Alice with probability 0.99, :Bob with probability 0.87).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Conclusions and Future Works</head><p>In this paper we presented BOTTARI, a location-based mobile application which is able to supply contents and personalized suggestions to the users. We explained the processing of new recommendations, based on the elaboration of data streams generated by microblogging platforms like Twitter and foursquare. The computation is defined as a workflow combining Semantic Web and machine learning techniques and it is executed on top of the LarKC platform.</p><p>Our future work will focus on the development of the first stable version of the BOTTARI application and its release as Android app. The initial release will focus on Korea and will be evaluated by following a user-centered approach: a set of users will try out the application, supplying us feedbacks via a survey with questions about the system and its accuracy in providing suggestions. This work was partially supported by the EU project LarKC (FP7-215535).</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Architecture of our Stream Reasoner</figDesc><graphic coords="2,151.98,318.03,311.39,100.01" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 2 .</head><label>2</label><figDesc>Fig. 2. Some screenshots of the BOTTARI Android application</figDesc><graphic coords="3,134.68,115.71,345.99,341.39" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Fig. 3 .</head><label>3</label><figDesc>Fig. 3. Ontology modelling of BOTTARI data</figDesc><graphic coords="4,134.68,115.70,345.99,185.38" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_0">This technology is a Saltlux trade secret. • #MSM2011 • 1st Workshop on Making Sense of Microposts •</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Deductive and Inductive Stream Reasoning for Semantic Social Media Analytics</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">F</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Braga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ceri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Della Valle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Tresp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rettinger</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wermser</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Intelligent Systems</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="32" to="41" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Large Knowledge Collider. A Service-oriented Platform for Large-scale Semantic Reasoning</title>
		<author>
			<persName><forename type="first">A</forename><surname>Cheptsov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of WIMS</title>
				<meeting>WIMS</meeting>
		<imprint>
			<date type="published" when="2011">2011. 2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Data Stream Management: Processing High-Speed Data Streams</title>
		<author>
			<persName><forename type="first">M</forename><surname>Garofalakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Gehrke</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Rastogi</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2007">2007</date>
			<publisher>Springer-Verlag New York, Inc</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">It&apos;s a Streaming World! Reasoning upon Rapidly Changing Information</title>
		<author>
			<persName><forename type="first">Della</forename><surname>Valle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Ceri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Van Harmelen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fensel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Intelligent Systems</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="83" to="89" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">C-SPARQL: a Continuous Query Language for RDF Data Streams</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">F</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Braga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ceri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Della Valle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Grossniklaus</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Int. J. Semantic Computing</title>
		<imprint>
			<biblScope unit="volume">4</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="3" to="25" />
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Incremental Reasoning on Streams and Rich Background Knowledge</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">F</forename><surname>Barbieri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Braga</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Ceri</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Della Valle</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Grossniklaus</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of ESWC2010</title>
				<meeting>of ESWC2010</meeting>
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Towards Scalable Reasoning on Ontology Streams via Syntactic Approximation</title>
		<author>
			<persName><forename type="first">Y</forename><surname>Ren</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Z</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Zhao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of IWOD2010</title>
				<meeting>of IWOD2010</meeting>
		<imprint>
			<date type="published" when="2010">2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Towards LarKC: a Platform for Web-scale Reasoning</title>
		<author>
			<persName><forename type="first">D</forename><surname>Fensel</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of ICSC 2008</title>
				<meeting>of ICSC 2008</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Materializing and querying learned knowledge</title>
		<author>
			<persName><forename type="first">V</forename><surname>Tresp</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Huang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bundschus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Rettinger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of IRMLeS</title>
				<meeting>of IRMLeS</meeting>
		<imprint>
			<date type="published" when="2009">2009. 2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">SIOC Core Ontology Specification</title>
		<author>
			<persName><forename type="first">D</forename><surname>Berrueta</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">W3C Member Submission</title>
				<meeting><address><addrLine>W3C</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2007">2007</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
