<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Introducing a Framework for Automatically Differentiating Witness Accounts of Events from Social Media</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Marie</forename><surname>Truelove</surname></persName>
							<email>truelove@student.unimelb.edu.au</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Infrastructure Engineering</orgName>
								<orgName type="institution">The University of Melbourne</orgName>
								<address>
									<country key="AU">Australia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Maria</forename><surname>Vasardani</surname></persName>
							<email>maria.vasardani@unimelb.edu.au</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Infrastructure Engineering</orgName>
								<orgName type="institution">The University of Melbourne</orgName>
								<address>
									<country key="AU">Australia</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Stephan</forename><surname>Winter</surname></persName>
							<email>winter@unimelb.edu.au</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Infrastructure Engineering</orgName>
								<orgName type="institution">The University of Melbourne</orgName>
								<address>
									<country key="AU">Australia</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Introducing a Framework for Automatically Differentiating Witness Accounts of Events from Social Media</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">A499B86DE009D204EEE1D02413BC0BE7</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T15:19+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Crowdsourcing</term>
					<term>Social Media</term>
					<term>Witness Accounts</term>
					<term>Supervised Machine Learning</term>
					<term>Dempster-Shafer Theory of Evidence</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Identifying Witnesses of events from social media is an opportunity to crowdsource real-time information to enhance numerous applications including emergency response in a crisis, filtering sources for journalism, and enhancing marketing services. Using a sporting event broadcast live to a proportionally much larger audience, this research demonstrates a significant increase in the number of Witnesses identified posting from the event venue, in comparison to the number identified from geotags alone. This is achieved by considering the text and image content of micro-blogs as additional evidence. This paper also reports progress towards the automatic categorisation of the additional text and image evidence, and modelling and testing this evidence for corroboration or conflict, using Dempster-Shafter Theory of Evidence.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>INTRODUCTION</head><p>Crowdsourcing information about events from social networks such as Twitter is recognised as an opportunity to harvest detailed real-time information, for example enhancing situational awareness for emergency response and management <ref type="bibr" target="#b17">[18]</ref> and creating news summaries of large sporting spectacles <ref type="bibr" target="#b18">[19]</ref>. However, these opportunities come with many problems to solve, including detecting the fraction of relevant micro-blogs, and assessing the credibility and location of the micro-bloggers who posted them. This research makes unique contributions by proposing a framework towards distinguishing those micro-blogs which are Witness Accounts (WA) of events. WA are defined as those micro-blogs which contain an observation of the event or its effects <ref type="bibr" target="#b16">[17]</ref>, for example a statement I see the bushfire smoke! or an image conveying the same information. The micro-blogger who posted the WA is considered a potential Witness to the event, and it can be inferred they are on-the-ground (OTG) <ref type="bibr" target="#b14">[15]</ref>, that is they in close proximity to the event <ref type="bibr" target="#b16">[17]</ref>. Impact Accounts (IA) are defined for those micro-blogs which do no contain an observation of the event, but from which it can also be inferred that the micro-blogger who posted it is OTG. IA statements may be as explicit as I'm being evacuated from my home due to the bushfire. Formally modelling the witnessing fundamentals of observation and spatial relationship separately enables a generic model for a range of event types including unpredicted natural disasters to scheduled events broadcast live from dedicated venues, such as the case study presented in this paper. All micro-bloggers who post observations of the event whether viewed direct from the grandstands or via television are by definition Witnesses. The research in this paper questions whether it is possible to differentiate those Witnesses which are physically at the event from those watching a broadcast. Such differentiation is supported by micro-blogs with geotags, but typically they are present in only a fraction of micro-blogs, for example 1% <ref type="bibr" target="#b0">[1]</ref>. This research demonstrates that including the text content and linked images as evidence, the sample of micro-blogs posted from the event location can be increased significantly from those identified by geotags alone. Additionally, this research questions whether text content and linked images can be automatically categorised, and used to test whether they corroborate the inference they were posted from the event.</p><p>In order to automatically differentiate those micro-blogs which are WA or IA, and test the Witness categorisation of the micro-bloggers who posted them, a framework is proposed with the following parts:</p><p>1. Machine learning approaches to categorise micro-blogs with text and linked images that are likely WA or IA;</p><p>2. Combine the evidence extracted for each individual micro-blog to determine those which can be ranked as containing corroborating or conflicting evidence;</p><p>3. For each micro-blogger found to have posted micro-blogs containing evidence, combine these to rank their likely status as a Witness OTG; and 4. For likely Witnesses, seek further evidence, for example from micro-blogging history posted during the event.</p><p>This paper presents progress to date on parts 1) and 2). To demonstrate part 1) supervised machine learning approaches are used to categorise the text and image content. A model of the micro-blog text, linked images and geotags using Dempster-Shafer Theory of Evidence <ref type="bibr" target="#b2">[3]</ref> is developed to demonstrate part 2). The results indicate a significant improvement on the recognition rate of micro-blogs posted from an event from geotags alone. And where multiple evidence is present for an individual micro-blog their combination does produce intuitive results, including identifying conflict due to GPS error. Enhancements and alternative approaches to those presented in this paper, as with parts 3) and 4) of the framework is the subject of future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>BACKGROUND</head><p>Communication technologies have been described as space-adjusting techniques <ref type="bibr" target="#b13">[14]</ref>, as they enable events to be witnessed by proportionally much larger audiences than the capacity of the venues in which they are held. In these scenarios, unlike previous case studies such as those in <ref type="bibr" target="#b15">[16]</ref>, it is not possible to infer a Witness is OTG for the dominating category of observations, that is of the play on the field <ref type="bibr" target="#b18">[19]</ref>. It has also been determined that the live broadcast delay of approximately 12 seconds cannot be detected in micro-blogs, ruling this feature out as a method to distinguish those witnessing via a broadcast <ref type="bibr" target="#b18">[19]</ref> . In addition to sport, differentiating Witnesses of crisis events has gained much interest from researchers. A journalistic approach describes extracting observation features from text to identify Witnesses <ref type="bibr" target="#b1">[2]</ref>, whereas spatial presence in the city of the event is the criterion in other work <ref type="bibr" target="#b9">[10]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Supervised Machine Learning for Categorisation</head><p>Natural language processing (NLP) using bag-of-words approaches from unigram, bigram and parts-ofspeech (POS) models, can be utilised as baseline text categorisation features <ref type="bibr" target="#b9">[10]</ref>  <ref type="bibr" target="#b17">[18]</ref>. These research report success, comparable in many scenarios to more sophisticated features <ref type="bibr" target="#b9">[10]</ref>  <ref type="bibr" target="#b17">[18]</ref>. A visual bag-ofwords approach to categorise images linked to micro-blogs has also been tested <ref type="bibr" target="#b8">[9]</ref>. The disadvantage of bag-of-words approaches is that although the methodology can be applied generically, the resulting model is not generic, for example, a model developed from training data for a football game cannot be used for a bushfire. Approaches which extract semantic meaning, for example locative expressions from text <ref type="bibr" target="#b6">[7]</ref> would enable a generic model, but their success to-date is limited in domains such as social media <ref type="bibr" target="#b6">[7]</ref>. Detecting micro-blogs posted from OTG is also recognised as a unbalanced class problem <ref type="bibr" target="#b14">[15]</ref>. Approaches taken to mitigate class imbalance typically involve balancing the data via sampling <ref type="bibr" target="#b9">[10]</ref> [5] <ref type="bibr" target="#b17">[18]</ref>, or algorithmically introducing a miss-classification cost to the under-represented class <ref type="bibr" target="#b14">[15]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dempster-Shafer Theory of Evidence</head><p>Dempster-Shafters Theory of Evidence is one method that has found application in classifier fusion, and managing uncertainty and incomplete reasoning <ref type="bibr" target="#b2">[3]</ref>. The theory models the power set for the frame of discernment of the hypothesis <ref type="bibr" target="#b2">[3]</ref>. A mass function is assigned for each subset in the power set from which the belief interval can be derived <ref type="bibr" target="#b2">[3]</ref>. The mass function can be assigned from various classifier results, including the overall accuracy, class statistics or individual instances <ref type="bibr" target="#b11">[12]</ref>. The mass functions for independent evidence can then be combined <ref type="bibr" target="#b2">[3]</ref>. Dempster's Rule of Combination has been shown to produce uninituitve results in scenarios with conflict <ref type="bibr" target="#b12">[13]</ref>, resulting in many enhancements being proposed including PCR6 <ref type="bibr" target="#b12">[13]</ref> based on proportional conflict resolution.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">METHODOLOGY Data Collection and Training Set Creation</head><p>The case study event is an Australian Football League (AFL) match played at the Melbourne Cricket Ground (MCG) on the annual ANZAC Day public holiday. In 2015, this match attracted a near capacity crowd of 88,398<ref type="foot" target="#foot_1">1</ref> and television ratings of 1.298 million<ref type="foot" target="#foot_2">2</ref> . The corpus was collected using the AFL's promoted hashtag #afldonspies, utilising the Twitter Data Analystics software packages <ref type="bibr" target="#b5">[6]</ref>. Pre-processing samples the micro-blogs to those which can be identified as individual and original, that is not a retweet or posted by a non-individual such as the media <ref type="bibr" target="#b16">[17]</ref>. To collect a sample of linked images, all micro-blogs in the corpus with a URL to Twitter or Instagram were inspected as these are more likely to contain WA <ref type="bibr" target="#b15">[16]</ref>. To create the training set, two expert annotators coded the tweet text and linked images with one of three categories, examples for which are presented in Table <ref type="table" target="#tab_0">1</ref>. The three categories are:</p><p>1. No Evidence (NE) when no evidence of being posted from OTG or another place could be detected.</p><p>2. When evidence is detected, it is categorised as either evidence posted from OTG (E-OTG); 3. Or counter-evidence indicating that it is not posted from OTG (E-NOTG). Evidence OTG (E-OTG) Evidence not OTG (E-NOTG) Fletcher goes bang with a 60 metre monster! #AFLDon-sPies Not the best seats in the house but just glad to be here at @MCG #AFLDonsPies</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>In front of TV with chips for next 3 hours! #AFLDonsPies</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Supervised Machine Learning for Categorisation of Text and Images</head><p>Pre-processing of the text included word tokenisation and parts-of-speech tagging using Ark NLP <ref type="bibr" target="#b10">[11]</ref>. WEKA's <ref type="bibr" target="#b3">[4]</ref> default pre-processing filters were used to experiment with unigram and bigram models. WEKA default feature selection filters are utilised to reduce the number of redundant dimensions, and experiment with a range of classifiers indicated by previous research including Naive Bayes, Random Forest and Support Vector Machines (SVM). All experiments were completed with 10-fold cross validation. As expected, class imbalance was an issue in particular for the text corpus. Sampling to micro-blogs posted by micro-bloggers with at least one piece of evidence detected in the the training set was used to mitigate the imbalance. The classifier selected was that which maximises precision of E-OTG and E-NOTG classes, at the expense of recall if necessary, to minimise conflict due to miss-classification in the Dempster-Shafer modeling.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Categorisation of Geotags</head><p>Geotag evidence was cateogorised as E-OTG or E-NOTG based on whether it was contained within or in the immediate vicinity of the MCG, the place of the event. It was necessary to create a decision boundary for this categorisation, which was informed primarily by the boundaries of places bordering the MCG, for example train lines, roads and other venues.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Dempster-Shafer Modelling of Evidence Extracted from Micro-blogs</head><p>The frame of discernment is modelled as {E-OTG, E-NOTG} with power set (null, E-OTG, E-NOTG, {E-OTG, E-NOTG}). The categorisation of NE is not modelled in the frame of discernment. For example, if a micro-blog has a geotag categorised as E-OTG, and text and image categorised as NE, the text and image do not corroborate or produce conflict with the E-OTG categorisation provided by the geotag. For demonstraton mass functions are set manually, with derivation from classifier results left to future work. The mass functions assigned to geotags represent greater certainty than that assigned for images, which are greater than that assigned for text. The combination rule PCR6 implemented in Matlab <ref type="bibr" target="#b7">[8]</ref> is then used to compute the combinations for analysis, and again, decision algorithm testing left for future work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">RESULTS AND DISCUSSION</head><p>The corpus contained 3260 micro-blogs, 265 with linked images and 133 with geotags. Table <ref type="table" target="#tab_1">2</ref> presents the categorisation results, both training and predicted by classifiers. The annotator agreement for the text and image content was high with Cohen's Kappa of 0.895 and 0.929 respectively. Combining the three content sources, or evidence, from the training data indicates the number of micro-blogs categorised as E-OTG and E-NOTG can be increased significantly from those with geotags alone. The increase for E-OTG is from 21 to 176 micro-blogs, and the increase for E-NOTG is from 112 to 241 micro-blogs. This corresponds to an additional 125 potential Witnesses OTG from 16. 54 tweets had more than one piece of evidence which could be checked for conflict. Conflict did exist for a fraction of tweets, found to be due to GPS error. The geotag indicated the micro-blog was posted from a nearby venue, when the image and text indicated it was posted from the MCG.</p><p>The combined results correctly predicted are fewer than the training data, but still a significant increase from those with geotags alone. The number of micro-blogs categorised as E-OTG increased from 21 to 125, and the number of micro-blogs categorised as E-NOTG increased from 112 to 182. This corresponded to an additional 77 potential Witnesses posting from OTG and an additional 50 potential Witnesses via the broadcast. From the predicted results of the classifiers, 26 micro-blogs had more than one piece of evidence, with five in conflict. In addition to GPS error, these conflicts are now also attributed to miss-classification. From the classifier experimentation it was found the WEKA default SVM, feature selection filter, and a unigram model maximised precision of the E-OTG and E-NOTG classes for text content. Using the methodology described by <ref type="bibr" target="#b8">[9]</ref> the SVM classifier was additionally selected for the image content. The precision and recall results for each class are presented in Table <ref type="table" target="#tab_2">3</ref>. These results indicate the image categorisation failed for the E-NOTG class, which is attributed to the insufficient number of samples in the training data. For future experiments this category could be removed, or the sample increased from other events, both options are to be tested in future work. In comparison, the E-OTG category proved acceptable for both precision and recall. The better precision for text E-NOTG compared to E-OTG could in part be explained by the topics contained in these micro-blogs were dominated by explicit statements critiquing the television coverage or the medium via which the broadcast was being viewed, enabling a more representative unigram model. In comparison, there was not a dominate topic for E-OTG. More robust feature development based on previously identified witnessing characteristics <ref type="bibr" target="#b15">[16]</ref> is being developed in future work. Additionally, the results indicate improvements could be made if the class imbalance were further addressed.  <ref type="table" target="#tab_3">4</ref> presents example mass functions manually assigned for text and image evidence and combination mass functions corresponding to predicted combination results. Manual analysis concludes that these results appear intuitive, for example, when there are multiple evidence present supporting a categorisation, the increased values indicate corroboration. Additionally, when conflict exists the values reflect this, suggesting the conflict redistribution of the PCR6 algorithm is appropriate for the modelled scenario. Hybrid methods for deriving the mass functions which can model the uncertainty of the evidence in addition to the automatic classification results are in progress. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">CONCLUSION</head><p>This paper presented progress on a framework to automatically extract WA and IA of events from social media. Baseline supervised machine learning techniques to categorise text and images were demonstrated, enabling micro-blogs posted from OTG or via the broadcast to be identified in signficantly greater numbers than with geotags alone. Additionally, a method based on Dempster-Shafer Theory of Evidence was demonstrated to combine the extracted evidence to test corroboration or conflict in the categorisation of the micro-blogs. Many areas for enhancements are identified, including machine learning approaches that further mitigate class imbalance and enable generic model development. In addition to seeking these enhancements, future work will include modeling the combination of evidence from multiple micro-blogs to identify the status of potential Witnesses.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Example text and image content for each category. (Source: twitter.com, access date: 25-26/04/15.)</figDesc><table><row><cell>No Evidence (NE)</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Number of micro-blogs categorised for each content individually and in combination. The number of miss-classified micro-blogs in the class are presented in parenthesis.</figDesc><table><row><cell>Content</cell><cell>E-OTG</cell><cell></cell><cell>E-NOTG</cell><cell></cell><cell>NE</cell><cell></cell></row><row><cell></cell><cell cols="5">Training Predicted Training Predicted Training</cell><cell>Predicted</cell></row><row><cell>Geotag</cell><cell>21</cell><cell>-</cell><cell>112</cell><cell>-</cell><cell>-</cell><cell>-</cell></row><row><cell>Image</cell><cell>95</cell><cell>95 (11)</cell><cell>26</cell><cell>17 (10)</cell><cell>146</cell><cell>173 (27)</cell></row><row><cell>Text</cell><cell>99</cell><cell>34 (10)</cell><cell>129</cell><cell>66 (6)</cell><cell>3032</cell><cell>1088 (143)</cell></row><row><cell>Combination</cell><cell>176</cell><cell>125 (15)</cell><cell>241</cell><cell>182 (17)</cell><cell>2328</cell><cell>876 (132)</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Class precision and recall results for adopted classifier.</figDesc><table><row><cell>Corpus</cell><cell cols="3">Statistic E-OTG E-NOTG</cell><cell>NE</cell></row><row><cell cols="2">Text Precision</cell><cell>0.706</cell><cell>0.909</cell><cell>0.869</cell></row><row><cell></cell><cell>Recall</cell><cell>0.242</cell><cell>0.465</cell><cell>0.984</cell></row><row><cell cols="2">Image Precision</cell><cell>0.844</cell><cell>0.412</cell><cell>0.844</cell></row><row><cell></cell><cell>Recall</cell><cell>0.896</cell><cell>0.280</cell><cell>0.896</cell></row><row><cell>Table</cell><cell></cell><cell></cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 .</head><label>4</label><figDesc>Example mass functions assigned or PCR6 combination results for evidence combinations detected in micro-blogs. (-indicates no data.)</figDesc><table><row><cell cols="2">Evidence Categorisat.</cell><cell></cell><cell cols="2">Mass Function</cell><cell></cell><cell>Comment</cell></row><row><cell>Text</cell><cell>Image</cell><cell>Geotag</cell><cell cols="3">E-OTG E-NOTG E-OTG,E-NOTG</cell><cell></cell></row><row><cell>E-OTG</cell><cell>-</cell><cell>-</cell><cell>0.70</cell><cell>0.15</cell><cell>0.15</cell><cell>Assigned Mass Funct.</cell></row><row><cell>E-OTG</cell><cell>-</cell><cell>E-OTG</cell><cell>0.95</cell><cell>0.04</cell><cell>0.01</cell><cell></cell></row><row><cell>E-OTG</cell><cell>E-OTG</cell><cell>E-OTG</cell><cell>0.97</cell><cell>0.02</cell><cell>0.01</cell><cell></cell></row><row><cell>E-OTG</cell><cell>E-OTG</cell><cell>E-NOTG</cell><cell>0.55</cell><cell>0.43</cell><cell>0.02</cell><cell>GPS error example</cell></row><row><cell>E-OTG</cell><cell>-</cell><cell>E-NOTG</cell><cell>0.35</cell><cell>0.64</cell><cell>0.01</cell><cell>Miss-classified Text</cell></row><row><cell>-</cell><cell>E-NOTG</cell><cell>-</cell><cell>0.10</cell><cell>0.80</cell><cell>0.10</cell><cell>Assigned Mass Funct.</cell></row><row><cell cols="2">E-NOTG E-NOTG</cell><cell>-</cell><cell>0.07</cell><cell>0.91</cell><cell>0.02</cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Proc. of the 3 rd Annual Conference of Research@Locate 14</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_1">twitter.com/MCG/status/591859347891748865</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_2">Proc. of the</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_3"> rd  Annual Conference of Research@Locate 15</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_4">Proc. of the 3 rd Annual Conference of Research@Locate 16</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_5">Proc. of the 3 rd Annual Conference of Research@Locate 17</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_6">Proc. of the 3 rd Annual Conference of Research@Locate 18</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">You are where you tweet: A content-based approach to geo-locating Twitter users</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Cheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Caverlee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 19th ACM International Conference on Information and Knowledge Management</title>
				<meeting>the 19th ACM International Conference on Information and Knowledge Management</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="759" to="768" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Finding and assessing social media information sources in the context of journalism</title>
		<author>
			<persName><forename type="first">N</forename><surname>Diakopoulos</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>De Choudhury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Naaman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems</title>
				<meeting>the ACM SIGCHI Conference on Human Factors in Computing Systems</meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="2451" to="2460" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Multiple Classifier Systems: Theory, Application and Tools</title>
		<author>
			<persName><forename type="first">F</forename><surname>Gargiulo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Mazzariello</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Sansone</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2013">2013</date>
			<publisher>Spinger-Verlag</publisher>
			<biblScope unit="volume">10</biblScope>
			<biblScope unit="page" from="335" to="378" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">The WEKA data mining software: An update</title>
		<author>
			<persName><forename type="first">M</forename><surname>Hall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Frank</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Holmes</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Pfahringer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Reutemann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">H</forename><surname>Witten</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">SIGKDD Explorations</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="10" to="18" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">A behaviour analytics approach to identifying tweets from crisis regions</title>
		<author>
			<persName><forename type="first">S</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 25th ACM Conference on Hypertext and Social Media</title>
				<meeting>the 25th ACM Conference on Hypertext and Social Media</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="255" to="260" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Twitter Data Analytics</title>
		<author>
			<persName><forename type="first">S</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Morstatter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Liu</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2014">2014</date>
			<publisher>Springer</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Automatic identification of locative expressions from social media text: A comparative analysis</title>
		<author>
			<persName><forename type="first">F</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vasardani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Baldwin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 4th International Workshop on Location and the Web (LocWeb)</title>
				<meeting>the 4th International Workshop on Location and the Web (LocWeb)</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="9" to="16" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Implementing general belief function framework with a practical codification for low complexity</title>
		<author>
			<persName><forename type="first">A</forename><surname>Martin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances and Application os DSmT for Information Fusion</title>
				<editor>
			<persName><forename type="first">F</forename><surname>Smarandache</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">J</forename><surname>Dezert</surname></persName>
		</editor>
		<imprint>
			<publisher>American Press Rehoboth</publisher>
			<date type="published" when="2009">2009</date>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="217" to="273" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<monogr>
		<title level="m" type="main">Identifying witness account in social media using imagery</title>
		<author>
			<persName><forename type="first">S</forename><surname>Mclean</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
		<respStmt>
			<orgName>The University of Melbourne</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Master&apos;s thesis</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Finding eyewitness tweets during crises</title>
		<author>
			<persName><forename type="first">F</forename><surname>Morstatter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Lubold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Pon-Barry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pfeffer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Liu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science</title>
				<meeting>the ACL 2014 Workshop on Language Technologies and Computational Social Science</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Improved part-of-speech tagging for online conversational text with word clusters</title>
		<author>
			<persName><forename type="first">O</forename><surname>Owoputi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Oconnor</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dyer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Gimpel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Schneider</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Smith</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of NAACL</title>
				<meeting>NAACL</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page">2013</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Application of Dempster-Shafer theory in condition monitoring applications: a case study</title>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">R</forename><surname>Parikh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Pont</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">B</forename><surname>Jones</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Pattern Recognition Letters</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="page" from="777" to="785" />
			<date type="published" when="2001">2001</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">On the consistency of PCR6 with the averaging rule and its application to probability estimation</title>
		<author>
			<persName><forename type="first">F</forename><surname>Smarandache</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Dezert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 16th International Conference on Information Fusion</title>
				<meeting>the 16th International Conference on Information Fusion</meeting>
		<imprint>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="1119" to="1126" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<title level="m" type="main">Cultural Geography</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">E</forename><surname>Spencer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">L J</forename><surname>Thomas</surname></persName>
		</author>
		<imprint>
			<date type="published" when="1969">1969</date>
			<publisher>John Wiley &amp; Sons, Inc</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Learning from the crowd: Collaborative filtering techniques for identifying on-the-ground Twitterers during mass disruptions</title>
		<author>
			<persName><forename type="first">K</forename><surname>Starbird</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Grace</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Leysia</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 9th International ISCRAM Conference</title>
				<meeting>the 9th International ISCRAM Conference</meeting>
		<imprint>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Testing a model of witness accounts in social media</title>
		<author>
			<persName><forename type="first">M</forename><surname>Truelove</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vasardani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Winter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 8th Workshop on Geographic Information Retrieval</title>
				<meeting>the 8th Workshop on Geographic Information Retrieval</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Towards credibility of micro-blogs: characterising witness accounts</title>
		<author>
			<persName><forename type="first">M</forename><surname>Truelove</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Vasardani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Winter</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">GeoJournal</title>
		<imprint>
			<biblScope unit="volume">80</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="339" to="359" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Natural language processing to the rescue? Extracting &quot;situational awareness&quot; tweets during mass emergency</title>
		<author>
			<persName><forename type="first">S</forename><surname>Verma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Vieweg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">J</forename><surname>Corvey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">H</forename><surname>Martin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Palmer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Schram</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Anderson</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 5th International AAAI Conference on Weblogs and Social Media</title>
				<meeting>the 5th International AAAI Conference on Weblogs and Social Media</meeting>
		<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="385" to="392" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">of Twitter and sports games</title>
		<author>
			<persName><forename type="first">S</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Zhong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Wickramasuriya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Vasudevan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Human as real-time sensors of social and physical events: A case study</title>
				<imprint>
			<date type="published" when="2011">2011</date>
		</imprint>
		<respStmt>
			<orgName>Rice University and Motorola Labs</orgName>
		</respStmt>
	</monogr>
	<note type="report_type">Tech. rep.</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
