<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Monitoring Real-time Spatial Public Health Discussions in the Context of Vaccine Hesitancy</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><roleName>M.S.E</roleName><forename type="first">Michael</forename><forename type="middle">C</forename><surname>Smith</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">The George Washington University</orgName>
								<address>
									<settlement>Washington</settlement>
									<region>DC</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><roleName>Ph.D</roleName><forename type="first">Mark</forename><surname>Dredze</surname></persName>
							<affiliation key="aff1">
								<orgName type="institution">The Johns Hopkins Univer-sity</orgName>
							</affiliation>
						</author>
						<author>
							<persName><roleName>Ph.D</roleName><forename type="first">Sandra</forename><surname>Crouse Quinn</surname></persName>
							<affiliation key="aff2">
								<orgName type="institution">The University of Maryland</orgName>
								<address>
									<settlement>College Park</settlement>
									<region>Maryland</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><roleName>Ph.D</roleName><forename type="first">David</forename><forename type="middle">A</forename><surname>Broniatowski</surname></persName>
							<affiliation key="aff0">
								<orgName type="institution">The George Washington University</orgName>
								<address>
									<settlement>Washington</settlement>
									<region>DC</region>
									<country key="US">USA</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Monitoring Real-time Spatial Public Health Discussions in the Context of Vaccine Hesitancy</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">E9BE5DD66068200EE97D1129B41A1A89</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T20:08+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Social media provide the potential to keep up with public discussions more quickly, at lower cost, and at potentially higher granularity and scope than do traditional surveys <ref type="bibr" target="#b10">9</ref> . This paper details a preliminary system of real-time geographical monitoring and analysis using the context of the vaccine-hesitancy discussion across the United States, a valuable backdrop for such a system because of the diverse and impactful nature of the vaccination discussions as they appear, change, and influence the public <ref type="bibr" target="#b13">12,</ref><ref type="bibr" target="#b22">20</ref> . We combine various methods in machine learning to geolocate, categorize, and classify vaccination discussions on Twitter. As a proof of concept, we show analyses with a prominent anti-vaccine discussion that validate the system with results from traditional surveys, yet also provide valuable spatial statistical power on top of such surveys on maps of the United States. We detail limitations and future work, yet still conclude that the system and the answers it enables are important because they will allow for more targeted and effective communication and reaction to the discussion as a first step towards monitoring people's views.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>Achieving herd immunity is a critical element of a vaccine's global effectiveness because it is vital to limiting transmission and reducing incidence. In 2007, most vaccine-preventable diseases were at an all-time low incidence <ref type="bibr" target="#b30">28</ref> , but in recent years the spread of vaccine-preventable diseases has increased. For example, there were outbreaks of measles in Disneyland in 2014 <ref type="bibr" target="#b16">15</ref> , of pertussis in 2014 <ref type="bibr" target="#b36">34</ref> , and others. This urgent rise in such diseases has highlighted recent increases in vaccine hesitancy as prominent and controversial issues seen as potential threats to herd immunity <ref type="bibr" target="#b22">20</ref> .</p><p>Reasons for this vaccine hesitancy, defined by the WHO as "refer [ring] to delay in acceptance or refusal of vaccines despite availability of vaccination services...complex and context specific varying across time, place and vaccines...includ [ing] factors such as complacency, convenience and confidence" <ref type="bibr" target="#b17">16</ref> , are varied. Not only are there myriad drivers of these decisions that vary spatially <ref type="bibr" target="#b13">12</ref> , but no "strategies intended to address vaccine hesitancy", were universally effective, if effective at all <ref type="bibr" target="#b12">11</ref> . Because of the potential impact of these decisions, the public health community needs to track and understand these rationales as they appear, change, and influence the public. This paper details a preliminary system of real-time geographical monitoring and analysis in the context of the vaccine hesitancy discussion across the United States. The system and the answers it enables, with accompanying theoretical advantages of using social media and survey data together, are important because they will allow a first step towards more targeted and effective communication and reaction to widespread discussion, critical to reducing such hesitancy.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Literature Review</head><p>This paper aims to produce an efficient, broad monitoring system for the discussions that exist in the United States about vaccine hesitancy. We detail the reasons for the system's potential for improvement over survey methods, and pertinent survey research as an option for validating the system.</p><p>Exploring new factors relevant to people's vaccination decisions across the nation would be slow and expensive using traditional survey methodsregardless of methods chosen, it would be difficult to keep up with new outbreaks of hesitancy or disease. Enter social media, "which provide unprecedented, realtime access to the attitudes, beliefs, and behaviors of people from across demographic groups" <ref type="bibr" target="#b10">9</ref> . Social media is already being used to effectively, quickly, and cheaply track health issues such as disease incidence <ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b8">8,</ref><ref type="bibr" target="#b24">22,</ref><ref type="bibr" target="#b31">29</ref> . Not only do social media successfully track disease, but studies have shown that it may also track public opinion related to medical and health issues <ref type="bibr" target="#b21">19,</ref><ref type="bibr" target="#b34">32</ref> . Vaccination decisions are opinions of sections of the public; therefore, social media can and should be used to track vaccine hesitancy <ref type="bibr" target="#b10">9</ref> . This work aims to spatially track discussion about vaccination using Twitter messages as a starting point towards capturing and validating opinions; to our knowledge no system exists with such spatial granularity and capability in this context.</p><p>To validate the system one can look at ground truth of how people make decisions in similar contexts; Quinn and colleagues have produced a body of work with such aims <ref type="bibr" target="#b14">13,</ref><ref type="bibr" target="#b26">24,</ref><ref type="bibr" target="#b27">25,</ref><ref type="bibr" target="#b29">27</ref> . Overall, they showed via surveys that such factors as public trust, demographics, risk perception, and social norms influence vaccine decision making <ref type="bibr" target="#b27">25,</ref><ref type="bibr" target="#b29">27</ref> . For example, in a qualitative study, Quinn et al. showed that dimensions of public trust affect medical decisions in a study about postal workers' reactions during the 2001 anthrax attacks <ref type="bibr" target="#b29">27</ref> . These "attitudinal and experience variables [and] demographic characteristics" <ref type="bibr" target="#b14">13</ref> provide insight into how rationales about vaccination decisions may vary. They provide a means of validating the system, a starting point for exploring the spatial and sociodemographic variability in vaccination decisions, and an opportunity to confirm that hypotheses established in limited survey environments hold in wider contexts. For example, is there a significant pocket of people in a certain area who are hesitant to vaccinate because they do not trust the government?</p><p>The monitoring system able to broadly, cheaply, and quickly test such survey and spatial hypotheses in this context is the novel contribution of this work. Specifically, it is A) a processing system for classifying messages and their sentiment that B) integrates existing analyses for topic and location and C) provides an extensible framework for statistically testing spatial hypotheses about vaccine hesitancy given the generated messages and metadata on social media. This system involves using targeted methodologies and leverages theoretical advantages from using social media data in concert with survey data. How might such a system shed light on vaccine-hesitancy discussions across the USA, what are its limitations, and how could it be used as a first stepping stone to augment survey methods?</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods</head><p>Given our context of vaccination discussion, our approach to the system is the following combination of natural language processing and geospatial techniques: we collect and classify social media posts on Twitter related to vaccination; then categorize these posts by their sentiment, location and topic; then interpret the topics related to vaccine refusal and hesitancy; then spatially join and aggregate the results. This process enables evaluation of our survey hypotheses using the spatial topic clusters, as well as spatial examination of new discussions as it can be re-run over time. What follows are descriptions of each of the system's sub-processes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods -Data</head><p>The data related to vaccines for our context, also described briefly by Dredze et al. <ref type="bibr" target="#b10">9</ref> , are Twitter posts (tweets) from the USA that we began collecting around the aforementioned measles outbreak in Disneyland. The system collects the data and classifies for relevance and sentiment. Initially we flagged tweets by keyword using the Twitter Streaming API i , specifying more than fifty hand-chosen keywords such as 'vaccine', 'shot', and 'immunization' similar to and validated by common practice <ref type="bibr" target="#b1">2,</ref><ref type="bibr" target="#b6">6,</ref><ref type="bibr" target="#b33">31</ref> .</p><p>Table <ref type="table">1</ref>: Keywords used to filter Twitter data vaccine,vaccines,shot,mmr,tdap,flushot,hpv,polio,rotavirus,chickenpox,smallpox,hepatitis,hepa,hepb,dtap,meningitis,shingles,vaccinate,vaccinated,vaccine,vaccines,vacine,vacines,tetanus,diptheria,pertussis,whoopingcough,dtp,dtwp,chickenpox,measles,mumps,rubella,varicella,diphtheria,haemophilus,papillomavirus,meningococcal,pneumococcal,rabies,tuberculosis,typhoid,yellowfever,immunizations,immunization,imunization,immune,imune,cholera,globulin,encephalitis,lyme</p><p>The Twitter Streaming API selects all tweets based on a given search (or a random 1% sample if a size threshold is exceeded); we obtained all matching tweets in the US during this time period, on the order of millions of tweets. See Dredze et al., for further details on these data. 9</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods -Relevance and Sentiment</head><p>We trained supervised machine learning algorithms that, as part of the system, automatically classify tweets for relevance to vaccination and for sentiment, as sentiment analysis produces a measure of the expressed opinion in messages <ref type="bibr" target="#b23">21</ref> . We obtained labeled training data using Amazon Mechanical Turk ii on randomly chosen subsets of our tweets to 1) tag them as being relevant to the topic of vaccines or not; 2) of those relevant, randomly choose and tag as having sentiment toward vaccines (neutral or non-neutral); 3) and of those that bear sentiment, randomly choose and tag as having positive or negative sentiment. While training the classifiers we conducted cross-validation and maximized precision and recall given tunable parameters. iii See Dredze et al., for further details on these classifiers. <ref type="bibr" target="#b10">9</ref> Given these classifiers and the ability to run them over any tweets (our dataset and those incoming in real-time), we thus have the first part of the system, namely a real-time categorization of vaccine-related Twitter posts: those relevant to vaccines; of those relevant, those that bear sentiment; and of those that bear sentiment, their sentiment polarity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods -Location Classification</head><p>The next part of the system involves location classification. We use the Carmen Geolocation Toolkit 10 to automatically classify a tweet's location; such geolocation has been shown to be appropriate and effective in other public health studies <ref type="bibr" target="#b11">10</ref> . Carmen improves upon information provided by the Streaming API, and returns location information at the country, state, county, and latitude/longitude levels if able.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods -Topic Classification</head><p>The system uses topic modeling to determine the content discussed in our relevant tweets. Latent Dirichlet Allocation (LDA) is a commonly-used machine learning algorithm that automatically determines topics in collections of text <ref type="bibr" target="#b0">1</ref> , common practice to automatically extract patterns and groups in text collections, of which social media data is a prime example. LDA assumes words in documents co-locate near other words (possibly across documents) because they are related, and the algorithm collects and reports groups of such related words, with the groups representing topics. Using the MAchine Learning for LanguagE Toolkit (MALLET) <ref type="bibr" target="#b20">18</ref> , the system involves running LDA over our tweet dataset (the documents labeled as relevant to vaccination) to evaluate topics relevant to vaccine hesitancy. This produces an overall list of topics, and a parameterization of each tweet by topic (which is roughly proportional to relative composition by topic). Note that LDA is unsupervised; in general there is no guarantee that the algorithm will return a specific topic, and it is up to the analyst to determine topics' relevance and substance by analyzing the words and groups returned <ref type="bibr" target="#b2">3</ref> . We leverage public health researchers' domain expertise to make such determinations. We note that in our context, LDA will show relevant topics because we have collected and categorized the tweets to fit a specific meta-topic (that of vaccines). By contrast, the substance of the relevant topics will be outputs of the system enabling hypothesis testing of our ground truth factors and exploration beyond.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Methods -Joins and Aggregations</head><p>The system enables nonspatial aggregation and analysis on the tweets by sentiment and topic. More central to this paper, however: the tweets also have location data, which one may spatially join and aggregate using ArcMap (version 10.3), part of ArcGIS. ArcGIS is a geographic information system software that can generate maps of aggregated data and can calculate and display spatial statistics on those maps. One such statistic is the Getis-Ord Gi* statistic for hotspot analysis <ref type="bibr" target="#b15">14</ref> , valuable to the system because it indicates statistically significant high (low) point data if a point and its neighbors are high (low) in terms of some common variable. iv Using these maps and statistics, one may spatially analyze where vaccine tweets (our point data) occur, where sentiment occurs, and where topics occur (our common variables), with notions of how often they occur and whether statistically significant differences exist. Accordingly, the system provides a geographic result to accompany our topic-substance result concerning the survey results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results</head><p>Running the topic model over all tweets in our dataset, we obtained information about topics and their distribution over our tweets. The system also produces classification results for each tweet in terms of its relevance, sentiment, and location. We may use a tweet's ID (e.g. "532385146419560448") to link its sentiment, location, and topic distribution. Given locations of relevant messages, we may filter by classification category and weight by topic distribution to find hotspots for discussion of a given discussion.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results -Topics</head><p>iii Relevance classifier (recall .91, precision .96); if relevant, whether contains sentiment about vaccines (recall .28, precision .63); if contains sentiment, is it positive vs negative (recall .85, precision .75). We chose to maximize precision in the second case because we were relying on the precision of our results in the positive/negative classifier. Such low recall is not an issue given the size of our dataset. iv The definition of 'neighbor' is variable; what is appropriate depends highly on the input data. Some of many possible options for our topic proportions and tweet data are k-nearest-neighbors (weighting influence such that all points have k neighbors) or weighting influence based on inverse Euclidian. We chose the former due to ease of interpretation and calculation. Specifically, the topic information consists of a relative weighting parameter for each topic for each tweet (roughly proportional to the proportion of each topic in the tweet), so one can get messages most representative of each topic. We ran the topic model on all messages, filtered by the regular expression *vacc* to prune irrelevant / noisy topics in advance, and qualitatively interpreted the topics. Needing to specify the number of topics, we chose 50 to capture enough variability in our large dataset. As a proof of concept, we considered topic 46 in our further analyses. Topic 46 pertains to the California government's bill eliminating exemptions from vaccinations in schoolchildren. Below are example messages from this topic; our domain experts who performed identification and validation looked at both the tokens in the topic and representative messages when doing so, as is good practice 5 .</p><p>• "california governor signs strict school vaccine legislation gov jerry brown signs california bill imposing..."</p><p>• "jim carrey brands governor 'fascist' over vaccine law jim carrey called california gov jerry brown" • "ahf criticizes dumb amp dumber star jim carrey for calling gov brown a fascist" • "calif gov jerry brown launching frosted mercury flakes children's cereal to accompany vaccine mandate"</p><p>We chose this topic for two reasons: it is an arguably prominent anti-vaccination discussion in our data, and it is pertinent to a hypothesis validated by Quinn's previous work that "public trust / trust in government" affects such attitudes about medical decisions as vaccination, a common thread for validation. The analysis steps are the same regardless of topic chosen.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Results -Hotspots for Topics</head><p>To identify hotbeds of these vaccine hesitancy discussions, we used the "Hot Spot Analysis" tool in ArcMap, which calculates the Gi* statistic. We continued the proof of concept by considering only the contiguous United States, but the analysis is identical using different geographical boundaries (e.g. an individual state or a different country). We also limited our hot spot analysis only to the tweet messages classified as having negative sentiment about vaccines since our chosen topic was 46. As the definition of a neighborhood may vary depending on input data, we chose to spatially weight our input data via the k-nearest-neighbor (KNN) algorithm (using the default value of 8 neighbors suggested by ArcMap) to elegantly allow for such variations. This yielded the following map. The hot-spot map of topic 46 shows statistically significant areas in the contiguous USA where the highest proportion of discussion of topic 46 is occurring in negative-sentiment vaccine messages on Twitter. For example, topic 46 is often discussed near LA and in the northern Appalachian region, among other areas. Such maps may be created for any permutation of classification and topic, and would yield any statistically significant results to be found among the spatial data for each permutation. Note that this statistic does not merely highlight points that contain a lot of messages, but highlights points with statistically significant differences of message totals compared to neighboring points. Such significant results would (and do in the case of topic 46) yield convergent findings with survey data. Future work will more rigorously relate and apply this mixed methods approach.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion</head><p>The results outlined above yielded statistically significant geographic hot and cold spots in terms of individual topics in negative-sentiment vaccine messages on Twitter as a proof of concept. Such hotspots in a topic correspond to a discussion being statistically prevalent, and more prevalent in certain areas than others. That discussions pertinent to the trust in government results from Quinn's surveys (topic 46) are statistically significant in the first place both validates our approach and supports Quinn's findings on a larger scale. The fact that no significant cold spots are found among the topic 46 negative-sentiment map also validates our approach, as one would expect only hotspots in such topics pertaining to anti-vaccine discussions. This proof of concept showed that social media contains valuable information that is more granular and available more cheaply and quickly than through traditional survey methods. With further refinement, this information may be leveraged to replicate and compare with survey results.</p><p>In addition, such hot spot information is immediately actionable from a public health perspective, a valuable quality in the context of vaccine hesitancy. For example, one may target messages towards public policy think tanks in Arkansas to foster a more balanced approach to the debate about the government mandates on vaccination. Identification of such geolocated issues is valuable to public health officials as it provides low hanging fruit to address if interventions are known. For example, officials might value being able to reach all of Arkansas in a messaging campaign by only messaging Little Rock (if that were the only hotspot). The other side of the coin is also valuable, however, as evidence-based interventions may not yet exist. Officials may have been unaware of a specific geographic area and its opinions on a sub-issue of vaccine hesitancy, as hesitancy itself has been shown to vary across regions and within countries without a successful strategy. <ref type="bibr" target="#b12">11</ref> Thus the system's analysis of its real-time sentiment-topic data allowed us to identify individual discussions from the aggregate meta-topic, suggested the ability to verify survey hypotheses relating to those discussions, and suggested spatial targets for more effective use of public health resources. With expertise in both content and data analysis to fully understand and leverage the social media data, the system provides a promising opportunity to monitor real-time views.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion -Limitations</head><p>However, this system and its underlying approach may be improved. For example, the ability of Carmen 10 to augment location information could be increased such that it identifies information at a more granular level in more messages. This would affect the geospatial hotspot analysis, as one could improve results by grouping by levels of granularity with more and better location information. In addition, this proof of concept topic analysis returned 50 topics, but sensitivity analysis on this number as up-or downsizing could reduce noise. Thirdly, the open debate of social media analysis applies as well: whether social media discussions are a valid and accurate proxy for the rationales of the population at large. This applies both in terms of users' demographics (see below) and in terms of the potential for fake users, which recent research may be used to filter <ref type="bibr" target="#b3">4</ref> . Fourthly, one should be cognizant of the (limited but nonzero) amount of technical supervision required: the system requires computational capacity and server administration, and it requires creating machine learning classifiers. 9</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Discussion -Future Work</head><p>An additional limitation is that the topic models in LDA are subjective; there are alternative models and means of interpretation associated with them that could be employed. Paul and Dredze created an elegant framework for supervised topic models <ref type="bibr" target="#b18">17,</ref><ref type="bibr" target="#b25">23</ref> , which could be adapted to our system, that would return topics seeded by specific a priori values (i.e., those in Quinn's survey results). Such seeded topics would remove subjectivity of topic interpretation, quantifiably associating topics with pre-determined results. Secondly, LDA is merely a long-running industry standard; an alternative is Linguistic Inquiry and Word Count (LIWC) <ref type="bibr" target="#b35">33</ref> . In contrast to LDA which returns words that are co-located, LIWC counts psychologically relevant words into categories, producing output along dimensions such as "negative emotion words" or "tentative language". These categories and their relative frequencies paint a picture of how the word user(s) consider their subject matter, in this case discussions about vaccines. Using LIWC would provide an alternative viewpoint that may be more easily interpreted using the framework of psychology.</p><p>Another aim of future work involves more explicit relations to traditional survey methods. One immediate improvement would be to aggregate tweets by user, which will enable user demographic classification <ref type="bibr" target="#b7">7,</ref><ref type="bibr" target="#b32">30</ref> and other user-level statistics such as comparisons to known outbreaks of disease or to news coverage. With this information, and analysis related to retweets and news mentions, one might operationalize survey questions to individuals as different slices of our dataset, which for example would allow exploring and validating if demographics are related to one's rationales and opinions, especially those opinions relating to trust in government, as previous work has suggested <ref type="bibr" target="#b27">25,</ref><ref type="bibr" target="#b28">26</ref> . Aggregating information by user would also allow the system to further the question of whether social media may be used as a proxy for the population at large, both in terms of demographics and in terms of coverage of topic discussion. The representativeness of social media users is an open question, whether relating to pro-or anti-vaccination communities or to the population as a whole. The analyses in this paper combined with demographic classification would allow us to determine how representative our social media users are of our target population(s).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Conclusions</head><p>Given the problem of tracking and understanding discussion in a population and the context of vaccine hesitancy, we have as a first step created a pipeline of natural language processing and geospatial techniques that enable real-time statistical analysis of different discussions in a population across space. This system showed statistically significant spatial hotspots of discussion in the USA that provide actionable insights for the time-sensitive context. Given the financial and computational ease of gathering and processing swaths of social media data, this system can be used to monitor real-time views, and, easily extensible, suggests the ability to verify traditional survey methods in broader spatial contexts.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Hotspots of the proportion of discussion of Topic 46 in the contiguous USA</figDesc><graphic coords="4,148.50,407.13,314.41,232.25" type="bitmap" /></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">i https://dev.twitter.com/streaming/overview ii https://www.mturk.com/mturk/welcome</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgements</head><p>Thank you to Amelia Jamison for her helpful feedback and topic analysis.</p><p>Dr. Dredze has received consulting fees from Directing Medicine LLC and Sickweather LLC, who use social media for public health surveillance.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Latent Dirichlet Allocation</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">Y</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">I</forename><surname>Jordan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">J Mach Learn Res</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="page" from="993" to="1022" />
			<date type="published" when="2003-03">2003 Mar</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Broniatowski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLOS ONE</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">12</biblScope>
			<biblScope unit="page">e83672</biblScope>
			<date type="published" when="2009">2013 Dec 9</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Reading tea leaves: How humans interpret topic models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">L</forename><surname>Boyd-Graber</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Gerrish</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<ptr target="https://papers.nips.cc/paper/3700-reading-tea-leaves-how-humans-interpret-topic-models.pdf" />
	</analytic>
	<monogr>
		<title level="m">Nips</title>
				<imprint>
			<date type="published" when="2009">2009</date>
			<biblScope unit="page" from="1" to="9" />
		</imprint>
	</monogr>
	<note>cited 2017 Mar 8</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Antisocial Behavior in Online Discussion Communities</title>
		<author>
			<persName><forename type="first">J</forename><surname>Cheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Danescu-Niculescu-Mizil</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Leskovec</surname></persName>
		</author>
		<idno>arXiv:150400680</idno>
		<ptr target="http://arxiv.org/abs/1504.00680" />
		<imprint>
			<date type="published" when="2015-04-02">2015 Apr 2. 2016 Dec 8</date>
		</imprint>
	</monogr>
	<note>cs, stat</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Termite: Visualization Techniques for Assessing Textual Topic Models</title>
		<author>
			<persName><forename type="first">J</forename><surname>Chuang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">D</forename><surname>Manning</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Heer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International Working Conference on Advanced Visual Interfaces</title>
				<meeting>the International Working Conference on Advanced Visual Interfaces</meeting>
		<imprint/>
	</monogr>
	<note>Internet</note>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title/>
		<idno type="DOI">10.1145/2254556.2254572</idno>
		<ptr target="http://doi.acm.org/10.1145/2254556.2254572" />
		<imprint>
			<date type="published" when="2012">2012. 2017 Mar 8</date>
			<publisher>ACM</publisher>
			<biblScope unit="page" from="74" to="77" />
			<pubPlace>New York, NY, USA</pubPlace>
		</imprint>
	</monogr>
	<note>AVI &apos;12</note>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Predicting the Political Alignment of Twitter Users</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">D</forename><surname>Conover</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Goncalves</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Ratkiewicz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Flammini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Menczer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2011 IEEE Third International Conference on Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom)</title>
				<imprint>
			<date type="published" when="2011">2011</date>
			<biblScope unit="page" from="192" to="199" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Predicting Twitter User Demographics using Distant Supervision from Website Traffic Data</title>
		<author>
			<persName><forename type="first">A</forename><surname>Culotta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">K</forename><surname>Ravi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Cutler</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Artificial Intelligence Research</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="page" from="389" to="408" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Towards Detecting Influenza Epidemics by Analyzing Twitter Messages</title>
		<author>
			<persName><forename type="first">A</forename><surname>Culotta</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the First Workshop on Social Media Analytics</title>
				<meeting>the First Workshop on Social Media Analytics</meeting>
		<imprint/>
	</monogr>
	<note>Internet</note>
</biblStruct>

<biblStruct xml:id="b9">
	<monogr>
		<title/>
		<idno type="DOI">10.1145/1964858.1964874</idno>
		<ptr target="http://doi.acm.org/10.1145/1964858.1964874" />
		<imprint>
			<date type="published" when="2010">2010. 2016 Mar 10</date>
			<publisher>ACM</publisher>
			<biblScope unit="page" from="115" to="122" />
			<pubPlace>New York, NY, USA</pubPlace>
		</imprint>
	</monogr>
	<note>SOMA &apos;10</note>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Understanding Vaccine Refusal: Why We Need Social Media Now</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Broniatowski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Hilyard</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">American Journal of Preventive Medicine</title>
		<imprint>
			<biblScope unit="volume">50</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="550" to="552" />
			<date type="published" when="2016-04">2016 Apr</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Carmen: A twitter geolocation system with applications to public health</title>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Bergsma</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Tran</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">AAAI Workshop on Expanding the Boundaries of Health Informatics Using AI (HIAI)</title>
				<imprint>
			<publisher>Citeseer</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="20" to="24" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Strategies intended to address vaccine hesitancy: Review of published reviews</title>
		<author>
			<persName><forename type="first">E</forename><surname>Dubé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Gagnon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">E</forename><surname>Macdonald</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Vaccine</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="issue">34</biblScope>
			<biblScope unit="page" from="4191" to="4203" />
			<date type="published" when="2015-08-14">2015 Aug 14</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Mapping vaccine hesitancy-Country-specific characteristics of a global phenomenon</title>
		<author>
			<persName><forename type="first">E</forename><surname>Dubé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Gagnon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Nickels</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Jeram</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Schuster</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Vaccine</title>
		<imprint>
			<biblScope unit="volume">32</biblScope>
			<biblScope unit="issue">49</biblScope>
			<biblScope unit="page" from="6649" to="6654" />
			<date type="published" when="2014-11-20">2014 Nov 20</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Trust during the early stages of the 2009 H1N1 pandemic</title>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">S</forename><surname>Freimuth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Musa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hilyard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Quinn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of health communication</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="321" to="339" />
			<date type="published" when="2014">2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">The Analysis of Spatial Association by Use of Distance Statistics</title>
		<author>
			<persName><forename type="first">A</forename><surname>Getis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">K</forename><surname>Ord</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Geographical Analysis</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="189" to="206" />
			<date type="published" when="1992-07-01">1992 Jul 1</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Measles at Disneyland, a Problem for All AgesMeasles at Disneyland</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">A</forename><surname>Halsey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Salmon</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Ann Intern Med</title>
		<imprint>
			<biblScope unit="volume">162</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="655" to="656" />
			<date type="published" when="2015-05-05">2015 May 5</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Vaccine hesitancy: Definition, scope and determinants</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">E</forename><surname>Macdonald</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Vaccine</title>
		<imprint>
			<biblScope unit="volume">33</biblScope>
			<biblScope unit="issue">34</biblScope>
			<biblScope unit="page" from="4161" to="4164" />
			<date type="published" when="2015-08-14">2015 Aug 14</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Supervised Topic Models</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Mcauliffe</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Advances in Neural Information Processing Systems 20</title>
				<editor>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Platt</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">D</forename><surname>Koller</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">Y</forename><surname>Singer</surname></persName>
		</editor>
		<editor>
			<persName><forename type="first">S</forename><forename type="middle">T</forename><surname>Roweis</surname></persName>
		</editor>
		<imprint/>
	</monogr>
	<note>Internet</note>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<title/>
		<author>
			<persName><forename type="first">Inc</forename><surname>Curran Associates</surname></persName>
		</author>
		<ptr target="http://papers.nips.cc/paper/3328-supervised-topic-models.pdf" />
		<imprint>
			<date type="published" when="2008">2008. 2016 Mar 17</date>
			<biblScope unit="page" from="121" to="128" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<title level="m" type="main">MALLET: A Machine Learning for Language Toolkit</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Mccallum</surname></persName>
		</author>
		<ptr target="http://mallet.cs.umass.edu" />
		<imprint>
			<date type="published" when="2002">2002</date>
		</imprint>
	</monogr>
	<note>Internet</note>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">Disease Detection or Public Opinion Reflection? Content Analysis of Tweets, Other Social Media, and Online Newspapers During the Measles Outbreak in the Netherlands in 2013</title>
		<author>
			<persName><forename type="first">L</forename><surname>Mollema</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><forename type="middle">A</forename><surname>Harmsen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Broekhuizen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Clijnk</surname></persName>
		</author>
		<author>
			<persName><forename type="first">De</forename><surname>Melker</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Paulussen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename></persName>
		</author>
		<ptr target="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468573/" />
	</analytic>
	<monogr>
		<title level="j">J Med Internet Res</title>
		<imprint>
			<biblScope unit="volume">17</biblScope>
			<biblScope unit="issue">5</biblScope>
			<date type="published" when="2015-05-26">2015 May 26. 2016 Mar 4</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Vaccine Refusal, Mandatory Immunization, and the Risks of Vaccine-Preventable Diseases</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">B</forename><surname>Omer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Salmon</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">A</forename><surname>Orenstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">P</forename><surname>Dehart</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Halsey</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">New England Journal of Medicine</title>
		<imprint>
			<biblScope unit="volume">360</biblScope>
			<biblScope unit="issue">19</biblScope>
			<biblScope unit="page" from="1981" to="1988" />
			<date type="published" when="2009-05-07">2009 May 7</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Opinion Mining and Sentiment Analysis</title>
		<author>
			<persName><forename type="first">B</forename><surname>Pang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Lee</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Found Trends Inf Retr</title>
		<imprint>
			<biblScope unit="volume">2</biblScope>
			<biblScope unit="issue">1-2</biblScope>
			<biblScope unit="page" from="1" to="135" />
			<date type="published" when="2008-01">2008 Jan</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<analytic>
		<title level="a" type="main">Twitter Improves Influenza Forecasting</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Broniatowski</surname></persName>
		</author>
		<ptr target="http://currents.plos.org/outbreaks/?p=39911" />
	</analytic>
	<monogr>
		<title level="j">PLoS Currents</title>
		<imprint>
			<date type="published" when="2014">2014. 2016 Mar 5</date>
		</imprint>
	</monogr>
	<note>Internet</note>
</biblStruct>

<biblStruct xml:id="b25">
	<analytic>
		<title level="a" type="main">SPRITE: Generalizing Topic Models with Structured Priors</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions of the Association for Computational Linguistics</title>
		<imprint>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="43" to="57" />
			<date type="published" when="2015-01-20">2015 Jan 20</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b26">
	<analytic>
		<title level="a" type="main">Public acceptance of peramivir during the 2009 H1N1 influenza pandemic: implications for other drugs or vaccines under emergency use authorizations</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Quinn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Hilyard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Castaneda-Angarita</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">S</forename><surname>Freimuth</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Disaster Med Public Health Prep</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="166" to="174" />
			<date type="published" when="2015-04">2015 Apr</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b27">
	<analytic>
		<title level="a" type="main">Public willingness to take a vaccine or drug under Emergency Use Authorization during the 2009 H1N1 pandemic. Biosecurity and bioterrorism: biodefense strategy, practice</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Quinn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kumar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">S</forename><surname>Freimuth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kidwell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Musa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">and science</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="275" to="290" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b28">
	<analytic>
		<title level="a" type="main">Exploring communication, trust in government, and vaccination intention later in the 2009 H1N1 pandemic: results of a national survey. Biosecurity and bioterrorism: biodefense strategy, practice</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Quinn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Parmer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><forename type="middle">S</forename><surname>Freimuth</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Hilyard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Musa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">H</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">and science</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="96" to="106" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b29">
	<analytic>
		<title level="a" type="main">The Anthrax Vaccine and Research: Reactions from Postal Workers and Public Health Professionals</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">C</forename><surname>Quinn</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Thomas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Kumar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Biosecur Bioterror</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="321" to="333" />
			<date type="published" when="2008-12">2008 Dec</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b30">
	<analytic>
		<title level="a" type="main">Disease Table Working Group a. HIstorical comparisons of morbidity and mortality for vaccine-preventable diseases in the united states</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">W</forename><surname>Roush</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">V</forename><surname>Murphy</surname></persName>
		</author>
		<author>
			<persName><surname>Vaccine-Preventable</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">JAMA</title>
		<imprint>
			<biblScope unit="volume">298</biblScope>
			<biblScope unit="issue">18</biblScope>
			<biblScope unit="page" from="2155" to="2163" />
			<date type="published" when="2007-11-14">2007 Nov 14</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b31">
	<analytic>
		<title level="a" type="main">Influenza A (H7N9) and the Importance of Digital Epidemiology</title>
		<author>
			<persName><forename type="first">M</forename><surname>Salathé</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">C</forename><surname>Freifeld</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">R</forename><surname>Mekaru</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">F</forename><surname>Tomasulo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Brownstein</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">New England Journal of Medicine</title>
		<imprint>
			<biblScope unit="volume">369</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page" from="401" to="404" />
			<date type="published" when="2013-08-01">2013 Aug 1</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b32">
	<analytic>
		<title level="a" type="main">Developing Age and Gender Predictive Lexica over Social Media</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sap</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Park</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Eichstaedt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">L</forename><surname>Kern</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">J</forename><surname>Stillwell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Kosinski</surname></persName>
		</author>
		<ptr target="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.672.9851" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) [Internet</title>
				<meeting>the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) [Internet</meeting>
		<imprint>
			<date type="published" when="2014">2014. 2016 Jun 8</date>
			<biblScope unit="page" from="1146" to="1151" />
		</imprint>
	</monogr>
	<note>Association for Computational Linguistics</note>
</biblStruct>

<biblStruct xml:id="b33">
	<analytic>
		<title level="a" type="main">The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic</title>
		<author>
			<persName><forename type="first">A</forename><surname>Signorini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">M</forename><surname>Segre</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">M</forename><surname>Polgreen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLOS ONE</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">5</biblScope>
			<biblScope unit="page">e19467</biblScope>
			<date type="published" when="2011-05-04">2011 May 4</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b34">
	<analytic>
		<title level="a" type="main">Towards Real-Time Measurement of Public Epidemic Awareness: Monitoring Influenza Awareness through Twitter</title>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">C</forename><surname>Smith</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Broniatowski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">J</forename><surname>Paul</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Dredze</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">AAAI Spring Symposium on Observational Studies through Social Media and Other Human-Generated Content</title>
				<meeting><address><addrLine>Stanford, CA</addrLine></address></meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b35">
	<analytic>
		<title level="a" type="main">The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods</title>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">R</forename><surname>Tausczik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">W</forename><surname>Pennebaker</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Language and Social Psychology</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="24" to="54" />
			<date type="published" when="2010-03-01">2010 Mar 1</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b36">
	<monogr>
		<title level="m" type="main">Pertussis Epidemic -California</title>
		<author>
			<persName><forename type="first">K</forename><surname>Winter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Carol</forename><surname>Glaser</surname></persName>
		</author>
		<author>
			<persName><forename type="first">James</forename><surname>Watt</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kathleen</forename><surname>Harriman</surname></persName>
		</author>
		<ptr target="https://www.cdc.gov/mmwr/preview/mmwrhtml/mm6348a2.htm" />
		<imprint>
			<date type="published" when="2014">2014. 2014</date>
		</imprint>
	</monogr>
	<note>Internet. cited 2017 Mar 8</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
