<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Visualization of Clandestine Labs from Seizure Reports: Thematic Mapping and Data Mining Research Directions</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">William</forename><forename type="middle">H</forename><surname>Hsu</surname></persName>
							<email>bhsu@ksu.edu</email>
							<affiliation key="aff0">
								<orgName type="department">Department of Computing and Information Sciences</orgName>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Max</forename><surname>Lu</surname></persName>
							<email>maxlu@ksu.edu</email>
							<affiliation key="aff1">
								<orgName type="department">Department of Geography</orgName>
								<orgName type="institution">Kansas State University Manhattan</orgName>
								<address>
									<postCode>66506</postCode>
									<region>KS</region>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Visualization of Clandestine Labs from Seizure Reports: Thematic Mapping and Data Mining Research Directions</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">C5BB35E8392214A0D356EE4D086CAB92</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-23T21:55+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>H.3.3 [Information Storage and Retrieval]: Information search and retrieval -clustering</term>
					<term>relevance feedback</term>
					<term>selection process; H.2.8 [Database Management]: Database applications -data mining</term>
					<term>spatial databases and GIS Algorithms</term>
					<term>Experimentation</term>
					<term>Human Factors information extraction</term>
					<term>information visualization</term>
					<term>event extraction</term>
					<term>topic modeling</term>
					<term>geoinformatics</term>
					<term>spatiotemporal information retrieval</term>
					<term>data mining</term>
					<term>machine learning</term>
					<term>time series</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The problem of spatiotemporal event visualization based on reports entails subtasks ranging from named entity recognition to relationship extraction and mapping of events. We present an approach to event extraction that is driven by data mining and visualization goals, particularly thematic mapping and trend analysis. This paper focuses on bridging the information extraction and visualization tasks and investigates topic modeling approaches. We develop a static, finite topic model and examine the potential benefits and feasibility of extending this to dynamic topic modeling with a large number of topics and continuous tome. We describe an experimental test bed for event mapping that uses this end-to-end information retrieval system, and report preliminary results on a geoinformatics problem: tracking of methamphetamine lab seizure events across time and space.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">INTRODUCTION</head><p>In this paper, we address the problem of event visualization based on structured data, in the form of time-referenced and georeferenced relational tuples, and on unstructured data, in the form of free text. Information extraction systems based on named entity recognition (NER) and relationship extraction have enabled detection of events mentioned in free text and extraction of structured tuples describing the location, time, along with other attributes of an event. Identifying hotspots and trends, however, remains an open problem. One limitation is the absence of ground truth for high event activity. In some cases this is due to a lack of well-defined criteria for activity and relevance, while in some it is due to limitations in existing annotation interfaces.</p><p>We first present a basic approach to event visualization. Our general framework makes use of mapping tools such as Google Maps <ref type="bibr" target="#b1">[1]</ref>, the Google web toolkit, and timeline visualization tools such as MIT SIMILE <ref type="bibr" target="#b2">[2]</ref>. It also builds upon previous work on gazetteer-based event recognition and syntactic patterns for semantic relationship detection. Next, we show how a system developed originally for visualization of animal disease outbreaks reported in online news documents can be adapted to display reports of methamphetamine lab seizures compiled by regional law enforcement. We briefly outline the development of a domain-specific data description language for increased portability and ease of information integration. We then discuss the role of topic modeling and information retrieval approaches in filtering and ranking events.</p><p>A key technical contribution of this work is the application of topic modeling algorithms in order to compute the posterior probability of a particular spatial location, time unit, or combination given the type of event, which is treated as a topic. This allows the data to be interrogated systematically in order to display geographic regions that are more prone to events of interest. A potential application of this is to construct a time composite map of administrative divisions within a state or province, or a spatial composite time series by month or year, showing active regions. These can be visualized using a choropleth map: a map in which regions (geographic regions in this case) are coded by colors or grayscale intensity levels. These represent a variable of interest -in this case, event frequency. Finally, the ability to estimate marginal likelihoods over locations and times given the event type parameters can also be used to filter events, to display only those that fall within a specified frequency range. For example, the system can be configured to search for seizures of methaphetamine production labs in counties or districts where they are common or rare.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">EVENT VISUALIZATION TASKS 2.1 Spatiotemporal Event Extraction</head><p>The goal of event extraction is to identify phenomena related to specific actions, occurrences, relationships, or entities. For example, a positive test for a contagious animal disease on a farm is an event that may be tied to an epidemic and identified post hoc as indicating an outbreak. The seizure of equipment from a methamphetamine production facility, or of waste products from a dump site, is an example of an event in the domain of drug enforcement. Events that can be localized in space and time form the basis of spatiotemporal event extraction.</p><p>In the domains of veterinary epidemiology and drug enforcement, decision support systems are typically based on spatiotemporal event extraction and visualization. When events are already available in structured form, they are usually compiled manually from investigative reports by local or national authorities: animal health agencies in the case of veterinary epidemiology and state of national bureaus of investigation in the case of drug seizures. By contrast, unstructured data often comes into the decision support system as the result of a web crawl based on domain-specific resource identifiers: seeds (URLs) and search terms. Federated displays and user interfaces for these decision support systems often combine event data from structured data repositories with data extracted from free text. This entails data integration challenges such as disambiguation, deduplication and identity uncertainty for entities and events; expansion of existing named entity sets from gazetteers (lists of known entities); and inference of attributes for relationships representing events and entities representing actors and objects. These topics are beyond the scope of this paper; we refer the interested reader to existing literature on the current state of the field in domain-focused relationship extraction.</p><p>Instead, we suppose here that some preliminary classification step has already taken place to identify the entity that serves as anchor point for an event, and that further classification or inference has identified a putative time and location for the event. Whether this is accomplished through supervised inductive learning from text corpora or as a result of basic pattern matching, our starting point is a candidate tuple to be analyzed, considered for presentation to a decision-maker or search user, and if selected, visualized in the context of events of interest.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2">Georeferencing and Map Visualization</head><p>Mapping out spatially-referenced events, even using structured data sources, entails a straighforward but data-intensive georeferencing task: looking up the coordinates (latitude and longitude) of street addresses and postal codes where events are reported to have occurred.</p><p>The resulting coordinates are placed into a spatial database management system (SDBMS) for visualization using software libraries and services such as Google Maps, as shown in Figure <ref type="figure" target="#fig_0">1</ref>. For this purpose, we developed two alternative access layers with a unified representation and geographic information system (GIS) data model. The first layer is based on Google's Keyhole Markup Language (KML) and a file-based application programmer interface (API), while the second layer is based on a PHP interface to a MySQL database implementing the KML schema. Our front-end application, TimeMap, can be configured to use either layer.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3">Timeline Visualization</head><p>Figure <ref type="figure" target="#fig_0">1</ref> depicts the data integration between the map and timeline visualization subsystems. The seizure event in April, 2010 is represented on the map by a pop-up note, on the monthly scale timeline (upper right) by a circled dot, and on the yearly scale timeline (lower right) by a circled point.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4">Thematic Mapping</head><p>The object of thematic mapping is to depict phenomena and trends in a geospatial context. Toward this end, we have added a mapping overlay to the TimeMap framework that allows map transformation such as superimposition of data and transparency layers to be applied using the Google Maps API. This includes choropleth maps with dynamically computable color palettes.  More importantly for information retrieval applications, it does not provide any drill-down interface cf. HealthMap <ref type="bibr" target="#b4">[4]</ref> or similar event visualization services. One of the reasons for the development of the geospatial visualization components of TimeMap was to facilitate information retrieval and multimodal information access using well-established visualization techniques such as thematic mapping and small multiples.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.5">Spatial Time Series Prediction</head><p>A final rationale for the TimeMap visualization framework arises from domain-specific data mining objectives in epidemiology and criminology. Governmental agencies devoted to agriculture, public health, and law enforcement often encounter a need for predictive analytics tools to assist with decision-making in both public policy and intervention, and with civic outreach. In the domain of public health, tools such as HealthMap <ref type="bibr" target="#b4">[4]</ref> have begun to do for individual citizens what more general crime-mapping systems are intended do for search users: provide relevance filters based on criteria related to incident frequency, corroborative reporting, and significance.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">TOPIC MODELING</head><p>As mentioned in Section 2.1, named entity recognition combined with date and location can provide a means of extracting a stream of events and updates from news stories. This also holds for microblogs and other social media. In order to classify new events and detect the emergence or revival of new event-related topics, however, a mechanism for monitoring update streams is needed. This requires a more flexible topic model than the fixed or expandable sets of named entities used for structured information extraction. Furthermore, the frequency and semantic heterogeneity of event reporting from multiple media outlets, even on the web alone, may require enrichment of the parameters beyond those used in classical generative models for information retrieval. We now examine possible extensions to these document clustering models.  </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1">Static Topic Models</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2">Dynamic Topic Models</head><p>As our preliminary experiments with historical data on both epizootic disease outbreaks and meth lab seizures showed, news flashes do not admit the kind of stationarity assumed in Figure <ref type="figure" target="#fig_2">3</ref>. Specifically, the latent variables of our topic model change over time as a result of concept drift and the arrival of new topics, which we can think of as a birth-death process tied to observable events. <ref type="bibr" target="#b5">Blei and Lafferty (2006)</ref> proposed a dynamic topic model with fixed topic count K in which each topic's word distribution and popularity are linked over time. <ref type="bibr" target="#b5">[5]</ref> Meanwhile, older topic modeling algorithms such as Latent Semantic Analysis (LSA) <ref type="bibr" target="#b6">[6]</ref> that permit K to vary suffer from problems such as proximity of different senses of a polysemous word, while variants Probabilistic Latent Semantic Analysis (PLSA) <ref type="bibr" target="#b7">[7]</ref> exhibit parameter growth linear in the number of documents D.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.1">Discrete Time, Infinite Topic</head><p>Ahmed and Xing (2010) proposed a partial solution to this problem by introducing an infnite Dynamic Topic Model (iDTM) that allows for an unbounded number of topics and an evolving representation of topics according to a Markovian dynamics. <ref type="bibr">[8]</ref> They analyzed the birth and evolution of topics in the neural computation community based on the Neural Information Processing Systems (NIPS) conference proceedings. Their model evolved topics over discrete epochs (time units). All proceedings of a conference meeting fall into the same epoch. This model does not suit less "bursty topic" applications such as meth lab seizures or disease outbreaks, which are asynchronous whether reported in the news or in local law enforcement records.</p><p>For topic modeling applications such as event visualization, such discrete time model may be too brittle. An extension to continuous time will give it the needed flexibility to account for variability and change in temporal granularity.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.2">Continuous Time, Finite Topic</head><p>Meanwhile, <ref type="bibr" target="#b9">Wang et al. (2008)</ref> proposed a continuous time dynamic topic model that uses Brownian motion to simulate the evolution of topics over time. <ref type="bibr" target="#b9">[9]</ref> Although this model uses a novel, sparse variational Kalman filtering algorithm for fast inference, the number of topics it samples from is bounded, which severely limits its application in news feed storyline creation and article aggregation. When the number of topics covered by the news feed is fewer than the pre-tuned number of topics K specified in the model, similar stories will appear under different headlines. On the other hand, if the number of topics covered becomes greater than the preset number of topics, topics and headlines will get conflated.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.3">Proposed: Continuous Time, Infinite Topic</head><p>To accommodate the needs of non-bursty news updates in domains such as our event visualization domains, we propose a hybridization of the infinite topic model and the continuous time model that combines a hierarchical Dirichlet process (DP) for dynamic topic abstraction and refinement with Brownian motion to capture stochastic topic drift. <ref type="bibr" target="#b10">[10]</ref> We can use a variational inference algorithm such as variational Kalman filtering to factorize the variational distribution over latent variables:</p><p>where  is the word distribution over topics and 1: , 1: ,1: is the word distribution over topics for time 1:T, topic z 1:T and word index 1:N, where N is the size of the document lexicon. Volkova &amp; Hsu (2010) describes earlier work on computational information and knowledge management (CIKM) and information extraction. <ref type="bibr" target="#b11">[11]</ref> This research, motivated by a need to visualize digests of news articles on animal disease outbreaks as shown in Figure <ref type="figure" target="#fig_4">4</ref>, led to the earliest prototype of our event visualization system, implemented using Google Maps and MIT SIMILE. This system used syntactic detectors for semantic equivalence assertions <ref type="bibr">(Volkova et al., 2010)</ref>. <ref type="bibr" target="#b12">[12]</ref> </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">APPLICATION TEST BED 4.1 Prior Work: Veterinary Epidemiology</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.2">Kansas Meth Lab Seizures</head><p>A more recent version of the event visualization system is represented in the meth lab application described in Sections 1 and 2. This system forms the test bed for both the visualization techniques described in Section 2 and the topic modeling techniques intended to raise the precision of the federated system by improving relevance filtering and ranking.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">EXPERIMENTAL EVALUATION</head><p>Figure <ref type="figure" target="#fig_5">5</ref> shows some baseline descriptive statistics for meth lab seizures from the test bed discussed in Sections 2 and 4.2. In our topic model, topics are event types, which are 50 different types of methamphetamine lab seizures. Given the text of a lab seizure report or a news story of the event with its date and location as a prior, we can use the topic model shown in Figure <ref type="figure" target="#fig_2">3</ref> to evaluate the likelihood of the event given the prior. Events with likelihood value above a threshold are considered highly likely to occur given the location and time of the event. Given a collection of police lab seizure reports with date and location, we ran the topic model on it and evaluated the topic composition of each report in the collection. Table <ref type="table" target="#tab_0">1</ref> shows the topic "Abandoned dump site" proportion per seizure reports for four Kansas counties over six years. If we set a likelihood threshold value of 0.02, then for year 2000 the event will be marked on the map for Cowley and Reno counties. The event will not get marked on the map for year 2002 for any of these four counties, and for year 2004 will be marked only in Reno County, and so on.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6.">CONTINUING AND FUTURE WORK</head><p>In continuing work, we are validating the baseline LDA output by using it to filter and rank search results the seizure database represented in Figure <ref type="figure" target="#fig_5">5</ref>  </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 .</head><label>1</label><figDesc>Figure 1. Map and timeline visualization of meth lab seizure events (2004-2011) using Google Maps and MIT SIMILE. Seizures from the first half of 2010 are depicted, with one event selected.</figDesc><graphic coords="2,317.88,273.60,240.00,186.00" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 2 .Figure 2</head><label>22</label><figDesc>Figure 2. Choropleth map of 2004 meth lab seizures in the USA. (Associated Press, 2005)Figure2is a choropleth map depicting meth lab seizures by state in 2004.<ref type="bibr" target="#b3">[3]</ref> This map, which was published by the Associated Press and featured on the ABC News web site in 2005, does not take state population or intrastate population distribution into account. More importantly for information retrieval applications, it does not provide any drill-down interface cf. HealthMap<ref type="bibr" target="#b4">[4]</ref> or similar event visualization services. One of the reasons for the development of the geospatial visualization components of TimeMap was to facilitate information retrieval and multimodal information access using well-established visualization techniques such as thematic mapping and small multiples.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head>Figure 3 .</head><label>3</label><figDesc>Figure 3. Plate model for Latent Dirichlet Allocation (LDA) in a system with an N-word lexicon, D documents, and K topics.</figDesc><graphic coords="3,54.00,257.22,240.00,55.50" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3</head><label>3</label><figDesc>Figure 3 illustrates the kind of generative Bayesian topic model widely used to cluster static collections of documents. Here,  is a topic distribution for a document, while Z is the topic sampled from  for word W.  is a Markov matrix giving the word distribution per topic, and  is the Dirichlet prior parameter used in generating that matrix.</figDesc><graphic coords="3,335.16,358.08,204.78,87.72" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 4 .</head><label>4</label><figDesc>Figure 4. Kansas Information Integration and Analysis Center (KIIAC) for epizootic diseases.</figDesc><graphic coords="3,317.88,521.76,239.22,164.22" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_5"><head>Figure 5 .</head><label>5</label><figDesc>Figure 5. Column graph of the 4942 total meth lab seizures in Kansas, 2000 -2011, by county (104 with seizures).</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 . Topic "Abandoned dump site" proportion per seizure reports for four Kansas counties.</head><label>1</label><figDesc></figDesc><table><row><cell></cell><cell>2000</cell><cell>2002</cell><cell>2004</cell><cell>2006</cell><cell>2008</cell><cell>2010</cell></row><row><cell>Cowley</cell><cell>0.0345</cell><cell>0.0188</cell><cell>0.0188</cell><cell>0.0182</cell><cell>0.0001</cell><cell>0.0175</cell></row><row><cell cols="2">Crawford 0.0185</cell><cell>0.0172</cell><cell>0.0188</cell><cell>0.0185</cell><cell>0.0182</cell><cell>0.0188</cell></row><row><cell cols="2">Cherokee 0.0185</cell><cell>0.0175</cell><cell>0.0175</cell><cell>0.0178</cell><cell>0.0181</cell><cell>0.0333</cell></row><row><cell>Reno</cell><cell>0.0350</cell><cell>0.0172</cell><cell>0.0344</cell><cell>0.0166</cell><cell>0.0527</cell><cell>0.0172</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head></head><label></label><figDesc>and Table1. Relevance feedback from multiple subject matter experts is being used to evaluate both LDA and DP-based topic models.<ref type="bibr" target="#b11">[11]</ref> </figDesc><table><row><cell>450</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>400</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>350</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>300</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>250</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>200</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>150</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>100</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>50</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>0</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>EDWARDS</cell><cell>GRANT</cell><cell>WICHITA</cell><cell>LOGAN</cell><cell>WALLACE</cell><cell>ELK</cell><cell>WOODSON</cell><cell>ELLSWORTH</cell><cell>RILEY</cell><cell>OTTAWA</cell><cell>SUMNER</cell><cell>PAWNEE</cell><cell>DOUGLAS</cell><cell>JACKSON</cell><cell>SEDGWICK</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" xml:id="foot_0">Presented at EuroHCIR2012. Copyright © 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes. This volume is published and copyrighted by its editors.</note>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="7.">ACKNOWLEDGMENTS</head><p>Thanks to Loretta Wyrick-Severin (Kansas Bureau of Investigation) for assistance with public information requests and to Surya Teja Kallumadi, Tim Weninger, Svitlana Volkova, John Drouhard, Landon Fowles, and Andrew Berggren for development work on TimeMap.</p></div>
			</div>

			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<title/>
		<author>
			<persName><surname>References</surname></persName>
		</author>
		<imprint/>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<monogr>
		<title level="m" type="main">Google Maps</title>
		<author>
			<persName><surname>Google</surname></persName>
		</author>
		<ptr target="https://maps.google.com" />
		<imprint>
			<date type="published" when="2010-06-29">2010. June 29, 2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<ptr target="http://simile.mit.edu/wiki/Main_Page" />
		<title level="m">Semantic Interoperability of Metadata and Information in unLike Environments</title>
				<imprint>
			<date type="published" when="2008-06-29">2008. June 29, 2012</date>
		</imprint>
		<respStmt>
			<orgName>Massachusetts Institute of Technology</orgName>
		</respStmt>
	</monogr>
	<note>from SIMILE</note>
</biblStruct>

<biblStruct xml:id="b3">
	<monogr>
		<title level="m" type="main">Meth Stats</title>
		<ptr target="http://abcn.ws/NYggv5" />
		<imprint>
			<date type="published" when="2005-11-01">2005. November 1. June 29, 2012</date>
			<publisher>Associated Press</publisher>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<monogr>
		<title level="m" type="main">HealthMap -Global Disease Alert Mapping System</title>
		<author>
			<persName><forename type="first">J</forename><surname>Brownstein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Feifeld</surname></persName>
		</author>
		<ptr target="http://www.healthmap.org" />
		<imprint>
			<date type="published" when="2007-01-25">2007. January 25, 2010</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Dynamic topic models</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Lafferty</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ICML 2006</title>
				<meeting>ICML 2006</meeting>
		<imprint>
			<publisher>ACM Press</publisher>
			<date type="published" when="2006">2006</date>
			<biblScope unit="page" from="113" to="120" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">An Introduction to Latent Semantic Analysis</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">K</forename><surname>Landauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><forename type="middle">W</forename><surname>Foltz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Laham</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Discourse Processes</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="259" to="284" />
			<date type="published" when="1998">1998</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Unsupervised Learning by Probabilistic Latent Semantic Analysis</title>
		<author>
			<persName><forename type="first">T</forename><surname>Hofmann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Machine Learning</title>
				<imprint>
			<date type="published" when="2001">2001</date>
			<biblScope unit="volume">42</biblScope>
			<biblScope unit="page" from="177" to="196" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Timeline: A dynamic hierarchical dirichlet process model for recovering birth/death and evolution of topics in text stream</title>
		<author>
			<persName><forename type="first">A</forename><surname>Ahmed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">P</forename><surname>Xing</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of UAI 2010</title>
				<meeting>UAI 2010</meeting>
		<imprint>
			<publisher>AUAI Press</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="20" to="29" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Continuous time dynamic topic models</title>
		<author>
			<persName><forename type="first">C</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Heckerman</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of UAI 2008</title>
				<meeting>UAI 2008</meeting>
		<imprint>
			<publisher>AUAI Press</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="579" to="586" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Introduction to probabilistic topic models</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Blei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Communications of the ACM</title>
		<imprint>
			<biblScope unit="volume">55</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="77" to="84" />
			<date type="published" when="2012-04">2012. April</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Computational knowledge and information management in veterinary epidemiology</title>
		<author>
			<persName><forename type="first">S</forename><surname>Volkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">H</forename><surname>Hsu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ISI 2010</title>
				<meeting>ISI 2010</meeting>
		<imprint>
			<publisher>IEEE Press</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="120" to="125" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Boosting Biomedical Entity Extraction by using Syntactic Patterns for Semantic Relation Discovery</title>
		<author>
			<persName><forename type="first">S</forename><surname>Volkova</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Caragea</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">H</forename><surname>Hsu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Drouhard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Fowles</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of WI-IAT</title>
				<meeting>WI-IAT</meeting>
		<imprint>
			<publisher>IEEE Press</publisher>
			<date type="published" when="2010">2010. 2010</date>
			<biblScope unit="page" from="272" to="278" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
