<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Topic-aware Network Visualisation to Explore Large Email Corpora</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Tim</forename><surname>Repke</surname></persName>
							<email>tim.repke@hpi.de</email>
							<affiliation key="aff0">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Ralf</forename><surname>Krestel</surname></persName>
							<email>ralf.krestel@hpi.de</email>
							<affiliation key="aff1">
								<orgName type="institution">Hasso Plattner Institute Potsdam</orgName>
								<address>
									<country key="DE">Germany</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Topic-aware Network Visualisation to Explore Large Email Corpora</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073)</idno>
					</monogr>
					<idno type="MD5">A5FC0045A7F94DCEC276EE550950D781</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T23:14+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Nowadays, more and more large datasets exhibit an intrinsic graph structure. While there exist special graph databases to handle ever increasing amounts of nodes and edges, visualising this data becomes infeasible quickly with growing data. In addition, looking at its structure is not sufficient to get an overview of a graph dataset. Indeed, visualising additional information about nodes or edges without cluttering the screen is essential. In this paper, we propose an interactive visualisation for social networks that positions individuals (nodes) on a two-dimensional canvas such that communities defined by social links (edges) are easily recognisable. Furthermore, we visualise topical relatedness between individuals by analysing information about social links, in our case email communication. To this end, we utilise document embeddings, which project the content of an email message into a high dimensional semantic space and graph embeddings, which project nodes in a network graph into a latent space reflecting their relatedness.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">INTRODUCTION</head><p>In our modern information society we produce substantial amounts of data each day. A large portion of it comes from the communication on social media platforms or through emails. Special graph databases enable the efficient storage of these large communication networks and provide interfaces to query or analyse the data. Visualising networks in their entirety on the other hand is a very challenging task. Users investigating a communication network want to find information about when does who communicate with whom about what. These kind of networks can be found in many different shapes. Modern social networks, such as Twitter or Facebook exhibit similar structures as classic, offline social networks <ref type="bibr" target="#b19">[20]</ref>. We investigate another type of social network: a collection of emails.</p><p>Given the communication data over a year or more, it is practically impossible to gain an overview or quick insights into the latent network structure with a basic approach as shown in 1. Also, in such a traditional network visualisation, information about the content of messages sent between individuals is lost. Besides these traditional systems, more exotic approaches use the metaphor of geographical maps <ref type="bibr" target="#b16">[17]</ref> to visualise networks, for example using topology to reflect connectivity of densely connected social communities. The map analogy can also be used to visualise the contents of documents by embedding them into a high dimensional semantic space <ref type="bibr" target="#b14">[15]</ref> and projecting it on the map as a document landscape. In order to highlight how relationships form and change based on the interactions, the metaphor of a growing tree ca be used (ContactTrees <ref type="bibr" target="#b17">[18]</ref>). Although this reflects temporal aspects of dynamic networks well, it focuses on one person as the root, thus an overview of the entire network is lost. CactusTrees <ref type="bibr" target="#b5">[6]</ref> on the other hand represent hierarchical structures with the goal of untangling overlaid bundles of intersecting edges, making distant connections more apparent. As higher order dependencies may get lost in traditional visualisations, HoNVis <ref type="bibr" target="#b20">[21]</ref> adds nodes to encode dependencies in chains of interactions. Usually, a communication network has many nodes and overlapping connections already, so Yang et al. <ref type="bibr" target="#b22">[23]</ref> rather focus on discovering overlapping cores to improve the identification of community boundaries to highlight global latent structures. Similarly, Gronemann et al. <ref type="bibr" target="#b10">[11]</ref> use the metaphor of islands and hills to visualise clustered graphs, making densely connected communities clearly noticeable. The edges are bundled and follow valleys of the resulting topology, thus making relationships between other communities hard to follow. MapSets <ref type="bibr" target="#b6">[7]</ref> assume a graph that was laid out using embeddings reflecting communities. An algorithm then draws regions around clusters of nodes, such that the bounding shapes are contiguous and non-overlapping, but yet abstract. Another approach to visualise networks at full scale is to aggregate nodes based on their spatial distribution and thereby allowing for a simple exploration with contour lines and heatmap overlays to emphasise latent structures as proposed by Hildenbrand et al. <ref type="bibr" target="#b12">[13]</ref>.</p><p>Document visualisation aims to visualise the content, such that users gain quick insights into topics, latent phrases, or trends. Tiara <ref type="bibr" target="#b21">[22]</ref> extracts topics and derives time-sensitive keywords to depict evolving subjects over time as stacked plots. Other approaches project documents into a latent space, using topic models or embeddings. Creating scatter-plots of embedded documents of a large corpus may result in a very dense and unclear layout, so Chen et al. <ref type="bibr" target="#b3">[4]</ref> developed an algorithm to reduce overfull visualisations by picking representative documents. A different approach is taken by Fortuna et al. <ref type="bibr" target="#b7">[8]</ref>, who do not show documents directly, but generate a heatmap of the populated canvas and overlay it with salient phrases at more densely populated areas from the underlying documents in that region. Friedl et al. <ref type="bibr" target="#b9">[10]</ref> extend that concept by drawing clear lines between regions and colouring them. They also add edges between salient phrases based on co-occurrences in the texts. Most recently Cartograph <ref type="bibr" target="#b18">[19]</ref> was proposed, which is visually very similar to previous approaches, but uses pre-rendered information of different resolution and map technology to enable a responsive interactive visualisation. Regions are coloured based on underlying ontologies from a knowledge-base.</p><p>Our goal is to merge approaches for network and document visualisations in one interactive user interface. This means to integrate multiple dimensions of email datasets including time, interactions, users and topics into a 2D map representation. Giving an overview over latent structures and topics in one map may significantly improve the exploration of a corpus by users A specific application scenario that could benefit from such integrated, interactive visualisations is the analysis of large, unstructured, heterogeneous data collections. Data-driven journalism <ref type="bibr" target="#b4">[5]</ref> often has to deal with leaked, unstructured, very heterogeneous data, e.g. in the context of the Panama Papers, where journalists needed to untangle and order huge amounts of information, search entities, and visualise found patterns <ref type="bibr" target="#b2">[3]</ref>. Similar datasets are of interest in the context of computational forensics <ref type="bibr" target="#b8">[9]</ref>. Auditing firms and law enforcement need to sift through huge amounts of data to gather evidence of criminal activity, often involving communication networks and documents <ref type="bibr" target="#b13">[14]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">INTERACTIVE VISUALISATION</head><p>Systems for document exploration largely vary in what they display and how users interact with them. This depends partly on the available raw data, but also on information extracted from pre-processing or enrichment with external sources. 1 shows a basic visualisation of the network graph extracted from an email corpus. Although it is an improvement over only listing connections, large densely connected graphs quickly become hard to read and information about the email contents is lost.</p><p>Exploring document collections can be seen as a top-down approach, where the system provides abstract overviews of the entire document collection and users incrementally refine the search, narrowing the results to just a few documents of interest. Such a top-down approach may help users without prior knowledge to get a sense for the data by visualising high level latent structures of communication networks or the topical distributions.</p><p>In the scope of this work we primarily consider documents to be emails or data attached to them. The sender, recipients, time, and content can directly be extracted from the raw data. We call these -and results from further processingdimensions that can be visualised. From the contents one may infer named entities, topics, embeddings, or salient phrases, while the communication network spanned by sender-recipient pairs can be used to detect salient structures and hierarchies. The temporal information enables the previously mentioned data to be analysed over time to detect evolving or changing patterns.</p><p>There are numerous ways to visualise each dimension on its own or in combination with others. The requirement of a dimension and its priority in a visualisation is dictated by the system objective. From the wide range of possibilities, we strive for a system which supports the exploration of a large collection of documents without any prior knowledge about its content and individuals involved.</p><p>In our system, we use the names and email addresses of senders and recipients (individuals), communication network, semantic vector representations of email contents, and as part of an overlay the timestamps of emails and propose a graph layout over a document landscape that visually describes who talks with whom about what at a given time period.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">SYSTEM ARCHITECTURE</head><p>Visualising communication networks in a topic-aware fashion to explore documents and salient structures is not straightforward Different layout objectives may produce contradicting results and the challenges of processing big data need to be addressed <ref type="bibr" target="#b1">[2]</ref>. In this section, we describe algorithmic approaches behind the system we are working on. For a discussion of engineering aspects on how to store, serve, and render the map-like data, we refer to the Cartograph stack <ref type="bibr" target="#b18">[19]</ref>, as we will focus on the process how to get the information that the map is generated from.</p><p>We visualise the embedded emails as dots in a two-dimensional landscape in which individuals are placed as nodes connected by edges. All emails between two individuals are reduced into one edge reducing visual complexity and making it easier to detect salient structures. However, that comes with the trade-off that nodes and edges cannot be perfectly placed in the landscape to cover all semantic aspects of the communication between them, but rather an estimate. Our very early prototype placed some individuals with no dominant topic in a crowded area in the centre of the landscape as shown in 2, where colours of opaque dots for emails correspond to that of the sender. Although the network visualisation at this point does not make connections more clear than in 1, users can already distinguish individuals with similar or unrelated topics.</p><p>Our proposed algorithm to find a stable network layout has three stages, namely an (i) initialisation phase which creates the landscape and roughly places nodes and connections, an (ii) update phase which iteratively updates the node placement towards a better fit, and finally a (iii) post processing phase where edges become splines to make latent structures more clear and a map topology is added.</p><p>Initialising the Landscape. To generate the document landscape, we first process the network graph to roughly determine regions, where documents will be placed. Therefore we apply node2vec <ref type="bibr" target="#b11">[12]</ref> to the communication network and embed each individual's node. We separate the graph into communities P i ⊂ P using the kernel density of the resulting populated space at Figure 2: Rendered prototype output after landscape initialisation without prior community segmentation threshold κ, where a higher κ results in more, but smaller communities. For each community P i , pairwise neighbourhood similarities are calculated using euclidean distance between nodes, forming the triangular matrix S i , where s kl is the similarity between p k , p l ∈ P i .</p><p>Furthermore, we train document embeddings <ref type="bibr" target="#b14">[15,</ref><ref type="bibr" target="#b24">25]</ref> on all emails and use them to infer high dimensional semantic vector representations. Let M i be the set of emails that originated in community P i . For each email m ∈ M i , the dimensionality is reduced using t-SNE <ref type="bibr" target="#b15">[16]</ref>, which retains possible semantic clusterings of documents in the higher dimensional space. The resulting two-dimensional vectors are then placed as dots on the map using the centre of embedded network communities as the respective origin, whereas the size is determined by the number of related individuals.</p><p>We also initialise communication network's layout. Thereby, the staring position of a node representing an individual is determined by the normalised sum of two-dimensional vectors of all emails he or she has sent or received. This way, we implicitly group semantically related individuals into communities as frequent communication biases this normalised sum. Straight edges are added between the nodes if the respective individuals exchanged emails. Note, that many edges may only represent a small number of emails. Applying a variable threshold σ can reduce the computational load in later stages, as these edges will not impact the overall layout very much. They can be added again as the user requests a detailed visualisation by zooming in or through other interactions.</p><p>In the algorithm's second stage, we iteratively try to improve the layout of the communication network by finding a balance between the closeness of nodes to semantic context and densely connected neighbourhoods a node belongs to. Therefore, for each individual p j ∈ P we use linear regression to fit a line m p j though all two-dimensional vectors of emails he or she has sent or received. As a node is placed near this line, it remains in a semantically good position.</p><p>Adjusting the Network Layout. The first stage of our proposed algorithm produces a fixed document landscape and roughly fits the communication network on top. We now aim to incrementally adapt the layout of the graph to better reflect salient structures in the network while keeping each individual's node close to the reflective semantic area in the landscape.</p><p>Therefore we define a score quantifying how well the current layout fits these objectives:</p><formula xml:id="formula_0">p i ∈P η d (p i , m p i ) + p j ∈P θ s i j − d (p i , p j )<label>(1)</label></formula><p>where d (•, •) is the distance between two nodes (zero if no connection exists) or shortest distance from a node to its ideal line.</p><p>To adjust the layout towards either a better semantic or structural fit, we introduce parameters θ and η.</p><p>In order to minimise 1, we use stochastic gradient descent. In each iteration step, we can derive the direction and magnitude each node should be moved towards a better semantic fit and closer proximity to it's neighbourhood in the network.</p><p>The semantic gradient for p j ∈ P is defined by</p><formula xml:id="formula_1">⃗ δ s j := (p m j − p j ) ∥p m j − p j ∥<label>(2)</label></formula><p>where p m j is the closest point on m p j to p j and ∥•∥ denotes the euclidean norm, while neighbourhood gradient is defined by</p><formula xml:id="formula_2">⃗ δ n j := p k ∈P p j ∼p k (p j − p k ) ∥p j − p k ∥ − s jk<label>(3)</label></formula><p>where p j ∼ p k denotes that an edge exists between p j and p k . With the definitions in 2 and 3, we can formulate the update vector ⃗ δ j for node p j ∈ P as</p><formula xml:id="formula_3">⃗ δ j := ξ θ ⃗ δ n j + η ⃗ δ s j (<label>4</label></formula><formula xml:id="formula_4">)</formula><p>where ξ is the learning rate and θ, η as before parameters to weight between a better semantic or neighbourhood fit. Most likely, complex network structures might prevent the stochastic gradient descent to find a stable minimum, so the score of the objective function should be monitored or intermediate layouts be visually evaluated to determine a satisfactory result.</p><p>Post Processing. Lastly, we use the post processing stage to enhance the readability of our visualisation. Densely connected communities in the graph are potentially hard to read, thus we apply edge bundling <ref type="bibr" target="#b0">[1]</ref> to visually clear latent structures. We also apply MapSets <ref type="bibr" target="#b6">[7]</ref> to separate the regions for each community. Since semantically similar emails may appear in different communities, we apply colouring based on clusters in the original global document embedding space to retain this aspect. Choosing the colours depends on the number of latent topics that should be depicted <ref type="bibr" target="#b23">[24]</ref>. If the topic number exceeds 25-30 topics, grouping topics and allowing for zooming within a two-level topichierarchy ensures distinguishable colors for up to 10 subtopics (25 × 10 = 25). In order to represent temporal aspects of the data, we calculate the kernel density of the document landscape for fixed time-intervals, which can be used to add heat-map overlays that users can select later on. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">CONCLUSION AND VISION</head><p>In this paper, we described an algorithm to lay out a communication network on top of a landscape of semantically embedded emails. This is still work in progress, thus 3 shows only a manually drawn mock-up of the visualisation we envision. In it, individuals are represented as nodes positioned such that densely connected communities are visually clustered. Edges describe the email traffic, where the opacity and thickness is used to indicate the frequency of messages between the nodes they connect.</p><p>The semantic representations of emails are used to place dots on a background layer which we call the document landscape. This landscape is used as additional input to the graph layout algorithm, aiming to place a node within corresponding semantic regions. The colouring of regions in the landscape is derived from densely connected communities in the communication graph. Optionally, representative words are selected for densely populated areas in the landscape, so that users get a rough idea about subjects in that area. The aforementioned timestamps of emails can be used to generate a heatmap overlay to show the activity in a certain time interval which is controlled by a slider. Similar to modern geographical maps, zooming into a region reveals more details. In our case, less prominent individuals and their connections are shown along with additional salient phrases from the document landscape. Selecting a node will not only highlight connected edges but may also temporarily show more edges which were previously hidden at that zoom level. The user will also be able to retrieve documents with the help of a selection rectangle or clicking dots in the document landscape.</p><p>In future work, we hope to evaluate this system using fullscale real-world data as well as practitioners from journalism and auditing. It may also be interesting to experiment with embedding methods, which take both the emails and the network graph as input and directly project the inferred representations into the two-dimensional landscape to simplify the proposed algorithm.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Traditional basic visualisation of a communication graph from 2000 emails using force layout</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Semantic landscape of email contents and dominant communication patterns (drawn mockup)</figDesc></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Towards unambiguous edge bundling: Investigating confluent drawings for network visualization</title>
		<author>
			<persName><forename type="first">Benjamin</forename><surname>Bach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nathalie</forename><surname>Henry Riche</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Christophe</forename><surname>Hurter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kim</forename><surname>Marriott</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tim</forename><surname>Dwyer</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions on Visualization and Computer Graphics</title>
		<imprint>
			<biblScope unit="volume">23</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="541" to="550" />
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Exploration and visualization in the web of big linked data: A survey of the state of the art</title>
		<author>
			<persName><forename type="first">Nikos</forename><surname>Bikakis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Timos</forename><surname>Sellis</surname></persName>
		</author>
		<ptr target="CEUR-WS.org" />
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Workshops of the EDBT/ICDT 2016 Joint Conference</title>
				<meeting>the Workshops of the EDBT/ICDT 2016 Joint Conference</meeting>
		<imprint>
			<date type="published" when="2016">2016</date>
			<biblScope unit="volume">1558</biblScope>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Panama papers: a case study for records management?</title>
		<author>
			<persName><forename type="first">Marie-Anne</forename><surname>Chabin</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Brazilian Journal of Information Science: Research Trends</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">4</biblScope>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Exemplar-based visualization of large document corpus</title>
		<author>
			<persName><forename type="first">Yanhua</forename><surname>Chen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lijun</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ming</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jing</forename><surname>Hua</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Transactions on Visualization and Computer Graphics</title>
		<imprint>
			<biblScope unit="volume">15</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1161" to="1168" />
			<date type="published" when="2009">2009. 2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Clarifying journalismâĂŹs quantitative turn: A typology for evaluating data journalism, computational journalism, and computerassisted reporting</title>
		<author>
			<persName><forename type="first">Mark</forename><surname>Coddington</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Digital Journalism</title>
		<imprint>
			<biblScope unit="volume">3</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="331" to="348" />
			<date type="published" when="2015">2015. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">CactusTree: A tree drawing approach for hierarchical edge bundling</title>
		<author>
			<persName><forename type="first">Tommy</forename><surname>Dang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Angus</forename><surname>Forbes</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Pacific Visualization Symposium. IEEE</title>
				<meeting>of the Pacific Visualization Symposium. IEEE</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="210" to="214" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">MapSets: Visualizing Embedded and Clustered Graphs</title>
		<author>
			<persName><forename type="first">Alon</forename><surname>Efrat</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yifan</forename><surname>Hu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephen</forename><forename type="middle">G</forename><surname>Kobourov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sergey</forename><surname>Pupyrev</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Graph Algorithms and Applications</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="571" to="593" />
			<date type="published" when="2015">2015. 2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Visualization of text document corpus</title>
		<author>
			<persName><forename type="first">Blaz</forename><surname>Fortuna</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Marko</forename><surname>Grobelnik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Dunja</forename><surname>Mladenic</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Informatica</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">4</biblScope>
			<biblScope unit="page" from="497" to="502" />
			<date type="published" when="2005">2005. 2005</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Computational forensics: Towards hybrid-intelligent crime investigation</title>
		<author>
			<persName><forename type="first">Katrin</forename><surname>Franke</surname></persName>
		</author>
		<author>
			<persName><surname>Sargur</surname></persName>
		</author>
		<author>
			<persName><surname>Srihari</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Symposium on Information Assurance and Security. IEEE</title>
				<imprint>
			<date type="published" when="2007">2007</date>
			<biblScope unit="page" from="383" to="386" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Maps of computer science</title>
		<author>
			<persName><forename type="first">Daniel</forename><surname>Fried</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephen</forename><forename type="middle">G</forename><surname>Kobourov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Pacific Visualization Symposium. IEEE</title>
				<meeting>of the Pacific Visualization Symposium. IEEE</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="113" to="120" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Drawing clustered graphs as topographic maps</title>
		<author>
			<persName><forename type="first">Martin</forename><surname>Gronemann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michael</forename><surname>Jünger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Symposium on Graph Drawing and Network Visualization</title>
				<meeting>of the Symposium on Graph Drawing and Network Visualization</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2012">2012</date>
			<biblScope unit="page" from="426" to="438" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">node2vec: Scalable feature learning for networks</title>
		<author>
			<persName><forename type="first">Aditya</forename><surname>Grover</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jure</forename><surname>Leskovec</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Conference on Knowledge Discovery and Data Mining</title>
				<meeting>of the Conference on Knowledge Discovery and Data Mining</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="855" to="864" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Flexible Level-of-Detail Rendering for Large Graphs</title>
		<author>
			<persName><forename type="first">Jan</forename><surname>Hildenbrand</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Arlind</forename><surname>Nocaj</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ulrik</forename><surname>Brandes</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Symposium on Graph Drawing and Network Visualization</title>
				<meeting>of the Symposium on Graph Drawing and Network Visualization</meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2016">2016</date>
			<biblScope unit="page" from="625" to="627" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">An intelligent system for semantic information retrieval information from textual web documents</title>
		<author>
			<persName><forename type="first">Mukundan</forename><surname>Karthik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Mariappan</forename><surname>Marikkannan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Arputharaj</forename><surname>Kannan</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Workshop on Computational Forensics</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="135" to="146" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Distributed representations of sentences and documents</title>
		<author>
			<persName><forename type="first">Quoc</forename><surname>Le</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Tomas</forename><surname>Mikolov</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the International Conference on Machine Learning. PMLR</title>
				<meeting>of the International Conference on Machine Learning. PMLR</meeting>
		<imprint>
			<date type="published" when="2014">2014</date>
			<biblScope unit="page" from="1188" to="1196" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Visualizing data using t-SNE</title>
		<author>
			<persName><forename type="first">Laurens</forename><surname>Van Der Maaten</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Geoffrey</forename><surname>Hinton</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Machine Learning Research</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page" from="2579" to="2605" />
			<date type="published" when="2008">2008. 2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Creating realistic map-like visualisations: Results from user studies</title>
		<author>
			<persName><forename type="first">Patrick Cheong-Iao</forename><surname>Pang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Robert</forename><forename type="middle">P</forename><surname>Biuk-Aghai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Muye</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bin</forename><surname>Pang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Visual Languages and Computing</title>
		<imprint>
			<biblScope unit="volume">43</biblScope>
			<biblScope unit="page" from="60" to="70" />
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<analytic>
		<title level="a" type="main">Contact trees: Network visualization beyond nodes and edges</title>
		<author>
			<persName><forename type="first">Arnaud</forename><surname>Sallaberry</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yang-Chih</forename><surname>Fu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Hwai-Chung</forename><surname>Ho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kwan-Liu</forename><surname>Ma</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PLOS ONE</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="1" to="23" />
			<date type="published" when="2016">2016. 2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<analytic>
		<title level="a" type="main">Cartograph: Unlocking Spatial Visualization Through Semantic Enhancement</title>
		<author>
			<persName><forename type="first">Shilad</forename><surname>Sen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Anja</forename><forename type="middle">Beth</forename><surname>Swoap</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Qisheng</forename><surname>Li</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Brooke</forename><surname>Boatman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Ilse</forename><surname>Dippenaar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Rebecca</forename><surname>Gold</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Monica</forename><surname>Ngo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Sarah</forename><surname>Pujol</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Bret</forename><surname>Jackson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Brent</forename><surname>Hecht</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of Conference on Intelligence User Interfaces</title>
				<meeting>of Conference on Intelligence User Interfaces</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="179" to="190" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Online and offline social networks: Use of social networking sites by emerging adults</title>
		<author>
			<persName><forename type="first">Kaveri</forename><surname>Subrahmanyam</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stephanie</forename><forename type="middle">M</forename><surname>Reich</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Natalia</forename><surname>Waechter</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Guadalupe</forename><surname>Espinoza</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Applied Developmental Psychology</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="420" to="433" />
			<date type="published" when="2008">2008. 2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">HoNVis: Visualizing and Exploring Higher-Order Networks</title>
		<author>
			<persName><forename type="first">Jun</forename><surname>Tao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jian</forename><surname>Xu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Chaoli</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Nitesh V</forename><surname>Chawla</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Pacific Visualization Symposium. IEEE</title>
				<meeting>of the Pacific Visualization Symposium. IEEE</meeting>
		<imprint>
			<date type="published" when="2017">2017</date>
			<biblScope unit="page" from="1" to="10" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<analytic>
		<title level="a" type="main">TIARA: a visual exploratory text analytic system</title>
		<author>
			<persName><forename type="first">Furu</forename><surname>Wei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Shixia</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Yangqiu</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Shimei</forename><surname>Pan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Michelle</forename><forename type="middle">X</forename><surname>Zhou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Weihong</forename><surname>Qian</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Lei</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Li</forename><surname>Tan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Qiang</forename><surname>Zhang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proc. of the Conference on Knowledge Discovery and Data Mining</title>
				<meeting>of the Conference on Knowledge Discovery and Data Mining</meeting>
		<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2010">2010</date>
			<biblScope unit="page" from="153" to="162" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">Overlapping communities explain core-periphery publisher of networks</title>
		<author>
			<persName><forename type="first">Jaewon</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Jure</forename><surname>Leskovec</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Proc. IEEE</title>
		<imprint>
			<biblScope unit="volume">102</biblScope>
			<biblScope unit="page" from="1892" to="1902" />
			<date type="published" when="2014">2014. 2014</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Escaping RGBland: selecting colors for statistical graphics</title>
		<author>
			<persName><forename type="first">Achim</forename><surname>Zeileis</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kurt</forename><surname>Hornik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Paul</forename><surname>Murrell</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Computational Statistics &amp; Data Analysis</title>
		<imprint>
			<biblScope unit="volume">53</biblScope>
			<biblScope unit="issue">9</biblScope>
			<biblScope unit="page" from="3259" to="3270" />
			<date type="published" when="2009">2009. 2009</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b24">
	<monogr>
		<title level="m" type="main">Context Aware Document Embedding</title>
		<author>
			<persName><forename type="first">Zhaocheng</forename><surname>Zhu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Junfeng</forename><surname>Hu</surname></persName>
		</author>
		<idno>CoRR abs/1707.01521</idno>
		<imprint>
			<date type="published" when="2017">2017. 2017</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
