<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>TopicLens: An Interactive Recommender System based on Topical and Social Connections</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Laura Devendorf</string-name>
          <email>ldevendorf@berkeley.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>John O'Donovan</string-name>
          <email>jod@cs.ucsb.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Tobias Höllerer</string-name>
          <email>holl@cs.ucsb.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>University of California</institution>
          ,
          <addr-line>Berkeley, CA 94720</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>University of California</institution>
          ,
          <addr-line>Santa Barbara, CA 93106</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <fpage>41</fpage>
      <lpage>47</lpage>
      <abstract>
        <p>This paper describes TopicLens, an interactive tool for exploring and recommending items within large corpora, based on both social metadata and topical associations. The system uses a hybrid visualization model that represents topics and content items side by side, allowing the user to actively explore recommendations rather than passively viewing them. The approach provides insight into the composition of relevant topics as they relate to the meta-data of underlying texts. We describe a novel approach to sorting and filtering, which can be topic or document-driven, and two novel interaction styles termed “view inversion” and “human-review”, each of which enable novel perspectives on topic modeled sets of documents. To evaluate the system, three use cases are presented to highlight interesting insights across three different data sets using our novel recommendation interface. Interface@RecSys'12, September 13, 2012, Dublin, Ireland. Paper presented at the Workshop on Interfaces for Recommender Systems 2012, in conjunction with the 6th ACM conference on Recommender Systems. Copyright c 2012 for the individual papers by the papers' authors. This volume is published and copyrighted by its editors. .</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>INTRODUCTION</title>
      <p>
        Recommender systems attempt to ease the information overload
problem by providing the right information to the right person at
the right time [
        <xref ref-type="bibr" rid="ref19 ref31 ref33">31, 19, 33</xref>
        ]. However, presentation mechanisms for
these systems are becoming increasingly important, as they are
applied to increasingly more diverse data on the social web. For
example, Herlocker’s early experiments on the value of explaining
recommendations [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ] have informed and influenced many of
today’s recommender system designs. Tintarev and Masthoff [
        <xref ref-type="bibr" rid="ref38">38</xref>
        ]
survey the role of explanation as an integral part of the
recommendation process and outline seven distinct advantages of
providing explanation. More recent efforts to analyse the effect of
“inspectability and control” [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ], interactive visual feedback [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], and
dynamic critiquing [
        <xref ref-type="bibr" rid="ref10 ref30">30, 10</xref>
        ] clearly show that the interface
components play an important role in a user’s acceptance and overall trust
in a recommendation.
      </p>
      <p>In this paper we focus on one specific interface design (Figure 1
for exploration of recommendations which have been derived from
a topic modeling algorithm. Topic modeling is a statistical method
for extracting relevant topics from a large corpus of text.
Visualization of connections formed through topic modeling can enable
users to quickly identify trends and other insightful details from a
large data set. Successful visualizations are especially effective at
highlighting patterns within high dimensional data. Such
visualizations may also allow the user to navigate and dynamically filter
information in order to extract specific and relevant items. Example
use cases are:
• To augment the users ability, beyond keyword based search
and navigation, to discover topical composition and
interrelationships in texts (i.e. recommendation via topic
associations).
• To highlight popular trends and conversations within social
networks.
• To compare bodies of text, visually exploring similarities,
differences and patterns in the underlying texts for better
personalized result sets.</p>
      <p>The focus of this paper is largely on the UI design and on novel
interaction techniques to represent connections formed over large
text datasets using topic modeling or other automated text analysis
algorithms. The key elements in our visual representations include:
• Recommendable Item: An abstract entity which can translate
to either a text document or a user within a social network.
These are conceptually grouped because they are both
represented by collections of terms. For example, in the Twitter
data set, a user is represented as a collection of Tweets.
• Topic: Multinomial distributions over a set of terms, which
can be associated with content items.</p>
      <p>
        While established representations, such as word clouds and tree
maps [
        <xref ref-type="bibr" rid="ref35">35</xref>
        ] can be useful for visualizing frequency in topic-item
relationships, we describe a model that also preserves and represents
relationships at the meta-data level. This allows users not only to
see which topics arise, but also how they arose and under what
conditions. The approach enables more informed reasoning about
documents a user wishes to investigate, while highlighting trends over
a number of different types of networks with respect to a particular
investigation.
      </p>
      <p>
        Microsoft’s "Twahpic” [
        <xref ref-type="bibr" rid="ref29">29</xref>
        ] approach to visualizing topics in
conjunction with meta-data leverages a composite view that optimizes
its visualization strategy for each different facet of the data. This
strategy is effective for illustrating and highlighting the multifaceted
nature of the data, but is difficult to navigate due to the separation
of each frame and the segregation of the data networks. In short, the
interaction model helps a user form impressions of the data rather
than supporting investigations into the data.
      </p>
      <p>
        Work by Cao et al. in [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] shows a benefit of using multiple
approaches to visualizing the different facets of the data, and in this
paper, we will present a model that takes a hybrid approach rather
than a segregated approach in order to facilitate navigation and
interaction with the data. The key features of the proposed technique
are as follows:
• Presents a choice of view modes, sorting parameters and
controls for navigation and dynamic filtering.
• Enables a user to filter topics in relation to the pre-existing
networks in the data.
• Allows for human oversight of algorithmically generated
results.
• Enables exploration of dataset as a map, traversing and
isolating regions of particular interest in order to extract relevant
items.
• Caters to diverse topic modeling scenarios, including
additional data such as social and information networks.
      </p>
      <p>In the remaining sections, we will discuss the related research
and provide a brief background of topic modeling before
describing in detail the design decisions made when developing the
TopicLens interface. The design decisions include those related to
overall structure and the mapping of formal elements to relational
information. Novel aspects of the interface are also discussed,
particularly new techniques that we have termed ï£¡view inversionï£¡ and
ï£¡human review.ï£¡ We will then present three applications of the
system, one of which uses data that does not contain topic-based
relations, thus highlighting a more generalized application of the
design.</p>
    </sec>
    <sec id="sec-2">
      <title>RELATED WORK</title>
      <p>
        Due to the proliferation of data available on the web, there is
an increasing need for better techniques for exploration of large
amounts of text data. This is commonly known as addressing an
information overload problem [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ]. Ongoing research has produced
theorem lemma proof follow constant bound exist definition
software process tool project development design system developer
protein genes expression network motif interaction pathway genome
political social policy economic china law government national
business firm services customer technology management market product
flow velocity wall fluid turbulence reynold pressure channel
proactive, query-based solutions in the fields of search [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] and
reactive or filter-based approaches in the field of recommendation
[
        <xref ref-type="bibr" rid="ref20 ref7">20, 7</xref>
        ]. In the context of this work, we are especially interested
in approaches that employ visual and interactive methods to tailor
an information space to a user’s individual needs. The novel
approach presented in this paper employs a statistical method known
as Latent Dirichlet Analysis (LDA) or “topic modeling” [
        <xref ref-type="bibr" rid="ref3 ref4">4, 3</xref>
        ] to
discover useful linkages between documents upon which
visualizations are built.
      </p>
      <p>
        While there has been a significant amount of research in this
domain from a variety of perspectives, from early approaches such
as [
        <xref ref-type="bibr" rid="ref18 ref27 ref40">27, 40, 18</xref>
        ] to more recent work in [
        <xref ref-type="bibr" rid="ref22 ref25 ref36 ref37 ref39">36, 37, 39, 22, 25</xref>
        ], visual
techniques for exploring large sets of documents have not yet been
widely adopted.
2.1
      </p>
    </sec>
    <sec id="sec-3">
      <title>Topic Modeling</title>
      <p>
        LDA or “topic modeling” is a statistical technique introduced by
Blei et al. [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] that computes focused probability distributions over
the words in a set of documents. The algorithm functions by
mapping documents onto a smaller number of “topics”. In this sense, a
topic consists of a multinomial distribution over words or stemmed
terms in a document set. For example, as p(w|t), for t ∈ 1 . . . T ,
where T is the number of topics [
        <xref ref-type="bibr" rid="ref16 ref4">4, 16</xref>
        ]. In many cases, topics are
displayed as a list of the top n words with the highest probability
in the set. Table 1 from [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] shows some example topics produced
by an LDA algorithm. In this case, the words “theorem, lemma,
proof, follow, constant...” seem to relate to the topic “Mathematical
Theory”. Recent research in [
        <xref ref-type="bibr" rid="ref26 ref9">9, 26</xref>
        ] has shown that although LDA
topics can be misinterpreted, they are generally well understood by
users. Techniques for the automatic labeling of topics have been
presented in [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>In TopicLens, topics are leveraged to form associations among
items in a large corpus, and these associations are used to produce
informative and highly flexible representations of the broader
content item space, using novel layout and interaction techniques.
Before describing our approach to visualizing a topic space, we now
present a discussion of existing approaches to visualization of large
document sets.</p>
      <p>
        Many approaches in the literature dealing with the representation
of large text collections, ranging from traditional static
representations, e.g. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], to more recent and highly interactive
representations which use advanced methods to relate documents together,
e.g. [?]. They can rely on pre-existing meta data, or can
compute relations on the fly. In this paper, we present a novel
interactive design and layout for exploring topic based and social network
relations in large document sets. Before presenting the prototype
system in detail, the following section provides a brief account of
the design choices for using a combination of river and graph-like
visual representations in the system.
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>The Need for a Hybrid Model</title>
      <p>
        As shown in Figure 2, we are supporting exploration of
multifaceted data in a variety of ways. Specifically, examples are
demonstrated on three different network types: social network data with
unidirectional edges (followers and followees) from Twitter;
augmented with topic relations, and a topic modeled network of news
articles from the New York Times; and social network data with
bidirectional connections from Facebook. Across all examples, the
goal is to use simple interaction and novel layouts to facilitate user
comprehension of complex data, particularly to communicate the
“credibility’ factor of peers in a network with respect to particular
topics of interest. This complexity would be inherently difficult to
communicate with a single visualization technique such as a river
or graph visualization. Accordingly, we have opted for a hybrid
approach which uses a graph-like mechanism similar to TopicNets
[
        <xref ref-type="bibr" rid="ref15">15</xref>
        ] for highlighting relations between document and topic nodes,
and a river-like view similar to ThemeRiver [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ] overlaid to
communicate frequency or “credibility” of different sets of peers within
the context of a topic selection. This approach has been successful
in applications such as Freire’s ManyNets [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ].
      </p>
    </sec>
    <sec id="sec-5">
      <title>DESIGN CONSIDERATIONS</title>
      <p>At the core, the TopicLens interface seeks to empower the user
to explore a large datasets based on a number of factors. We
designed the interface with the idea that potential users would benefit
most from learning and engaging in the system rather than making
sense of the data at a glance. We see applications of our system
being beneficial for any researcher who is looking to glean insights
into a large body of text. This includes analysts of social networks
as well as scholars in the humanities who may want to use
TopicLens to explore trends in the bodies of work by a single author
or works belonging to a single or set of genres. We provide
functionality with the goal of avoiding a crowded interface and we took
great measures to ensure clarity and consistency across multiple
view modes. In our informal tests and observations of interactions
with the system, we have found it easy to learn and that users take
quickly to the dynamic filtering and sorting tools we provide.
4.</p>
    </sec>
    <sec id="sec-6">
      <title>VISUALIZATION DESIGN</title>
      <p>The most prominent feature of the visualization, shown in Figure
2, is its use of the wheel to structure information. Using a
circular structure allows us accommodate variability in the size of the
datasets. The wheel dynamically expands to fit the data and
contracts upon filtering. Zooming and font size are adjusted in order to
keep information present within the visualization space, regardless
of how much there is to display.</p>
      <p>The visualization is designed to fit within a rectangular window
with width larger than height. The exact dimensions can vary and in
our examples, we found it most effective to use a full screen view
on a high-resolution display (1280x1024 and higher), especially
when dealing with large sets. The left side of the screen contains
the controls and legends and the wheel rotates on an axis in the
center of the screen. A static camera is also positioned at the center
of the screen, allowing the user to zoom in towards and away from
the center. The river is positioned along the outer edge of the wheel
and protrudes in different directions depending on the current data
selection.
4.1</p>
    </sec>
    <sec id="sec-7">
      <title>Organization</title>
      <p>In order to support the user in exploring the data at varying levels
of detail, the organization of the visualization needs to clearly
distinguish the different relationships that are represented. We classify
those relationships into three types: primary, secondary and ternary.
The data we collect has pre-existing relationships as formed through
meta-data (primary relations), the topic modeling algorithm
provides information about relationships between items and topics
(secondary relationships), and we found it helpful to further analyze
the topics in relation to items and item meta-data (ternary
relationships). By dividing the wheel into three concentric regions,
we were able to map each type relationship to its own location
on the wheel. As you travel from the center out, the information
represented reflects a increasing number of factors. The wheel,
combined with zooming, was intended to give the user the idea
that zooming out will provide them with a big picture, birds-eye
overview of the data and zooming in closer will focus on the finer
detailed relationships. The following paragraphs provide a detailed
explanation of the relationship types and the regions they map to.
4.1.1</p>
      <sec id="sec-7-1">
        <title>Primary Relations: Center</title>
        <p>Primary relations are formed though associations in item
metadata. In the analysis of Twitter networks, a single item represents
a Twitter user. Item meta-data includes, but is not limited to, a list
of followers of this user and a list of other Twitter users that this
user is following. In the case of topic modeling run over New York
Times articles, primary relationships would be formed between two
or more articles that share the same author formed by two articles.
Primary relationships are mapped to the center so these
relationships can be viewed in a local space. Figure 2 shows primary
relations through coloring in the view on the right. In the view on the
left, topics are featured in the center. Since primary relations don’t
exist within topics, no explicit color mapping is represented.
4.1.2</p>
      </sec>
      <sec id="sec-7-2">
        <title>Secondary Relations: Center &amp; Inner Ring</title>
        <p>Secondary relationships occur as a result of the topic modeling
and define the relationships between topic and item nodes. Each
of these relationships occurs with a given probability as defined by
the LDA algorithm. These relationships as well as their respective
probabilities are represented by interactions between the center and
inner ring. While the nodes in the center are not bound to any axis
or predetermined path, the nodes in the inner ring are equidistantly
laid out in a circle. This is primarily because the inner ring also
functions as the axis points for the river visualization but also
reinforces simplicity by defining only one type of data to be related
spatially. On the left side of Figure 2, highlighting Wikileaks changed
the opacity of the nodes on the inner ring in order to indicate how
related each item is to this topic. On the right, highlighting User
16 changed the opacity of the topics in the inner ring, similarly
showing the strength of the connection.
4.1.3</p>
      </sec>
      <sec id="sec-7-3">
        <title>Ternary Relations: Outer Ring / River</title>
        <p>Ternary relationships are formed between the topic modeled
results and the meta-information of the items related to those results.
Using the river visualization to graph these relationships allows us
to see an overall frequency of the node in addition to the
metainformation frequencies within the same space. Depending on the
data and filtering, the river model can be customized to show any
particular facet of the meta-information. Figure 2 is showing
average probabilities over each facet of item meta-data in relation to
the selected item. The colors in the river match the colors of the
meta-data in the center, reinforcing this relationship.
4.2</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Visual Mappings</title>
      <p>Because the TopicLens visualization needs to encode a rich
variety of data, we took care to make the visual encoding of different
relationships and concepts distinct. In order to maintain simplicity
we map objects and relationships to specific formal elements.
Depending on the underlying dataset, visual features may be turned
on and off in order to keep the visual complexity to a minimum.</p>
      <p>At the root, our information display consists of two basic entities:
topics and content items. Items are mapped to circles and topics are
mapped to rectangles with the text label of the topic in the center.
We made these entities distinct in order to visually and conceptually
separate them. The topic text is always visible but the item text is
only present on demand. Similarly, the circular shape of the item is
always visible but the rectangular shape of the topic is only visible
on selection.</p>
      <p>Color is used to visually group items based on meta-data. For
instance, if there is meta-information about item categories, each
category type would map to a unique color. This mapping was
chosen partly because it enables a quick visual grouping of items and
extends to a large number of categorizations. Another reason for
choosing color, was its ability to support a visual connection
between the meta-data of the individual item and the corresponding
meta-data represented in the river. This offers the user two levels
of understanding by illustrating how the meta information is
connected to the item as well as the topic.</p>
      <p>Opacity is used to illustrate secondary relationships,
relationships between topics and content items. These relationships occur
with a probability specified by the LDA algorithm. Opacity is an
effective means of illustrating these connections as it indicates
relative strength. Darker nodes have strong probabilities of relation,
lighter have weaker ones. If a node is unrelated, it is removed from
the space. Secondary relations are highlighted upon interaction as
the user must specify a single item or topic in order to view its
connections. If multiple items or topics are selected, then the opacity
value is determined by the average probability from all nodes in the
selected set.</p>
      <p>Position and order are used in conjunction to highlight patterns in
the data. Patterns are exposed by using the ordering of the items or
topics on the inner ring to position the items or topics in the center.
Each value begins in the center of the circle and is pulled towards
all of its related nodes in the inner ring. The strength of attraction
depends on the probability of the connection between the item and
the topic. The result is a spatial grouping of items or topics that
share similar relationships. A number of interaction techniques for
positioning items on the inner ring will be discussed in the
following sections.</p>
      <p>Size is used to illustrate measures of numerical magnitude such
as frequency or number of relations. Similar to position and
ordering, some mappings of attributes to size can be more informative
than others. For this reason, we allow the user to indentify the node
attribute that determines node size.</p>
    </sec>
    <sec id="sec-9">
      <title>IMPLEMENTATION</title>
      <p>This visualization evolved through a number of design iterations.
Using Processing to program the design and interaction allowed us
to easily explore changes in the design and instantly see the
results. The Processing framework also made it simple to program
animations and transitions between states. A number of libraries
were used to extend the scope and flexibility ofProcessing. The
PeasyCam library provided the basic virtual viewpoint control, the
ControlP5 library was used to implement text boxes, range sliders
and list boxes and an OpenGL library was used to add custom
functionality into the system such as smoothing and alpha blending.</p>
      <p>The TopicLens application creates node and edge objects by
parsing configuration and data files on load. During the execution of the
program, nodes and edge objects are referenced in order to create
dynamic links. Links are the elements that are drawn to the
canvas and much of the code is devoted to maintaining those links and
dynamically updating their values to indicate relationships. The
smooth transitions were created using an integrator class that allows
the user to specify characteristics such as mass, position, damping
and attraction. When a link targets a given position, the integrator
dynamically updates its position depending on its physical
characteristics.</p>
    </sec>
    <sec id="sec-10">
      <title>USE CASES AND DISCUSSION</title>
      <p>In order to showcase the flexible applicability of our visual model,
we present three use cases that explore different dynamic datasets.
Each use-case will discuss the design decision made to cater to the
specific data domain as well as a usage scenario to illustrate its
potential for a variety of applications.
6.1</p>
    </sec>
    <sec id="sec-11">
      <title>Recommending Credible Information In</title>
    </sec>
    <sec id="sec-12">
      <title>Twitter</title>
      <p>Preserving social network relations in topic modeled systems
allows us to glean insights into the networks and salient topics
therein. This example is catered specifically as an attempt to
visualize credibility in Twitter networks. Our definition of
“credibility” relates to the probability by which a user is connected with a
particular topic, based on LDA analysis over a bag of words
representation of all of that user’s tweets. In analyzing credibility, we
also examine that user’s followers and followees and their
respective associations with the given topic.</p>
      <p>In this visualization, which is represented earlier in Figure 2,
each topic node contains a label that represents the list of words
in a mined topic. Primary relationships are formed between a user
and their followers and followees. Secondary relationships occur
between users and extracted keywords and the ternary relationships
represent probabilities over the meta data. In this example, the user
meta data contains a list of other users following this user and a list
of the users this user is following. One type of ternary relationship
in this example is the relationship between a topic and the average
probability that the users friends are discussing that topic.</p>
      <p>As noted in Section 4.1, ternary relations are mapped to the river
and the nodes in the inner ring form the axis points. The river
displays specific information depending on the organization of the
nodes within the space. This approach affords the user an
opportunity to uncover potentially interesting relations in the following 6
view configurations.</p>
      <p>With topic nodes in the center, and user nodes on the outer ring:
• Upon selection of an individual user, the river view shows
that user, their friends and their followers’ probabilistic
association with each topic on the outer ring.
• When no user is selected, the river shows the average
probability for each topic across all users.
• When a topic is selected, the river shows each user’s
association with that topic.</p>
      <p>With user/item nodes in the center, and topic nodes on the outer
ring:
3 • When a topic is selected, the user’s friends and follower’s
opacity is varied to represent association with that topic.
• When a user is selected, their association with each topic on
the outer ring is shown in the river view.
• When no user is selected, the probability of each topic in the
global space is shown on the outer ring</p>
      <p>For this scenario, the river represents three probabilities for each
node, the average probability of the user using the topic, the average
probability of the user’s followers using the topic and the average
probability that the people following this user are using the topic.
Since topics are represented along the inner ring, this information
is available for every topic. Each of the probabilities is represented
on the river, using color matching to indicate the group or single
user it applies to. To further explain what the river is visualizing,
a legend on the bottom left of the interface dynamically updates,
explaining the current model. In this case mode visibility is of
particular importance as the river maps different values through the
life of the visualization.</p>
      <p>When a Twitter user is highlighted in the space, interactions take
place at each of the three levels. In the center, the primary
relationships are presented through colors. All users who don’t belong to
this user’s network are removed and the remaining users are color
coordinated to indicate whether they are a follower of the selected
user, or someone the selected user is following. Spatially, each user
is attracted to the topic nodes in the inner ring by the positioning
algorithm mentioned above. Topics related to the selected node vary
by opacity in order to indicate the strength of connection.</p>
      <p>The probability mappings were specifically designed to
investigate credibility or trust. The top left of Figure 2 shows a network
with two people selected. All of the nodes in the set represent both
of the selected people’s networks. On the outer river, one can see
the probability distributes for this network over each topic in the
network. From the river you can conclude that these two users are
using the topic "Crowley" quite a bit, however their friends and
followers are not. For this reason, they may not be a trusted source
for this topic since their followers do not appear to be interested in
similar topics. On the other side of the visualization is the topic
"asylum" which is being used largely by the network and not so
much by these users.</p>
      <p>Drawing firm conclusions at this level is not necessarily reliable
but better information can be introduced by selecting the right
network. For instance, if you know the terms "Assange", "Julian" and
"Wikileaks" are all terms related to Wikileaks, then you could
select those terms from the visualization and view the results over the
given network of users associated with those terms. By
investigating the probability of these three words occurring together across
the social network you may be able to visualize trends about who
is followed, by whom and for what reasons.
6.2</p>
    </sec>
    <sec id="sec-13">
      <title>Recommending New York Times Articles</title>
      <p>In the example shown on the left of Figure 3, topic modeling was
performed on a set of New York Times articles and is used for
investigation and discovery of related articles that may not have been
discovered through traditional search models. Each document node
represents an article and topic nodes represent the topics extracted
from those articles. Each article contains information about the
section of the paper which it belonged to, such as opinion, world or
national news.</p>
      <p>Two unique design features were included in this interface to
improve the functionality in relation to the underlying data. The
first one is colored rectangles on topics. These colors are used to
reinforce ternary connections though the use of color averaging.
The color of the rectangle is determined by the category of each of
the articles associated with it. Should a color tend heavily towards
a single category’s color, on could deduce that the topic tends to
appear most frequently within that category. The actual distribution
of the categories is explicitly represented in the river.</p>
      <p>The second unique feature is the use of lines. When hovering
over a topic, darkened lines extend form the topic itself to all
related documents. Lighter lines then extend from each of those
related documents to all of the other topics they are related to. This
conveys information to the user about other topics related to their
selection. The user is able to specifically locate the documents that
contributed to this relationship by following the lines or selecting
multiple topics and browsing the filtered document space.</p>
      <p>The lines are particularly useful for illustrating how two topics
are related to each other and upon what criteria. This is helpful
when browsing for articles associated with a given theme. Let’s say
a researcher is looking for references on "peak oil." Searching for
and selecting "peak oil" from the space would show the researcher
other related topics as well as articles specifically contain the
relation. If one of the related articles contains a topic that is also
of interest to him or presents a particularly interesting comparison,
he or she can easily isolate and obtain information about the
articles containing both topics by filtering the space and hovering over
the document node, revealing information about the article such as
title, date and author.</p>
      <p>TopicLens could also be used visualize trends associated with a
subset of articles. Say a user read a few articles in the Times
Opinion section and they would like to find other articles about similar
and related subjects. This could be accomplished by typing each
article name into the search field. This would in turn select each
of the corresponding articles in the space and illustrate the topics
associated with them. In order to remove outliers, they would
adjust the slider to specify the amount of documents that need to be
associate a topic in order for it to appear in the space. After this
filtering step, they are presented with a number of related topics,
the most popular being the largest and darkest. By selecting that
topic, the space is reorganized to show all the the articles related
to that topic. The user can now visually browse these articles and
quickly identify which one appeared in the opinion section, based
on color. Hovering the mouse over a document node would reveal
its specific information and provide access to the full text..
6.3</p>
    </sec>
    <sec id="sec-14">
      <title>Recommending Movies via Facebook API</title>
      <p>In this example, shown in the right of Figure 3, we use the
proposed framework to visualize data that is not topic modeled in
order to show how the interface also operates on similarly structured
datasets. Reinterpreting the definitions of recommendable item and
topic allows us to use the existing visual model for this dataset. In
this example, a single Facebook user takes the place of a
recommendable item and a movie takes the place of a topic. Since movies
can be related to any number of Facebook users and Facebook
users can be associated with any number of movies, this dataset
can function similarly to the topic model examples. Each
itemtopic, or rather user-movie, combination is assigned the probability
of 1 since the user has specified explicitly that they like the given
movie. This visualization is able to provide exploratory views of
the most popular movies within a Facebook friend network as well
as the least popular movies. It can also isolate pockets of users that
are fans of these most or least popular movies. Essentially this view
is a visual representation of a social collaborative filtering process,
since items which are popular among Facebook friends are
promoted for a single target user receiving the recommendation.</p>
    </sec>
    <sec id="sec-15">
      <title>CONCLUSION</title>
      <p>In summary, this paper has presented a novel interactive interface
for recommending interesting topics and documents from within a
large corpus. The design is a hybrid which combines river and
graph-like representations of recommended items and can be
easily adapted and customized by the end user for different use cases.
We have also introduced novel interaction methods that support
human skills in the exploration of topic modeled data sets. In doing
so, we have extended the efficacy of both the system and the
algorithm, allowing the user to navigate large datasets and uncover
patterns. Details of our design choices and methodology have been
discussed, and demonstrated over three example applications,
including social network data from Twitter augmented with topic
modeling over users’ tweets, a topic modeled set of New York
Times news articles, and social network data from Facebook,
including item preferences. In each example case, we have discussed
ways in which the approach facilitates discovery of relevant
information which may go undiscovered in traditional analysis tools.
We have also demonstrated TopicLens’ ability to act as a flexible
interaction layer, supporting exploration of multiple application
domains.</p>
    </sec>
    <sec id="sec-16">
      <title>ACKNOWLEDGMENT</title>
      <p>The authors would like to thank Peter Pirolli and Bongwon Suh
for use of their Twitter data set. This work was partially supported
by the U.S. Army Research Laboratory under Cooperative
Agreement No. W911NF-09-2-0053; by NSF grant IIS-1058132; and
by the U.S. Army Research Laboratory under MURI grant No.
W911NF-09-1-0553; The views and conclusions contained in this
document are those of the authors and should not be interpreted
as representing the official policies, either expressed or implied, of
ARL, NSF, or the U.S. Government. The U.S. Government is
authorized to reproduce and distribute reprints for Government
purposes notwithstanding any copyright notation here on.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Analyzer</surname>
          </string-name>
          .
          <article-title>Matheo analyzer, database analysis and information mapping</article-title>
          . http://www.matheo-analyzer.com/,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K.</given-names>
            <surname>Andrews</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Kienreich</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Sabol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Becker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Droschl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Kappe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Granitzer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Auer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Tochtermann</surname>
          </string-name>
          .
          <article-title>The infosky visual explorer: exploiting hierarchical structure and document similarities</article-title>
          .
          <source>Information Visualization</source>
          ,
          <volume>1</volume>
          :
          <fpage>166</fpage>
          -
          <lpage>181</lpage>
          ,
          <year>December 2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>D.</given-names>
            <surname>Blei</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Lafferty</surname>
          </string-name>
          .
          <article-title>Correlated topic models</article-title>
          .
          <source>In Advances in NIPS 18</source>
          , pages
          <fpage>147</fpage>
          -
          <lpage>154</lpage>
          . MIT Press, Cambridge, MA,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D. M.</given-names>
            <surname>Blei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          , and
          <string-name>
            <given-names>M. I.</given-names>
            <surname>Jordan</surname>
          </string-name>
          .
          <article-title>Latent Dirichlet allocation</article-title>
          .
          <source>Journal of Machine Learning Research</source>
          ,
          <volume>3</volume>
          :
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>I.</given-names>
            <surname>Borg</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Groenen</surname>
          </string-name>
          .
          <source>Modern Multidimensional Scaling: Theory and Applications</source>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Bostandjiev</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. O'Donovan</surname>
            , , and
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Hollerer</surname>
          </string-name>
          . Tasteweights:
          <article-title>An interactive hybrid recommender system</article-title>
          .
          <source>In 6th ACM Conference on Recommender Systems</source>
          , Dublin, Ireland, September 9 to 13th,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>R.</given-names>
            <surname>Burke</surname>
          </string-name>
          .
          <article-title>Knowledge-based recommender systems</article-title>
          .
          <source>In Encyclopedia of Library and Information Systems</source>
          , volume
          <volume>69</volume>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>N.</given-names>
            <surname>Cao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sun</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.-R.</given-names>
            <surname>Lin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Gotz</surname>
          </string-name>
          , S. Liu, and
          <string-name>
            <given-names>H.</given-names>
            <surname>Qu</surname>
          </string-name>
          . Facetatlas:
          <article-title>Multifaceted visualization for rich text corpora</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          ,
          <volume>16</volume>
          :
          <fpage>1172</fpage>
          -
          <lpage>1181</lpage>
          ,
          <year>November 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Chang</surname>
          </string-name>
          . Reading Tea Leaves: How Humans Interpret Topic Models.
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>L.</given-names>
            <surname>Chen</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Pu</surname>
          </string-name>
          .
          <article-title>Critiquing-based recommenders: survey and emerging trends. User Model. User-Adapt</article-title>
          . Interact.,
          <volume>22</volume>
          (
          <issue>1-2</issue>
          ):
          <fpage>125</fpage>
          -
          <lpage>150</lpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Cutting</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. R.</given-names>
            <surname>Karger</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. O.</given-names>
            <surname>Pedersen</surname>
          </string-name>
          .
          <article-title>Constant interaction-time scatter/gather browsing of very large document collections</article-title>
          .
          <source>In Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          ,
          <source>SIGIR '93</source>
          , pages
          <fpage>126</fpage>
          -
          <lpage>134</lpage>
          , New York, NY, USA,
          <year>1993</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>B. D.</given-names>
            <surname>Davison</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Suel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Craswell</surname>
          </string-name>
          , and B. Liu, editors.
          <source>Proceedings of the Third International Conference on Web Search and Web Data Mining, WSDM</source>
          <year>2010</year>
          , New York, NY, USA, February 4-
          <issue>6</issue>
          ,
          <year>2010</year>
          . ACM,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>M.</given-names>
            <surname>Freire</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Plaisant</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Shneiderman</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Golbeck</surname>
          </string-name>
          .
          <article-title>Manynets: an interface for multiple network analysis and visualization</article-title>
          .
          <source>In CHI '10: Proceedings of the 28th international conference on Human factors in computing systems</source>
          , pages
          <fpage>213</fpage>
          -
          <lpage>222</lpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>B.</given-names>
            <surname>Fry</surname>
          </string-name>
          .
          <article-title>Visualizing data - exploring and explaining data with the processing environment</article-title>
          .
          <source>O'Reilly</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>B.</given-names>
            <surname>Gretarsson</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. O'Donovan</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Asuncion</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Newman</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Bostandjiev</surname>
            , T. Hï£¡llerer, and
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Smyth</surname>
          </string-name>
          . Topicnets:
          <article-title>Visual analysis of large text corpora with topic modeling</article-title>
          .
          <source>ACM Transactions on the Web: Special Issue on Intelligent Text Visualization</source>
          ,
          <volume>16</volume>
          :
          <fpage>1172</fpage>
          -
          <lpage>1181</lpage>
          ,
          <year>November 2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>T. L.</given-names>
            <surname>Griffiths</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Steyvers</surname>
          </string-name>
          .
          <article-title>Finding scientific topics</article-title>
          .
          <source>PNAS</source>
          ,
          <volume>101</volume>
          (
          <issue>Suppl 1</issue>
          ):
          <fpage>5228</fpage>
          -
          <lpage>5235</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>S.</given-names>
            <surname>Havre</surname>
          </string-name>
          , E. Hetzler,
          <string-name>
            <given-names>P.</given-names>
            <surname>Whitney</surname>
          </string-name>
          , and
          <string-name>
            <given-names>L.</given-names>
            <surname>Nowell</surname>
          </string-name>
          . Themeriver:
          <article-title>Visualizing thematic changes in large document collections</article-title>
          .
          <source>IEEE Transactions on Visualization and Computer Graphics</source>
          ,
          <volume>8</volume>
          (
          <issue>1</issue>
          ):
          <fpage>9</fpage>
          -
          <lpage>20</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Hearst</surname>
          </string-name>
          .
          <article-title>Tilebars: visualization of term distribution information in full text information access</article-title>
          .
          <source>In CHI '95: Proc. of the SIGCHI Conf.</source>
          , pages
          <fpage>59</fpage>
          -
          <lpage>66</lpage>
          , New York, NY, USA,
          <year>1995</year>
          . ACM Press/Addison-Wesley Publishing Co.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Explaining collaborative filtering recommendations</article-title>
          .
          <source>In CSCW</source>
          , pages
          <fpage>241</fpage>
          -
          <lpage>250</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Herlocker</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Konstan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. G.</given-names>
            <surname>Terveen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Riedl</surname>
          </string-name>
          .
          <article-title>Evaluating collaborative filtering recommender systems</article-title>
          .
          <source>ACM Trans. Inf</source>
          . Syst.,
          <volume>22</volume>
          :
          <fpage>5</fpage>
          -
          <lpage>53</lpage>
          ,
          <year>January 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>B.</given-names>
            <surname>Knijnenburg</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. O'Donovan</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Bostandjiev</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Kobsa</surname>
          </string-name>
          .
          <article-title>Inspectability and control in social recommenders</article-title>
          .
          <source>In 6th ACM Conference on Recommender Systems</source>
          , Dublin, Ireland, September 9 to 13th,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>S.</given-names>
            <surname>Koch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Bosch</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ertl</surname>
          </string-name>
          .
          <article-title>Towards content-oriented patent document processing</article-title>
          .
          <source>IEEE Symposium on Visual Analytics, Science and Technology</source>
          , pages
          <fpage>203</fpage>
          -
          <lpage>210</lpage>
          ,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Mei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Shen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>C.</given-names>
            <surname>Zhai</surname>
          </string-name>
          .
          <article-title>Automatic labeling of multinomial topic models</article-title>
          .
          <source>In KDD</source>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <string-name>
            <given-names>N. E.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Chung Wong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Brewster</surname>
          </string-name>
          , and
          <string-name>
            <given-names>H.</given-names>
            <surname>Foote</surname>
          </string-name>
          .
          <article-title>Topic islands - a wavelet-based text visualization system</article-title>
          .
          <source>In Proceedings of the conference on Visualization '98</source>
          , VIS '
          <volume>98</volume>
          , pages
          <fpage>189</fpage>
          -
          <lpage>196</lpage>
          , Los Alamitos, CA, USA,
          <year>1998</year>
          . IEEE Computer Society Press.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>D.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Baldwin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Cavedon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Martinez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Scholer</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Zobel</surname>
          </string-name>
          .
          <article-title>Visualizing search results and document collections using topic maps</article-title>
          .
          <source>Web Semantics: Science, Services and Agents on the World Wide Web</source>
          ,
          <volume>8</volume>
          (
          <issue>2</issue>
          -3):
          <fpage>169</fpage>
          -
          <lpage>175</lpage>
          ,
          <year>July 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          [26]
          <string-name>
            <given-names>D.</given-names>
            <surname>Newman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Noh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Talley</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Karimi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Baldwin</surname>
          </string-name>
          .
          <article-title>Evaluating topic models for digital libraries</article-title>
          .
          <source>In JCDL '10: Proceedings of the 10th annual joint conference on Digital libraries</source>
          , pages
          <fpage>215</fpage>
          -
          <lpage>224</lpage>
          , New York, NY, USA,
          <year>2010</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          [27]
          <string-name>
            <given-names>K. A.</given-names>
            <surname>Olsen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. R.</given-names>
            <surname>Korfhage</surname>
          </string-name>
          ,
          <string-name>
            <surname>K. M. Sochats</surname>
            ,
            <given-names>M. B.</given-names>
          </string-name>
          <string-name>
            <surname>Spring</surname>
            , and
            <given-names>J. G.</given-names>
          </string-name>
          <string-name>
            <surname>Williams</surname>
          </string-name>
          .
          <article-title>Visualization of a document collection: the vibe system</article-title>
          .
          <source>Inf. Process. Manage.</source>
          ,
          <volume>29</volume>
          (
          <issue>1</issue>
          ):
          <fpage>69</fpage>
          -
          <lpage>81</lpage>
          ,
          <year>1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          [28]
          <string-name>
            <surname>Questel</surname>
          </string-name>
          .
          <article-title>Qpat, intellectual property patent and trademark searching</article-title>
          . http://www.qpat.com/,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          [29]
          <string-name>
            <given-names>D.</given-names>
            <surname>Ramage</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dumais</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Liebling</surname>
          </string-name>
          .
          <article-title>Characterizing microblogs with topic models</article-title>
          . In A. Cohn, editor,
          <source>Proceedings of the Fourth International on Weblogs and Social Media</source>
          , pages
          <fpage>130</fpage>
          -
          <lpage>137</lpage>
          . AAAI Press,
          <fpage>23</fpage>
          -
          <lpage>26</lpage>
          May
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          [30]
          <string-name>
            <given-names>J.</given-names>
            <surname>Reilly</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>McCarthy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>McGinty</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B.</given-names>
            <surname>Smyth</surname>
          </string-name>
          .
          <article-title>Dynamic critiquing</article-title>
          .
          <source>In ECCBR</source>
          , pages
          <fpage>763</fpage>
          -
          <lpage>777</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          [31]
          <string-name>
            <given-names>P.</given-names>
            <surname>Resnick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Iacovou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Suchak</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bergstrom</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Riedl</surname>
          </string-name>
          . Grouplens:
          <article-title>An open architecture for collaborative filtering of netnews</article-title>
          .
          <source>In CSCW</source>
          , pages
          <fpage>175</fpage>
          -
          <lpage>186</lpage>
          ,
          <year>1994</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref32">
        <mixed-citation>
          [32]
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Rushall</surname>
          </string-name>
          and
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Ilgen</surname>
          </string-name>
          . Depict:
          <article-title>Documents evaluated as pictures. visualizing information using context vectors and self-organizing maps</article-title>
          .
          <source>In Proceedings of the 1996 IEEE Symposium on Information Visualization (INFOVIS '96)</source>
          ,
          <source>INFOVIS '96</source>
          , pages
          <fpage>100</fpage>
          -, Washington, DC, USA,
          <year>1996</year>
          . IEEE Computer Society.
        </mixed-citation>
      </ref>
      <ref id="ref33">
        <mixed-citation>
          [33]
          <string-name>
            <given-names>U.</given-names>
            <surname>Shardanand</surname>
          </string-name>
          and
          <string-name>
            <given-names>P.</given-names>
            <surname>Maes</surname>
          </string-name>
          .
          <article-title>Social information filtering: Algorithms for automating "word of mouth"</article-title>
          .
          <source>In CHI</source>
          , pages
          <fpage>210</fpage>
          -
          <lpage>217</lpage>
          ,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref34">
        <mixed-citation>
          [34]
          <string-name>
            <given-names>B.</given-names>
            <surname>Shneiderman</surname>
          </string-name>
          .
          <article-title>The eyes have it: A task by data type taxonomy for information visualizations</article-title>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref35">
        <mixed-citation>
          [35]
          <string-name>
            <given-names>B.</given-names>
            <surname>Shneiderman</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Wattenberg</surname>
          </string-name>
          .
          <article-title>Ordered treemap layouts</article-title>
          .
          <source>Information Visualization</source>
          , IEEE Symposium on,
          <volume>0</volume>
          :
          <fpage>73</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref36">
        <mixed-citation>
          [36]
          <string-name>
            <given-names>W.</given-names>
            <surname>Spangler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kreulen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Lessler</surname>
          </string-name>
          . Mindmap:
          <article-title>Utilizing multiple taxonomies and visualization to understand a document collection</article-title>
          .
          <source>Hawaii International Conference on System Sciences</source>
          ,
          <volume>4</volume>
          :
          <fpage>102</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref37">
        <mixed-citation>
          [37]
          <string-name>
            <given-names>J. T.</given-names>
            <surname>Stasko</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Görg</surname>
          </string-name>
          , and
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <article-title>Jigsaw: supporting investigative analysis through interactive visualization</article-title>
          .
          <source>Information Visualization</source>
          ,
          <volume>7</volume>
          (
          <issue>2</issue>
          ):
          <fpage>118</fpage>
          -
          <lpage>132</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref38">
        <mixed-citation>
          [38]
          <string-name>
            <given-names>N.</given-names>
            <surname>Tintarev</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Masthoff</surname>
          </string-name>
          .
          <article-title>A survey of explanations in recommender systems</article-title>
          .
          <source>In Data Engineering Workshop</source>
          , 2007 IEEE 23rd International Conference on, pages
          <fpage>801</fpage>
          -
          <lpage>810</lpage>
          . IEEE,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref39">
        <mixed-citation>
          [39]
          <string-name>
            <surname>Wanner</surname>
          </string-name>
          .
          <article-title>Towards content-oriented patent document processing</article-title>
          .
          <source>World Patent Information</source>
          ,
          <volume>30</volume>
          (
          <issue>1</issue>
          ):
          <fpage>21</fpage>
          -
          <lpage>33</lpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref40">
        <mixed-citation>
          [40]
          <string-name>
            <given-names>J. A.</given-names>
            <surname>Wise</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. J.</given-names>
            <surname>Thomas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Pennock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Lantrip</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pottier</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Schur</surname>
          </string-name>
          , and
          <string-name>
            <given-names>V.</given-names>
            <surname>Crow</surname>
          </string-name>
          .
          <article-title>Visualizing the non-visual: Spatial analysis and interaction with information from text documents</article-title>
          . In N. D. Gershon and S. Eick, editors,
          <source>IEEE Information Visualization '95</source>
          , pages
          <fpage>51</fpage>
          -
          <lpage>58</lpage>
          . IEEE Computer Soc. Press,
          <fpage>30</fpage>
          -
          <lpage>31</lpage>
          Oct.
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref41">
        <mixed-citation>
          [41]
          <string-name>
            <given-names>P. C.</given-names>
            <surname>Wong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Hetzler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Posse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Whiting</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Havre</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Cramer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Shah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Singhal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Turner</surname>
          </string-name>
          , and
          <string-name>
            <surname>J. Thomas.</surname>
          </string-name>
          <article-title>In-spire infovis 2004 contest entry</article-title>
          .
          <source>Information Visualization, IEEE Symposium on, 0:r2</source>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>