=Paper= {{Paper |id=Vol-2068/esida1 |storemode=property |title=PIHVI: Online Forum Posting Analysis with Interactive Hierarchical Visualisation |pdfUrl=https://ceur-ws.org/Vol-2068/esida1.pdf |volume=Vol-2068 |authors=Jaakko Peltonen,Ziyuan Lin,Kalervo Jaäarvelin,Jyrki Nummenmaa |dblpUrl=https://dblp.org/rec/conf/iui/PeltonenLJN18 }} ==PIHVI: Online Forum Posting Analysis with Interactive Hierarchical Visualisation == https://ceur-ws.org/Vol-2068/esida1.pdf
       PIHVI: Online Forum Posting Analysis with Interactive
                     Hierarchical Visualization

                Jaakko Peltonen∗ , Ziyuan Lin∗ , Kalervo Järvelin∗ , and Jyrki Nummenmaa∗
          jaakko.peltonen@uta.fi, ziyuan.lin@uta.fi, kalervo.jarvelin@uta.fi, jyrki.nummenmaa@uta.fi
ABSTRACT                                                                or administrators of the venue. Online forums are big data:
We introduce PIHVI: a novel interactive system for visualizing          as a case study we use a popular Finnish discussion forum
and exploring a large hierarchical text corpus of online forum          “Suomi24” (www.suomi24.fi) spanning 16 years and 6.5 mil-
postings. The main view of the visual interface shows a large-          lion threads. Hierarchically organized discussion occurs also
scale scatter plot, created by flexible nonlinear dimensionality        for example in user commentaries on news sites whose news
reduction based on text contents of the postings, and we couple         have a hierarchy, in online reviews of products that have a hi-
it with a coloring optimized by a second dimensionality reduc-          erarchy, and so on; our work is applicable to such hierarchies
tion to represent the forum hierarchy. We exploit the hierarchy         as well but we focus on the discussion forum case.
to provide data-driven summaries of plot areas at multiple lev-
                                                                        In discussion forums, sections and their hierarchy are created
els of detail, allowing the user to quickly see and compare both
                                                                        by administrators as a simplified division of discussions in-
the content-based similarity of groups of posts and how near
                                                                        tended to represent a subset of prototypical user interests, and
they arise in the forum hierarchy. A user can move between
                                                                        such a division does not suffice to describe the true variety of
hierarchy levels, mark posts or spots of interest, filter posts by
                                                                        semantic content in online discussion. Our target users are ad-
content similarity and by location within the hierarchy, and
                                                                        ministrators and analysts of the forums: if the hierarchy does
inspect post contents. Experiments show the interface can
                                                                        not support the content, forum owners can add subsections or
reveal hidden semantic relationships between postings that
                                                                        shortcuts to promote content, aiming to increase user engage-
would be hard to find based on the known hierarchy alone.
                                                                        ment with the venue. An important task in data analytics of
ACM Classification Keywords                                             online forums is to perform visual analytics of how discus-
D.2.2 Software Engineering: Design Tools and Techniques                 sions and their underlying topics spread across the hierarchy.
User interfaces; H.5.2 Information Interfaces and Presenta-             When the data and the hierarchy are large this becomes a hard
tion: User Interfaces User-centered design; I.2.7 Artificial            task, as exhaustive inspection of all data or comparisons across
Intelligence: Natural Language Processing Text analysis                 all sections become unfeasible: computational support is then
                                                                        needed. However, most visual analytics tools for social media
Author Keywords                                                         discussion focus on other aspects such as temporal evolution
user interaction; visual analytics; text analysis.                      or networks of conversation participants.
                                                                        We propose a new system for analysis of online conversation in
INTRODUCTION                                                            hierarchical forums in a large scale, by an interactive interface
Online discussion often takes place in venues that have a hier-         that scales to large data (Figure 1; details in later sections). We
archical organization, such as large message boards organized           focus on analysis of content variation and its relationship to the
in sections, subsection, sub-sub-sections and so on. While              hierarchical sections; we do not focus on e.g. temporal filter-
some venues such as specialized message boards may use a                ing or other such aspects which are extensively studied in other
relatively flat organization having a small number of sections          systems and can easily be integrated as additional views for
directly under a root, other venues such as general-interest            our system. Main novelties: first work to use dimensionality
message boards have an extensive multilevel hierarchy form-             reduction based linking of views through optimized coloring;
ing a deep tree of sections. Users post new threads, and post           among the first to use large-scale dimensionality reduction in
comments and replies under threads, but the hierarchy of sec-           visual analytics of discussion venues; novel multi-level inspec-
tions is typically designed by owners of the discussion venue           tion of a large-scale scatterplot; novel use case - analysis of
* All the authors are affiliated with University of Tampere, Finland.   suitability of existing hierarchy in a venue to describe content
The work was supported by Academy of Finland decisions 295694           of the venue. We next discuss related work, introduce the
and 313748 and by Aalto Science-IT.                                     system, denoted PIHVI - Posting Analysis with Interactive Hi-
                                                                        erarchical Visualization, then describe our machine learning
                                                                        solution, describe a user study and discuss conclusions.

                                                                        RELATED WORK
                                                                        Visual analytics for social media has been an active research
                                                                        topic, as social media data are rich and heterogeneous in struc-
©2018. Copyright for the individual papers remains with the authors.    ture and thus difficult to model. A recent survey [2] taxono-
Copying permitted for private and academic purposes.                    mized methodologies on the field by Data sources – Twitter,
ESIDA ’18, March 11, 2018, Tokyo, Japan
Facebook, blogs, online forums, etc.; Entities of data – net-       various visual analytics for inspecting online forums. Sim-
work, geographic information, or text content; Goal of visual-      ple text search does not suffice when users aim to broadly
ization – pattern extraction, visual monitor, anomaly detection,    comprehend the information space with multiple exploratory
etc.; Visualization techniques – scatter plots, trees, node-link    searches and information needs [17]. For example, a set of
diagrams, flows, etc.; Target users – general public or analysts;   visualization designs showing different aspects from MOOC
Applications – journalism, politics, finance, sports, advertise-    forums, including posts, users, and threads at different scales
ments, etc. Our PIHVI system can be categorized as using            is considered, with an aim to support online education with the
interactive hierarchical scatter plots to analyze text content of   forums [5]. A recent work [19] focuses on revealing anoma-
general online forums for analysts, with the goals of extracting    lous information spreading occurring in social media along the
interesting patterns of threads, detecting outliers of threads      time-line with coordinated views, which particularly contains
like spam or novel topics, and advertising, to name a few. Be-      a scatter plot of threads created from Multidimensional scaling
low we summarize related work on using other visualization          (MDS) [1] without the hierarchical functionality. We provide
techniques to analyze aspects of online discussions.                views that help gain comprehensive understanding of ongoing
                                                                    discussion and how it relates to a multi-level hierarchy of the
Semantic features of threads in online forums. Bag-of-              whole discussion venue, rather than on relationships within
word (BoW) vectors like TF-IDF are typical to represent text        individual threads; moreover our focus is on mostly anony-
content for analysis [9, 13]. Topic facets to extract sentence      mous discussion where most users are unregistered so user or
level sub-topics are considered [6], The facets are shown to        user-group views are less useful.
reflect programming concepts for questions in programming
discussion forums, visualized in a prototypical system. An
alternative hierarchical semantic model for sentences incor-        OVERVIEW OF THE SYSTEM: PIHVI - POSTING ANALYSIS
porating discrete Fourier transform with a mixture of topic         WITH INTERACTIVE HIERARCHICAL VISUALIZATION
models was proposed [4]. In this first publication on PIHVI,        We now describe the proposed system PIHVI - Posting Analy-
we focus on the main goal of studying content relationships         sis with Interactive Hierarchical Visualization, on an overview
to the known hierarchy within the venue, and in order not to        level, describing the design principles and how the system
muddle that focus we use simple BoW-based features which            operates. In the next section we describe how the system is
already work well in capturing thread similarities in our experi-   implemented through dimensionality reduction methods.
ments. Other derived features like topical features are possible,
                                                                    Design Principles. The design principles of the PIHVI sys-
the rest of our method works for any choice of features.
                                                                    tem are: 1. The system should illustrate, in a compact way,
Showing posts from individual threads. In PIHVI, we em-             the overall variety of discussion over the online forum, both
phasize exploration based on the content-based hierarchical         in terms of semantic content, and the hierarchical variety of
scatter plot and the related tasks, thus we use a simple detail     discussion sections where threads can be posted. This answers
view showing plain texts from selected threads. However, we         the question: “Which kinds of topics are users discussing,
are aware of works that visually display thread content. This       and in which sections is the discussion happening?” 2. The
line of research usually focuses on visualizing the structure       system should let analysts efficiently browse semantic content
of the thread under inspection. For example, thread structures      of threads across the forum, on an overall level of trends and
can be encoded with indented rectangles [3], where each in-         on more detailed levels of individual thread groups, without
dentation corresponds to one level of nested reply to a post.       restricting browsing to boundaries of specified sections. This
Such thread visualization is orthogonal to PIHVI novelties and      answers the question: “Which discussion trends are individual
can be easily integrated into PIHVI.                                threads of interest related to?” 3. The system should compactly
                                                                    show relationships of threads in terms of semantic content and
Dimensionality reduction in visual analytics. Dimension-            position in the hierarchy. This answers the question: “What
ality reduction has been brought into the pipeline of visual        other similar threads exist for a thread of interest?”
analytics by providing a 2D representation of threads, to help      The above design principles should hold even when the num-
users to gain an overview. It is related to semantic feature con-   ber of threads, the number of sections in the hierarchy, and the
struction discussed above, if obtained features are given to di-    depth of the hierarchy, all become large. The resulting system
mensionality reduction algorithms. For example, t-distributed       for visual analytics should support key tasks in Shneiderman’s
Stochastic Neighbor Embedding (t-SNE) [7] has been consid-          [15] taxonomy of tasks for information visualizations, includ-
ered [16]. In PIHVI, we build our hierarchical exploration          ing the visual information seeking mantra overview first, zoom
functionality on top of a t-SNE based embedding. See Section        and filter, details on demand, and relate.
4 for details. Another branch of studies in this line is to let a
user personalize the representation by interaction. See, e.g.,      System Components. Following the design principles in Sec-
our work in [11, 10, 12] and references in [14]. We point out       tion 3, we devise the following components for large-scale
the novel goal of PIHVI is to study relationships of large-scale    interactive analysis on hierarchical forum posts, and to support
content to a large-scale hierarchy, hence the novelty is in in-     our user tasks in PIHVI. Screen shots of the system are shown
corporation of the hierarchy into the content visualization and     in Figure 1. The system has five linked views, where views
in hierarchical exploration of content through multiple views.      1-3 below are always shown and 4-5 are on demand:
                                                                    1. Content-based map: An interactive scatter plot of the en-
Forum analysis system combined with different visual an-            tire collection of threads from the online forum, which can
alytics. There have been integrated systems with views of           be zoomed to several detail levels. The map shows a spatial
                                                                   tion distributions: higher opacity reflects a higher density of
                                                                   threads and vice versa; color mixing suggests mixtures of dom-
                                                                   inating sections sharing similar content. Both patterns may
                                                                   interest a forum analyst/administrator. For interactivity, each
                                                                   dot, from different zoom levels, can be inspected by clicking
                                                                   to show a pop-up: the pop-up may contain either the thread
                                                                   title and the name of its section, if the dot represents a single
                                                                   thread, or the name of the dominating common parent section,
                                                                   if the dot represent a group of threads. To avoid occlusions of
                                                                   multiple pop-ups, the analyst can bring a pop-up to front or
                                                                   back by clicking or dragging it; also, when a pop-up is mouse-
                                                                   overed, the remaining pop-ups without the cursor hovered will
                                                                   turn half-transparent. Buttons on this view allow the analyst
                                                                   to drag a rectangle over an area of the map: all threads in the
                                                                   area will be described in the Threads and Sections panel.
                                                                   2. Thumbnail view of the map: This view shows the entire
                                                                   collection and a rectangle denoting which portion of the map
                                                                   an analyst is currently viewing; this lets the analyst maintain
                                                                   focus and context while browsing the collection.
                                                                   3. Thumbnail view of the thread hierarchy: This view shows
                                                                   a radial plot of the tree hierarchy of discussion sections, and
                                                                   shows how optimized colors of the sections vary across the
                                                                   hierarchy. Due to space constraint, showing all 2434 section
                                                                   names at once is not feasible: we show section names on de-
                                                                   mand when the analyst mouse-overs the tree nodes of sections.
                                                                   The window size can be enlarged by standard zoom controls
                                                                   to help mouse-over the nodes. This view allows the analyst
                                                                   to relate where each thread or group of threads seen in the
                                                                   content view arises from in the hierarchy. Buttons on this view
                                                                   allow the analyst to drag a rectangle over a subset of sections:
                                                                   those sections will be highlighted in the content-based map.
                                                                   4. Threads and Sections panel: When the analyst has selected
                                                                   an area of the content map, all threads in that area will be listed
                                                                   here as a scrollable list. The panel has two tabs: the Thread
                                                                   tab lists the threads by title and section, and the Sections tab
                                                                   lists all unique sections from which the threads arise.
Figure 1: The PIHVI interface. Top: the initial state, where       5. Thread Information panel: When the analyst clicks any
the content-view zoomed out to show an overview of the entire      individual thread in the content-based map (individual dot that
thread collection. Middle: section-filtering state. An analyst     represents a single thread in the scatter plot) or clicks the title
selects a subset of sections in the section panel, circles of      of any individual thread in the Thread tab of the Threads and
corresponding sections are then highlighted. Bottom: zoomed-       Sections panel, the full text of that thread is shown here.
in state for an analyst exploring a cluster of hearing-related
threads from different sections across the forum; the analyst      Walkthrough example. Here we depict a typical usage of
has selected a group of threads with a bounding box and is         PIHVI for thread discovery. Suppose an analyst is interested
viewing their titles and content of an example thread.             in sport and fitness related discussion but does not yet know
                                                                   how such discussion is related to other discussion within the
                                                                   Suomi24 forum. This can be a possible sequence of findings
organization of threads by content similarity: similar threads     and interactions from the analyst when discovering the threads
are shown nearby. The map is created by machine learning           of interest: 1. Initially the analyst views the content view
based dimensionality reduction. The map is overlaid with           zoomed out to show all Suomi24, and notices several clusters
colors representing sections and their similarity, again created   of color, indicating discussion similar in content and arising
by dimensionality reduction. At detailed zoom levels, each         from tight branches of the section hierarchy.
thread is shown as a dot, and colors of the dots indicate the      2. The analyst clicks some of these clusters to find a starting
section of the threads. At less detailed zoom levels, groups of    point for analysis. After a time the analyst find a green cluster
threads are shown as dots with diameters scaling with the num-     (at the bottom in Figure 1 top) whose pop-up label shows the
bers of threads in the groups, and colors of the dots indicate a   dominating section in the cluster is Sport and Fitness (“Urheilu
dominating common parent section of threads in the group, as       ja kuntoilu”). Alternatively, the analyst could have browsed
detailed later. We draw the different-sized dots with moderate     the section tree to find a Sport and Fitness section.
transparency, yielding color blending when there are overlaps      3. The analyst then zooms in to view individual threads within
between the dots, which approximates the content and sec-          the cluster, and draws a rectangle around them to quickly view
                                                                    Properties of the System. As desired in Design Principles,
                                                                    the proposed system directly supports several key tasks in
                                                                    Shneiderman’s [15] taxonomy, in particular the visual infor-
                                                                    mation seeking mantra is supported as follows:
                                                                    Overview: The two linked views (content-based map and the
                                                                    tree of the section hierarchy) together show an at-a-glance
                                                                    overview of threads in the entire online forum. The radial tree
                                                                    view shows an at-a-glance overview of the section organiza-
                                                                    tion: sections with numerous child sections are shown as wide
                                                                    branches, and sections with multiple levels of subsections un-
                                                                    der them are shown as long branches. On the other hand the
Figure 2: Possible interactions when an analyst is discovering      content-based map shows an at-a-glance overview of seman-
threads of interest. Left: the analyst has used section filtering   tic content across the whole forum, organized so that similar
to discover several locations where Sport and Fitness (“Urheilu     content is shown nearby in the map, and groups of similar
ja kuntoilu”) is discussed (green highlighted clusters). Right:     threads appear as clusters; moreover, section colors overlaid
the analyst zoomed in towards the middle-right green cluster,       on the map provide an easy overview of which sections have
and inspected nearby other clusters: near the green Health and      similar content, which sections have varied content appearing
Fitness cluster (“Urheilu ja kuntoilu”) are two clusters, Health    in multiple places on the map, and what the lowest common
(“Terveys”; violet color) and Dogs (“Koirat”; red color).           hierarchy section is for some cluster of content.
                                                                    Zoom and filter: the analyst can zoom into the levels of detail
                                                                    in the content-based map with the mouse wheel, and can filter
                                                                    threads from both linked views with selection boxes, e.g. to
                                                                    select threads in a particular location in the content map and
                                                                    highlight threads from a particular branch of the section tree.
their titles. The analyst discovers many of them are about          Details on demand: clicking on threads in the content based
ice hockey, a popular sport in Finland, such as threads about       map shows their content, clicking on thread-clusters gives the
Finnish ice hockey teams and leagues and about related sports       name of their lowest common section, and creating a selection
like “Jääpallo” (Bandy in Finnish).                                 box over an area of the content map opens a linked view listing
4. The analyst draws a rectangle around the Sport and Fit-          all threads in the area and all sections of those threads; and
ness section and its nearby sections, to detect where else such     clicking any thread in that list shows the thread content.
threads occur in the content map: Figure 2 (Left) shows that        Relate: the content-based map relates threads by their con-
besides the ice hockey related cluster at the bottom, there are     tent similarity by placing similar threads nearby in the map;
small isolated clusters (light green dots) at edges of the map,     the section tree relates sections by showing their parent-child
and a large cluster at center right in the map. Thus the analyst    relationships; and the overlay of section colors onto the map
discovered a new cluster of Sport and Fitness discussion.           relates similarity of thread content to similarity of their sec-
5. The analyst removes the section filtering since he/she is in-    tions. Inspecting these two linked views lets analysts quickly
terested in all sections in the area. The analyst zooms towards     study how threads and sections at different branches in the hi-
the discovered cluster, and notices it is located at the border     erarchy relate to each other in terms of content, and study how
of two larger clusters (Figure 2 Right): clicking the clusters      a cluster of semantically similar content is related to multiple
reveals their dominating sections as Health (“Terveys”; violet      sections where discussion of the content happens in the forum.
colored cluster) and Dogs (“Koirat”; red colored cluster).
6. Since the three clusters are close-by, this indicates they may   As noted above, PIHVI includes filtering/highlighting of the
have common topics. The analyst is then interested what com-        content map by selected sections. Highlighting could also be
monalities there are in threads arising from Sport and Fitness      done by results of text lookup searches; we do not focus on
and in threads arising from Health, so the analyst zooms in         simple text search for several reasons: 1. Our focus is not on
more on the plot, and draw a rectangle around the boundary          searching individual documents but on overall comprehension
between “Terveys” and “Urheilu ja kuntoilu”.                        of the forum where information needs of the user evolve as
7. The topics of the selected threads are listed in the detail      he/she learns about the data; such comprehension would not
panel; the analyst finds several threads on weight-loss in the      be well served by individual text lookups. 2. Naive text search
information panel.                                                  does not take into account complexity of natural language and
8. The analyst could continue investigating Sport and Fitness       could miss related threads if they did not share search terms
(e.g. moving to view some of the other small clusters of Sport      (due to synonyms etc.), whereas the content map is organized
and Fitness threads), or could e.g. investigate commonalities       by overall similarity of all words in the documents, and serves
between Health and Dogs in the same window–this would turn          exploration of discussion content better than simple search
out to be discussion of allergies when living with dogs.            would. Nevertheless, where text lookup is needed it is trivial
The above described interaction can go on as long as desired        to highlight lookup results like we do to selected sections.
by the analyst. For an analyst, e.g., in the forum administration
                                                                    We will compare PIHVI to a baseline system we implemented
team, such exploration may help understand the distribution
                                                                    to represent a more traditional interactive section-based brows-
of the threads, which can inspire, for example, reorganization
                                                                    ing interface, described in the User Experiment section.
of forum sections, or a digest on hot discussion topics.
DIMENSIONALITY REDUCTION FOR THE PIHVI SYSTEM                      other in the section tree) get close-by colors. This color en-
We describe the dimensionality reduction solution for creating     coding is a visual cue helping users distinguish sections based
the two main views – the content-based scatter plot and its        on their locations in the hierarchy. We do not require users to
color-based link to the section tree view, and then describe how   perceive section distances exactly from colors; it is enough to
zooming is implemented efficiently and using the hierarchy         notice similar colors (close-by sections) when browsing the
for summarization of thread groups.                                content. We take path lengths between sections in the section
                                                                   tree as input distances, and give weights to atomic paths –
Content-based Representation of Textual Data. PIHVI is             paths connecting a section to its direct parent or child section
initiated by a content-based two-dimensional scatter plot of       with wi↔ j = (Ci +C j )/2, where Ci is the number of child sec-
threads on the online forum (here Suomi24), followed by            tions of section i. Intuitively, if one end of the path has a large
other interface components facilitating exploratory data anal-     number of child sections, the parent is not connected to any
ysis on the forum. In PIHVI, dots are discussion threads           particular one of its child sections, implying a looser connec-
in the “content-based” scatter plot. By “content-based”, we        tion with the other end. The distance between two sections is
mean to pursue a two-dimensional representation such that          then the sum of weights of atomic paths connecting them. The
1. if two threads are similar in content, they should be placed    distances are reduced to coordinates in a 3D output space by
nearby; and 2. if two threads are dissimilar in content, they      multi-dimensional scaling (MDS): the 3D output coordinates
should be placed faraway. We use BoW models to capture             are then normalized to the unit cube where they can be used as
(dis)similarities between content of threads: the more words       RGB color coordinates (more sophisticated color encodings
two threads share, the more similar they are in content. We ex-    from perception theory are possible [8]). As a result, sections
tract thread features based on BoW as follows, and perform a       close to the root get mild colors, with deepening saturation
nonlinear dimensionality reduction with neighbor embedding.        for more specialized lower-level sections. We empirically find
BoW-based features in high-dimensional space. We                   MDS can yield a more continuous coloring than non-distance-
start with lemmatized tokens of Suomi24, provided by the           preserving dimensionality reduction algorithms like t-SNE.
Finnish Language Bank service (“Kielipankki”, http://www.          Zooming in the Content-based Plot. At the most detailed
kielipankki.fi). The resulting data set contains the lemma-
                                                                   level in the content plot, threads are individual dots and the
tized version of each token in each thread, along with other       user can easily see their content similarities from their spatial
meta-data of the token. We first calculate the TF-IDFs for each    relationships and their section similarities from their colors.
token, constituting raw features of each thread. We further        When the user zooms out, he/she should similarly be able to
process the raw features as follows: 1. Remove stop-words          see content and section similarities of larger groups of threads.
based on a Finnish stop-word list; 2. Only keep tokens that        However, naive zooming of a large scatter plot would not ac-
appear in at least k threads, to discard incidental tokens that    complish this: firstly, in large discussion forums it would not
do not carry shared semantic content, such as rare URLs or         be feasible to redraw huge numbers of threads into a scatter
non-word character strings. 3. For each thread, keep only l        plot quickly; secondly, in areas where threads arise from mul-
tokens with highest TF-IDF values. With stop-words already         tiple sections the zoomed-out view would become a clump of
removed, we assume these kept tokens are more informative          different-colored dots, making it hard to notice overall simi-
than others in each thread. 4. Normalize each obtained feature     larities in sections of such threads. We solve both problems
vector by dividing vector entries with their sum. We feed          by an intelligent zoom. The plot area is divided into a grid
the final features into t-SNE to obtain the two-dimensional        (the higher the zoom-out factor, the rougher the grid). Instead
content-based representation for PIHVI. Considering the size       of plotting individual threads we plot one circle per grid cell,
of our corpus, t-SNE is a suitable choice since 1. t-SNE has       summarizing all threads in it: the circle radius indicates num-
been shown to work well with high dimensional data; 2. scal-       ber of threads in the cell, and circles are drawn with slight
able t-SNE variants exist. We use a tree-based implementation      translucency to allow smooth overlap instead of occlusion.
[18] to achieve a O(N log N) complexity for N threads.             Grid cells are indexed by Hilbert curves in each zoom level
Section Distance based Coloring for Linking the Views. In          for better performance in querying points in cells. For each
the thumbnail view of the section hierarchy, we show a radial      grid cell the color of the circle must be chosen intelligently
tree plot of the hierarchy and draw section nodes in differ-       as the grid cell contains threads from multiple sections: we
ent colors so the analyst can distinguish the sections and link    compute the color by starting from each thread and travel-
them with the content-based map. There are a huge number           ing up the section hierarchy until we find the lowest section
of sections, and giving each a completely different color is       covering at least 40% of threads in the grid cell, denoted the
not possible, thus assigning colors to successfully link the two   lowest dominating section. Thus the color of a circle indicates
main views is nontrivial. We assume the analyst wishes to          diversity of content in it: the less saturated (i.e., the closer to
notice similar content from far-apart sections of the hierarchy,   the color of the root section), the more diverse the content.
and highly different content from close-by in the hierarchy,       Such adaptive coloring lets the user quickly browse forum
hence color differences should correspond to distances in the      content at multiple detail levels. At each level, circles, their
section hierarchy. We use dimensionality reduction to opti-        colors, and their overlap yield an at-a-glance view of content
mize colors so that the most important differences are encoded     and section variation. As the user zooms out, areas of mixed
with the most different colors, and close-by sections in the       sections become large diffuse circles colored ever closer to
hierarchy (having small path length from one section to the        the root color; as the user zooms in, such circles break into
subclusters with more vibrant colors, indicating thread groups       but lacks the dimensionality reduction based content view and
arising from tight branches of the section hierarchy.                focuses on the section hierarchy only, allowing the user to
                                                                     browse it in a large resolution. This baseline represents a
USER EXPERIMENT AND RESULTS                                          system that would be readily implementable without having
We perform a user experiment to show the advantage of PIHVI          access to a dimensionality reduction based visualization of
compared to a baseline, in controlled information seeking sce-       thread content and thread relationships. In the baseline system
narios representing typical analytics subtasks. We describe the      there are four linked views: 1. Scrollable large-scale radial
experiment design (the forum and thread corpus, task design,         plot of the section tree hierarchy: each node in the tree corre-
baseline system we designed&implemented, participants, and           sponds to one section of the forum.
evaluation criteria) and then the results.                           2. A thumbnail of the whole tree, with a square control for
                                                                     navigating inside the tree; moving the square over the thumb-
Details of the Online Forum and Corpus. We run our user
                                                                     nail changes the corresponding position in the large-scale plot
experiment on Suomi24, one of Finland’s largest online com-
                                                                     of the tree. This thumbnail view is similar to the one in PIHVI,
munities: it is mostly in Finnish with several Swedish sections;
                                                                     but focuses on navigation instead of filtering a content-based
in 2015 it had 0.8 million unique visitors per week1 . We con-
                                                                     plot which is not available in the baseline system.
sider the realistic scenario where analysts are analyzing data
                                                                     3. A panel that appears when the user clicks on any section in
of a specific time frame: here we filter data to the subset of
                                                                     the tree. It shows the thread titles from the selected section.
thread from 2005; our methods can naturally be used for data
                                                                     This panel is similar to the threads tab in the PIHVI system.
of any other years or for all of Suomi24. We set k = 20 and
                                                                     4. A window for full-text content of threads; clicking any
l = 200 in the preprocessing told in BoW-based features in
                                                                     thread in the title panel shows the full-text content, again simi-
high-dimensional space. We then have 314871 threads and
                                                                     lar to the corresponding panel in PIHVI.
80123 tokens in the corpus, arising from 2434 sections.
                                                                     The thumbnail window and the window containing the thread-
Task Design. In order to measure the success of our inter-           list panel and full-text panel are movable by dragging. Figure
face for visual analytics of hierarchical online discussion, we      3 shows a screen shot of the baseline system. Essentially, the
evaluate performance in a typical subtask that would often be        baseline corresponds to PIHVI without a content map, thread
present as part of an analytics session: given a thread of in-       content of sections is then naturally accessed through the sec-
terest, find semantically related threads across the online          tion hierarchy tree. This makes the two systems comparable.
forum. We point out that given a thread of interest, it is trivial
to browse other threads in the same section, and therefore we
focus on the nontrivial task of finding related content from
other sections across the online forum. The ability to find
related content comprehensively across an online forum is
crucial for comprehensive analytics of online discussion and
how topics of interest are spread across the forum.
We choose four cases where discussion on a semantic topic
spreads across several Suomi24 sections: for each task, we
indicate to the user one thread as a starting point, and ask to
find threads on the same topic from other sections. Users carry
out a training task to learn the interfaces: Kuulo (“hearing” in
Finnish; the task is to find threads about hearing outside of
the discussion section dedicated to hearing). We pick three          Figure 3: A state of the baseline system. Components are
tasks for the actual experiments: 1. Matti Nykänen (a Finnish        marked with labels: the tree of the section hierarchy, a thumb-
ski jumper; the task is to find threads about him outside of the     nail of the full tree with a draggable control for navigation, and
discussion section dedicated to him), 2. Svenskt (“Swedish”          a thread information panel showing thread titles and content.
in Swedish; the task is to find discussions in Swedish out-          In the figure, the threads in the information panel are from the
side of discussion sections dedicated to Swedish-language            selected section “Häävalmistelut” (“Wedding preparation” in
discussion), and 3. Hiukset (“hair” in Finnish; the task is to       Finnish, marked in black in the section tree).
find discussion about hair/hair loss/hair care/hair fashion out-
side of the section dedicated to hair). Tasks involve several
words of interest due to informal names, loose language, etc.,       Participants. We perform a user study with 12 participants,
and are not fully solvable by naive text search. All the tasks       including 1 female and 11 males; all participants are computer
correspond to the research questions we ask in Section 3.            science PhD students or post-doctoral researchers who know
                                                                     both Finnish and Swedish. Each participant first performs a
The Baseline System. We compare the proposed system in               training stage on both systems by using the systems to perform
terms of task performance and usability against a baseline           the training task. Next, the participant performs all three
system. The baseline is an interactive system for browsing           experiment tasks on both systems (PIHVI and baseline). The
threads by interacting with a radial visualization of the section    order of the task and system combinations (which order of the
hierarchy. It is comparable to the proposed PIHVI system,            three tasks; which system first in each task) was randomized
1 As reported in https://goo.gl/Ukt49K.                              and counterbalanced over the participants to avoid bias.
Evaluation Measures. We evaluate the experiment results in                                     Baseline            PIHVI
four ways. First, we inspect the results qualitatively: is the                Mean (Std)     1.045 (1.222)      4.137 (1.359)
PIHVI system generally better in the opinions of the partici-
pants? Secondly, we quantitatively measure task performance:         Table 1: Means (over tasks and users) and standard deviations
how many relevant threads was the user able to find in each          of quality measures R calculated from retrieved threads. A
task, compared to the amount of time spent? We measure this          larger mean suggests better relevance.
as a proportion of total relevant threads to time spent, averaged
over users, for each task. Thirdly, we measure user experi-                         System        Participant     System×Participant
ence for both systems by a questionnaire asking users to rate         p-value    6.82 × 10−13       0.938               0.440
their agreement to statements about the system on a 5-point
Likert scale from strong disagreement to strong agreement;           Table 2: Two-way ANOVA for relevance of retrieved threads.
the questions are based on the standard ResQue questionnaire         We have p < 0.05 in the system effect; the 95% confidence
with some modification for the online forum domain, and are          interval of the PIHVI system coefficient is [0.379, 4.819], fully
listed in Table 3. Fourthly, after the tasks we also ask each        above zero. Thus PIHVI performs significantly better than the
user which system they preferred overall.                            baseline in terms of relevance of retrieved threads.
Results
Qualitative analysis. We collect qualitative descriptions on         results are in Table 2. We observe the system effect is statis-
the systems by allowing participants to give free-form written       tically significant in this ANOVA, suggesting that PIHVI is
or oral feedbacks. Overall, the participants agree the tasks         significantly better than the baseline system when measuring
are clear. Besides good quantitative performance in tasks,           the relevance of the retrieved threads over time spent.
participants made discoveries, hence PIHVI helped them com-
prehend the discussion venue. One participant stated he dis-         User responses to post-task questionnaire. Table 3 shows
covered Swedish is sometimes used for trolling in Suomi24            the distribution of answers to the post-task questionnaire. The
when he was doing the task “Svenskt” on PIHVI. In task               questions can be divided into two groups, 13 positive questions
“Matti Nykänen”, one participant noted PIHVI can put threads         (higher user agreement is better) and 5 negative questions
on Matti Nykänen together, though people in the threads call         (lower user agreement is better). Roughly, positive questions
him different names. Another participant said discussions on         cover overall user experience and ability to perform tasks with
Matti Nykänen in marriage related sections reminded him of           the system, negative questions focus on ease of use. Half of the
                                                                     18 questions yielded statistically significant differences. All
the ski jumper’s series of short-lived marriages. Participants
                                                                     statistically significant differences are in favor of the PIHVI
commented PIHVI can give them threads from diverse sec-
                                                                     system. Table 3 lists in detail the p-values showing statistical
tions, compared to the baseline. Participants remarked that the
                                                                     significance (Q1, Q8, Q9, Q13–Q18) at the level of 0.05.
baseline system is reasonable, and one pointed out it is similar
to how people usually browse the forum; the only difference          Users wanted to use PIHVI frequently more than the baseline
is that in the baseline we organize sections radially, not lin-      (Q1), found PIHVI less cumbersome than the baseline (Q8),
early. As for the section filter, one participant stated it helps    felt more confident using PIHVI than the baseline (Q9), found
understand section distributions when the map is zoomed-out.         it more easy with PIHVI to find what they were interested in
Lastly, though some of them agreed that PIHVI is slightly            (Q13), found it more easy to find a thread to read with PIHVI
more complicated than the baseline, the learning curve of            (Q14), would use PIHVI more than the baseline to find threads
PIHVI is not steep, so they can learn to use it fairly quickly.      to read (Q15), would use PIHVI again more than the baseline
                                                                     (Q16), would use PIHVI frequently more than the baseline
Task performance. We evaluate task performance of partici-           (Q17), and would prefer PIHVI over other analytics systems
pants in terms of relevance of retrieved threads and time spent.     more than they would prefer the baseline over other systems
To evaluate relevance of threads retrieved by the participants,      (Q18). Thus overall, participants report they can find what
a native Finnish speaker assessed all retrieved threads without      they are interested in easily with PIHVI, and would like to use
knowing which system they were from, rating them on a three-         it to do exploratory data analysis frequently (Q1, Q13-Q18).
point scale from 0 to 2 for non-relevance, partial relevance,        Also, PIHVI is easier to use and gives the participant more
and full relevance. For each task on each system performed by        confidence during the exploration compared with the baseline
each participant, we use the quantity R = ∑i ri /T as the qual-      (Q8-Q9). The confidence may arise because PIHVI’s layout
ity measure for this (task, system, participant) combination,        reflects thread similarity better so that in PIHVI the relevance
where T is the number of minutes taken to finish the task, ri is     between the found threads and the tasks is clearer than in
the rated relevance of the i-th thread, with i going from 1 to the   the baseline. For brevity we simply list the questions that
number of retrieved threads in the combination. The measure          did not yield significant differences: Q2 I found the system
R can be interpreted as the amount of “retrieved relevance in        unnecessarily complex; Q3 I thought the system is easy to use;
unit time”. A larger R indicates that a larger number of more        Q4 I think that I would need the support of an analytic person
relevant threads are retrieved in a shorter time. We collect the     to be able to use this system; Q5 I found the various functions
R values from all results of the participants on the two sys-        in this system are well integrated; Q6 I thought there was
tems, and perform a two-way analysis of variance (two-way            too much inconsistency in this system; Q7 I would imagine
ANOVA; “participant”× “system”). Means and standard devi-            that most people would learn to use this system very quickly;
ations of the R values are shown in Table 1. And the ANOVA           Q10 I needed to learn a lot of things before I could get going
                                                                                   Baseline          PIHVI          p (Welch’s
 P/N    ID                                  Question                              mean (std)       mean (std)          t-test)
  +     Q1     I think that I would like to use this system frequently           1.917 (0.862)    3.750 (0.433)    9.866 × 10−6
  -     Q8     I found the system very cumbersome to use                         3.167 (1.280)    2.000 (0.816)       0.0198
  +     Q9     I felt very confident using the system                            3.083 (1.187)    4.000 (0.816)       0.0480
  +     Q13    Using this system to find what I am interested in is easy         1.750 (0.829)    4.083 (1.115)    1.780 × 10−5
  +     Q14    Finding a thread to read with the help of this system is easy     2.750 (1.090)    4.083 (1.037)    7.604 × 10−3
  +     Q15    I will use this system to find threads to read                    2.083 0.954)     3.333 (0.850)    3.757 × 10−3
  +     Q16    I will use this system again                                      2.083 (0.862)    3.167 (1.143)       0.0206
  +     Q17    I will use this system frequently                                 1.833 (0.799)    2.917 (1.115)       0.0165
               I will use this system rather than other systems of exploratory
   +    Q18                                                                      2.167 (0.898)    3.583 (0.640)    3.852 × 10−4
               data analysis for online forums

Table 3: The questions yielding statistical significance in the post-questionnaire, with mean sand standard deviations of user
agreement with the questions. For “positive” questions (Q1, Q3, Q5, Q7, Q9, Q11, Q12, Q13, Q14, Q15, Q16, Q17, Q18) higher
agreement is better, for “negative” questions (Q2, Q4, Q6, Q8, Q10) lower agreement is better.


with this system; Q11 The layout of this system interface is        8. T. Munzner and E. Maguire. Visualization analysis and
attractive; Q12 The layout of this system is adequate.                 design. CRC Press, 2015.
User preference. Users were also asked which system they            9. A. I. Obasa, N. Salim, and A. Khan. Hybridization of
overall prefer: all users preferred the proposed PIHVI system.         bag-of-words and forum metadata for web forum
                                                                       question post detection. Indjst, 8(32), 2016.
CONCLUSIONS
We presented PIHVI – Posting Analysis with Interactive Hi-         10. J. Peltonen, K. Belorustceva, and T. Ruotsalo.
erarchical Visualization, a novel visual analytics system for          Topic-relevance map: Visualization for improving search
hierarchical online forums based on multiple linked views.             result comprehension. In ACM IUI 2017, 2017.
The main view (interactive thread content scatterplot) and         11. J. Peltonen, M. Sandholm, and S. Kaski. Information
the coloring that links it to another prominent view (radial           retrieval perspective to interactive data visualization. In
tree of sections) are created by nonlinear dimensionality re-          Eurovis 2013 short papers, 2013.
duction from high-dimensional similarities between threads
and between sections, respectively. A user study on a large        12. J. Peltonen, J. Strahl, and P. Floreen. Negative relevance
Finnish online forum shows significant advantages compared             feedback for exploratory search with visual interactive
to a baseline without dimensionality reduction based views:            intent modeling. In ACM IUI 2017, 2017.
users found more relevant results with PIHVI, and preferred        13. Z. Qu and Y. Liu. Finding problem solving threads in
it over the baseline, with statistical significance. The system        online forum. In IJCNLP 2011.
helps analysts find and relate discussion content across large
hierarchies, an important need as forums keep growing.             14. D. Sacha, L. Zhang, M. Sedlmair, J. A. Lee, J. Peltonen,
                                                                       D. Weiskopf, S. North, and D. A. Keim. Visual
REFERENCES                                                             interaction with dimensionality reduction: a structured
 1. I. Borg and P. J. F. Groenen. Modern Multidimensional              literature analysis. IEEE TVCG, 23(1):241–250, 2016.
    Scaling: Theory and Applications. Springer, 2005.              15. B. Shneiderman. The eyes have it: A task by data type
 2. S. Chen, L. Lin, and X. Yuan. Social media visual                  taxonomy for information visualizations. In IEEE VL
    analytics. Computer Graphics Forum, 36(3), 2017.                   1996.
 3. K. Dave. Flash forums and forumreader: Navigating a            16. A. J. Soto, R. Kiros, V. Kešelj, and E. Milios. Exploratory
    new kind of large-scale online discussion. In CSCW 2004.           visual analysis and interactive pattern extraction from
                                                                       semi-structured data. ACM TiiS, 5(3):16:1–16:36, 2015.
 4. J. Duan, J. Zeng, and S. Zhang. Hierarchical semantic
    model for objectionable web text content detection. In         17. J. Teevan, C. Alvarado, M. S. Ackerman, and D. R.
    IEEE ASID 2012.                                                    Karger. The perfect search engine is not enough: A study
 5. S. Fu, J. Zhao, W. Cui, and H. Qu. Visual analysis of              of orienteering behavior in directed search. In CHI 2004.
    mooc forums with iforum. IEEE TVCG, 23:201–210,                18. Z. Yang, J. Peltonen, and S. Kaski. Scalable optimization
    2017.                                                              of neighbor embedding for visualization. In ICML 2013.
 6. I.-H. Hsiao and P. Awasthi. Topic facet modeling:              19. J. Zhao, N. Cao, Z. Wen, Y. Song, Y.-R. Lin, and
    Semantic visual analytics for online discussion forums. In         C. Collins. # fluxflow: Visual analysis of anomalous
    LAK 2015. ACM.                                                     information spreading on social media. IEEE TVCG,
 7. L. v. d. Maaten and G. Hinton. Visualizing data using              20(12):1773–1782, 2014.
    t-sne. JMLR, 9(Nov):2579–2605, 2008.