=Paper= {{Paper |id=Vol-1311/paper6 |storemode=property |title=How Can Heat Maps of Indexing Vocabularies be Utilized for Information Seeking purposes? |pdfUrl=https://ceur-ws.org/Vol-1311/paper6.pdf |volume=Vol-1311 }} ==How Can Heat Maps of Indexing Vocabularies be Utilized for Information Seeking purposes?== https://ceur-ws.org/Vol-1311/paper6.pdf
    How can heat maps of indexing vocabularies be utilized
             for information seeking purposes?

                      Peter Mutschke, Karima Haddou ou Moussa

            GESIS – Leibniz Institute for the Social Sciences, Cologne, Germany
             {peter.mutschke,karima.haddououmoussa}@gesis.org



       Abstract. The ability to browse an information space in a structured way by
       exploiting similarities and dissimilarities between information objects is crucial
       for knowledge discovery. Knowledge maps use visualizations to gain insights
       into the structure of large-scale information spaces, but are still far away from
       being applicable for searching. The paper proposes a use case for enhancing
       search term recommendations by heat map visualizations of co-word relation-
       ships taken from indexing vocabulary. By contrasting areas of different “heat”
       the user is enabled to indicate mainstream areas of the field in question more
       easily.

       Keywords. Knowledge Maps, Interactive Information Retrieval, Heat Maps,
       Information Seeking


1      Introduction

During the past decade interactive search interfaces [1] have emerged as important in
Information Retrieval (IR) research. The insight that the success of information seek-
ing mainly depends on the ability of an information system to properly support inter-
action between user and system has led to the establishment of Interactive Infor-
mation Retrieval (IIR) as a specific discipline within IR research. Corresponding to
this, whole-session retrieval issues [2] as well as search interfaces [3,4,5] became a
focal point in research, in particular with respect to exploratory searching [4,6]. Due
to the enormous increase of information spaces bibliometric enhanced IR models
addressing non-textual attributes of the domain under study became also more and
more important at the same time. This is particular true for the case of scholarly
searching [7,8]. Moreover, studies in interactive information seeking behavior have
confirmed that the ability to browse an information space in a structured way by ex-
ploiting similarities and dissimilarities between information objects is crucial for
knowledge discovery [9,10].
   Knowledge maps, on the other hand, use visualizations to gain insights into the
structure of large-scale information spaces. They can take very different forms such as
network visualizations, heat maps, tree maps or geographic map like arrangements of
information spaces [11-17]. However, knowledge maps are still far away from being
applicable as search interfaces for Digital Libraries. Most maps are static visualiza-
tions made for special purposes, and hence neither interactive [18] nor dynamic, i.e.
they do not adapt to the change of user perspective to an information space during
interaction. Thus, combining knowledge mapping with IR is still a challenging re-
search issue. There are just a few examples where visual concepts also used in
knowledge mapping have been applied to information systems, such as DANSEasy1,
which visualizes an archive’s category structure and its content in form of a dendro-
gram and a tree map to be used as a navigation tool through the information space.
Another example is PepBank2 which uses heat maps for visualizing and refining
search results.
   Interface studies have shown that a simple spatial interface layout performs better
than complex ones [3]. Heat maps are simple visualizations of data in a color-coded
2-dimensional matrix where cells have a particular color index indicating remarkable
values of the matrix. To the best of our knowledge, heat maps have not been used for
the query formulation process so far. This position paper discusses a use case for us-
ing heat map visualizations of relationships between indexing terms to enhance search
term recommendations.


2      Using Heat Maps of Co-Word Relationships for Searching

A particular point of failure of current information systems is the vagueness between
user search terms and the terms used for indexing the documents to be retrieved, i.e.
the indexing terms which are usually based on a controlled vocabulary such as a the-
saurus [19]. Search term recommenders (STRs) provide models that map a user’s
search term to more appropriate terms. The model proposed by [7], for instance, maps
search terms to indexing terms on the basis of a co-word analysis and recommends
indexing terms that strongly co-occur with the search term. The expectation here is
that retrieval quality will increase when indexing terms are used for searching. In-
deed, a retrieval evaluation showed that the use of STRs relying on controlled vo-
cabulary has a great potential to improve the precision of a search significantly [7].
    However, search term recommendations usually appear in forms which do not as-
sist the user in locating the information need within the wider information space. The
STR provided by the Social Science literature portal sowiport3, for instance, displays
recommended indexing terms in a drill down menu (see Fig. 1, cp. [7]). From that list
the user can select more appropriate terms for searching. The major problem however
is that the user’s choice is not facilitated by further potentially helpful information, in
particular structural information about the semantic contexts in which a recommended
term appears.




1
  http://www.drasticdata.nl/ProjectDANSEasy/indexMultipleAssignments.htm
2
  http://pepbank.mgh.harvard.edu/
3
  http://sowiport.gesis.org/
    Fig. 1. Search Term Recommendations in Sowiport. Recommended indexing terms strongly
      co-occurring with the search term are displayed in a one-dimensional drilldown menu4.

Our approach therefore is to extend the initial list of term suggestions by further in-
dexing terms that frequently co-occur with the search term as well as with the initially
suggested terms. This yields a two-dimensional space of co-word relationships where
the initially recommended terms represent one dimension (first order terms) and the
indexing terms co-occurring with the search term as well as with first order terms
represent the other dimension (second order terms). A natural way to visualize such a
matrix of co-word relationships is a heat map visualization (see Fig. 2) which displays
the strength of each relationship (its “heat”) by a color on a scale ranging from red
(indicating high values) to blue (indicating low values). To keep it simple just the
frequency of co-occurrences was taken to indicate the strength of a co-word relation-
ship (see numbers in the cells of Figure 25). The background color code of the cells of
the map is calculated according to the ratio of the individual frequencies to the maxi-
mum and minimum values of the matrix. By this color-coded visualization the heat
map is divided into “hot” (red), “warm” (yellow, green) and “cold” (blue) areas. Thus,
red cells point to topic combinations which appear most frequently, blue cells corre-
spond to areas which appear least frequently in the map. Red cells therefore represent
issues which are more heavily discussed in the research field (“hot” topics of the
fields). Accordingly, blue cells represent issues which are less heavily discussed
(compared to “hot” fields). Hence, for red areas the user can expect to find more doc-
uments than for blue areas.

4
  Here, the search term “violence” is mapped to the indexing terms “Gewalt” (“Violence”),
   “Gewaltbereitschaft” (“Propensity to violence”), “Jugendlicher” (“Adolescent”), “Krieg”
   (“War”), and “Rechtsradikalismus” (“Right-wing radicalism”), based on the thesaurus for
   the Social Sciences provided by GESIS.
5
  The map displays the frequency of documents containing the respective term combination
   (see numbers in the cells as well as numbers next to the terms). Thus, the total number of
   documents containing the search term “Violence” is 9718; the total number of documents
   containing the search term “Violence” and the first order term “Adolescent” is 1934; the to-
   tal number of documents containing the search term “Violence” and the second order term
   “Right-wing radicalism” is 846; and the total number of documents containing the search
   term “Violence”, the first order term “Adolescent” as well as the second order term “Right-
   wing radicalism”is 405.
   The goal of the heat map visualization is to enable the user to indicate mainstream
areas of the research field more easily. This is relevant for the case where a user starts
with a search term and needs an overview of main issues of the field in question. Fig-
ure 2 displays, for the example from Figure 1, a heat map of indexing terms that are
closely related to the search term “Violence”. The column headings of the heat map
show the 10 first order terms that most frequently co-occur with the search term. The
terms are displayed in descending order of the frequency of their co-occurrence with
the search term, i.e. “Adolescent”, followed by “Developing country”, “Propensity to
violence” and so on. The row headings of the heat map display the top three second
order terms, i.e. the three indexing terms that most frequently co-occur with the re-
spective first order term as well as with the search term6. By this, the heat map visual-
izes tuples of indexing terms as well as their “heat” with respect to the search term.




Fig. 2. Heat Map Visualization of Co-Word Relationships. The map is divided into “hot” (red),
“warm” (yellow, green) and “cold” (blue) areas according to the frequency of co-occurrences.
 Red cells point to topic combinations which appear most frequently, blue cells correspond to
                 topic combinations which appear least frequently in the map.

The heat map displayed in Figure 2 clearly indicates the combination “Developing
country”+“Asia” and “Developing country”+“Africa” as “hot” areas of research in the
context of „Violence“, followed by “Adolescent”+“Right-wing radicalism”, “Adoles-
cent”+“Child”, “Developing country”+“Latin America”, “Right-wing radical-
ism”+“Xenophobia” and “Propensity to violence”+”Adolescent”. By clicking a term
or into a cell the user can browse the documents related to topic combination in ques-
tion.

6
    The list of second order terms is reduced to a disjoint set. For this example, this yields a list of
     19 terms (instead of 10x3=30 terms).
   It is important to note that instead of visualizing the entire information space by a
heat map, the approach here is to reduce the information space to the fraction that
matches the information need of the user. The basic idea is to dynamically adapt the
heat map of term recommendations when the user clicks into the map or changes
search terms, i.e. throughout the entire retrieval process.


3      Conclusion and Future Work

The paper proposes a use case for using heat map visualizations of term co-
occurrence matrices which can be used as a visual navigation tool through an infor-
mation space. The approach is to provide a “big picture” view of the relevant fraction
of the information space that can be adapted dynamically during a search session.
Contrasting areas of different “heat” within a set of co-word relationships highly as-
sociated with the user’s search term enables the user to indicate main issues of the
field in question more easily. We assume that this approach might also help the user
to better locate a particular information need within a larger information space. The
paper may also provide a principle idea of how knowledge maps of information spac-
es can be integrated in information seeking processes.
   For future work we intend to evaluate the proposed approach with real users on the
basis of an evaluation panel [20], apply the approach also to result sets, find more
suitable metrics and color codes for calculating and displaying heat maps, and – final-
ly – develop a generic model for the use of heat map visualizations of term sugges-
tions in interactive retrieval systems.


References
 1. Ruthven, I., Kelly, D. (eds.): Interactive Information Seeking, Behaviour and Retrieval.
    London:Facet Publishing (2011)
 2. Belkin, N., Dumais, S., Kando, N. and Sanderson, M.: NII Shonan meeting on Whole-
    Session Evaluation of Interactive Information Retrieval Systems, October 2012,
    http://www.nii.ac.jp/shonan/seminar020 (2012)
 3. Hearst, M.A.: Search User Interfaces. Cambridge University Press. New York (2009)
 4. Wilson, M.L., Kules, B.: From Keyword Search to Exploration: Designing Future Search
    Interfaces for the Web. Foundations and Trends in Web Science 2(1), 1-97 (2010)
 5. Dix, A.: Introduction to Information Visualization. In: Agosti, M. et al. (eds.): Information
    retrieval meets Information Visualization. PROMISE Winter School 2012. LNCS 7757,
    Springer, Berlin-Heidelberg, 1-27 (2013)
 6. Marchionini, G.: Exploratory Search: from Finding to Understanding. Communications of
    the ACM 49(4), 41-46 (2006)
 7. Mutschke, P., Mayr, P., Schaer, P., Sure, Y.: Science models as value-added services for
    scholarly information systems. In: Scientometrics, 89 (1), 349-364 (2011)
 8. Mayr, P., Scharnhorst, A., Larsen, B., Schaer, P., Mutschke, P.: Bibliometric-enhanced in-
    formation retrieval. In: de Rijke, M. et al. (eds.): Advances in information retrieval: 36th
    European Conference on IR Research, ECIR 2014, Amsterdam, The Netherlands, April
    13-16. LNCS vol. 8416, Berlin: Springer, 798–801 (2014)
 9. Nicholas, D. et.al.: Reappraising Information Seeking Behaviour in a Digital Environment:
    bouncers, checkers, returners, and the like, Journal of Documentation, 60(1), 24-43 (2004)
10. Westerman, S.J.; Collins, J.; Cribbin, T.: Browsing a Document Collection Represented in
    Two- and Three-dimensional Virtual Information Space, International Journal of Human-
    Computer Studies, 62(6), 713-736 (2005)
11. Börner, K.; Chen, C.; Boyack, K. W.: Visualizing knowledge domains. Annual Review of
    Information Science and Technology, 37, 179-255 (2003)
12. Shiffrin, R.; Börner, Katy: Mapping Knowledge Domains. Proceedings of the National
    Academy of Sciences of the United States of America, Vol. 101(Suppl. 1) (2004)
13. Börner, K.: The Atlas of Science. MIT Press (2010)
14. Klavans, R.; Boyack, K. W.: Toward an objective, reliable, and accurate method for meas-
    uring research leadership. Scientometrics, 82(3), 539-553 (2010)
15. Skupin, A.; Biberstine, J.R.; Börner, K.: Visualizing the Topical Structure of the Medical
    Sciences: A Self-Organizing Map Approach. PLoS ONE 8 (3): e58779 (2013)
16. Sahal, A.A.; Wyatt, S.; Passi, S.; Scharnhorst, A.: Mapping EINS - An exercise in map-
    ping the Network of Excellence in Internet Science. Conference Proceedings of the First
    International Conference on Internet Science, Brussels, 75-78 (2013)
17. Boyack, K. W.; Klavans, R.: Creation of a highly detailed, dynamic, global model and map
    of science. Journal of the American Society for Information Science and Technology
    (2013, forthcoming)
18. Akdag Salah, A.A.; Scharnhorst, A.; Ten Bosch, O.; Doorn, P.; Manovich, L.; Salah, A.A.;
    Chow, J.: Significance of Visual Interfaces in Institutional and User-Generated Databases
    with Category Structures. Proceedings of the second international ACM workshop on
    “Personalized access to cultural heritage (PATCH)”. ACM Multimedia Conference, Nara,
    Japan (2012)
19. Petras, V.: Translating Dialects in Search: Mapping between Specialized Languages of
    Discourse and Documentary Languages. University of California, Berkley (2006)
20. Kern, D.; Mutschke, P.; Mayr, P.: Establishing an Online Access Panel for Interactive In-
    formation Retrieval Research. In: 2014 IEEE/ACM Joint Conference on Digital Libraries,
    473–474. London, United Kingdom: IEEE. doi:10.1109/JCDL.2014.6970231 (2014).