=Paper=
{{Paper
|id=Vol-1338/paper6
|storemode=property
|title=Just Looking Around: Supporting Casual Users Initial Encounters with Digital Cultural Heritage
|pdfUrl=https://ceur-ws.org/Vol-1338/paper_6.pdf
|volume=Vol-1338
|dblpUrl=https://dblp.org/rec/conf/ecir/WalshH15
}}
==Just Looking Around: Supporting Casual Users Initial Encounters with Digital Cultural Heritage==
Just looking around: Supporting casual users initial
encounters with Digital Cultural Heritage
David Walsh Mark M. Hall
Department of Computing Department of Computing
Edge Hill University Edge Hill University
St Helens Road St Helens Road
Ormskirk, L39 4QP Ormskirk, L39 4QP
United Kingdom United Kingdom
walshd@edgehill.ac.uk mark.hall@edgehill.ac.uk
ABSTRACT in storage rather than on display. To address this limitation
Cultural Heritage institutions have developed numerous ways CH institutions have been engaged in a massive digitisation
of supporting visitors who have simply wandered in through process, making large parts of their holdings available to the
the front door. However, for their digital collections, the CH public on their websites.
institutions mostly provide a simple search-box, which sup- Access to these collections is primarily provided through
ports the expert user, but which does not support the casual the search box. This works very well for expert users who
user who has just stumbled across the collection. These ca- know exactly what they want and which keywords they need
sual users frequently have no goal or topic in mind, but just to use to find what they are looking for [25]. However, CH
want to have a look around what is available in the collec- institutions also have to support the casual user who has just
tion. For these users the blank search-box presents a signif- stumbled across their collection in the same way that they
icant obstacle, as without a goal or topic it is very difficult would wander into the CH institution’s physical space. For
to formulate an appropriate query. In this paper we pro- these casual users, who have no specific goal in mind when
pose extending current exploratory search and information they come to the collection, the blank search-box presents
seeking models to support the initial interaction between the an almost insurmountable problem [30]. To access anything
casual user and the collection. in the collection, they are required to know at least one
keyword and these users are not provided with any alter-
native way to find what might be available and develop an
Keywords overview of the collection [15, 23, 14]. This is illustrated by
information seeking, information retrieval, casual user, ex- the following quote from a casual user of such a collection:
ploration
So what use are the digital libraries, if all they
do is put digitally unusable information on the
1. INTRODUCTION web. [5]
Cultural Heritage (CH) institutions such as museums and
galleries are accustomed to dealing with visitors who arrive There is thus a clear need to support these casual users
at their doors with no knowledge of what they could see or in their use of the collection. However, in the information
even what they would like to see. At the moment of ar- seeking (IS) and information retrieval (IR) domains, the fo-
rival the only goal these visitors have is spending some time cus has always been on users with a more or less clearly
in the museum or gallery. CH institutions have developed defined goal, as the lack of goal breaks the fundamental as-
a number of successful strategies for supporting these visi- sumptions in all current IS and IR models [12, 31]. Much
tors by providing them with floor plans, fliers, guide-books, work has been done on supporting exploration and discovery
audio-guides, and guided tours. when the user has at least a very vague goal [13, 17, 7], but
The major limitation that CH institutions face is that there remains a gap in our understanding around that first,
the physical space available for displaying the institution’s casual interaction between the user and the collection.
artefacts and the time required to curate the various intro-
ductory guides severely limit the number of artefacts that
can be shown to visitors, with the majority of artefacts held
2. BACKGROUND
Opening the CH institutions’ archives to the world through
digitisation makes them available to a much wider audience
than just the museum curators and CH researchers [15].
With this expanded audience comes the requirement of sup-
porting users outside the core group of experts, who have
previously explored the CH institutions archives, to include
Copyright c 2015 for the individual papers by the papers’ authors. Copy- the casual user [9, 27]. This presents a problem to current
ing permitted for private and academic purposes. This volume is published information retrieval models and techniques, as these are
and copyrighted by its editors.
ECIR Supporting Complex Search Task Workshop ’15 Vienna, Austria generally built around the concept that the user has some
Published on CEUR-WS: http://ceur-ws.org/Vol-1338/. kind of goal, however vague, in mind.
The reasons users turn to digital information systems cover the user is interested in. Clustering and visualization ap-
the whole spectrum from (re-)locating a piece of information proaches can deal with the amount of data, but the resulting
they know exists to exploring an unknown topic to develop visualizations tend to suffer from information overload and
an understanding [6]. These interactions with digital infor- do not provide a usable overview over the large collections.
mation systems can roughly be classified as known-item or Similarly, faceted interfaces can process the amount of data
known-topic searches, where the user knows what they are available, but DCH collections are very heterogeneous [15]
looking for and what they expect to see, and exploratory and showing the most frequent 20 or 30 keywords does not
search interactions, where the goal is to explore, learn, in- give the user an overview and access to more than a very
terpret, synthesize, and understand [18, 28, 30]. small fraction of the total content.
The traditional IR model describes a simple loop consist- More importantly, however, is that all these theories and
ing of problem identification, query formulation, and result interfaces start with the assumption that the user has at
evaluation [25], which successfully supports the known-item least a very vague goal in mind. They do not model or
and known-topic search tasks [29]. To support the more support the completely undirected casual user.
open-ended exploration interactions, this basic model has
been expanded to create exploratory search models [18, 20],
which have much wider scope, complexity, and duration [19, 3. SUPPORTING THE CASUAL USER
3, 28].
To support the casual user in their initial interaction with
These models all treat the search process as if it is com-
the collection, the major change we propose is to let go of
pleted in a single session. However, the process of satisfying
the concept of the “information need” as the reason for in-
an information need will often extend over multiple search
teracting with an information system. The casual user has
sessions as the users slowly develop and refine their precise
a motivation for coming to the CH institution’s site, but
understanding of what they are looking for. A number of
this it not necessarily a need for information, they might
models of this extended process have been created to de-
just want to procrastinate . The focus for supporting the
scribe this information seeking journey [16, 26]. These mod-
casual user has to shift from supporting them in exploring
els all describe a process in which the user starts with a
and finding what they are looking for to supporting them in
very vague notion of what they are looking for and what the
understanding what is available in the collection and where
journey’s end-point will be. Then, as the user interacts with
they might start browsing.
the search system, they develop a clearer understanding of
We envision a number of different interfaces that could
their information need and their searches become evermore
enable such access. One approach would be to generate tex-
focused until they develop the final queries that satisfy their
tual summaries that describe the type of content available
information need.
from the collection. Such an approach would need to analyse
The final phases of the information seeking journey are
the individual items meta-data using an algorithm such as
generally well supported by the traditional search model
LDA [4], then combine that with a textual resource such as
and interfaces. For the earlier, more open-ended stages,
Wikipedia to generalise the topics, and finally generate tex-
a number of exploratory interfaces have been developed.
tual descriptions such as “The collection contains historical
Hierarchical systems [10] were intended to help organize
artefacts from ancient Egypt, space exploration, horology,
large sets of documents into groups or categories [8] en-
and a modern collection of oceanographic specimens.” The
abling searchers to perform more sophisticated browsing tac-
user could then click on any of the topics to get a summary
tics such as traversing and exploring nearest neighbour cat-
of the content in the selected area of the collection, enabling
egories [2, 30]. Clustering approaches [7] group together
them to freely explore.
related documents to give the user an overview over the
An alternative would be to use the topic structure to gen-
“topics” in their search results. Faceted classification [24,
erate an exploratory semantic map, that the users can inter-
18, 13, 21] generates a list of the most frequent keywords
act with and explore like they would a physical map. An-
for the collection (or search result) and shows these to the
other approach could be to look at developing an automatic
user. The user can then explore the collection by clicking
measure for the “interestingness” of items in the collection.
on the keywords, rather than having to type them into the
This could then be used to sample items from the collection
search box. Tag-clouds provide a similar visualization of the
to show the casual user the “highlights” of the collection.
most frequent keywords. Socially curated systems [22, 11,
The investigation of potential interfaces will have to be
1] allow users to curate their own mini-exhibitions and then
accompanied by a series of user studies that investigate how
share these with other users, providing the new and casual
casual users develop a topic they are interested in, when
user with a starting point for exploring the collection.
confronted with a new collection. This will enable us to
These approaches all suffer from a number of technical
extend the existing models for exploratory search and infor-
limitations, primarily around the difficulty of scaling to the
mation seeking by providing a more detailed understanding
massive amount of information that is available in modern
of the initial phase in which the user develops their informa-
Digital Cultural Heritage collections. The manual processes
tion need. This extension will enable information systems
that create hierarchical systems cannot deal with the mil-
to support the complete information journey, from the de-
lions of items that exist in modern big-data DCH collections
velopment of the information need to its final fulfilment.
and that need to be classified. Socially curated systems suf-
Finally, while in DCH this issue is particularly prevalent,
fer from the same lack-of-manpower issue and additionally
understanding the casual user who has no immediate need
to provide a comprehensive overview over a collection, they
could also have significant impact in the area of E-commerce.
would require so many mini-exhibitions that they simply
E-commerce is a major growth area, but currently does not
replace the problem of finding an item that the user is inter-
support browsing the available things in the same way that
ested in with the problem of finding a mini-exhibition that
you can browse through a shop.
4. REFERENCES Basic books, 1988.
[20] P. Pirolli. Powers of 10: Modeling complex
[1] E. Agirre, N. Aletras, P. Clough, S. Fernando, information-seeking systems at multiple scales.
P. Goodale, M. Hall, A. Soroa, and M. Stevenson. Computer, 42(3):33–40, 2009.
Paths: A system for accessing cultural heritage [21] P. L. Schmitz and M. T. Black. The delphi toolkit:
collections. In ACL (Conference System
Enabling semantic search for museum collections. In
Demonstrations), pages 151–156. Citeseer, 2013.
Museums and the Web 2008: the international
[2] M. J. Bates. Information search tactics. Journal of the conference for culture and heritage on-line, 2008.
American Society for information Science, [22] F. M. Shipman, R. Furuta, D. Brenner, C.-C. Chung,
30(4):205–214, 1979. and H.-w. Hsieh. Guided paths through web-based
[3] M. J. Bates. The design of browsing and berrypicking collections: Design, experiences, and adaptations.
techniques for the online search interface. Online Journal of the American Society for Information
Information Review, 13(5):407–424, 1989. Science, 51(3):260–272, 2000.
[4] D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent [23] M. Skov and P. Ingwersen. Exploring information
dirichlet allocation. the Journal of machine Learning seeking behaviour in a digital museum context. In
research, 3:993–1022, 2003. Proceedings of the second international symposium on
[5] C. L. Borgman. The digital future is now: A call to Information interaction in context, pages 110–115.
action for the humanities. Digital humanities ACM, 2008.
quarterly, 3(4), 2009. [24] D. A. Smith, A. Owens, A. Russell, C. Harris,
[6] K. Byström and K. Järvelin. Task complexity affects M. Wilson, et al. The evolving mspace platform:
information seeking and use. Information processing & leveraging the semantic web on the trail of the memex.
management, 31(2):191–213, 1995. In Proceedings of the sixteenth ACM conference on
[7] C. Carpineto, S. Osiński, G. Romano, and D. Weiss. A Hypertext and hypermedia, pages 174–183. ACM, 2005.
survey of web clustering engines. ACM Computing [25] A. Sutcliffe and M. Ennis. Towards a cognitive theory
Surveys (CSUR), 41(3):17, 2009. of information retrieval. Interacting with computers,
[8] M. Chen, M. Hearst, J. Hong, and J. Lin. Cha-cha: A 10(3):321–351, 1998.
system for organising intranet search results. 1999. [26] P. Vakkari, M. Pennanen, and S. Serola. Changes of
[9] A. S. Cifter and H. Dong. User characteristics: search terms and tactics while writing a research
Professional vs. lay users, 2009. proposal: A longitudinal case study. Information
[10] E. Clarkson, K. Desai, and J. D. Foley. Resultmaps: processing & management, 39(3):445–463, 2003.
Visualization for search interfaces. Visualization and [27] P. Vilar and A. Šauperl. Archival literacy: Different
Computer Graphics, IEEE Transactions on, users, different information needs, behaviour and
15(6):1057–1064, 2009. skills. In Information Literacy. Lifelong Learning and
[11] M. Hall, P. Goodale, P. Clough, and M. Stevenson. Digital Citizenship in the 21st Century, pages
The paths system for exploring digital cultural 149–159. Springer, 2014.
heritage’. In Clare Mills, Michael Pidd and Esther [28] R. W. White and S. M. Drucker. Investigating
Ward. Proceedings of the Digital Humanities Congress, behavioral variability in web search. In Proceedings of
2012. the 16th international conference on World Wide Web,
[12] M. Harvey, M. Wilson, and K. Church. Workshop on pages 21–30. ACM, 2007.
searching for fun 2014. In Proceedings of the 5th [29] R. W. White and R. A. Roth. Exploratory search:
Information Interaction in Context Symposium, IIiX Beyond the query-response paradigm. Synthesis
’14, pages 6–6, New York, NY, USA, 2014. ACM. Lectures on Information Concepts, Retrieval, and
[13] M. A. Hearst. Clustering versus faceted categories for Services, 1(1):1–98, 2009.
information exploration. Communications of the [30] M. L. Wilson and D. Elsweiler. Casual-leisure
ACM, 49(4):59–61, 2006. searching: the exploratory search scenarios that break
[14] K. Hornbæk and M. Hertzum. The notion of overview our current models. In Proceedings of HCIR, pages
in information visualization. International Journal of 28–31, 2010.
Human-Computer Studies, 69(7-8):509 – 525, 2011. [31] C. Ye and M. Wilson. The characteristics of casual
[15] A. Johnson. Users, use and context: supporting sessions in search behaviour logs, 2014.
interaction between users and digital archives. What
Are Archives?: Cultural and Theoretical Perspectives:
A Reader, pages 145–64, 2008.
[16] C. C. Kuhlthau. Inside the search process:
Information seeking from the user’s perspective.
JASIS, 42(5):361–371, 1991.
[17] C. D. Manning, P. Raghavan, and H. Schütze.
Introduction to information retrieval, volume 1.
Cambridge university press Cambridge, 2008.
[18] G. Marchionini. Exploratory search: from finding to
understanding. Communications of the ACM,
49(4):41–46, 2006.
[19] D. A. Norman. The psychology of everyday things.