=Paper=
{{Paper
|id=Vol-2816/short7
|storemode=property
|title=Labelling on Academic library Websites
|pdfUrl=https://ceur-ws.org/Vol-2816/short7.pdf
|volume=Vol-2816
|authors=Tanja Svarre
|dblpUrl=https://dblp.org/rec/conf/ircdl/Svarre21
}}
==Labelling on Academic library Websites==
Labelling on Academic Library Websites
Tanja Svarre [0000-0002-5468-0406]
Aalborg University, 9000 Aalborg, Denmark
tanjasj@hum.aau.dk
Abstract. This paper studies labels on Danish academic library websites. Labels
are one amongst several elements that can support user interaction with library
websites and their related content and thus add to a reduction of the vocabulary
problem. A total of 2075 labels used on the websites of 21 academic libraries
with special obligations were analysed using a combination of content analysis
and clustering analysis. The findings show not only large variety in the use of
labels amongst the libraries but also a large concordance of labels used across
library domains and purposes. A cluster analysis of the labels reveals that some
libraries with similar purposes and functions also tend to be similar in their use
of labels, which indicates a shared terminology within domains, sectors and pur-
poses. The findings add to our understanding of the characteristics and variety of
recent labelling across libraries in the academic library sector.
Keywords: Academic libraries, Websites, Labelling
1 Introduction
Library websites increasingly serve as the point of contact between the library and its
users [1]. An academic library website represents the online portal to the many re-
sources, both digital and analogue, that are offered to students, researchers and other
users [2, 3]. Being able to locate relevant information on research library websites is
crucial for students, researchers and other users when approaching the library website.
Thus, the library website is the gateway to the online information and resources that are
necessary when acting within the academic world [4, 5].
Labelling is one of several elements that ensure the usability of academic library
websites [3]. It has previously been shown how terminology and labelling on library
websites challenge usability [3, 6–8]. The Danish National Statistics Office lists 22
Danish research libraries with special obligations. Most are connected to higher educa-
tion institutions, such as universities and university colleges. The remaining libraries
are associated with national museums for history and various aspects of the arts, except
for one, which serves as the national library of D enmark and a university library for
several universities across the country. The aim of this paper is to study the use of labels
on Danish academic library websites. It sheds light on the characteristics of academic
library labels and on how libraries differ in their use of labels in terms of communi-
cating with their users.
——————
Copyright © 2021 for this paper by its authors. Use permitted under Creative
Commons License Attribution 4.0 International (CC BY 4.0). This volume is
published and copyrighted by its editors. IRCDL 2021, February 18-19, 2021, Padua,
Italy.
2 Theory
Labelling is one of four systems within information architecture outlined by Morville,
Rosenfeld and Arango [9]. In web environments, labelling is used to represent under-
lying information and guide users towards relevant content. Labelling is considered a
representation of knowledge organising systems [9]. Thus, labelling follows the char-
acteristics of knowledge organising systems as being controlled or free [10], simple or
complex [11] and narrow or broad [12].
Users can experience a variety of challenges when interacting with an information
system. One of these is known as the vocabulary problem [13, 14]. The vocabulary
problem addresses how the same content can be described and identified in an endless
number of ways, depending on the user’s viewpoint. Several empirical studies of library
websites illustrate this phenomenon. For example, Dougan and Fulton [15], in their
usability study of an academic library website, found issues both with the specificity of
terms and general problems in understanding the website terminology. General confu-
sion in terms of terminology was also identified in Tidal’s [16] study of an academic
library website. Knowledge organising systems play an important role in reducing the
vocabulary problem and explicating the content that vocabulary represents, but, clearly,
they should be developed with an understanding of user terminology to support users
in their interaction with the websites and the content.
Morville et al. [9] distinguish between four types of labels: 1) contextual links, 2)
headings, 3) navigation system choices and 4) index terms. In this paper, the focus is
on navigation system elements, as the analysis is based on the labels extracted from the
websites under study.
3 Methodology
The Danish National Statistics Office lists 22 libraries as research libraries with special
responsibilities. We based the selection of libraries on the 2019 statistics. One library,
the Danish School of Media and Journalism Library, could not be crawled by the web
crawler due to technical issues. This left 21 libraries for the empirical part of this paper.
The libraries with abbreviations are listed in Table 1. To ensure the most extensive
versions of the websites, we used the Danish version for the current analysis. Most of
the websites offer English counterparts, but they usually contain limited information
compared to the Danish equivalents.
We collected the data on 3 September 2020. We used an open access web menu
crawler (http://webscompare.com/) to crawl the selected websites. The crawler scrapes
category labels using Xpath [17] on the basis of one or several web addresses entered
into the interface. The output is a csv file with a flat, alphabetical list of the labels found
on the websites. After the data collection, the data was cleaned and prepared for data
analysis. The scraper is not capable of handling Danish special letters (Æ, Ø, Å), which
are commonly used across the websites in this study, and it replaces the special letter
with a blank space. The blanks were therefore manually replaced with the correct spe-
cial letters.
The data analysis consisted of two elements. First, we carried out a quantitative con-
tent analysis of the web categories [18, 19], analysing per library the number of cate-
gories, the average length of categories and the average number of terms in the catego-
ries. Subsequently, we used the text analysis functionality of the analysis software
NVivo (version 12 Pro) (https://www.qsrinternational.com/nvivo-qualitative-data-
analysis-software/home) for analyses at the term level. Specifically, we used the word
frequency functionality and analysed the 1000 most frequent terms in the dataset. We
used grouping with synonyms to consider similar terms as one. The analysis was carried
out using the Danish language, and the most frequent categories were subsequently
translated into English for reporting in the current paper. Subsequently, we carried out
a cluster analysis on the basis of word similarity to identify similar libraries.
Table 1. Included libraries
Abbreviation Library name Type of library
AARCH Aarhus School of Architecture Library UNI
ABSAL University College Absalon Library UC
ARBMUS The Workers’ Museum Library OTHER
AUB Aalborg University Library UNI
CBS Copenhagen Business School Library UNI
CINEMA Danish Film Institute Library OTHER
DESMUS Design Museum Denmark Library OTHER
DIIS Danish Institute for International Studies Library OTHER
DKDM The Royal Danish Academy of Music Library UNI
DST Statistics Denmark Library OTHER
DTU Technical University of Denmark Library UNI
FB Royal Danish Defence Academy Library OTHER
KADK The Royal Danish Academy of Fine Arts Library UNI
KB The Danish Royal Library UNI/OTHER
PHB Copenhagen University College Library UC
POLAR Polar Library UNI
SDUB University of Southern Denmark Library UNI
UCLB UCL University College Library UC
UCNB UCN University College Library UC
UCSB University College South Denmark Library UC
VIAB VIA University College Library UC
Legend: The libraries are divided into three categories by purpose: UNI (serving a university or
university-like institution), UC (serving a university college) and OTHER (serving other types
of institutions like museums or organisations).
4 Results
The libraries included for analysis serve different purposes (Table 1). Six libraries qual-
ify as university college libraries, eight are located at universities or university-like in-
stitutions, and six libraries serve different organisations or museums. The Royal Library
has a role both as the national library of Denmark and the university library for several
Danish universities, resulting in two labels in the table: university library and other.
9
8
7
6
5
4
3 No
2 Yes
1
0
Figure 1. Number of libraries incorporated in the host institution website
Yes=incorporated; No=not incorporated
In many of the library cases, their organisational relations appear from the placement
of the library website in the web structure. All the UNI and OTHER libraries (14 of the
21 libraries) are incorporated into their host institution websites. The remaining seven
libraries, meaning all the UC libraries and the Danish Royal Library, have independent
websites with no institutional connection to their host organisation (Figure 1).
4.1 Term Distribution
Crawling the 21 research libraries, we found the distribution of categories that appears
in Table 2. The table shows a large variation in the number of labels on the library
websites and an average of 98.76 labels per website. A standard deviation of 84.97
illustrates the large variation between the libraries.
Table 2. Number of labels, average length of labels and average number of label terms
Mean Minimum Maximum Standard deviation
Number of labels 98.76 29 378 84.97
Length of labels (characters) 16.66 2 43 8.57
Number of terms 2.25 1 9 1.37
We used the independent sample T-test to test if the variation was related to whether
the library website is incorporated into the host organisation’s website or has its own
website. With no significant difference identified for the number of labels, the average
length of labels or the average number of terms in labels, this does not appear to be the
case.
4.2 Term Frequency
We used the term frequency functionality in NVivo to identify high-frequency terms in
the data set. The results of the analysis appear in Figure 2. “Search” and variations of
“Library” are the most frequent terms along with other library-related terms like “Ar-
chive”, “Books”, “Materials” and “Journals”. Another category of high-frequency
terms relates to the library as a service function. This category is exemplified by terms
like “Way” (representing street names and wayfinding in Danish), “Contact”, “Book-
ing”, and “Opening”, inviting users to use the library and library services like assistance
from a trained librarian. The organisational attachment of many of the libraries is also
reflected in the most frequent terms in labels. Here we see the university abbreviations
(“SDU”, “DTU” and “CBS”) and “Research”, “Education” and “Student”.
120
100
80
60
40
20
0
SDU
Way
Movie
Booking
Search
Books
Research
Contact
Journals
CBS
Denmark
Archive
Knowledge
Materials
Collaborate
Student
Education
News
Library
Statistics
Organization
Opening
DTU
Figure 2. Most frequent terms in labels
Furthermore, we analysed the number of libraries in which the high-frequency terms
occurred (Figure 3). This figure, to some extent, changes the impression of the most
frequent terms. As would be expected, the institution-specific terms only appear in the
related libraries, whereas general library-specific terms are used more generally across
the included libraries, with “Library”, “Search” and “Contact” as the most used terms.
25
20
15
10
5
0
Way
Booking
Search
Contact
Research
Journals
Movie
SDU
Denmark
CBS
Books
Materials
Collaborate
Knowledge
Archive
Education
Statistics
Library
Student
Organization
News
Opening
DTU
Figure 3. Number of libraries using the most frequent terms in labels
4.3 Cluster Analysis
We used the clustering functionality in NVivo to analyse the similarity between the
selected libraries. The results of the analysis appear in Figure 4, which illustrates the
two main clusters that evolved from the analysis.
Figure 4. Cluster analysis based on word frequency
The upper cluster consists of all the university college libraries in the population. It
is interesting to identify how the purpose and the target group of the library actually
influences the choice of terminology at these libraries. The lower and larger cluster
consists of a combination of university libraries and the category “other”. Here, the
picture is a bit more muddled than the upper cluster of the figure, but still some obser-
vations can be made. For instance, the Danish Film Institute (CINEMA) and the Design
Museum (DESMUS) libraries are so similar that they end up in the same cluster. If the
next level of the cluster is considered, they are also connected to another museum, the
Royal Danish Defence Academy Library (FB). Likewise, the Aarhus School of Archi-
tecture Library (AARCH) and the Royal Danish Academy of Fine Arts Library
(KADK), which amongst others are connected to architecture education in Copenha-
gen, are also so similar that they share a cluster in the figure.
5 Discussion and Concluding Remarks
Our analysis shows that the academic libraries with special obligations in Denmark
serve various institutions and purposes. They represent a large variation in their use of
labels, both in numbers and variety. Previous research has identified challenges with
library jargon on academic library websites [e.g. 16]. Considering the most frequent
labels used in the current study, the deliberate use of terminology does not appear to be
prevalent. However, users should be involved in further studies to obtain a deeper un-
derstanding of this issue.
The cluster analysis revealed that academic libraries with similar purposes also tend
to be similar in their use of labelling. Independent sample T-tests did not reveal that
this can be explained by whether the libraries are incorporated into their host institu-
tions’ websites and thereby have institution labels as part of their pool of labels. Instead,
it seems that the specific use of labels is similar, for instance, between university college
libraries, between some museum libraries and between the two Danish schools of ar-
chitecture. The findings indicate that the libraries in their labelling draw on a shared
terminology within their domains, which is a step towards reducing the vocabulary
problem in information interaction. Further studies with users within the specific do-
mains can further elaborate on how they understand and experience the vocabulary
problem within their domains.
References
1. Guay, S., Rudin, L., Reynolds, S.: Testing, testing: a usability case study at University of
Toronto Scarborough Library. Library Management. 40, 88–97 (2019).
https://doi.org/10.1108/LM-10-2017-0107.
2. Million, A.J.: Help Needed: Best Practices, Collaborative Advantage, and Library Websites.
International Information & Library review. 50, 312–318 (2018).
https://doi.org/10.1080/10572317.2018.1526851.
3. Silvis, I.M., Bothma, T.J.D., de Beer, K.J.W.: Evaluating the usability of the information
architecture of academic library websites. Library Hi Tech. 37, 566–590 (2019).
https://doi.org/10.1108/LHT-07-2017-0151.
4. Kim, Y.-M.: Users’ perceptions of university library websites: A unifying view. Library &
Information Science Research. 33, 63–72 (2011).
5. Hu, C.-P., Hu, Y., Yan, W.: An empirical study of factors influencing user perception of
university digital libraries in China. Library & Information Science Research. 36, 225–233
(2014).
6. Pant, A.: Usability evaluation of an academic library website: Experience with the Central
Science Library, University of Delhi. The Electronic Library. 33, 896–915 (2015).
https://doi.org/10.1108/EL-04-2014-0067.
7. Gillis, R.: “Watch Your Language!”: Word Choice in Library Website Usability. Partnership :
the Canadian Journal of Library and Information Practice and Research. 12, (2017).
https://doi.org/10.21083/partnership.v12i1.3918.
8. Kous, K., Pušnik, M., Heričko, M., Polančič, G.: Usability evaluation of a library website
with different end user groups. Journal of Librarianship and Information Science. 52, 75–90
(2020). https://doi.org/10.1177/0961000618773133.
9. Louis Rosenfeld, Peter Morville, Jorge Arango: Information architecture, For the Web and
Beyond. O`reilly Media, Inc., Sebastopol (2015).
10. Dubois, C.P.R.: Free text versus controlled vocabulary. Online Review. 11, 243–253 (1987).
11. Zeng, M.: Knowledge organization systems (KOS). Knowledge Organization. 35, 160–182
(2008).
12. Soergel, D.: Indexing and retrieval performance: The logical evidence. Journal of the Amer-
ican Society for Information Science. 45, 589–599 (1994).
https://doi.org/10.1002/(SICI)1097-4571(199409)45:8<589::AID-ASI14>3.0.CO;2-E.
13. Furnas, G.W., Landauer, T.K., Gomez, L.M., Dumais, S.T.: The Vocabulary Problem in Hu-
man-system Communication. Commun. ACM. 30, 964–971 (1987).
https://doi.org/10.1145/32206.32212.
14. Hearst, M.: Search User Interfaces. Cambridge University Press, Cambridge (2009).
15. Dougan, K., Fulton, C.: Side by Side: What a Comparative Usability Study Told Us About a
Web Site Redesign. Journal of Web Librarianship. 3, 217–237 (2009).
https://doi.org/10.1080/19322900903113407.
16. Tidal, J.: Creating a user‐centered library homepage: a case study. OCLC Systems & Ser-
vices: International digital library perspectives. 28, 90–100 (2012).
https://doi.org/10.1108/10650751211236631.
17. Luthfiyanto, A., Kusumo, D.S.: Extraction of Website Navigation Label Using A Multiple
Web Crawler: A Case Study on 14 University Websites in Indonesia. In: 2020 International
Conference on Data Science and Its Applications (ICoDSA). pp. 1–7. IEEE, New York
(2020).
18. Krippendorff, K.: Content Analysis: An Introduction to Its Methodology. SAGE, Thousand
Oaks (2004).
19. White, M.D., Marsh, E.E.: Content Analysis: A Flexible Methodology. Library Trends. 55,
22–45 (2006).