=Paper= {{Paper |id=Vol-2084/short16 |storemode=property |title=Challenges and Perspectives on the Use of Open Cultural Heritage Data Across Four Different User Types: Researchers, Students, App Developers and Hackers |pdfUrl=https://ceur-ws.org/Vol-2084/short16.pdf |volume=Vol-2084 |authors=Ditte Laursen,Henriette Roued-Cunliffe,Stig Svenningsen |dblpUrl=https://dblp.org/rec/conf/dhn/LaursenRS18 }} ==Challenges and Perspectives on the Use of Open Cultural Heritage Data Across Four Different User Types: Researchers, Students, App Developers and Hackers== https://ceur-ws.org/Vol-2084/short16.pdf
 Challenges and perspectives on the use of open cultural
 heritage data across four different user types: Research-
        ers, students, app developers and hackers
Laursen, Ditte1[0000-0002-07386562], Roued-Cunliffe, Henriette 2[1111-2222-3333-4444] and Sven-
                                ningsen, Stig1[0000-0001-7949-0740]
                         1
                             Head of Department, Royal Danish Library
                                   2
                                     University of Copenhagen
                                         dila@kb.dk



       Abstract. In this paper, we analyse and discuss from a user perspective and from
       an organisational perspective the challenges and perspectives of the use of open
       cultural heritage data. We base our study on empirical evidence gathered through
       four cases where we have interacted with four different user groups: 1) research-
       ers, 2) students, 3) app developers and 4) hackers. Our own role in these cases
       was to engage with these users as teachers, organizers and/or data providers. The
       cultural heritage data we provided were accessible as curated data sets or through
       API's. Our findings show that successful use of open heritage data is highly de-
       pendent on organisations' ability to calibrate and curate the data differently ac-
       cording to contexts and settings. More specifically, we show what different needs
       and motivations different user types have for using open cultural heritage data,
       and we discuss how this can be met by teachers, organizers and data providers.


1      Introduction

Open data in the heritage sector is a combination of the idea of openness in heritage
(such as public engagement and access) and the practicality of heritage data (in other
words the increasing amount of digitised heritage material as well as the dissemination
of this). GLAM institutions (Galleries, Libraries, Archives, and Museums) throughout
the Nordic countries (as well as in many parts of the world) have made a considerable
investment in digitising the documents and photographs held in their physical collec-
tions and then subsequently providing access to these digital materials to a global au-
dience [1]. This online access to digital heritage material has meant a massive step to-
wards more openness in heritage across the world. However, the extent to which these
datasets are open for reuse on platforms and circumstances other than those provided
by the institution is limited by organisational policies as well as technological con-
straints. In addition, little is known about how the data are utilised by different users.
           One of the biggest challenges when venturing into providing open heritage data
is to strike a good balance between data and interface. Not only in terms of how heritage
material is presented to the public but also in terms of how open data is presented to
potential end-users through the API. API or Application Programming Interface is an
interface that enables data access for the use in application programming. As with any
2


interface design an API comes in many shapes and sizes and each of the cases show
different needs in terms of interface access to heritage data. Another challenge is bal-
ancing the needs of end-users against the needs of institutions and data providers. Each
has a different perspective and the cases will illustrate some of the needs and potential
uses of end-users and in each case how this corresponds to the organisational perspec-
tive.
          In this paper, we analyse and discuss from a user perspective and from an or-
ganisational perspective four cases where we as teachers, organizers and/or data pro-
viders have interacted with four different user groups: 1) researchers, 2) students, 3)
app developers and 4) hackers. We deal specifically with the challenges and perspec-
tives of the use of open cultural heritage data available as structured data as curated
data sets or through API's. The cases we will present illustrate the range of challenges
and perspectives that are currently a large part of heritage institutions work with open
data. Each case provides a different view of the challenge of balancing datasets vs.
interfaces as well as end-user perspective vs. organisational perspective. The cases
range across four different potential user groups of heritage data and will show how the
institution interacts with these different end-users as well as the challenges involved
with each.


2      User cases

2.1    Researchers

In spring 2016, the Royal Danish Library invited researchers to join a series of data
sprints in the exploration of digitized material related to the former Danish colonies,
eg. photographs, maps, census records, newspapers, and toll registrations. The library's
university lab helped organize the sprints and used their university networks for recruit-
ing approx. 45 participants. The participants had different skills, for instance some were
subject experts in history, others had programming skills, and some just had an interest
in combining and learning about using digital data in new ways. Most participants came
from humanities, fewer had a technical background and even fewer were from social
science. In turn, we as data providers and curators brought a variety of competences
(eg. in the archival material and in GIS and programming).
               Some researchers came with specific research questions in mind, and oth-
ers with more explorative approaches. In both cases, the data was critically examined
and questions related to the data's reliability and trustworthiness sparked a continuous
dialogue with curators and data providers. In several cases, research questions were
reframed in an iterative process according to newly gained insights about the data. In
the same way, tools were tested and discussed among researchers and curators, eg.
methodological implications were discussed when analysing data with the tool Open-
Refine to reduce the complexity of data by clustering and merging almost similar data.
Researchers also had specific needs for working with the data, for instance some wanted
to work with high resolution files for visual exploration, so this was facilitated by the
library. The researchers emphasised the need to be able to search for material across
collections through the API. Furthermore, the data sprint showed the need to improve
                                                                                         3


the API documentation and interface. This motivated the library to further develop the
API with a how-to guide. In summary, the case illustrates researcher's continuous need
for knowledge about data and tools during a research process and the challenge for the
organization to meet unique and specific demands for original research questions while
at the same time providing a generic service.


2.2    Students

In spring 2017 the Department of Information Studies, University of Copenhagen ran
an elective course on open data (not only in heritage) for second year BA students. The
course included an introduction to political and ethical issues through for example cop-
yright and data protection legislation as well as practical exercises in using HTML,
CSS, PHP and JavaScript to transform open datasets into new applications. The stu-
dents met with open data providers, such as the Royal Danish Library, in order to gain
a deeper understanding of institutional issues in providing open heritage data. Further-
more, the students were invited to join a boot camp organised in order to kickstart the
idea generation and development process.
         The course resulted in a handful of projects and prototypes developed in groups
which presented different types of open data in various applications. In terms of herit-
age material one project, for example, used data from the Danish register of listed build-
ings provided by the Danish Agency for Culture and Palaces in order to build an app
prototype for that uses principles of serendipity to lead users through the city of Copen-
hagen. The brief for the students was to use existing datasets to build new interfaces
and visualisations of these data. They were instructed to clearly define a target audience
for their app and could also include knowledge on system evaluation from other
courses. One of the biggest challenges faced by the students was not so much using the
data but rather understanding the content of the datasets through the API. The data in-
terfaces they wanted to use were generally speaking not very well documented, with
the exception of the Flickr API.
         The students’ overall aim, in this case, was to pass the course exam. In order
to do this they had to produce a low or high fidelity prototype and discuss their use of
open data. However, they were not reliant on being able to access a well-documented
API as the discussion element of the course allowed them to also reflect over the chal-
lenges of using a particular dataset.
       This case shows in particular the value of presenting open data in an interface that
provides well documented content. Flickr is a good example of this with very specific
methods for accessing data (fx. Flickr.photos.getinfo which returns information about
specific photos if it receives the photo id). At this point in time with open heritage data
at this early stage one of the best outcomes of this case was to get the students in the
same room with data providers. Furthermore, getting students to test and discuss open
heritage datasets is a prime opportunity to get feedback to data providers on the useful-
ness and usability of their APIs and data documentation.
4


2.3    App developers

This case reports on the use of digitised historical aerial imagery from the Royal Danish
Library by app developers, retrieved through an API. The collection holds more than
900.000 historical aerial photographs, which have been digitised and geo-located
through a crowdsourcing platform using volunteered geographical information (VGI)
approach, where users can geo-locate the aerial images. The analogue collection has
historically been intensively used by consultancy firms and government authorities to
screen for potential environmental hazards, such as soil pollution and landfills. How-
ever, such use has been expensive due to the high cost of manual retrieval of the infor-
mation. The first phase of the project concentrated on the island of Fyn. By 2015 more
than 95% of the aerial photographs from Fyn were geo-located, thus constituting a com-
prehensive dataset of historical aerial imagery for this area [2].
      The collection is accessible through an API in addition to the crowdsourcing plat-
form. In January 2017 the Royal Danish Library was contacted by Geo Fyn A/S, a
company established by the municipalities on Fyn, to develop and maintain a jointly
operated Spatial Data Infrastructure (SDI) for administrative purposes among the local
authorities on Fyn (http://www.geofyn.dk/). Key personnel from Geo Fyn A/S used the
aerial photo API to harvest the coordinates and metadata, thus creating a GIS dataset of
points with metadata and links to the specific imagery. This layer was then embedded
on the Geo Fyn A/S data portal as well as a data layer in different systems of the various
municipalities, making the historical information instant available on screen. Although
only 5000 records could be retrieved pr. server request, this was easily navigated by
Geofyn by developing a small script. Geo Fyn A/S plans to make yearly reviews of the
data due to the potential adding of new metadata and adjustment of the geo-location by
the library's volunteers. The Geofyn programmers did not have any issues accessing
and using the API and did not need further documentation than was already provided.
      This case illustrate that open heritage data, with sufficient metadata and accessible
through an API, can provide app developers from industry and authorities with access
to datasets, which can be used at low cost for developing new systems or adding new
layers to existing decision support systems. App developers from these sectors, gener-
ally have the technical skills to extract data through an API and thus probably do not
need a high degree of documentation in order to do so. However, their interested in
heritage datasets is probably limited to those that hold an instant and easily accessible
value for their operations, such as easily extractable location points and limited
metadata.


2.4    Hackers

Since 2012, the Royal Danish Library has been co-organiser of the Danish national
event Hack4dk. The event is an annual heritage hackathon with about 50 – 150 partic-
ipants in average, organised by major GLAM institutions in Denmark. Participants are
data providers, programmers, interaction designers, curators, etc. The outcomes of the
hackathons are usually prototypes, products or concepts. The event is free and takes
place during a weekend in autumn, from Friday afternoon till Sunday afternoon.
                                                                                            5


   The goal of the event from the point of view of the organisers and data providers can
broadly be defined as exploring the possibilities and usability of digital heritage collec-
tions, as well as testing the limitations and digital infrastructure of heritage institutions.
At the same time, organisers break their monopoly of on creative use of their digital
resources. Participants typically engage in the event for networking, learning and hav-
ing fun. In their own words: "I'm here to have fun and do cool stuff", "I'm here to meet
fellow geeks and talk about common interests", "I'm here to network and meet new
people" [3].
   While the process is facilitated by organisers, participants self-organise and work on
their own projects, and in ways in which they prefer to work. Two participants explain
the process in this way: "We are back this year because we have time again, and the
past years have been really, really fun. It's great meeting new people and work on fun
projects. We recently started our company with us two and another friend of ours and
we came the three of us together and most likely we will end up working together, but
we are open to ideas. At the moment we are just kind of exploring. First we are going
get some inspiration also known as beer and then we are going to improvise something
I guess." (video interview, hack4dk's facebook page).
      The hackers in this case work on projects that do not necessarily have to end in a
product and engage in activities that test the limits of skill, imagination and wits. The
resulting products often are for fun. For instance, the winner project from 2017 was
connecting Tin toys from KULMUS with AR and QR codes. By using the QR codes
on the small blocks it is possible play with the toys without actually touching the objects
and making stop-motion-movies (see presentation video at goo.gl/mwRmio).
      This case shows that sometimes the process, the challenge and the collaboration
is what counts. It shows in particular that from organizational point of view the data
must be easy accessible, self-explanatory and inviting – and even so, whether the data
will be used or not, is at the total mercy of the participants, dependent on their personal
feelings, tastes, or opinions.


3      Conclusion

In this paper, we have analysed and discussed four cases where we as teachers, organ-
isers and/or data providers have interacted with four different user groups: researchers,
students, app developers and hackers. The four cases illustrate the range of challenges
and perspectives that are part of heritage institutions work with open data. More spe-
cifically, we show that each of the four user groups have different motivations and skills
for using open cultural heritage data. These findings can be summarized in this table,
showing the user perspective, ie. users' motivation and users' technical skills:

          Table 1. User motivation and technical skills among different user groups

                        Users' motivation                             Users'      technical
                                                                      skills
 Researchers            Gaining new knowledge                         Varies
6


 Students                 Learning, pass the course                      Varies
 App developers           Make profit, development of own ser-           Highly skilled
                          vices
 Hackers                  Have fun, be challenged                        Varies
 Researchers              Gaining new knowledge                          Varies

   In turn, different user groups' motivations and technical skills require the cultural
heritage organisation to curate its data according to specific user groups. These findings
can be summarized in this table, showing the organisational perspective, ie. data content
requirements and data accessibility:


       Table 2. Data content requirements and data accessibility among different user groups

                          Data content requirements                  Data accessibility
 Researchers              Subject specific, unique data, trustworthy Varies      (technical
                          data                                       skills), Flexible (re-
                                                                     search questions)
 Students                 Data as training sets                      Varies, depending
                                                                     on skills and learn-
                                                                     ing goals
 App developers           Subject specific, unique data              "Help to self-help"
 Hackers                  Subjective data/dependent on personal Data should be easy
                          feelings, tastes, or opinions              accessible/data as
                                                                     mean to something
                                                                     else
                          Data content requirements                  Data accessibility

   Thus, successful use of open cultural heritage data is highly dependent on organisa-
tions' ability to calibrate and curate the data differently according to contexts and set-
tings.


4         Reference
    1. Tanner, S., & Deegan, M. (2013, October). Measuring the impact of digitized resources: The
       Balanced Value Model. In Digital Heritage International Congress (DigitalHeritage), 2013
       (Vol. 2, pp. 15-19). IEEE
    2. Svenningsen, S. R., Brandt, J., Christensen, A. A., Dahl, M. C., & Dupont, H. (2015). His-
       torical oblique aerial photographs as a powerful tool for communicating landscape changes.
       Land Use Policy, 43, 82-95. DOI: 10.1016/j.landusepol.2014.10.021
    3. Wang, Jacob R. (2017): Hack4dk: Hacking our heritage 2012-2017, presentation at
       Hack4dk, Enigma, Copenhagen, Sept. 29.