<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>AFEL: Towards Measuring Online Activities Contributions to Self-Directed Learning</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Mathieu d'Aquin</string-name>
          <email>mathieu.daquin@insight-centre.org</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessandro Adamou</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Dietze</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Besnik Fetahu</string-name>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ujwal Gadiraju</string-name>
          <email>gadirajug@l3s.de</email>
          <xref ref-type="aff" rid="aff4">4</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ilire Hasani-Mavriqi</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Peter Holtz</string-name>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Joachim Kimmerle</string-name>
          <email>j.kimmerleg@iwm-tuebingen.de</email>
          <xref ref-type="aff" rid="aff5">5</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dominik Kowald</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Elisabeth Lex</string-name>
          <email>elisabeth.lex@tugraz.at</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Susana Lopez Sola</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ricardo A. Maturana</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vedran Sabol</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Pinelopi Troullinou</string-name>
          <email>pinelopi.troullinoug@open.ac.uk</email>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Eduardo Veas</string-name>
          <email>eduveasg@know-center.at</email>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>GNOSS</institution>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Insight Centre for Data Analytics, National University of Ireland</institution>
          ,
          <addr-line>Galway</addr-line>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Know-Center Graz, University of Technology Graz</institution>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Knowledge Media Institute, The Open University</institution>
          ,
          <country country="UK">UK</country>
        </aff>
        <aff id="aff4">
          <label>4</label>
          <institution>L3S Research Center, Leibniz University Hanover</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff5">
          <label>5</label>
          <institution>The Leibniz-Insititut fur Wissensmedien</institution>
          ,
          <addr-line>Tubingen</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>More and more learning activities take place online in a selfdirected manner. Therefore, just as the idea of self-tracking activities for tness purposes has gained momentum in the past few years, tools and methods for awareness and self-re ection on one's own online learning behavior appear as an emerging need for both formal and informal learners. Addressing this need is one of the key objectives of the AFEL (Analytics for Everyday Learning) project. In this paper, we discuss the di erent aspects of what needs to be put in place in order to enable awareness and self-re ection in online learning. We start by describing a scenario that guides the work done. We then investigate the theoretical, technical and support aspects that are required to enable this scenario, as well as the current state of the research in each aspect within the AFEL project. We conclude with a discussion of the ongoing plans from the project to develop learner-facing tools that enable awareness and selfre ection for online, self-directed learners. We also elucidate the need to establish further research programs on facets of self-tracking for learning that are necessarily going to emerge in the near future, especially regarding privacy and ethics.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>a university program) and where online activities happen through a provided
eLearning system. However, whether or not they are formally engaged in
learning, more and more learners are now using a large variety of online platforms
and resources which are not necessarily connected with their learning
environment or with each other. Such use of online resources tends to be self-directed
in the sense that learners make their own choices as to which resource to employ
and which activity to realize amongst the wide choice o ered to them (MOOCs,
tutorials, open educational resources, etc). With such practices becoming more
common, there is therefore value in researching the way in which to support such
choices.</p>
      <p>
        In several other areas than learning where self-directed activities are
prominent (e.g. tness), there has been a trend in recent years following the
technological development of tools for self-tracking [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. Those tools quantify a speci c
user's activities with respect to a certain goal (e.g. being physically t) to enable
self-awareness and re ection, with the purpose of turning them into behavioral
changes. While the actual bene ts of self-tracking in those areas are still
debatable, our understanding of how such approaches could bene t learning behaviors
as they become more self-directed remains very limited.
      </p>
      <p>
        AFEL7 (Analytics for Everyday Learning) is an European Horizon 2020
project which aim is to address both the theoretical and technological
challenges arising from applying learning analytics [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] in the context of online, social
learning. The pillars of the project are the technologies to capture large scale,
heterogeneous data about learner's online activities across multiple platforms
(including social media) and the operationalization of theoretical cognitive
models of learning to measure and assess those online learning activities. One of the
key planned outcomes of the project is therefore a set of tools enabling
selftracking on online learning by a wide range of potential learners to enable them
to re ect and ultimately improve the way they focus their learning.
      </p>
      <p>In this paper, we discuss the research and development challenges
associated with achieving those goals and describe initial results obtained by the
project in three key areas: theory (through cognitive models of learning),
technology (through data capture, processing and enrichment systems) and support
(through the features provided to users for visualizing, exploring and drawing
conclusions from their learning activities). We start by describing a motivating
scenario of an online, self-directed learner to clarify our objective.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Motivating Scenario</title>
      <p>Below is a speci c scenario considering a learner not formally engaged in a
speci c study program, but who is, in a self-directed and explicit way, engaged
in online learning. The objective is to describe in a simple way how the envisioned
AFEL tools could be used for self-awareness and re ection, but also to explore
what the expected bene ts of enabling this for users/learners are:</p>
      <sec id="sec-2-1">
        <title>7 http://afel-project.eu</title>
        <p>Jane is 37 and works as an administrative assistant in a local
mediumsized company. As hobbies, she enjoys sewing and cycling in the local
forests. She is also interested in business management, and is
considering either developing in her current job to a more senior level or making
a career change. Jane spends a lot of time online at home and at her
job. She has friends on Facebook with whom she shares and discusses
local places to go cycling, and others with whom she discusses sewing
techniques and possible projects, often through sharing YouTube videos.
Jane also follows MOOCs and forums related to business management,
on di erent topics. She often uses online resources such as Wikipedia
and online magazines. At school, she was not very interested in maths,
which is needed if she wants to progress in her job. She is therefore
registered on Didactalia8, connecting to resources and communities on maths,
especially statistics.</p>
        <p>Jane has decided to take her learning seriously: She has registered to
use the AFEL dashboard through the Didactalia interface. She has also
installed the AFEL browser extension to include her browsing history,
as well as the Facebook app. She has not included in her dashboard her
emails, as they are mostly related to her current job, or Twitter, since
she rarely uses it.</p>
        <p>Jane looks at the dashboard more or less once a day, as she is prompted by
a noti cation from the AFEL smart phone application or from the
Facebook app, to see how she has been doing the previous day in her online
social learning. It might for example say \It looks like you progressed well
with sewing yesterday! See how you are doing on other topics..." Jane,
as she looks at the dashboard, realizes that she has been focusing a lot on
her hobbies and procrastinated on the topics she enjoys less, especially
statistics. Looking speci cally at statistics, she realizes that she almost
only works on it on Friday evenings, because she feels guilty of not
having done much during the week. She also sees that she is not putting
as much e ort into her learning of statistics as other learners, and not
making as much progress. She therefore makes a conscious decision to
put more focus on it. She adds new goals on the dashboard of the form
\Work on statistics during my lunch break every week day" or \Have
achieved a 10% progress compared to now by the same time next week".
The dashboard will remind her of how she is doing against those goals as
she goes about her usual online social learning activities. She also gets
recommendations of things to do on Didactalia and Facebook based on
the indicators shown on the dashboard and her stated goals.</p>
        <p>While this is obviously a ctitious scenario, which is very much simpli ed, it
shows the way tools for awareness and self-re ection can support online
selfdirected learning, and it provides a basis to investigate the challenges to address
in order to enable the development of tools of the kind that are described, as
discussed in the rest of this paper.</p>
      </sec>
      <sec id="sec-2-2">
        <title>8 http://didactalia.net</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Theoretical Challenge: Measuring Self-Directed</title>
    </sec>
    <sec id="sec-4">
      <title>Learning</title>
      <p>One result of the advent of the Internet as a mass phenomenon was a slight
change in our understanding of constructs such as \knowledge" and \learning".
In such contexts as described above, it is by no means a trivial task to identify
and to assess learning. Indeed, in order to understand how learning emerges from
a collection of disparate online activities, we need to get back to fundamental,
cognitive models of learning, as we cannot make the assumption that usual ways
to test the results of learning are available.</p>
      <p>
        Traditionally, the acquisition metaphor was frequently used to describe
learning processes [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]: From this perspective, learning consists in the accumulation of
\basic units of knowledge" within the \container" (p. 5) of the human mind.
Already before the digital age, there was also an alternative, more socially oriented
understanding of learning, which is endowed in the participation metaphor: Here,
knowing is equaled to taking up and getting used to the customs and habits of
a community of practice [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], into which a learner is socialized. Over the last
two decades however, the knowledge construction metaphor has emerged [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ]
as a third important metaphor of learning. Building upon a constructivist
understanding of learning, the focus lies here on the constant creation and
recreation of knowledge within knowledge construction communities. Knowledge
is no longer thought of as a rather static entity in form of a \justi ed true
belief"; instead, knowledge is constantly re-negotiated and evolves in a dynamic
way [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ]. In this tradition, the co-evolution model of learning and knowledge
construction [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] treats learning on the side of individuals and knowledge
construction on the side of communities as two structurally coupled processes (see
Figure 1). Irritations of a learner's cognitive system in form of new or
unexpected information that has to be integrated into existing cognitive structures
can lead to learning processes in the form of changes in the learner's
cognitive schemas, behavioral scripts, and other cognitive structures. In turn, such
learning processes may trigger communication acts by learners within
knowledge construction communities and stimulate further communication processes
that lead to the construction of new knowledge. In this model, shared artifacts,
for example in form of digital texts such as contributions to wikis or social
media messages, mediate between the two coupled systems of individual minds and
communicating communities [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        When talking about learning in digital environments, we can consequently
de ne learning as the activity of learners encountering at least partly new
information in form of digital artifacts. In principle, every single interaction between
a learner and an artifact can entail learning processes. Learning can either
happen occasionally and accidentally or in the course of planned and at least partly
structured learning activities [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Planned and structured learning activities can
either be self-organized or follow to a certain degree a pre-de ned curriculum of
learning activities [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. In both cases, the related activities will constitute a
certain learning trajectory [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ] which comprises of \the learning goal, the learning
activities, and the thinking and learning in which the students might engage"
(p. 133). Successful learning will result in increases in the learner's abilities and
competencies; for example, successful learners will be able to solve increasingly
di cult tasks or to process increasingly complex learning materials [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ].
      </p>
      <p>Based on these theoretical considerations, the challenge in building tools
for self-tracking of online, self-directed learning is to recognize to what extent
encountering and processing a certain artifact (a resource) induced learning. In
the co-evolution model, we assume that what we can measure is the friction (or
irritation) which triggers internalization processes, i.e. what does the artifact
bring to the cognitive system that leads to its evolution. At the moment, we
distinguish three forms of \frictions", leading to three categories of indicators of
learning:
{ New concepts and topics: The simplest way in which we can think about
how an artifact could lead to learning is through its introduction of new
knowledge unknown to the learner. This is consistent with the traditional
acquisition metaphor. In our scenario, this kind of friction happens for
example when Jane watches a video about a sewing technique previously unknown
to her.
{ Increased complexity: While not necessarily introducing new concepts, an
artifact might relate to known concepts in a more complex way, where
complexity might relate to the granularity, speci city or interrelatedness with
which those concepts are treated in the artifact. In a social system, the
assumption of the co-evolution model is that the interaction between
individuals might enable such increases in understanding of the concepts being
considered through iteratively re ning them. In our scenario, this kind of
friction happens for example when Jane follows a statistics course which is
more advanced than the ones she had encountered before.
{ New views and opinions: Similarly, known concepts might be introduced \in
a di erent light", through varying points of views and opinions enabling a
re nement of the understanding of the concepts treated. This is consistent
with the co-evolution model in the sense that it can be seen either as a
widening of the social system in which the learner is involved, or as the integration
into di erent social systems. In our scenario, this kind of friction happens
for example when Jane reads a critical review of a business management
methodology she has been studying.</p>
      <p>What appears evident from confronting the co-evolution model and the types
of indicators described above with the scenario of the previous section is that
such indicators and models should be considered within distinct \domains" of
learning. Indeed, Jane in the scenario would relate to di erent social systems for
example for her interest in sewing, cycling, business management and statistics.
The concepts that are relevant, the levels of complexity to consider and the views
that can be expressed are also di erent from each other in those domains.</p>
      <p>We call those domains of learning learning scopes. In the remainder of this
paper, we will therefore consider a learning scope to be an area or theme of
interest to a learner (sewing, business, etc.) to which are attached (consciously
or not) speci c learning goals, as well as a speci c set of concepts, topics and
activities.
4</p>
    </sec>
    <sec id="sec-5">
      <title>Technical Challenge: Making-Sense of Masses of</title>
    </sec>
    <sec id="sec-6">
      <title>Heterogeneous Activity Data</title>
      <p>Considering the conclusions from the previous section, the key challenge at the
intersection of theory and technology for self-tracking of online, self-directed
learning is to devise ways to compute the kind of indicators that are useful
to identify and approximate some quanti cation of the three types of frictions
within (implicit/emerging) learning scopes. Before that, however, we have to
face more basic technical challenges to set in place the mechanisms to collect,
integrate, enrich and process the data necessary to compute those indicators.
4.1</p>
      <sec id="sec-6-1">
        <title>Data capture, integration and enrichment</title>
        <p>
          The AFEL project aims at identifying the features that characterize learning
activities within online contexts across multiple platforms. With that, we
contribute to the eld of Social Learning Analytics that is based on the idea that
new ideas and skills are not only individual achievements, but also the results of
interaction and collaboration [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ]. With the rise of the Social Web, online social
learning has been facilitated due to the participatory and collaborative nature
of the Social Web. This has posed several challenges for Learning Analytics: The
(online) environments where learning activities and related features are to be
detected are largely heterogeneous and tend to generate enormous amounts of
data concerning user activities that may or may not relate to learning, and even
when they do, the relation is not guaranteed to be explicit. A key issue is that,
even with an emerging theoretical model, there is no established model for
representing the data for learning that can span across all the types of activities that
might occur in online environments. With respect to data capture, it may be
hard to track all relevant learning traces and some indicators such as readership
data may be misleading due to switches between the online and o ine world [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
        </p>
        <p>Therefore, AFEL adopted an approach to identify reliable data sources and
to structure their capture process, which is based on an e ort to classify data
sources, rather than the data themselves. Such an exercise in classi cation is
important as it is the result of an e ort to understand what dimensions of the
activities through the Web should be captured, before setting out to detect
speci c learning activity factors. The resulting taxonomy revolves around a core of
seven types of entities that a candidate data source has a potential for
describing; these are further speci ed into sub-categories that capture a speci c set of
dimensions, some of which are common to users and communities (e.g. learning
statements), or to users (e.g. indicators of expertise) and learning resources (e.g.
indicators of popularity). Those categories are at the core of the proposed AFEL
Core Data Model9, an RDF vocabulary largely based on schema.org and which
is, amongst other things, used to aggregate the datasets that AFEL makes
publicly available10.</p>
        <p>
          The following challenge for AFEL is to integrate data from a large number
of sources into a shared platform, using the core data model to integrate and
make them processable. The approach taken is to create a \data space", which
keeps most of the data sources intact at the time of on-boarding and being
integrated at query time through a smart API, following the principles set out in [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
Using this platform, the project has already created a number of tools, called
extractors, which can extract data about user activities from several di erent
platforms, creating a consistent and processable data space for each AFEL user
who can choose to enable some of those tools. At the time of writing those
extractors include browser extensions for extracting browsing history, applications
for Facebook and Twitter, as well as analytics extractors for the Didactalia
portal from AFEL partner GNOSS.11 We also integrate resource metadata from
several open sources related to learning.
        </p>
        <p>
          Beyond data storage and integration, the key to enable extracting the
features necessary to compute the kind of indicators mentioned in the previous
section is to connect those datasets at a semantic level, i.e. to enrich the raw data
into a more complete \Knowledge Graph". In other words, connecting the di
erent entities with each other and extracting from unstructured or semi-structured
sources entities of interest that can connect the data from a wide range of places.
In AFEL, we use entity linking approaches [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] as well as natural language
pro9 http://data.afel-project.eu/catalogue/dataset/afel-core-data-model/
10 http://data.afel-project.eu/catalogue/learning-analytics-dataset-v1/
11 http://gnoss.com
cessing [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ] and speci c feature extraction approaches to turn a user data space
into such a semantically enriched knowledge graph. Examples for such feature
extraction approaches are computing the complexity of a resource [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ],
determining the semantic stability of a resource [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ], or to assess in uencing factors in
consensus building processes in online collaboration scenarios [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>Additionally, AFEL provides a methodology to determine the characteristic
features, which allow learning activities to be detected and described, and
consequently the attributes that instantiate them, in di erent data sources identi ed
within the project. This methodology facilitates an initial speci cation of the
features relevant to learning activities by presenting an instantiation of them on
some of key data sources. Furthermore, with our methodology, we also outline a
top-down perspective of feature engineering indicating that features identi ed in
AFEL are applicable in di erent use cases, in general online contexts and that
they can be extracted from our data basis.
4.2</p>
      </sec>
      <sec id="sec-6-2">
        <title>An example: Learning scopes and topic-based indicator in browsing history</title>
        <p>In this section, we present a short pilot experiment in which we implemented
an initial version of showing indicators based on topics included in the learning
activities of a user (consistently with what described in Section 3). This relies
on some of the technical aspects described above, including data capture and
enrichment.</p>
        <p>The data: We use approximately 6 weeks of browsing history data for a user,
obtained through the AFEL browser extension12, which pushes this information
as the user is browsing the web. Each activity is described as an instance of the
concept BrowsingActivity in the AFEL Core Data Model, with as properties
the URL of the page accessed and the time at which it was accessed. In our
illustrative example, this corresponds to 42 707 activities, making reference to
12 738 URLs of webpages.</p>
        <p>Topic Extraction: The rst step to extracting the learning scopes from the
activity data is to extract the topics of each resource (webpage). For this, we rst
use DBpedia Spotlight13 to extract the entities referred to in the text in the
form of Linked Data entities in the DBpedia dataset14. DBpedia is a Linked
Data version of Wikipedia, where each entity is described according to various
properties, including the categories in which the entity has been classi ed in
Wikipedia. We therefore query DBpedia to obtain up to 20 categories from the
ones directly connected to the entities, or their broader categories in DBpedia's
category taxonomy.
12 https://github.com/afel-project/browsing-history-webext
13 http://spotlight.dbpedia.org
14 http://dbpedia.org</p>
        <p>For example, assume the learner views a YouTube video titled LMMS
Tutorial | Getting VST Instruments.15 When mining the extracted text (stripped
of HTML markup), DBpedia Spotlight detects that the description of this video
mentions entities such as &lt;http://dbpedia.org/resource/LMMS&gt; (dbp:LMMS
for short - a digital audio software suite) or dbp:Virtual Studio Technology.
Querying DBpedia reveals subject categories for dbp:LMMS, such as
&lt;http://dbpedia.org/resource/Category:Free audio editors&gt; (or short,
dbc:Free audio editors) or dbc:Software drum machines. The detected
category dbc:Free audio editors is in turn declared in DBpedia to have broader
categories such as dbc:Audio editors or dbc:Free audio software. All of
these elements are included in the description of the activity that corresponds
to watching the above video, to be used in the next step of clustering activities.</p>
        <p>On our browsing history data, running the resources through DBpedia
Spotlight extracted 20 876 distinct entities, each being added 20 categories on average.
To give an idea of the scale, the nal description of the 6 weeks of activities of
this one learner takes approximately 1.1GB of space and took between 1 and 15
seconds to compute for each activity (depending on the size of the original text,
using a modern laptop with a good internet connection).</p>
        <p>Clustering activities: In the next step, we use the description of the activities
as produced through the process described above in order to detect candidate
learning scopes, i.e. groups of topics and activities that seem to relate to the same
broader theme. To do this, we consider the set of entities and categories obtained
before similarly to the text of documents and apply a common document
clustering process on them (i.e. TFIDF vectorization and k-Means clustering). We
obtain from this a set of k clusters (with k being a given) that group activities
based on the overlap they have in the topics (entities and categories) they cover.
We label each cluster based on the entity or category that best characterizes it
in terms of F-Measure (i.e. that covers the maximum number of activities in the
cluster, and the minimum number of activities outside the cluster), representing
the target of the topic scope.</p>
        <p>The clustering technique we applied (k-Means) requires to x the number of
clusters to be obtained in advance. We experimented with numbers between 6
and 100, to see which could best represent the width and breadth of interests of
this particular learner. Here, we used 50 as it appeared to lead to good results (as
future work, we will integrate ways to automatically discover the ideal number
of clusters for a learner). Figure 2 shows the clusters obtained and their size.
The gray line describes all activities in the topic scope, i.e. all activities that
have been included in the cluster. As can be seen, the clusters are unbalanced
between the ones with a large number of activities (Google, Web Programming)
with thousands of activities, and the ones representing only a few hundreds of
activities.</p>
        <p>Topic-based indicator: In the initial scenario we are considering here, we focus on
a topic-based indicator which consist in checking whether an activity introduces
15 https://www.youtube.com/watch?v=aZKra7rNspU
new topics (entities or categories) into the learning scope (cluster) in which it
is included. We therefore \play back" the sequence of browsing activities from
the learner's history, checking at each time how many new topics are being
introduced that were not present in the previous activities of the learner in this
scope.</p>
        <p>Looking again at Figure 2, it is interesting to look at the di erence between
the gray line (number of activities in the topic scope) and the black line,
representing the number of activities that have integrated new topics in the scope
and can therefore be considered learning activities. For example, since the user
uses many Google services for basic tasks (such as Gmail for emails), it is not
surprising that the Google scope, while being the largest in activities, does not
actually include much detected learning activities. What is obvious however is
that the balance is much di erent for other clusters that can be clearly identi ed
for including large amounts of learning activities.</p>
        <p>Indeed, we can see the value of the process here by comparing the learning
trajectories of the learner according to the de nition of contributions to di erent
learning scopes considered. For example, the scope on Digital Technology,
representing the largest number of learning activities, can be seen in Figure 3 (top)
as a broad topic on which the learner is constantly (almost everyday) learning
new things. In contrast, the learning scope on Web Programming, although very
related, is one where we can assume the learner already has some familiarity and
only makes a signi cant increment in their learning punctually, as can be seen
by the jump around 08 September in Figure 3 (bottom).
The current state in the implementation of the aforementioned aspects takes the
form of a prototype learner dashboard, available from the Didactalia platform.
The dashboard illustrated in Figure 4 includes initial placeholder indicators for
the kind of frictions identi ed in Section 3 and is implemented on the
technologies described above. It is however a preliminary result, showing the ability to
technically integrate the di erent AFEL components into an rst product. It will
be further evolved in order to truly address the scenario of Section 2, including
user feedback and more accurate indicators.</p>
        <p>
          A key aspect to achieve the goal in our everyday learning scenario is that
the user should have control over what is being monitored. Indeed, the learner
should be able to decide what area of the data should be displayed, according
to which indicator and which dimension of the data (e.g. speci c topics, times,
resources or platforms). Our approach here is to rely on a framework for exible
dashboards based on visualization recommendation, implemented through the
VizRec tool [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ]. At the root of VizRec lies a visualization engine that extracts
the basic features of the data and guiding the user in choosing appropriate ways
to visualise them. Hereby, a learning expert may design a dashboard with an
initial view of set of learning indicators, but VizRec also empowers the user in
choosing what area of the data to show. This includes the ability to add new
charts to the dashboard that can be selected based on the characteristics of the
data (e.g. show a map for geo-graphical data). The tool can learn the user
preferences, and therefore show a personalized dashboard which is always consistent
with the visualization choices made by the user. Figure 5 shows an example of
VizRec displaying multidimensional learning data. A scatterplot correlates the
number of previous attempts with studied credits, showing that the number of
previous attempts is smaller when studied credits is high. The grouped bar chart
displays the number of previous attempts for female (right) and male (left)
students, with genders being further subdivided by the highest level of education
(encoded by color). It is obvious that education level has a very similar e ect for
both females and males. Notice that in the VisPicker (shown on right) only some
visualizations are enabled, which is a direct consequence of the data dimensions
which were chosen by the user: gender, highest education, number of previous
attempts (shown on left). The user is free to choose only the enabled,
meaningful visualizations, with the optional possibility of the system recommending
the optimal representation based on previous user behavior. As the title of this
section calls, it is important to move from metrics to action and consider what
the learner should do, having seen her status.
        </p>
        <p>
          One way to move the learner to action is via recommending learning resources
that appear to be relevant considering the current state of the learner [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. Here,
the monitoring of learning activities has a direct bene t in supplying
recommendations to the learner. The current implementation of such a recommender
system is based on two well-known approaches: (i) Content-based ltering, which
recommends similar resources based on the content of a given resource, and (ii)
Collaborative ltering, which recommends resources of similar users based on
the learning activities of a given user [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ].
        </p>
        <p>However, an important aspect, which is still missing, is how such measures
of similarity can be based on metrics that are relevant to learning rather than
on basic content or pro le similarity. Indeed, the objective here would be to
recommend learning resources (or even learning resource paths) that have already
been helpful for other users with a similar learning goal and a similar learning
state (in terms of the concepts, complexities and views already encountered). In
other words, the recommendations can be based on a meaningful view of what
the suggested resources might contribute to learning.
6</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>Discussion: Towards Wide-Availability, Ethical Tools for Self-Tracking of Online Learning</title>
      <p>In the previous section, we discussed how to theoretically and technically
implement tools for self-awareness targeted at self-directed online learning. Those
tools are currently at early stages of development. Beyond those aspects however,
other challenges will be faced by the AFEL consortium. One of them includes
facilitating the adoption of these tools by a wide variety of users. Indeed, the
actual usefulness and value of such personal analytics dashboards and learning
assistant technologies have not been formally assessed and the participation of
the learner community in their development is necessary in order to ensure that
they reach their potential. The approach taken by AFEL here is to start with
the community of learners in the Didactalia platform, enabling the dashboard
for them and through that, supporting them in integrating data from other
platforms. With a large number of users, we will be able to collect enough data to
understand how such monitoring can truly support users in reaching awareness of
their learning behavior, and how this can help them take decisions with respect
to their own learning.</p>
      <p>Another aspect which is not discussed in this paper is the ethical implications
of realizing such tools and reaching a wide-adoption. As mentioned above, each
of the learners is assigned their own data space on the AFEL platform, which
is only accessible by them. However, as mentioned in the scenario of Section 2,
support to the learner might be better achieved by enabling them to compare
their own behavior with others, and we aim to make some aggregated data
available to others for research purposes. Proper anonymisation techniques need
to be applied in order to ensure that external parties cannot infer information
about speci c learners from having access to those tools and data.</p>
      <p>
        Beyond privacy however, it is also important to ensure that the e ect of the
tool might not turn out to be negative. Existing work have shown a number
of ethical harms that might come out of enabling self-governance in a number
of domains, despite the obvious positive e ects [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. Those include introducing
biases towards common learning behaviors or pushing learners towards excessive
behaviors for the purpose of improving the values of indicators that are
necessarily only approximate representations of learning. Activities within and connected
to the AFEL project have for speci c objective to tackle those aspects, through
establishing contrasting scenarios of the possible e ect of self-tracking tools as
a basis to engage with users of those tools about the ways to avoid the negative
e ects while keeping the positive ones.
      </p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgement</title>
      <p>This work has received funding from the European Union's Horizon 2020
research and innovation programme as part of the AFEL (Analytics for Everyday
Learning) project under grant agreement No 687916.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>A.</given-names>
            <surname>Adamou</surname>
          </string-name>
          and
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>dAquin</year>
          .
          <article-title>On requirements for federated data integration as a compilation process</article-title>
          .
          <source>In Proceedings of the PROFILES 2015 workshop</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>U.</given-names>
            <surname>Cress</surname>
          </string-name>
          and
          <string-name>
            <given-names>J.</given-names>
            <surname>Kimmerle</surname>
          </string-name>
          .
          <article-title>A systemic and cognitive view on collaborative knowledge building with wikis</article-title>
          .
          <source>International Journal of Computer-Supported Collaborative Learning</source>
          ,
          <volume>2</volume>
          (
          <issue>3</issue>
          ),
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>S. A.</given-names>
            <surname>Crossley</surname>
          </string-name>
          , J. Green eld, and
          <string-name>
            <given-names>D. S.</given-names>
            <surname>McNamara</surname>
          </string-name>
          .
          <article-title>Assessing text readability using cognitively based indices</article-title>
          .
          <source>Tesol Quarterly</source>
          ,
          <volume>42</volume>
          (
          <issue>3</issue>
          ):
          <volume>475</volume>
          {
          <fpage>493</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>M. De Laat</surname>
            and
            <given-names>F. R.</given-names>
          </string-name>
          <string-name>
            <surname>Prinsen</surname>
          </string-name>
          .
          <article-title>Social learning analytics: Navigating the changing settings of higher education</article-title>
          .
          <source>Research &amp; Practice in Assessment, 9</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>S.</given-names>
            <surname>Dietze</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sanchez-Alonso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Ebner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. Qing</given-names>
            <surname>Yu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Giordano</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Marenzi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>B. Pereira</given-names>
            <surname>Nunes</surname>
          </string-name>
          .
          <article-title>Interlinking educational resources and the web of data: A survey of challenges and approaches</article-title>
          . Program,
          <volume>47</volume>
          (
          <issue>1</issue>
          ):
          <volume>60</volume>
          {
          <fpage>91</fpage>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>R.</given-names>
            <surname>Ferguson</surname>
          </string-name>
          .
          <article-title>Learning analytics: drivers, developments and challenges</article-title>
          .
          <source>International Journal of Technology Enhanced Learning</source>
          ,
          <volume>4</volume>
          (
          <issue>5</issue>
          -6):
          <volume>304</volume>
          {
          <fpage>317</fpage>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7. I.
          <string-name>
            <surname>Hasani-Mavriqi</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Geigl</surname>
            ,
            <given-names>S. C.</given-names>
          </string-name>
          <string-name>
            <surname>Pujari</surname>
            , E. Lex, and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Helic</surname>
          </string-name>
          .
          <article-title>The in uence of social status and network structure on consensus building in collaboration networks</article-title>
          .
          <source>Social Network Analysis and Mining</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ):
          <fpage>80</fpage>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>J.</given-names>
            <surname>Kimmerle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Moskaliuk</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Oeberst</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Cress</surname>
          </string-name>
          .
          <article-title>Learning and collective knowledge construction with social media: A process-oriented perspective</article-title>
          .
          <source>Educational Psychologist</source>
          , (
          <volume>50</volume>
          ),
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>S.</given-names>
            <surname>Kopeinik</surname>
          </string-name>
          , E. Lex,
          <string-name>
            <given-names>P.</given-names>
            <surname>Seitlinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Albert</surname>
          </string-name>
          , and
          <string-name>
            <given-names>T.</given-names>
            <surname>Ley</surname>
          </string-name>
          .
          <article-title>Supporting collaborative learning with tag recommendations: a real-world study in an inquiry-based classroom project</article-title>
          .
          <source>In LAK</source>
          , pages
          <volume>409</volume>
          {
          <fpage>418</fpage>
          ,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <given-names>J.</given-names>
            <surname>Lave</surname>
          </string-name>
          and
          <string-name>
            <given-names>E.</given-names>
            <surname>Wenger</surname>
          </string-name>
          .
          <article-title>Situated learning: Legitimate peripheral participation</article-title>
          . Cambridge University Press,
          <year>1991</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>C. D. Manning</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Surdeanu</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>J. R.</given-names>
          </string-name>
          <string-name>
            <surname>Finkel</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Bethard</surname>
            , and
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>McClosky</surname>
          </string-name>
          .
          <article-title>The stanford corenlp natural language processing toolkit</article-title>
          .
          <source>In ACL (System Demonstrations)</source>
          , pages
          <fpage>55</fpage>
          {
          <fpage>60</fpage>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>V.</given-names>
            <surname>Marsick</surname>
          </string-name>
          and
          <string-name>
            <given-names>K.</given-names>
            <surname>Watkins</surname>
          </string-name>
          .
          <article-title>Lessons from informal and incidental learning</article-title>
          .
          <source>Management learning: Integrating perspectives in theory and practice</source>
          ,
          <year>1997</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>C. McLoughlin</surname>
            and
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Lee</surname>
          </string-name>
          .
          <article-title>Personalised and self regulated learning in the web 2.0 era: International exemplars of innovative pedagogy using social software</article-title>
          .
          <source>Australasian Journal of Educational Technology</source>
          ,
          <volume>1</volume>
          (
          <issue>26</issue>
          ),
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <given-names>B.</given-names>
            <surname>Mutlu</surname>
          </string-name>
          , E. Veas, and
          <string-name>
            <given-names>C.</given-names>
            <surname>Trattner</surname>
          </string-name>
          . Vizrec:
          <article-title>Recommending personalized visualizations</article-title>
          .
          <source>ACM Trans. Interact. Intell. Syst.</source>
          ,
          <volume>6</volume>
          (
          <issue>4</issue>
          ):
          <volume>31</volume>
          :1{
          <fpage>31</fpage>
          :
          <fpage>39</fpage>
          ,
          <string-name>
            <surname>Nov</surname>
          </string-name>
          .
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <given-names>G.</given-names>
            <surname>Ne</surname>
          </string-name>
          and
          <string-name>
            <given-names>D.</given-names>
            <surname>Nafus. The</surname>
          </string-name>
          Self-Tracking. MIT Press,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <given-names>A.</given-names>
            <surname>Oeberst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kimmerle</surname>
          </string-name>
          , and
          <string-name>
            <given-names>U.</given-names>
            <surname>Cress</surname>
          </string-name>
          .
          <article-title>What is knowledge? who creates it? who possesses it? the need for novel answers to old questions</article-title>
          .
          <source>Mass collaboration and education</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <given-names>S.</given-names>
            <surname>Paavola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Lipponen</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.</given-names>
            <surname>Hakkarainen</surname>
          </string-name>
          .
          <article-title>Models of innovative knowledge communities and three metaphors of learning</article-title>
          .
          <source>Review of Educational Research</source>
          ,
          <volume>4</volume>
          (
          <issue>74</issue>
          ),
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <given-names>P.</given-names>
            <surname>Seitlinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kowald</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Kopeinik</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Hasani-Mavriqi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Ley</surname>
          </string-name>
          , and
          <string-name>
            <given-names>E.</given-names>
            <surname>Lex</surname>
          </string-name>
          .
          <article-title>Attention please! a hybrid resource recommender mimicking attention-interpretation dynamics</article-title>
          .
          <source>In Proceedings of the 24th International Conference on World Wide Web, WWW '15 Companion</source>
          , pages
          <volume>339</volume>
          {
          <fpage>345</fpage>
          , New York, NY, USA,
          <year>2015</year>
          . ACM.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <given-names>A.</given-names>
            <surname>Sfard</surname>
          </string-name>
          .
          <article-title>On two metaphors for learning and the dangers of choosing just one</article-title>
          .
          <source>Educational Researcher</source>
          ,
          <volume>2</volume>
          (
          <issue>27</issue>
          ),
          <year>1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <given-names>S. B.</given-names>
            <surname>Shum</surname>
          </string-name>
          and
          <string-name>
            <given-names>R.</given-names>
            <surname>Ferguson</surname>
          </string-name>
          .
          <article-title>Social learning analytics</article-title>
          .
          <source>Journal of educational technology &amp; society, 15(3):3</source>
          ,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <given-names>M.</given-names>
            <surname>Simon</surname>
          </string-name>
          .
          <article-title>Reconstructing mathematics pedagogy from a constructivist perspective</article-title>
          .
          <source>Journal for Research in Mathematics Education</source>
          ,
          <volume>2</volume>
          (
          <issue>26</issue>
          ),
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <given-names>D.</given-names>
            <surname>Stanisavljevic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            <surname>Hasani-Mavriqi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Lex</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Strohmaier</surname>
          </string-name>
          , and
          <string-name>
            <given-names>D.</given-names>
            <surname>Helic</surname>
          </string-name>
          .
          <article-title>Semantic stability in wikipedia</article-title>
          .
          <source>In International Workshop on Complex Networks and their Applications</source>
          , pages
          <volume>379</volume>
          {
          <fpage>390</fpage>
          . Springer,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <given-names>H.</given-names>
            <surname>Stubbe</surname>
          </string-name>
          and
          <string-name>
            <given-names>N.</given-names>
            <surname>Theunissen</surname>
          </string-name>
          .
          <article-title>Self-directed adult learning in a ubiquitous learning environment: A meta-review</article-title>
          .
          <source>In Proceedings of the First Workshop on Technology Support for Self-Organized Learners</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Whitson</surname>
          </string-name>
          .
          <article-title>Foucaults tbit: Governance and gami cation</article-title>
          .
          <source>The Gameful World - Approaches</source>
          , Issues, Applications,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>