<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Linked Data based applications for Learning Analytics Research: faceted searches, enriched contexts, graph browsing and dynamic graphic visualisation of data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ricardo Alonso Maturana</string-name>
          <email>riam@gnoss.com</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>María Elena Alvarado</string-name>
          <email>elenaalvarado@gnoss.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Susana López-Sola</string-name>
          <email>susanalopez@gnoss.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>María José Ibáñez</string-name>
          <email>mariajoseibanez@gnoss.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Lorena Ruiz Elósegui</string-name>
          <email>lorenaruiz@gnoss.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>gnoss.com*</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>gnoss.com</institution>
          ,
          <addr-line>*Piqueras 31, 4th floor, E-26006 Logroño. La Rioja. Spain, +34 941248905</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2013</year>
      </pub-date>
      <abstract>
        <p>We present a case of exploitation of Linked Data about learning analytics research through innovative end-user applications built on GNOSS, a semantic and social software platform. It allows users to find and discover knowledge from two datasets, Learning Analytics Knowledge (LAK) and Educational Data Mining (EDM), and also reach some related external information thanks to the correlation with other datasets. We used four additional datasets, either to supplement information or to generate enriched contexts: Dbpedia, Geonames, DBLP-GNOSS (an index of scientific publications in Computer Science from DBLP) and DeustoTech Publications (publications of the Institute of Technology of the University of Deusto, and more specifically a selection of works by the DeustoTech Learning research unit). The featured applications are: faceted searches, enriched contexts, navigation through graphs and graphic visualization in charts or geographic maps. Faceted searches can be performed on three basic items: scientific publications, researchers (authors of the publications) and organizations in the learning analytics area. The search engine enables aggregated searches by different facets and summarization of results for each successive search. Analytics on data are provided firstly through that summarization given for results in every facet, and secondly through dynamic graphic representations for some attributes. Several charts are available to show the distribution of publications depending on different attributes (e.g. per publication type and year or per organization). The search results for organizations and researchers can be visualized in geographic maps. Classification Scheme: The 2012 ACM Computing Classification System (CCS) http://dl.acm.org/ccs.cfm General Terms</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Categories and Subject Descriptors</title>
      <p>Information systems - Data management systems - Database
design and models - Graph-based database models
Information systems - Information systems applications
Collaborative and social computing systems and tools - Social
networking sites</p>
    </sec>
    <sec id="sec-2">
      <title>1. MOTIVATION: PURPOSES OF THE</title>
    </sec>
    <sec id="sec-3">
      <title>SOLUTION</title>
      <p>
        The main purpose of the service developed on the gnoss.com1
software platform is to provide end-users with innovative
applications that allow them to find and discover knowledge
related to learning analytics research from the LAK and EDM
datasets [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].2 Based on the exploitation of Linked Data [
        <xref ref-type="bibr" rid="ref2 ref3">2, 3</xref>
        ], the
system includes faceted searches, recommendation systems and
adapted contexts. More specifically, the software solution serves
the following purposes:
1. Explore and navigate the datasets (LAK and EDM) through
faceted searches and graph browsing. It enables to find
publications, researchers and organizations in the area, as well as
to know about which topics are being investigated, who is
working in which fields, where those people and their
organizations are located, who has published about LAK in an
organization, or who is collaborating with whom, for instance.
2. Access a geographic visualisation of researchers and
organizations working in learning analytics, with the option of
filtering results by different and aggregated facets.
3. Visualise dynamic charts of some analytic information related
to the evolution and distribution of publications. The charts are
dynamic as the results evolve in the chart with the successive
selected filters in the search facets.
4. Discover related information within the dataset once the user
has access a specific item (internal context), such as related LAK
publications, co-authors, related and nearest organizations, etc.
1 GNOSS: http://www.gnoss.com/en/about-gnoss
2 LAK
      </p>
      <p>and EDM datasets are available
http://www.solaresearch.org/resources/lak-dataset
online
in:
5. Discover external related information through the correlation
with other datasets. Some examples of datasets have been chosen
for the demo to show the potential of the GNOSS platform tools.
6. Facilitate the potential relevant re-use of these datasets as
contexts in other scenarios by linking them to existing social
learning communities on gnoss.com, or any community related to
the study of those topics.</p>
    </sec>
    <sec id="sec-4">
      <title>2. DESCRIPTION OF THE SOLUTION</title>
    </sec>
    <sec id="sec-5">
      <title>AND DATASETS</title>
      <p>The solution exploiting the LAK datasets has been developed on
gnoss.com, a social and semantic platform with a deep focus on
the generation of social knowledge ecosystems and end-user
applications in a Linked Data environment. It includes faceted
searches, recommendation systems and adapted contexts in
education, university and enterprises. GNOSS could be conceived
as a network of networks or a linked networks space oriented to
using semantic technologies for data and service integration. It
expresses the data generated by users with default basic semantic
standard vocabularies. This is done automatically when a user
shares content on the platform. Besides, GNOSS has an engine for
developing specific ontologies and, as a consequence, specific
search engines if necessary. Moreover, it has a wide range of
configurable social tools, which have been mostly deactivated for
this demo, except for comments and the option to share the link
via email or other social networks.</p>
    </sec>
    <sec id="sec-6">
      <title>2.1 The basis: LAK and EDM datasets</title>
      <p>The baseline information to develop the solution was obtained
from two datasets related to learning analytics: 1) Learning
Analytics and Knowledge (LAK) 2011-2012 and 2) Educational
Data Mining (EDM) 2008-2012. Both of them have information
about people (researchers), organizations in which they work, and
publications (proceedings, inproceedings and articles).
The information of the original datasets was enriched with data
coming from Dbpedia3 and Geonames,4 and also with
automatically generated tags. Moreover, some duplicated data
(researchers and organizations) that appeared when unifying the
two datasets were eliminated.</p>
      <p>This information was uploaded to an online space inside the
gnoss.com platform to consume and exploit the data and present
the end-user applications.</p>
      <p>We prepared a general navigation through tabs that includes a
homepage with content selection and three other tabs
corresponding to the three entities from the datasets: publications,
researchers and organizations.</p>
      <p>The three previous entities were represented on the platform with
their specific ontologies thanks to the semantic CMS of GNOSS
following the standard vocabularies of the original data: FOAF
(Friend-of-a-Friend),5 SWRC (Semantic Web for Research
Communities),6 DC (Dublin Core),7 etc. In addition, other
vocabularies were included for representing the extended
3 Dbpedia: http://dbpedia.org/</p>
      <sec id="sec-6-1">
        <title>4 Geonames: http://www.geonames.org/</title>
        <p>5 FOAF Vocabulary specification: http://xmlns.com/foaf/spec/
6 SWRC ontology: http://swrc.ontoware.org/ontology#</p>
      </sec>
      <sec id="sec-6-2">
        <title>7 Dublin Core terms: http://purl.org/dc/terms/</title>
        <p>information and/or correlating datasets, for instance SIOC
(Semantically Interlinked Online Communities),8 SKOS (Simple
Knowledge Organization System),9 DBPROP10 or GN
(Geonames).11</p>
      </sec>
    </sec>
    <sec id="sec-7">
      <title>2.2 Other datasets</title>
      <p>Besides the direct consumption of the information provided in the
datasets on learning analytics research, we used other additional
datasets, either to supplement the information of the former ones
(as explained above) or to automatically generate dynamic
contexts with external related information. The following
additional datasets were employed:
1. Dbpedia, for supplementing the data of organizations and
obtaining geographic information that enables the connection with
Geonames.</p>
      <p>The automation of this process gave rise to incomplete
information for some items, in such a way that we could not
obtain the necessary information to represent all the researchers
and organizations in the geographic map. As a consequence, the
presentation of results differs from the ‘mosaic view’ (it includes
all the results) and the ‘map view’ (it only represents the
geolocated data). These data could be improved in the future.
This is a common problem in the Web of Data. Datasets usually
need to be refined because of one or more of the following
reasons: incomplete data, insufficient (missing) data or
inconsistent data (data are not well described or depicted or are
named wrongly). This complicates to provide an adequate service
and, specially, this makes it difficult to upload datasets and set
relations between data.
2. Geonames, with the aim of recovering geolocation data and use
them to develop the exploitation of geographic visualization of
data.
3. Two GNOSS existing datasets of scientific publications that we
found interesting as contexts in the field of learning analytics:
DBLP-GNOSS12 and DeustoTech publications. 13
‘DBLP-GNOSS’ is and index with over two million scientific
publications in IT, developed by GNOSS in collaboration with the
University of Deusto. The data of DBLP-GNOSS have been
obtained from the dataset in the LOD cloud ‘DBLP’ promoted by
the University of Trier, and have been enriched with abstracts and
key words.
‘DeustoTech publications’ is the dataset of scientific publications
of the Technology Institute of the University of Deusto,
DeustoTech. As a demo of a relevant external context, we
included a selection of the publications produced by the research
team DeustoTech Learning.
8 SIOC Core Ontology Specification: http://rdfs.org/sioc/spec/
9 SKOS namespace: http://www.w3.org/2004/02/skos/core#
10 Dbpedia ontology: http://dbpedia.org/Ontology
11 Geonames ontology: http://www.geonames.org/ontology
12 GNOSS Research Groups: http://researchgroups.gnoss.com
13DeustoTech publications:
http://deusto.gnoss.com/comunidad/DeustoTech/Publications</p>
    </sec>
    <sec id="sec-8">
      <title>2.3 Faceted searches</title>
      <p>The web of structured data makes it possible to develop strategies
for intelligent information retrieval based on faceted searches [4,
5, 6 and 7]. GNOSS has a powerful faceted search engine that is
generated by the GNOSS semantic graphs (RDF triplets); the
search engine exploits that graphs through reasoned or
inferencebased searches.</p>
      <sec id="sec-8-1">
        <title>The main advantages of facet-based searches are:</title>
        <p>They are based on meaning and concepts, and relations
between them.</p>
        <p>Users obtain reduced lists of results based on semantic
properties or attributes of the data.</p>
        <p>They allow reasoning: a new search allows restricting the
subset of data from the previous search across multiple
facets. You can progressively filter results until you reach a
manageable data set.</p>
        <p>Searches on the LAK Data Challenge space in GNOSS can be
started from two approaches:</p>
      </sec>
      <sec id="sec-8-2">
        <title>As a meta-search, seeking in any kind of content.</title>
        <p>Or selecting the item type to perform the search, either
choosing it from the facet ‘item type’ in the home webpage,
or navigating through the corresponding tab for every item.
In this case, there are three basic item types: publications,
researchers and organizations.</p>
        <p>Once an item type is selected, the search engine provides specific
facets for each of them, which are configurable in function of the
available data. The relevant facets that have been set for each case
of the LAK Data are:</p>
        <p>For publications: categories, tags, author, year, conference
and publication type.</p>
        <p>For researchers: categories, tags, affiliation (organization)
and country.</p>
        <p>For organizations: categories, tags, country, region, city and
students number.</p>
        <p>The GNOSS faceted search engine allows concatenated searches,
and all relationships among the facets are recalculated with each
successive filter for the corresponding set of results.
2.3.1 Summarization of results: direct quantitative
exploitation of data
GNOSS offers summarization of the number of results in each
property represented in the facets. The values are recalculated for
every set of results in aggregated searches. This gives direct
analytic information that is represented in the form of facets for
searches (see example in Figure 1).</p>
        <p>Thus, the search results give a lot of information through the
facets: how they relate to the other searching attributes. For
example, you look for publications with the tag ‘intelligent
tutoring system’, you obtain 12 results and know who worked on
this topic and who published the most papers, and you know that
the author Zachary A. Pardos, for example, wrote 3 publications
in the field. If you select this author, all the facets are recalculated
and you can see how they relate to the publications about
intelligent tutoring system by Pardos, for example, that they were
published in 2008, 2011 and 2012, that one of them is related to
Bayesian knowledge, and that he collaborated with other four
authors.</p>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>2.4 Navigation trough graphs and relationships between entities and properties</title>
      <p>The possibilities of navigation through graphs that connect
entities and properties (among them and with each other) are
immense and n-dimensional. Just to give some examples of items
relationships and possible navigation paths in the present case:
Authors and papers: authors who wrote articles on a specific
topic, authors of a publication you are interested in, papers
written by a selected author.</p>
      <p>Researchers and organizations in which they work: related
organizations working on similar topics.</p>
      <p>Authors and co-authors: if you find a researcher, you could
be interested in the people working with him, and then
discover other research areas the latter are working at, and
see their location in a map.</p>
      <p>Related topics and their relation with researchers (authors):
you look for a key word and you see other related terms and
the researchers publishing on that subject. You can navigate
through the authors and discover new publications,
coauthors, etc.</p>
      <p>Location, people and research topics: you look for
researchers by geographic criteria, e.g. United Kingdom, and
you get the topics they are working at (tags).</p>
    </sec>
    <sec id="sec-10">
      <title>2.5 Enriched contexts of information and recommendations</title>
      <p>The Web of Data also enables to connect information
significantly, which can be exploited in GNOSS for the generation
of dynamic contexts that can be customized for each case.
In the present work on LAK and EDM data, we set several
demonstration contexts depending on the object or entity that the
user is viewing:
1. Contexts for the entity ‘publication’: related LAK and EDM
publications (internal), related DBLP publications (external),
DeustoTech Learning publications (external).
2. Contexts for the entity ‘researcher’: co-authors (internal),
related organizations by topics (internal) and related DBLP
publications (external).
3. Contexts for the entity ‘organization’: related organizations
(internal), related researchers (internal), nearest organizations
(internal), geolocation (external) and its visualization on a map.
4. Contexts of general purpose: Freebase definitions of tags of the
contents (when the concepts have an article in Freebase). For
example, if you select a publication about data mining, when you
put the mouse on the tag ‘data mining’, a window appears with its
definition on Freebase and the link to the Freebase and Wikipedia
articles.</p>
    </sec>
    <sec id="sec-11">
      <title>2.6 Geographical visualisation of data</title>
      <p>The present work includes the development of an application to
represent a set of geolocated results in a geographical map. In the
case of LAK and EDM datasets, this visualisation is enabled for
researchers and organizations (see example in Figure 2),
combined with the option of filtering results by different and
aggregated facets.</p>
    </sec>
    <sec id="sec-12">
      <title>2.7 Visualisation of analytics with dynamic charts</title>
      <p>The analytics provided by summarization in search facets was
supplemented with some graphic visualisations. Google charts
tools14 were integrated in the platform to represent some analytics
related to the evolution and distribution of publications. Four
types of charts were used: column chart, intensity map, pie chart
and bar chart. The user can choose among several charts, and
continue filtering through facets successively, thus seeing how the
results evolve in the chart with the selected filters. Six charts were
included to analyse LAK data:</p>
      <p>Evolution of number of publications per year and publication
type (column chart, Figure 3). It shows how the number of
publications in this area has increased during the last years,
and how they are distributed in inproceedings (the main
part), articles and proceedings. These results can be restricted
to selected criteria filtering through facets.</p>
      <p>Distribution of number of publications per country (intensity
map, Figure 4). It gives a quick idea about the countries with
more scientific production in the field, according to search
criteria (total number, one or more specific topics, a selected
year, etc.).
14 Google chart tools. Information for developers available in
https://developers.google.com/chart/.</p>
      <p>Figure 5 Distribution of number of researchers with
publications per year (pie chart)
Distribution of number of publications per organization (bar
chart, Figure 6). The first view shows the total number of
publications for each organization along the years, and shows
clearly which ones have produced the larger amount, with the
Worcester Polytechnic Institute leading the list. By filtering
through facets, like year, tags or publication type, the user
can observe how the chart changes depending on those filter
options.</p>
      <p>Figure 6 Distribution of number of publications per
organization
Distribution of number of publications per author (bar chart).
It is similar to the previous one, but representing authors
instead of organizations.</p>
      <p>This work shows some examples of charts representing analytics
on the LAK data, and it is extensible to additional similar
exploitations.</p>
    </sec>
    <sec id="sec-13">
      <title>4. ACKNOWLEDGMENTS</title>
      <p>Government of Spain, Ministry of Economy and Competitiveness,
CDTI (Centre for Industrial Technological Development (CDTI),
for the funding of a project financed by CDTI (Project
IDI20110600).</p>
      <p>University of Deusto, for the collaborative work in the generation
of external contexts.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Taibi</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dietze</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <article-title>Fostering analytics on learning analytics research: the LAK dataset</article-title>
          ,
          <source>Technical Report</source>
          , 03/
          <year>2013</year>
          , URL: http://resources.linkededucation.org/
          <year>2013</year>
          /03/lak-datasettaibi.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <surname>Bizer</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Heath</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Linked Data - The Story So Far</article-title>
          .
          <source>International Journal on Semantic Web and Information Systems</source>
          ,
          <volume>5</volume>
          (
          <issue>3</issue>
          ),
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          . DOI:
          <volume>10</volume>
          .4018/jswis.2009081901.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <surname>Bauer</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Kaltenböck</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <year>2012</year>
          .
          <article-title>Linked Open Data: The Essentials. A Quick Start Guide for Decision Makers</article-title>
          . Edition Mono/monochrom, Vienna, Austria. ISBN:
          <fpage>978</fpage>
          -3-
          <fpage>902796</fpage>
          - 05-09.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <surname>Suominen</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Viljanen</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Hyvönen</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>User-centric faceted search for semantic portals</article-title>
          .
          <source>In The Semantic Web: Research and Applications. Proceedings of the 4th European Semantic Web Conference ESWC2007</source>
          ,
          <article-title>forth-coming (Innsbruck, Austria</article-title>
          , June 3-7,
          <year>2007</year>
          ),
          <fpage>356</fpage>
          -
          <lpage>370</lpage>
          . DOI:
          <volume>10</volume>
          .1007/978-3-
          <fpage>540</fpage>
          -72667-8_
          <fpage>26</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <surname>Stefaner</surname>
            ,
            <given-names>M</given-names>
          </string-name>
          : Ferré,
          <string-name>
            <given-names>S.</given-names>
            ;
            <surname>Perugini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ;
            <surname>Koren</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ; Zhang,
          <string-name>
            <surname>Y.</surname>
          </string-name>
          <year>2009</year>
          .
          <article-title>User Interface Design</article-title>
          . In Dynamic Taxonomies and
          <string-name>
            <given-names>Faceted</given-names>
            <surname>Search</surname>
          </string-name>
          .
          <year>2009</year>
          . Sacco,
          <string-name>
            <given-names>G. M.</given-names>
            ;
            <surname>Tzitzikas</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y</surname>
          </string-name>
          . (Eds.)
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <surname>Ferré</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; Hermann,
          <string-name>
            <given-names>A.</given-names>
            ;
            <surname>Ducassé</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2011</year>
          .
          <article-title>Semantic Faceted Search: Safe and Expressive Navigation in RDF Graphs. Research report</article-title>
          . ISSN:
          <fpage>2102</fpage>
          -
          <lpage>6327</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Dal</given-names>
            <surname>Mas</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2012</year>
          .
          <article-title>Faceted Semantic Search for Personalized Social Search</article-title>
          . In Computing Research Repository,
          <year>2012</year>
          , abs-
          <volume>1202</volume>
          -
          <fpage>6685</fpage>
          . URL: http://arxiv.org/abs/1202.6685.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>