<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Linked-Data-based search system of educational tools for the Web of Data</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Adolfo Ruiz-Calleja</string-name>
          <email>adolfo@gsic.uva.es</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>School of Telecommunications Engineering, University of Valladolid</institution>
          <addr-line>Camino del Cementerio s/n, 47011 Valladolid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
      </contrib-group>
      <fpage>79</fpage>
      <lpage>84</lpage>
      <abstract>
        <p>The number of learning situations that can be carried out in a VLE (Virtual Learning Environment) can be improved by the integration of third-party external tools. Before the integration takes place, it is compulsory to retrieve some information, both to select the most appropriate tool to support an speci c situation and to be able to integrate it in the VLE. Current tool registries have some drawbacks that make the data retrieval di cult for the educators; among these drawbacks, the most important one is that search engines are not able to automatically import information from external datasets. To overcome this limitation, this paper introduces an ongoing doctorate research that proposes the creation of a search engine based on the Linked-Data principles.</p>
      </abstract>
      <kwd-group>
        <kwd>Linked Data</kwd>
        <kwd>educational tools</kwd>
        <kwd>Semantic Web</kwd>
        <kwd>integration</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        designed to retrieve text documents and not data about software tools; these
documents may contain a text description about software tools, but data must
be extracted out of them by a human. In order to facilitate the software tool
data retrieval, some other search engines, such as Google Gadgets3 or Yahoo!
Widgets4, give support for searching tools, providing data about thousands of
tools. Moreover, they are commonly used by non-expert users. However, these
kind of search engines are not specialized in educational domain, so they do
not provide educational information related to the tools; in addition they are
keyword-based search systems, which are prone to obtain irrelevant results [8,
Pag. 91]. These drawbacks are overcome by OntoolSearch5, an educational tool
search engine based on semantic technologies that use the Ontoolcole [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]
ontology to describe software tools. Nevertheless, OntoolCole only allows to express
the functional properties of the tools, and not their non-functional properties.
Therefore, OntoolSearch cannot provide the information for integration of tools.
      </p>
      <p>Another important drawback that is common to all the aforementioned search
systems is that they are not able to automatically import information from
thirdparty external repositories. These search systems can only take information from
their own internal registry, which behaves as a independent data silo. Thus, each
search system is only able to provide the information that has been explicitly
described in its internal registry, even if there is more relevant information in
another dataset freely accessible through the Web. Moreover, a software tool may
be described in a search system internal registry but it may have been updated
in another external data source, so a teacher that uses this search system will
get out-of-date information.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Proposed approach</title>
      <p>
        The most important problem that this doctorate research work is trying to solve
is how to create an educational tool search engine able to automatically collect
information from third-party external data repositories. Obviously, the problem
of integrating external data does not only a ect to the educational domain; it
is a very important problem to solve related to the information retrievement
and data management. The Linked Data [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] approach is a recent proposal that
is expected to facilitate the automatic access to the information published in
external repositories.
      </p>
      <p>
        Linked Data is a methodology for publishing data in the Semantic Web [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
Its key idea is to identify concepts (both data and meta-data) with URIs and
reuse URIs de ned by external data providers. Following this approach, when
two repositories have information referred to the same entity, they will use the
same URI to de ne it and thus a software agent could automatically retrieve
the information related to a concept published in di erent data registries. This
proposal has been widely adopted and many data providers are linking their
3 http://desktop.google.com/plugins/
4 http://widgets.yahoo.com/
5 http://gsic.uva.es/ontoolsearch
datasets according to these principles, building the Web of Data, which is
motivated by the Linking Open Data Project6.
      </p>
      <p>In the Web of Data there are several interlinked data repositories with
information from multiple domains. Even if none of them provide speci c information
about educational tools, there are some data repositories where useful
information of software tools is registered. One of these datasets is DBpedia7, where
up-to-date structured information extracted from Wikipedia is freely available.
A key step to solve the proposed research question is to design a search system
able to extract structured data from di erent sources from the Web of Data, such
as Dbpedia, and automatically relate the data to the system internal vocabulary.
Note that the data extraction from each dataset will not be ad-hoc implemented
because, as di erent data sources are linked, a single software agent can extract
information from several datasets.</p>
      <p>However, there is no data registry in the Web of Data that describes software
tools contemplating their educational capabilities, nor there is any that give
details about how to integrate them in a VLE. So that, the support of searching
tools using educational abstractions will be limited. In order to solve this problem
a search engine is proposed; this search engine will have a collection of adaptors
that will allow to automatically extract data from di erent sources. Moreover,
in order to complete the description of software tools , teachers can publish their
educational capabilities and technical users can give details of how to integrate
them in a learning environment. This way, a new interlinked dataset will be
created, containing information with a speci c purpose and reusing information
from multiple previously available data repositories.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Methodology</title>
      <p>
        In order to design, develop and evaluate an educational tool search system, the
doctorate research is being iteratively performed according to the methodology
in engineering [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], following the next steps:
1. Research problem de nition. The rst stage consists on the de nition of a
relevant research problem after exploring the literature. The problem stated
is how to retrieve useful information for the selection of learning tools and
their integration in VLEs.
2. Solution proposal. The second stage tries to overcome the problem found
with a solution proposal. A software that supports the information retrieval
from third-party external data sources, as well as the publication of
educational information following the Linked Data principles is the current
proposal for the problem.
3. Solution design and development. This stage consists on the design of the
solution proposed in the previous stage and also implies the development of
a prototype. Therefore, a prototype of the architecture, including the search
6 http://linkeddata.org
7 http://dbpedia.org/About
engine and some adaptors will be developed. Moreover, the integration of
the prototype in at least a VLE will be carried out.
4. Solution assessment. The last stage shows that the solution overcomes the
problems that were detected in the rst stage of this methodology. In this
context, the system should be able to automatically retrieve and integrate
information from di erent sources and to registry educational information
about the tools following the Linked Data principles.
      </p>
      <p>The whole picture of the research proposal is represented in Figure 1.
Doctorate research has been working on some of the partial objectives described
in Section 2. Speci cally, previous work has focussed on the detection of the
information requirements for the tool integration in VLEs and in the design of
the search engine architecture.</p>
      <p>
        In order to detect the information that should be provided by the system,
some real examples were studied where a tool is published by a tool provider,
found and selected by a teacher and later on integrated in a VLE. Furthermore,
an analysis of the literature was carried out. Out of this review the tool
information requirements were established: the description of the tool should
contemplate both functional, technological and administrative parameters of the tool.
Once the information requirements were analyzed, an ontology was design in
order to de ne a vocabulary that can be used by the search system to describe the
learning tools. This ontology reuses other conceptualizations that can be found
in the literature; for example, Ontoolcole [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] is used to de ne the educational
concepts related to the tools while Dublin Core [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] provides the vocabulary of
the administrative domain. Nonetheless, a conceptualization for describing the
technical parameters has been speci cally de ned since there was not found any
ontology that de nes the needed concepts.
      </p>
      <p>
        As far as the search engine design concerns, the proposed architecture, which
is based on papers such as [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], is shown in Figure 2. The search engine has a
central manager that collects the query made by the teacher through its interface
and coordinates the data retrievement. The central manager sends the query
using a common language, de ne by the abovementioned ontology, to several
adaptors (two in the example of the Figure 2), which mediate in the data
exchange between the manager and the external data sources. Finally, there is an
educational data registry, which contains educational data about software tools
and enriches the information about tools provided by external data repositories.
      </p>
    </sec>
    <sec id="sec-4">
      <title>Conclusions and future work</title>
      <p>The present paper shows an ongoing doctorate research work. A research
question was detected, which consists on the design of an educational tool search
engine able to automatically collect information from external data repositories.
The proposed approach to solve this problem is based on the Linked Data
principles because it is a recent trend that allows software agents to automatically
retrieve information from the Web of Data.</p>
      <p>This approach overcomes the detected problems in current educational tool
search systems, such as OntoolSearch: rstly, it is possible to nd educational
tools that were not speci cally described in the system's data source; secondly,
data maintenance will be facilitated, as the system is able to automatically
import up-to-date information; nally, the publication of educational tool
information will be easier, since it will be possible to reuse the data retrieved from
external registries, so it will only be necessary to registry the tools educational
aspects. In addition, all the information created by the system will be published
and will be freely available on the Web, so it could be reused by other people
for some other educational applications.</p>
      <p>Future work will focus on the implementation of the search system described
in Section 4, taking into account the proposed architecture and the designed
ontology. In addition, the integration of the search system in at least a VLE will
be an interesting task to be carried out.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>W.R.</given-names>
            <surname>Adrion</surname>
          </string-name>
          .
          <article-title>Research methodology in software engineering: summary of the Dagstuhl workshop on future directions on software engineering</article-title>
          .
          <source>SIGSoft Software Engineering Notes</source>
          ,
          <volume>18</volume>
          (
          <issue>1</issue>
          ):
          <volume>36</volume>
          {
          <fpage>37</fpage>
          ,
          <year>January 1993</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>C.</given-names>
            <surname>Alario Hoyos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.I. Asensio</given-names>
            <surname>Perez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.L. Bote</given-names>
            <surname>Lorenzo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. Gomez</given-names>
            <surname>Sanchez</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          <article-title>Vega Gorgojo, and</article-title>
          <string-name>
            <given-names>A. Ruiz</given-names>
            <surname>Calleja</surname>
          </string-name>
          .
          <article-title>Integration of external tools in virtual learning environments: main design issues and alternatives</article-title>
          .
          <source>In Proceedings of 10th International Conference on Advanced Learning Technologies (ICALT)</source>
          , Sousse, Tunez,
          <year>July 2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee. Linked Data - Design Issues</surname>
          </string-name>
          ,
          <year>July 2007</year>
          . URL = http://www.w3.org/DesignIssues/LinkedData.html, ultima visita junio de
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>T.</given-names>
            <surname>Berners-Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Hendler</surname>
          </string-name>
          , and
          <string-name>
            <surname>O. Lassila.</surname>
          </string-name>
          <article-title>The semantic web</article-title>
          .
          <source>Scienti c American</source>
          ,
          <volume>284</volume>
          (
          <issue>5</issue>
          ):
          <volume>34</volume>
          {
          <fpage>43</fpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5. Miguel L.
          <string-name>
            <surname>Bote-Lorenzo</surname>
            ,
            <given-names>Eduardo</given-names>
          </string-name>
          <string-name>
            <surname>Gomez-Sanchez</surname>
          </string-name>
          , Guillermo Vega-Gorgojo,
          <article-title>Yannis A</article-title>
          .
          <string-name>
            <surname>Dimitriadis</surname>
            ,
            <given-names>Juan I.</given-names>
          </string-name>
          <string-name>
            <surname>Asensio-Perez</surname>
          </string-name>
          , and
          <string-name>
            <surname>Ivan M. Jorr</surname>
          </string-name>
          n-Abellan.
          <article-title>Gridcole: A tailorable grid service based system that supports scripted collaborative learning</article-title>
          .
          <source>Computers &amp; Education</source>
          ,
          <volume>51</volume>
          (
          <issue>1</issue>
          ):
          <volume>155</volume>
          {
          <fpage>172</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>DCMI</given-names>
            <surname>Usage</surname>
          </string-name>
          <article-title>Board</article-title>
          .
          <article-title>DCMI metadata terms</article-title>
          .
          <source>Speci cation</source>
          , DCMI,
          <year>January 2008</year>
          . URL = http://dublincore.org/documents/dcmi-terms/,
          <source>last visited July</source>
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>P.</given-names>
            <surname>Dillenbourg</surname>
          </string-name>
          .
          <article-title>Virtual learning environments</article-title>
          .
          <source>In Learning in the New Millennium: Building New Education Strategies for Schools. EUN Conference Workshop on Virtual Learning Environments</source>
          , Rhodes, Greece,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>D.</given-names>
            <surname>Fensel</surname>
          </string-name>
          .
          <article-title>Ontologies: A Silver Bullet for Knowledge Management</article-title>
          and
          <string-name>
            <given-names>Electronic</given-names>
            <surname>Commerce</surname>
          </string-name>
          . Springer-Verlag, Berlin, Germany, second edition,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>W.</given-names>
            <surname>Meng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Yu</surname>
          </string-name>
          , and
          <string-name>
            <given-names>K.L.</given-names>
            <surname>Liu</surname>
          </string-name>
          .
          <article-title>Building e cient and e ective metasearch engines</article-title>
          .
          <source>ACM Computing Surveys (CSUR)</source>
          ,
          <volume>34</volume>
          (
          <issue>1</issue>
          ):
          <volume>48</volume>
          {
          <fpage>89</fpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. G.
          <string-name>
            <surname>Vega-Gorgojo</surname>
            ,
            <given-names>M. L.</given-names>
          </string-name>
          <string-name>
            <surname>Bote-Lorenzo</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Gomez-Sanchez</surname>
            ,
            <given-names>J. I.</given-names>
          </string-name>
          <string-name>
            <surname>Asensio-Perez</surname>
            ,
            <given-names>Y. A.</given-names>
          </string-name>
          <string-name>
            <surname>Dimitriadis</surname>
            ,
            <given-names>and I. M.</given-names>
          </string-name>
          <string-name>
            <surname>Jorr</surname>
          </string-name>
          n-Abellan.
          <article-title>Ontoolcole: Supporting educators in the semantic search of CSCL tools</article-title>
          .
          <source>Journal of Universal Computer Science (JUCS)</source>
          ,
          <volume>14</volume>
          (
          <issue>1</issue>
          ):
          <volume>27</volume>
          {
          <fpage>58</fpage>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>