<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Towards Service-Oriented Resource Discovery by means of Semantic Web Reasoning</article-title>
      </title-group>
      <contrib-group>
        <aff id="aff0">
          <label>0</label>
          <institution>High Performance Computing Center Stuttgart (HLRS), University of Stuttgart</institution>
          ,
          <addr-line>Nobelstrasse 19, 70569 Stuttgart</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>67</fpage>
      <lpage>80</lpage>
      <abstract>
        <p>Reasoning is one of the essential tools of the modern Semantic Web. A number of applications for resource discovery on the Web such as random indexing enjoy a prominent place in face of the novel Semantic Web Reasoning trends. However, the reasoning algorithms are dealing with significant challenges when scaled up to the problem sizes addressed by the modern Semantic Web application. As such, they are not well-optimized to be applied to the emerging Internet-scale knowledge bases. We introduce a solution to building highly efficient and scalable reasoning applications based on the Large Knowledge Collider - a service-oriented incomplete reasoning platform breaking the scalability barriers of the existing solutions. We discuss the application of incomplete reasoning for the resource discovery tasks and demonstrate a serviceoriented realization for the query expansion and subsetting algorithms based on the random indexing knowledge extraction technique.</p>
      </abstract>
      <kwd-group>
        <kwd>Random Indexing</kwd>
        <kwd>Semantic Web Reasoning</kwd>
        <kwd>Large Knowledge Collider</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        The large- and internet-scale data applications is a primary challenge for the Semantic
Web, and in particular for reasoning algorithms, used for processing exploding
volumes of data, exposed currently on the Web. Reasoning is the process of making
implicit logical inferences from the explicit set of facts or statements, which constitute
the core of any knowledge base. The key problem for most of the modern reasoning
engines such as Jena [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] or Pellet [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] is that they can not efficiently be applied for the
real-life data sets that consist of tens, sometimes of hundreds of billions of triples (a
unit of the semantically annotated information), which can correspond to several
petabytes of digital information. Whereas modern advances in the Supercomputing
domain allow this limitation to be overcome, the reasoning algorithms and logic need
to be adapted to the demands of rapidly growing data universe, in order to be able to
take advantages of the large-scale and on-demand infrastructures such as high
performance computing or cloud technology. On the other hand, the algorithmic
principals of the reasoning engines need to be reconsidered as well in order to allow for
very large volumes of data. Service-oriented architectures (SOA) can greatly
contribute to this goal, acting as the main enabler of the newly proposed reasoning
techniques such as incomplete reasoning [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. This paper focuses on a service-oriented
solution for constructing Semantic Web applications of a new generation, ensuring
the drastic increase of the scalability for the existing reasoning applications, as
elaborated by the Large Knowledge Collider (LarKC)1 EU project.
      </p>
      <p>The paper is organized as follows. In Section 2, we collect our consideration
towards enabling the large-scale reasoning and its application for the resource discovery
tasks. In Section 3, we discuss LarKC – a service-oriented platform for development
of fundamentally new reasoning application, with much higher scalability barriers as
by the existing solutions. In Section 4, we introduce some successful resource
discovery applications implemented with LarKC, such as Random Indexing. In Section 5,
we discuss our conclusions and highlight the directions for future work in highly
scalable semantic reasoning.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Semantic Reasoning on the Web Scale</title>
      <p>
        Despite the majority of data on the Web is available as an unstructured text, e.g.
generated from the content kept in RDBM, the application areas of the modern Semantic
Web spawn a wide range of domains, from social networks to large-scale Smart Cities
projects in the context of the future internet [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ][
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. However, data processing in such
applications goes far beyond a simple maintenance of the collection of facts; based on
the explicit information, collected in datasets, and simple rule sets, describing the
possible relations, the implicit statements and facts can be acquired from those
datasets.
      </p>
      <p>Many data collections as well as application built on top of them allow for
rulebased inferencing to obtain new, more important facts. The process of inferring
logical consequences from a set of asserted facts, specified by using some kinds of logic
description languages (e.g., RDF/RDFS and OWL2), is in focus of semantic
reasoning. The goal is to provide a technical way to determine when inference processes is
valid, i.e., when it preserves truth. This is achieved by the procedure which starts
from a set of assertions that are regarded as true in a semantic model and derives
whether a new model contains provably true assertions.</p>
      <p>
        The latest research on the Internet-scale Knowledge Base Technologies, combined
with the proliferation of SOA infrastructures and cloud computing, has created a new
wave of data-intensive computing applications, and posed several challenges to the
Semantic Web community. As a reaction on these challenges, a variety of reasoning
methods have been suggested for the efficient processing and exploitation of the
semantically annotated data. However, most of those methods have only been approved
for small, closed, trustworthy, consistent, coherent and static domains, such as
synthetic LUBM [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] sets. Still, there is a deep mismatch between the requirements on the
      </p>
      <sec id="sec-2-1">
        <title>1 http://www.larkc.eu/</title>
        <p>2 http://www.w3.org/TR/owl-ref/
real-time reasoning on the Web scale and the existing efficient reasoning algorithms
over the restricted subsets.</p>
        <p>
          Whereas unlocking the full value of the scientific data has been seen as a strategic
objective in the majority of ICT- related scientific activities in EU, USA, and Asia
[
          <xref ref-type="bibr" rid="ref7">7</xref>
          ], the “Big Data” problem has been recognized as the primary challenger in
semantic reasoning [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ][
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. Indeed, the recent years have seen a tremendous increase of the
structured data on the Web with scientific, public, and even government sectors
involved. According to one of the recent IDC reports [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], the size of the digital data
universe has grown from about 800.000 Terabytes in 2009 to 1.2 Zettabytes in 2010,
i.e. an increase of 62%. Even more tremendous growth should be expected in the
future (up to several tens of Zettabytes already in 2012, according to the same IDC
report [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]).
        </p>
        <p>
          The “big data” problem makes the conventional data processing techniques, also
including the traditional semantic reasoning, substantially inefficient when applied for
the large-scale data sets. On the other hand, the heterogeneous and streaming nature
of data, e.g. implying structure complexity [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], or dimensionality and size [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ],
makes big data intractable on the conventional computing resource [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. The problem
becomes even worse when data are inconsistent (there is no any semantic model to
interpret) or incoherent (contains some unclassifiable concepts) [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
        </p>
        <p>
          The broad availability of data coupled with increasing capabilities and decreasing
costs of both computing and storage facilities has led the semantic reasoning
community to rethink the approaches for large-scale inferencing [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Data-intensive
reasoning requires a fundamentally different set of principles than the traditional mainstream
Semantic Web offers. Some of the approaches allow for going far beyond the
traditional notion of absolute correctness and completeness in reasoning as assumed by the
standard techniques. An outstanding approach here is interleaving the reasoning and
selection [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. The main idea of the interleaving approach (see Fig. 1a) is to
introduce a selection phase so that the reasoning processing can focus on a limited (but
meaningful) part of the data, i.e. perform incomplete reasoning.
        </p>
        <p>
          As discussed before, the standard reasoning methods are not valid in the existing
configurations of the Semantic Web. Some approaches, such as incomplete reasoning,
offer a promising vision how a reasoning application can overcome the “big data”
limitation, e.g. by interleaving the selection with the reasoning in a single
“workflow”, as shown in Fig. 1a. However the need of combining several techniques within
a single application introduces new challenges, for example related to ensuring the
proper collaboration of team of experts working on a concrete part of the workflow,
either it is identification, selection, or reasoning. Another challenge might be the
adoption of the already available solutions and reusing them in the newly developed
applications, as for example applying selection to the JENA reasoner [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ], whose
original software design doesn’t allow for such functionality. The SOA approach can
help eliminate many of the drawbacks on the way towards creating new,
servicebased reasoning applications. Supposed that each of the construction blocks shown in
Fig. 1a is a service, with standard API that ensures easy interoperability with the other
similar services, quite a complex application can be developed by a simple
combination of those services in a common workflow (see Fig. 1b).
        </p>
        <p>
          Resource discovery is an essential feature of the Semantic Web, which involves
tasks of decentralized and autonomous control, distributed service discovery etc.
Reasoning can greatly contribute to solving these issues by for example improving the
fine-grained service matchmaking, resource ranking, etc. in typical resource discovery
workflows [
          <xref ref-type="bibr" rid="ref29">29</xref>
          ].
        </p>
        <p>
          Although utilizing reasoning in the resource discovery workflows is not a new
concept for the Semantic Web [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ][
          <xref ref-type="bibr" rid="ref18">18</xref>
          ], there was quite a big gap in realizing the single
steps of the reasoning algorithms (Fig. 1b) as a service. This was due to many
reasons, among them complexity of the data dependency management, ensuring
interoperability of the services, heterogeneity of the service’s functionality. Realizing a
system where a massive number of parties can expose and consume services via
advanced Web technology was also a research highlight for Semantic Web. An example
of very successful research on offering a part of the semantic reasoning logic as a
service is the SOA4ALL3 project, whose main goal was to study the service abilities
of development platforms capable of offering semantic services. Several useful
services wrapping such successful reasoning engines as IRIS [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ] and several others had
been developed in the frame of this project. Nevertheless, the availability of such
services is only an intermediate step towards offering reasoning as a service, as a lot
of efforts were required to provide interoperability of those services in the context of
a common application. Among others, a common platform is needed that would
allow the user to seamlessly integrate the service by annotating their dependencies,
manage the data dependencies intelligently, being able to specify parts of the
execution that should be executed remotely, etc.
        </p>
        <p>
          An outstanding effort to develop such a platform was performed in the LarKC
(Large Knowledge Collider) [
          <xref ref-type="bibr" rid="ref20">20</xref>
          ] project. In the following sections, we discuss the
main ideas, solutions, and outcomes of this project.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>3 http://www.soa4all.eu/</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Large Knowledge Collider Approach</title>
      <p>
        In order to create a technology for creation of trend-new applications for large-scale
reasoning, several leading Semantic Web research organizations and technological
companies have joined their efforts around the project of the Large Knowledge
Collider (LarKC), supported by the European Commission. The mission of the project
was to set up a distributed reasoning infrastructure for the Semantic Web community,
which should enable application of reasoning far beyond the currently recognized
scalability limitations [
        <xref ref-type="bibr" rid="ref22">22</xref>
        ], by implementing the interleaving reasoning approach. The
current and future Web applications that deal with “big data” are in focus of LarKC.
      </p>
      <p>The LarKC’s design has been guided by the primarily goal to build a scalable
platform for distributed high performance reasoning. Fig. 2 shows a conceptual view of
the LarKC platform’s architecture and the proposed development life-cycle. The
architecture was designed to holistically cover the needs of the three main categories of
users – semantic service (plug-in) developers, application (workflow) designers, and
end-users internet-wide. The platform’s design ensures a trade-off between the
flexibility and the performance of applications in order to achieve a good balance between
the generality and the usability of the platform by each of the categories of users.</p>
      <p>Below we introduce some of the key concepts of the LarKC architecture and
discuss the most important platform’s services and tools for them.</p>
      <sec id="sec-3-1">
        <title>Plug-in developers</title>
      </sec>
      <sec id="sec-3-2">
        <title>Workflow designers</title>
      </sec>
      <sec id="sec-3-3">
        <title>Application end-users</title>
        <p>SSeemmaanntitcicWWeebbSSeervrvicicee</p>
      </sec>
      <sec id="sec-3-4">
        <title>FFaarmrm/ /PPl ulugg-i-ninMMaarkrkeetptplalaccee</title>
        <p>DDeecciiddeerrss
IIddeennttiiffiieerrss
SSeelleeccttoorrss
TTrraannssffoorrmmeerrss
RReeaassoonneerrss
Plug-in
… Plug-in
Plug-in
Plug-in
Registry
Plug-in
Managers
Workflow
Support
System
End-points
RDF data base
SStotoraraggee</p>
      </sec>
      <sec id="sec-3-5">
        <title>LarKC platform</title>
        <p>Execution
Framework</p>
      </sec>
      <sec id="sec-3-6">
        <title>Infrastructure</title>
        <p>Highperformance
computer
CCoommppuutat atitoionn</p>
        <p>Data
Layer
(OWLIM)
Remote
Invocation
Framework</p>
        <p>(GAT)
Monitoring</p>
        <p>Service
MMoonnitiotorirningg
1. Plug-ins</p>
        <p>
          Plug-ins are standalone services implementing some specific parts of the reasoning
logic as discussed previously, whether it is selection, identification, transformation, or
reasoning algorithm, see more at [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ]. In fact, plug-ins can implement much broader
functionality as foreseen by the incomplete reasoning schema (Fig. 1), hence enabling
the LarKC platform to target much wider Semantic Web user community as originally
targeted, e.g. for machine learning or knowledge extraction. The services are referred
as plug-ins because of their flexibility and ability to be easily integrated, i.e. plugged
into a common workflow and hence constitute a reasoning application. To ensure the
interoperability of the plug-ins in the workflows, each plug-in should implement a
special plug-in API, based on the annotation language [
          <xref ref-type="bibr" rid="ref23">23</xref>
          ]. Most essentially, the API
defines the RDF schema (set of statements in the RDF format) taken as input and
produced as output by each of the plug-ins. The plug-in development is facilitated by
a number of special wizards, such as Eclipse IDE wizard or Maven archetype for
rapid plug-in prototyping. The ready-to-use plug-ins are uploaded and published on
the marketplace – a special web-enabled service offering a centralized, web-enabled
repository store for the plug-ins4.
2. Workflows
        </p>
        <p>The workflow designers get access to the Marketplace in order to construct a
workflow from the available plug-ins, combined to solve a certain task. In terms of LarKC,
workflow is a reasoning application that is constructed of the (previously developed
and uploaded on the Marketplace) plug-ins. The workflow’s topology is characterised
by the plug-ins included in the workflow as well as the data- and control flow
connections between these plug-ins.</p>
        <p>The complexity of the workflow’s topology is determined by the number of
included plug-ins, data connections between the plug-ins (also including multiple splits
and joins such as in Fig. 3a or several end-points such as in Fig. 3b), and control flow
events (such as instantiating, starting, stopping, and terminating single plug-ins or
even workflow branches comprising several plug-ins). Same as for plug-ins, the input
and output of the workflow is presented in RDF, which however can cause
compatibility issues with the user’s GUI, which are not obviously based on an RDF-compliant
representation. In order to confirm the internal (RDF) dataflow representation with
the external (user-defined) one, the LarKC architecture foresees special end-points,
which are the adapters facilitating the workflow usage in the tools outside of the
LarKC platform. Some typical examples of end-points, already provided by LarKC,
are e.g. SPARQL end-point (SPARQL query as input and set of RDF statements as
output) and HTML end-point (HTTP request without any parameters as input and
HTML page as output).</p>
        <p>
          For the specification of the workflow configuration, a special RDF schema was
elaborated for LarKC, aiming at simplification of the annotation efforts for the
work4 Visit the LarKC Plug-in Marketplace at http://www.larkc.eu/plug-in-marketplace/
72
flow designers. Fig. 4a shows a simple example of the LarKC workflow annotation.
Creation of the workflow specification can greatly be simplified by using upper-level
graphical tools, e.g. Workflow Designer that offers a GUI for visual workflow
construction (Fig. 4b) [
          <xref ref-type="bibr" rid="ref28">28</xref>
          ]. The elaborated schema makes specification of the additional
features such as remote plug-in execution extremely simple and transparent for the
users and can be used for tuning the front-end graphical interfaces of the applications
to adapt them to the user needs.
3. Applications
        </p>
        <p>
          Workflows are already standalone applications that can be submitted to the
platform and executed by means of such tools as Workflow Designer discussed above.
Nevertheless, workflows can also be wrapped into much more powerful user
interfaces, adapted to the needs of the targeted end-user communities, e.g. Urban
Computing [
          <xref ref-type="bibr" rid="ref24">24</xref>
          ], and using LarKC as a back-end engine. The service-oriented approach
makes possible hiding the complexity of the LarKC platform, by enabling its whole
power to the end-users through such interfaces. We present an exemplarily LarKC
application in Section 4.
4. Platform services
        </p>
        <p>
          All above-described activities related to plug-in creation, workflow design, and
application development are facilitated by an extensive set of the platform services, as
shown in Fig. 2. A detailed description of the main LarKC services can be found in
our previous publication [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
4
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Application Scenario – Random Indexing</title>
      <p>
        Random indexing [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ] is a distributional statistic technique used in resource discovery
for extracting semantically similar words from the word co-occurrence statistics in the
text data, based on high-dimensional vector spaces (Fig. 5).
      </p>
      <p>
        Random indexing offers new opportunities for a number of large-scale Web
applications performing the search and reasoning on the Web scale [
        <xref ref-type="bibr" rid="ref26">26</xref>
        ]. Prominent
application using random indexing is subsetting (Fig. 6a) and query expansion (Fig. 6b).
      </p>
      <p>
        Query expansion [
        <xref ref-type="bibr" rid="ref30">30</xref>
        ] is used in information retrieval with the aim to expand the
document collection returned as a result to a query, thus covering the larger portion of
the documents. Subsetting (also known as selection) [
        <xref ref-type="bibr" rid="ref31">31</xref>
        ], on the contrary, deprecates
the unnecessary items from a data set in order to achieve faster processing. Both
presented problems are complementary, as change properties of the query to best adapt it
to the search needs.
      </p>
      <p>The main complexity of the random indexing algorithms lies in the following:
• High dimensionality of the underlying vector space.</p>
      <p>A typical random indexing search algorithm performs traversal over all the entries of
the vector space. This means, that the size of the vector space to the large extent
defines the search performance. The modern data stores, such as Linked Life Data or
Open Phacts consolidate many billion of statements and result in vector spaces of a
very large dimensionality. Random indexing over such large data sets is
computationally very costly, with regard to both execution time and memory consumption. The
latter is of especial drawback for use of random indexing packages on the mass
computers. So far, only relatively small parts of the Semantic Web data have been indexed
and analyzed.
• High call frequency.</p>
      <p>Both indexing and search over the vector space is typically a one-time operation,
which means that the entire process should be repeated from scratch every time new
data is encountered.</p>
      <p>The implementation as a LarKC plug-in allows random indexing to take
advantages of the LarKC data and execution model, being seamlessly integrated with the
other plug-ins and building up a common workflow. This allows random indexing to
be coupled with reasoners to improve the resource discovery algorithm. On the other
hand, the reasoning process can also benefit from the integration, for example by
using random indexing to expand the initial query and improve the quality of the
obtained results, such as shown in Fig. 7.</p>
      <p>LarKC is the technology that not only enables the large-scale reasoning approach
for the already existing applications, but also facilitates their rapid prototyping with
low initial investments, leveraging the SOA approach through the unique platform
solutions. Furthermore, LarKC delivers a complete eco-system where the researches
from very different domains can team up in order to develop new challenging
mashup-applications, e.g. for the resource discovery, hence having a dramatic impact
on a lot of problem domains.</p>
      <p>Query expansion part</p>
      <p>Reasoning part
We proposed a technology that allows a resource discovery process to be enhanced by
integration with the reasoning. The technology is based on the Large Knowledge
Collider (LarKC). LarKC is very promising platform for creation of new-generation
semantic reasoning applications. The LarKC’s main value is twofold. On the one hand,
it enables a new approach for large-scale reasoning based on the technique for
interleaving the identification, the selection, and the reasoning phases. On the other hand,
through over the project’s life time (2008-2011), LarKC has evolved in an
outstanding, service-oriented platform for creating very flexible but extremely powerful
applications, based on the plug-in’s realization concept. The LarKC plug-in
marketplace has already comprised several tens of freely available plug-ins, which
implement new know-how solutions or wrap existing software components to offer their
functionality to a much wider range of applications as even originally envisioned by
their developers. Moreover, LarKC offers several additional features to improve the
performance and scalability of the applications, facilitated through the parallelization,
distributed execution, and monitoring platform. LarKC is an open source
development, which encourages collaborative application development for Semantic Web.
Despite being quite a young solution, LarKC has already established itself as a very
promising technology in the Semantic Web world. Some evidence of its value was a
series of Europe- and world-wide Semantic Web challenges won by the LarKC
applications. It is important to note that the creation of LarKC applications, including the
ones discussed in the paper, was also possible and without LarKC, but would have
required much more (in order of magnitude) development efforts and financial
investments.</p>
      <p>We believe that the availability of such platform as LarKC will make a lot of
developers to rethink their current approaches for resource discovery as well as semantic
reasoning towards their tighter coupling and wider adoption of the service-oriented
paradigm.
6</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgment</title>
      <p>This work has been performed for the LarKC project (http://www.larkc.eu), partly
funded by the European Commission's IST activity of the 7th Framework Programme
(ICT-FP7-215535).
7</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>McCarthy</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          : Introduction to Jena. IBM Developer Works, http://www.ibm.com/developerworks/xml/library/j-jena/
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Sirin</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Parsia</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cuenca Grau</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kalyanpur</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Katz</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Pellet: a practical owl-dl reasoner</article-title>
          .
          <source>Journal of Web Semantics</source>
          , http://www.mindswap.org/papers/PelletJWS.pdf
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Fensel</surname>
          </string-name>
          , D., van
          <string-name>
            <surname>Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Unifying Reasoning and Search to Web Scale</article-title>
          .
          <source>IEEE Internet Computing</source>
          ,
          <volume>11</volume>
          (
          <issue>2</issue>
          ),
          <fpage>96</fpage>
          --
          <lpage>95</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Broekstra</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Klein</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Decker</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fensel</surname>
          </string-name>
          , D., van
          <string-name>
            <surname>Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Horrocks</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Enabling knowledge representation on the Web by extending RDF schema</article-title>
          .
          <source>Proceedings of the 10th international conference on World Wide Web (WWW '01)</source>
          , ACM,
          <fpage>467</fpage>
          --
          <lpage>478</lpage>
          (
          <year>2001</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Donovang-Kuhlisch</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Smart City Process Support and Applications as a Service - from the Future Internet</article-title>
          .
          <source>Future Internet Assembly</source>
          <year>2010</year>
          , http://fi-ghent.fiweek.eu/files/2010/12/1430-
          <string-name>
            <surname>Margarete-</surname>
          </string-name>
          Donovang-Kuhlisch.pdf (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Guo</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pan</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Heflin</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>LUBM: A Benchmark for OWL Knowledge Base Systems</article-title>
          .
          <source>Web Semantics</source>
          ,
          <volume>3</volume>
          (
          <issue>2</issue>
          ),
          <fpage>158</fpage>
          --
          <lpage>182</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>High</given-names>
            <surname>Level</surname>
          </string-name>
          Expert EU Group:
          <article-title>Riding the wave - How Europe can gain from the rising tide of scientific data</article-title>
          .
          <source>Final report, October</source>
          <year>2010</year>
          , http://ec.europa.eu/information_society/newsroom/cf/document.cfm?action=display&amp;doc_ id=
          <fpage>707</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Thompson</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Personick</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Large-scale mashups using RDF and bigdata</article-title>
          .
          <source>Semantic Technology Conference</source>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Hustadt</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sattler</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Data Complexity of Reasoning in Very Expressive Description Logics</article-title>
          .
          <source>Proc. IJCAI</source>
          <year>2005</year>
          ,
          <article-title>Edinburgh</article-title>
          ,
          <string-name>
            <surname>UK</surname>
          </string-name>
          ,
          <source>July 30-August</source>
          <volume>5</volume>
          <fpage>2005</fpage>
          . Morgan Kaufmann Publishers,
          <fpage>466</fpage>
          --
          <lpage>471</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>McKendrick</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Size of the data universe: 1.2 zettabytes and growing fast</article-title>
          ,
          <source>ZDNet.</source>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>Della</given-names>
            <surname>Valle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Ceri</surname>
          </string-name>
          , S., van
          <string-name>
            <surname>Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fensel</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          :
          <article-title>It's a streaming world! Reasoning upon rapidly changing information</article-title>
          .
          <source>IEEE Intelligent Systems</source>
          ,
          <volume>24</volume>
          (
          <issue>6</issue>
          ),
          <fpage>83</fpage>
          --
          <lpage>89</lpage>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Fensel</surname>
          </string-name>
          , D., van
          <string-name>
            <surname>Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          :
          <article-title>Unifying Reasoning and Search to Web Scale</article-title>
          .
          <source>IEEE Internet Computing</source>
          .
          <volume>11</volume>
          (
          <issue>2</issue>
          ),
          <fpage>96</fpage>
          --
          <lpage>95</lpage>
          (
          <year>2007</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Cheptsov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Assel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Towards High Performance Semantic Web - Experience of the LarKC Project</article-title>
          . inSiDE - Journal of Innovatives Supercomputing in Deutschland,
          <volume>9</volume>
          (
          <issue>1</issue>
          ),
          <fpage>72</fpage>
          - -
          <lpage>75</lpage>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>van Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teije</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Reasoning with inconsistent ontologies</article-title>
          .
          <source>Proceedings of the International Joint Conference on Artificial Intelligence, IJCAI'05</source>
          ,
          <fpage>454</fpage>
          --
          <lpage>459</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Bozsak</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ehrig</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Handschuh</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hotho</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Maedche</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motik</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Oberle</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmitz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Staab</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stojanovic</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stojanovic</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Studer</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumme</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sure</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tane</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Volz</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zacharias</surname>
          </string-name>
          , V.:
          <article-title>KAON - Towards a Large Scale Semantic Web</article-title>
          . Tjoa,
          <source>Proceedings of the Third international Conference on E-Commerce and Web Technologies</source>
          ,
          <fpage>304</fpage>
          --
          <lpage>313</lpage>
          (
          <year>2002</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Interleaving Reasoning and Selection with Semantic Data</article-title>
          .
          <source>Proceedings of the 4th International Workshop on Ontology Dynamics (IWOD-10)</source>
          ,
          <source>ISWC2010 Workshop</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Deelman</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gannon</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shields</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
            <given-names>I.</given-names>
          </string-name>
          :
          <article-title>Workflows and e-Science: An overview of workflow system features and capabilities</article-title>
          .
          <source>Future Generation Computer Systems</source>
          ,
          <volume>25</volume>
          (
          <issue>5</issue>
          ) (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Gil</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ratnakar</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fritz</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Assisting Scientists with Complex Data Analysis Tasks through Semantic Workflows</article-title>
          .
          <source>Proceedings of the AAAI Fall Symposium on Proactive Assistant Agents, Arlington</source>
          , VA.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>IRIS - Integrated Rule Inference System - API and User Guide</surname>
          </string-name>
          , http://irisreasoner.org/pages/user_guide.pdf
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          20.
          <string-name>
            <surname>Fensel</surname>
          </string-name>
          , D., van
          <string-name>
            <surname>Harmelen</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Andersson</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Brennan</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cunningham</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Della</given-names>
            <surname>Valle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Fischer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            ,
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            ,
            <surname>Kiryakov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            ,
            <surname>Schooler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            ,
            <surname>Tresp</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            ,
            <surname>Wesner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            ,
            <surname>Witbrock</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Zhong</surname>
          </string-name>
          , N.:
          <article-title>Towards LarKC: A Platform for Web-Scale Reasoning</article-title>
          .
          <source>Proceedings of the 2008 IEEE international Conference on Semantic Computing ICSC</source>
          ,
          <fpage>524</fpage>
          --
          <lpage>529</lpage>
          (
          <year>2008</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          21.
          <string-name>
            <surname>Assel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheptsov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gallizo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Celino</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Dell'Aglio</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bradeško</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Witbrock</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>Della</given-names>
            <surname>Valle</surname>
          </string-name>
          , E.:
          <article-title>Large knowledge collider: a service-oriented platform for large-scale semantic reasoning</article-title>
          .
          <source>Proceedings of the International Conference on Web Intelligence, Mining and Semantics</source>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          22.
          <string-name>
            <surname>Assel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheptsov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gallizo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benkert</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tenschert</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          :
          <article-title>Applying High Performance Computing Techniques for Advanced Semantic Reasoning</article-title>
          . eChallenges e-2010
          <source>Conference Proceedings. Paul Cunningham and Miriam</source>
          Cunningham (Eds).
          <source>IIMC International Information Management Corporation</source>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          23.
          <string-name>
            <surname>Roman</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bishop</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Toma</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gallizo</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fortuna</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>LarKC Plug-in Annotation Language</article-title>
          .
          <source>Proceedings of The First International Conferences on Advanced Service Computing - Service Computation</source>
          <year>2009</year>
          (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          24.
          <string-name>
            <given-names>Della</given-names>
            <surname>Valle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            ,
            <surname>Celino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Dell'Aglio</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.</surname>
          </string-name>
          :
          <article-title>The Experience of Realizing a Semantic Web Urban Computing Application</article-title>
          .
          <source>T. GIS</source>
          , vol.
          <volume>14</volume>
          ,
          <issue>iss</issue>
          .
          <volume>2</volume>
          ,
          <fpage>163</fpage>
          --
          <lpage>181</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          25.
          <string-name>
            <surname>Sahlgren</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>An introduction to random indexing</article-title>
          .
          <source>Proceedings of Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering TKE</source>
          <year>2005</year>
          ,
          <fpage>1</fpage>
          --
          <lpage>9</lpage>
          (
          <year>2005</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          26.
          <string-name>
            <surname>Jurgens</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stevens</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <string-name>
            <surname>The S-Space</surname>
            <given-names>Package</given-names>
          </string-name>
          :
          <article-title>An Open Source Package for Word Space Models</article-title>
          .
          <source>Proceedings of the ACL 2010 System Demonstrations</source>
          ,
          <fpage>30</fpage>
          --
          <lpage>35</lpage>
          (
          <year>2010</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          27.
          <string-name>
            <surname>Assel</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cheptsov</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Czink</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Damljanovic</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Quesada</surname>
            ,
            <given-names>J.:</given-names>
          </string-name>
          <article-title>MPI Realization of High Performance Search for Querying Large RDF Graphs using Statistical Semantics</article-title>
          .
          <source>Proceedings of the 1st Workshop</source>
          on
          <article-title>High-Performance Computing for the Semantic Web (HPCSW2011), co-located with the 8th Extended Semantic Web Conference</article-title>
          , ESWC2011, Heraklion, Greece, May
          <volume>29</volume>
          (
          <year>2011</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref28">
        <mixed-citation>
          28.
          <string-name>
            <surname>Le Phuoc</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polleres</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Morbidoni</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hauswirth</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tummarello</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Rapid semantic web mashup development through semantic web pipes</article-title>
          .
          <source>Proceedings of WWW2009 Research</source>
          Track (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref29">
        <mixed-citation>
          29.
          <string-name>
            <surname>Ruta</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>If objects could talk: novel resource discovery approaches in pervasive environments</article-title>
          . http://www.iaria.org/conferences2010/filesUBICOMM10/MicheleRuta_NexTech2010_
          <article-title>Ke ynote_Speech-2</article-title>
          .pdf
        </mixed-citation>
      </ref>
      <ref id="ref30">
        <mixed-citation>
          30.
          <string-name>
            <surname>Efthimiadis</surname>
            ,
            <given-names>E.: Query</given-names>
          </string-name>
          <string-name>
            <surname>Expansion. Martha E. Williams</surname>
          </string-name>
          (ed.),
          <source>Annual Review of Information Systems and Technology (ARIST)</source>
          ,
          <year>v31</year>
          ,
          <fpage>121</fpage>
          --
          <lpage>187</lpage>
          (
          <year>1996</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref31">
        <mixed-citation>
          31.
          <string-name>
            <surname>Quilitz</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Leser</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          :
          <article-title>Querying Distributed RDF Data Sources with SPARQL</article-title>
          .
          <source>Proceedings of the 5th European Semantic Web Conference (ESWC2008)</source>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>