<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Efficient Selection and Integration of Data Sources for Answering Semantic Web Queries</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Abir Qasem</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Dimitre A. Dimitrov</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jeff Heflin</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Lehigh University</institution>
          ,
          <addr-line>19 Memorial Drive West, Bethlehem, PA 18015</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Tech-X Corporation</institution>
          ,
          <addr-line>5621 Arapahoe Avenue, Suite A, Boulder, CO 80303</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>We present an approach to identifying the minimal set of potentially relevant Semantic Web data sources for a given query. Our solution involves the adaptation of an efficient information integration algorithm that has polynomial time complexity. We then use these selected sources and an OWL reasoner to answer queries on the Semantic Web. We introduce a concept of source relevance expressed in OWL to reduce the number of sources needed to get the answers to a query. As the Semantic Web is an autonomous entity, some of the data sources may contain data that are not described directly in terms of a given query ontology. In our solution we define and use a mapping language that is a subset of OWL for the purpose of aligning heterogeneous ontologies. Our implemented system supports a subset of SPARQL queries, simple OWL ontologies and data sources that commit to them. Since the time to load sources is a dominating factor in performance, and our system identifies the minimal set of potentially relevant sources, it is very efficient. We have conducted an experiment using synthetic ontologies and data sources which demonstrates that our system performs well over a wide range of queries. A typical response time for a given work load of 20 domain ontologies, 20 map ontologies and 400 data sources is approximately 1 second. Furthermore, our system returned correct answers to 200 randomly generated queries in three different data configurations.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Regardless of the advances in reasoning techniques and our most sincere efforts in
clever coding we will always have to deal with data that are just too big for a given
Semantic Web knowledge base system. Fensel and van Harmelen [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] suggest that one
way around this problem is to incrementally refine a subset of data sources that will
give us an acceptable result. In this work we consider an approach for identifying a
sufficient subset in a single shot. Our approach uses a concept that we refer to as “source
relevance” in conjunction with an adapted information integration algorithm to select
the data sources that are sufficient to provide the answers to a given query. Intuitively,
a data source is potentially relevant to a query if it has some information on some
predicate of the query. In our approach a data source can make assertions about its contents
by means of what we call REL statements. We then use these selected sources and an
OWL reasoner to obtain answers to distributed queries on the Semantic Web data. We
put forward, that this selective loading of only relevant data sources will provide a more
optimal solution because in the Semantic Web, data sources significantly outnumber
ontologies . This is reflected in Swoogle’s present cache which has 10,000 ontologies
but a staggering 2.5 million documents[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Currently our implemented system supports
a restricted subset of SPARQL queries, simple OWL ontologies and data sources that
commit to them.
      </p>
      <p>When identifying sources it is essential that we do not miss relevant sources that
commit to ontologies different from the one used in the query. In order to align
heterogeneous ontologies, we use the notion of map ontologies. In our solution, these are like
any other OWL ontology except they consist solely of axioms that relate concepts from
one ontology to concepts of another ontology. We use the term domain ontologies for
all other ontologies. The choice of OWL to articulate the alignments make these maps
shareable via the Web, where any one can create alignments and publish them in OWL
for others to use. Such maps may be created manually or by using state-of-the-art
ontology alignment tools. We note that aligning ontologies is a difficult but relevant problem
but this is not the focus of this paper.</p>
      <p>We hypothesize that a web like alignment framework will enable integration to be an
emergent property of the Web. Furthermore, existing OWL tools can be used to process
these maps. We note that we will not have alignments between all pairs of ontologies,
but it should be possible to compose an alignment from existing alignments.</p>
      <p>
        Once we have the maps between the ontologies established, we need to use them
to translate a query in terms of the data sources that are available. Database researchers
have developed information integration algorithms that address similar problems. In
our work we adapt one such algorithm by Halevy et al. known as the PDMS
reformulation algorithm [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. PDMS uses both Global-As-View (GAV) and Local-As-View
(LAV) maps, which are the two most well known information integration formalisms.
From a knowledge representation point of view, a GAV rule is essentially equivalent to
a Horn clause without function symbols and a LAV rule is a First Order Logic (FOL)
implication with a single antecedent and multiple consequents.
      </p>
      <p>
        Our query language is based on the conjunctive query language for Description
Logic (DL) that has been proposed by Horrocks et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. This query language
overcomes the inadequacy of DL languages in forming extensional queries. Furthermore, it
corresponds to the most common SPARQL queries. Note, however, that our
implementation is restricted by the query language supported by KAON2 which we use as our
reasoner. KAON2 can only answer the so called DL-safe conjunctive queries [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ].
      </p>
      <p>
        This work is an extension of our initial work presented in Dimitrov et al. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. The
previous work was based on a two-tier ontology architecture so it had no support for
map composition. Our enhancements now enable us to support a multi-tier ontology
architecture and we can compose maps of arbitrary length. Specifically we make the
following two technical contributions in this paper.
1. We present our source selection algorithm that uses OWL maps and REL statements
to identify relevant data sources for a given query.
2. We demonstrate that by loading the sources selected by our algorithm into a DL
reasoner, we achieve an efficient and effective solution for answering distributed
queries on the Semantic Web.
      </p>
      <p>The rest of the paper is organized as follows: In Section 2, we describe our
mapping language that is compatible with the mapping formalisms used by the PDMS and
introduce a mechanism for describing source relevance. In Section 3, we describe the
details of the Ontology Based Information Integrator (OBII), including a source
selection algorithm. In Section 4, we describe some experiments that we have conducted to
evaluate our system. In Section 5, we compare some related work with our approach
and in Section 6 we conclude and discuss future work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Mapping Language</title>
      <p>In this section, we introduce OWL for Information Integration (OWLII). It is a subset
of OWL, the design of which is influenced by the PDMS algorithm. We use OWLII to
describe maps and data sources in the Semantic Web using GAV and LAV rules. Since
OWL DL is decidable, its subset OWLII is also decidable.</p>
      <p>
        We start with Description Horn Logic (DHL) [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], which is contained in the
intersection of Description Logic and Horn Logic. More specifically, it is a fragment of
OWL corresponding to Horn clauses. As we have observed GAV rules are essentially
equivalent to Horn clauses without function symbols. Thus if we were using a GAV
information integration system, DHL would be a suitable mapping language.
      </p>
      <p>Although DHL is a very good starting point, the PDMS query reformulation
algorithm also supports LAV rules, and as such we can use a language that is more
expressive than DHL.</p>
      <p>
        We follow a process similar to Grosof et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] to define the subset of DL for
OWLII. In the discussion below the subscript a is used to refer to a DL language whose
classes can be mapped to antecedents of a FOL implication and the subscript c is used
to refer to a DL language whose classes can be mapped to consequents. Similarly, we
use the subscript ac to refer to classes that can be mapped to either the antecedent or the
consequent.
      </p>
      <p>Definition 1 Lac is a DL language where A is an atomic class and i is an individual.
If C and D are classes and R is a property, then C u D, ∃ R.C and ∃ R.{i} are also
classes.</p>
      <p>Note: ∃ R.{i} allows us to incorporate owl:hasValue in our language. Otherwise,
nominals are not supported.</p>
      <p>Definition 2 La includes all classes in Lac. Also, if C and D are classes then C t D is
also a class.</p>
      <p>Definition 3 Lc includes all classes in Lac. Also, if C and D are classes then ∀ R.C is
also a class.</p>
      <p>Definition 4 We now define a OWLII map ontology as a set of OWLII axioms of the
form C v D, A ≡ B, P v Q, P ≡ Q, P≡ Q−, where C is an La class, D is an Lc class,
A, B are Lac classes and P, Q are properties</p>
      <p>
        We have also defined a translation function T which takes a DL axiom of the form
C v D, where C is an La class and D is an Lc class, and maps it into the FOL format
for OWLII. This definition expands the one for DHL [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Due to limited space, we do
not present it here. Please see our technical report [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] for details.
      </p>
      <p>In Section 1, we introduced the notion of a REL statement. If a source can express
that it has relevant information, we can choose to query it as opposed to other sources
that do not express this information. In this way we can locate the desired information
without querying every possible source. Having relevant information, however, does
not mean that the source is capable of answering the query completely. It just says that
the source may have some useful information on the query.</p>
      <p>
        For example, consider the statement REL(http://sourceURL, Electronics,
CinemaDisplay u ∃ madeBy.“DELL”). It asserts that http://sourceURL contains some information
about cinema displays made by Dell and the data is marked up with the class
“Electronics”. In OWL we express the REL statement by introducing four new predicates in a
new name space "meta". They are meta:RelStatement, meta:contained, meta:container
and meta:source. In the above example the meta:container will be the class expression
that defines CinemaDisplay u ∃ madeBy.“DELL”, the meta:contained will be
Electronics and meta:source will be http://sourceURL. The meta:RelStatement will encapsulate
these three predicates. REL statements can be translated into LAV rules using a minor
variation of the function T mentioned above. Due to limited space, we can not present
the detailed description of the REL statements in this paper. Please see our technical
report [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] for details.
3
      </p>
    </sec>
    <sec id="sec-3">
      <title>OBII: A Semantic Web Query Answering System</title>
      <p>In this section we present our modified algorithm and briefly discuss the implemented
architecture of OBII to give the readers a flavor of the working system.</p>
      <p>
        Our algorithm is based on the PDMS algorithm, which takes as input a query, a set
of views describing the sources and the maps, and computes a reformulation strictly in
terms of the sources. As long as there are no cycles in the maps, the PDMS algorithm
computes complete reformulations in polynomial time [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The PDMS imposes certain
restrictions on the input for it to remain polynomial time. In our adaption we ensure
that our input language conforms to these restrictions. The algorithm constructs a
“rulegoal” tree: where goal nodes are labeled with atoms of the peer relations, and rule nodes
are labeled with peer maps. Each AND-OR traversal from root to leaf of the rule-goal
tree represents one way of answering the query. The reformulation then is obtained by
the union of all of these traversals.
      </p>
      <p>We now describe our source selection algorithm. Note: from here on we refer to an
ontology that only has classes, properties, subClassOf and subPropertiesOf axioms as
a simple ontology. For our algorithm we assume that all domain ontologies are simple
ontologies given their present dominance in the Semantic Web. Our mapping
ontologies are described in OWLII (see Section 2), and sources are described using the REL
statements and only contain ABox assertions that use named classes and properties.</p>
      <p>
        Given a conjunctive query we unfold it to give us the starting point of our rule goal
tree. Now for each atom in the unfolded query we attempt to expand it using maps and
source descriptions that are available to the system. By recursively continuing with this
expansion until we do not have any more maps (or source descriptions) available we
build a rule goal tree in a similar fashion as the basic PDMS described above. However,
our expansion is different due to the presence of domain ontologies with class and
property taxonomies. We implement taxonomic reasoning as follows. For each query goal
we use a reasoner to find the set of subclasses (sub properties). We then implement a
variation of standard unification process that attempts to unify any of the sub predicates
of a node with a given map. Our extended expansion algorithm is presented as
Algorithm 1. Due to limited space we can not explain the algorithm in details. Interested
readers are referred to our technical report [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. We however, note the following for
readability: a) the subPred and the match routine implements the taxonomic reasoning
b) the details of the MINICON routine can be found in Pottinger and Halevy [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and c)
the MapViews and SourceViews objects are data structures that store the LAV and GAV
maps and LAV source descriptions respectively. Note, both of these sets are indexed by
their source ontologies for efficient retrieval where the source ontology of a map is the
left hand side (LHS) ontology of a GAV map or the right hand side (RHS) ontology of
a LAV map. After we have built the rule goal tree we read off all of the unique sources
from the leaves of the rule goal tree.
      </p>
      <p>Algorithm 1 OBII node expansion.</p>
      <p>EXPAND(Node n, MapViews MV, SourceViews SV)
1: sp ← SUBPRED(n.pred, n.ont)
2: omaps ← {m | (n.ont, m) ∈ MV}
3: for each v ∈ omaps do
4: if v has not been used then
5: if GAV(v) and MATCH(n, sp, HEAD(v)) then
6: for each sub goal ∈ v do
7: create an AND child goal node
8: for each child goal node cgn do
9: EXPAND(cgn, MV, SV)
10: else if LAV(v) and MATCH(n, sp, b) for some b ∈ BODY(v) then
11: create an OR child node cgn for the view using MINICON
12: EXPAND(cgn, MV, SV)
13: smaps ← {m | (n.ont, m) ∈ SV}
14: for each v ∈ smaps do
15: if v has not been used then
16: if GAV(v) and MATCH(n, sp, HEAD(v)) then
17: for each sub goal ∈ v do
18: create an AND child goal node
19: else if LAV(v) and MATCH(n, sp, b) for some b ∈ BODY(v) then
20: create an OR child node cgn for the view using MINICON</p>
      <p>The basic PDMS does not allow cycles in the maps as this makes query answering
undecidable. It prevents cycles by never expanding a goal node with a map that was
already used on the path from the root to that node in question. Therefore the set of
AND-OR traversals will not be a complete reformulation in the presence of cyclic maps.
However, our source selection algorithm only needs to identify the unique set of sources
from the leaves of the tree. Since the "cycle check" only prunes subtrees that would be
exact copies of other portions of the rule-goal tree, we do not miss any relevant sources,
even in the presence of cyclic maps.</p>
      <p>Figure 1 shows the architecture of our system. OBII executes in two asynchronous
phases: a conversion phase and the query phase. The conversion phase, is done by the
OWLIIRuleProcessor and occurs when a new source or a map ontology becomes
available to the system. OWLIIRuleProcessor parses the OWLII map ontologies and REL
meta data into OBII’s MapKB as LAV and GAV rules. In this way the system’s
knowledge base always has the necessary information to reformulate a query.</p>
      <p>User</p>
      <p>MapKB
SPARQL
results</p>
      <p>maps</p>
      <p>The query phase occurs when a query comes in. The Source Selector then takes the
user’s query and mapKB as input and uses our source selection algorithm to produce a
set of selected sources which may contribute to an answer. Note, the module will also
load some domain ontologies for use in taxonomic reasoning. The AnsweringEngine
loads the sources selected by the Source Selector, the domain ontologies that are used
in the node expansion and all the relevant map ontologies into KAON2. Then it issues
the original query to the reasoner and formats the retrieved answers. Note: by loading
only the used ontologies we provide a system that will scale well in terms of reasoning
when we have a potentially large number of ontologies.
4</p>
    </sec>
    <sec id="sec-4">
      <title>Evaluation</title>
      <p>We implemented a workload generator that allows us to control several characteristics
of our dataset. In generating the synthetic domain ontologies we decided to have on
the average 20 classes and 20 properties (influenced by the dominance of small
ontologies in the current Semantic Web). The class and property taxonomy have an average
branching factor of 4 and an average depth of 3. In generating the OWLII map
ontologies we chose to have an even distribution of various OWLII axioms and chose to
map about 30% of the classes and 30% of the properties of a given domain ontology.
The resulting GAV and LAV maps contain an average of 5 predicates with some maps
containing up to 11 predicates. The average data source has 75 triples and uses 30% of
the classes and 30% of the properties of the domain ontology that it commits to. We
generate 200 random conjunctive queries with 1 to 3 predicates (75% are properties as
opposed to a class).</p>
      <p>
        In choosing the configurations for our experiments we decided to vary two
parameters: the number of data sources that commits to an ontology and the maximum number
of maps required to translate from any source ontology to any target ontology. This
number is referred to as the diameter by Halevy et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. We adopt the term in our
discussion. We conducted two sets of experiments to evaluate the systems. In the first
experiment (herein after referred to as experiment 1) we have varied the number of data
sources that commits to a given ontology. In the second experiment (herein after
referred to as experiment 2) we have varied the diameter. In both experiments we kept the
number of ontologies to 20. We denote an experiment configuration as follows:
(nOnD-nS) where nO is number of ontologies, nD is the diameter and nS is the number of
sources that commit to an ontology.
      </p>
      <p>In our experiments the two main metrics we collected and examined are response
time and the percentage of complete responses to queries. The response time is the time
it takes from the issue of a query to the delivery of its result. We compared three systems
in our experiments: A baseline system that loads all the ontologies and data sources
and reasons over the complete knowledge base, a system that uses the original PDMS
algorithm for source selection and our OBII system. We should note here that because
the baseline system has a very different architecture, its response time is calculated
differently. For the baseline system we add the load time (i.e. the time to load semantic
web space) and the reasoning time to get the answers. The load time is added because
as we are considering a dynamic environment, we should always work on fresh data,
therefore each query results in a new knowledge base. For the other two systems, load
time is calculated as the time to load the domain ontologies, the map ontologies that
have been used in the source selection and the selected data sources. The response time
for these two systems then is a sum of load time, source selection time and the reasoning
time.</p>
      <p>In determining the completeness of queries, we consider the baseline system’s
answers to be the reference set. This metric is the percentage of queries where a system
returns the same answer as baseline, considering only queries that entail at least one
answer. Note: KAON2 is only sound and complete for DL-safe conjunctive queries
therefore the baseline system may not be complete for some queries.</p>
      <p>The first observation from our experiments is that OBII is complete with respect to
KAON2, where as the basic PDMS drops in completeness as we add data sources or
increase the diameter. This is evident from Figure 2(a) which shows the completeness
over the set of queries for experiment 1. The second observation is that this
completeness comes at a price. In figures 2(b) and 3(a) where we show the average response
time for each system it is clear that OBII is twice as slow as the basic PDMS. Note:
(a) Completeness of OBII and PDMS (b) Scalability w.r.t. the increase of
compared to baseline sources</p>
      <p>(a) Scalability w.r.t. the increase of (b) Query to Load time ratio for PDMS and
diameter OBII
the graphs use a logarithmic scale. However, OBII is 10 times faster than the baseline
system, which provides identical functionality.</p>
      <p>This time penalty versus the PDMS is essentially unavoidable. In order to get
complete answers OBII loads more sources than PDMS. Recall this is because we
incorporate taxonomic reasoning in our algorithm which the original PDMS does not. The cost
of loading these sources generally dominates its response time. This is evident from
the figure 3(b). However, for large diameter this dominance is reduced as OBII works
with deeper rule goal trees. Even so, in the experiment that had the largest diameter,
the response time for OBII is little over 2.5 seconds (as opposed to PDMS’s 1 second).
Furthermore, PDMS is only 60% complete in that scenario.
5</p>
    </sec>
    <sec id="sec-5">
      <title>Related Work</title>
      <p>
        Serafini and Tamilin have developed the DRAGO system that reasons with multiple
semantically related ontologies by using semantic mappings to combine the inferences of
local reasoning of each ontology. Although their original work focused only on TBoxes
they have recently extended this work to accommodate ABoxes to perform instance
retrieval queries [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Their work is different from ours in that they consider the map
processing (translating query to source) as part of the reasoning process. Therefore they
have to work on a much larger knowledge base as they have to consider all the maps
available to the system. The Piazza system [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] that uses the PDMS algorithm focuses
more on integrating XML documents. The treatment of OWL is limited in this work
and is described as a fairly difficult problem.
      </p>
      <p>
        Haase and Motik [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have described a mapping system for OWL and proposed a
query answering algorithm. They identify a mapping language that is similar to ours.
However, as their language adds rules to OWL, it is undecidable and as such they need
to introduce restrictions to achieve decidability. Our language, on the other hand, is a
sublanguage of a decidable language. Furthermore, similar to the DRAGO approach,
Haase and Motik do not rely on an explicit reformulation step and process all the maps
for a query reformulation.
      </p>
      <p>
        Peer-to-peer systems like Bibster [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and SomeWhere [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] have shown promises in
providing query answering solutions for the Semantic Web. However, a peer-to-peer
system needs special software installed at every server. Our system on the other hand
makes use of the existing infrastructure of the Web.
      </p>
      <p>
        There has been some work done in selecting the appropriate RDF source for a query.
One approach is to develop an index of sources [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and develop a query answering
algorithm that exploits these indexes. However, their source description language is
limited to RDF where as our OWLII is a much richer language for source description.
6
      </p>
    </sec>
    <sec id="sec-6">
      <title>Conclusion and Future Work</title>
      <p>In this paper we have introduced a source selection problem for the Semantic Web.
We have defined OWLII, a subset of OWL that is compatible with the GAV and LAV
formalisms and which is more expressive than DHL. We have adapted a query
reformulation algorithm to solve our source selection problem. As our experiments demonstrate
our system is 10 times faster than a naive approach of reasoning with all sources, and
only twice as slow as the incomplete information integration algorithm that it is based
on. Our system returned correct answers with respect to KAON2 to 200 randomly
generated queries in three different data configurations.</p>
      <p>This work opens up some interesting avenues for further research. First, we have
assumed that the mapped ontologies only have simple taxonomic axioms. We have not
considered the more advanced axioms that are available in OWL. One way to address
this is to view the axioms as self-referential maps. Second, we want to investigate ways
that will determine the best path to a translation. This may not always be the shortest
path. Sometimes due to a translation that loses information, we may choose to follow a
path that results in the least loss of information as opposed to the least number of
translations. Third, we intend to explore methods of automatically generating high quality
REL statements. Finally, we have observed that our rule-goal trees get very big as we
increase the diameter of the system. We intend to explore optimizations that remove
redundancy from these trees.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgment</title>
      <p>This work was funded by Department of Energy via a Tech-X Corporation subcontract.
We thank Ameet Chitnis and Fabiana Prabhakar for comments on this paper.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>P.</given-names>
            <surname>Adjiman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Chatalic</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Goasdoué</surname>
          </string-name>
          ,
          <string-name>
            <surname>M.-C. Rousset</surname>
            , and
            <given-names>L. Simon.</given-names>
          </string-name>
          <article-title>SomeWhere in the semantic web</article-title>
          . In F. Fages and S. Soliman, editors,
          <source>Principles and Practice of Semantic Web Reasoning</source>
          , Third International Workshop, PPSWR, Lecture Notes in Computer Science, pages
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          . Springer,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>J.</given-names>
            <surname>Broekstra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Ehrig</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Harmelen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Menken</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mika</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Schnizler</surname>
          </string-name>
          , and
          <string-name>
            <given-names>R.</given-names>
            <surname>Siebes</surname>
          </string-name>
          .
          <article-title>Bibster: A semantics-based bibliographic peer-to-peer system</article-title>
          .
          <source>In International Semantic Web Conference</source>
          , pages
          <fpage>122</fpage>
          -
          <lpage>136</lpage>
          . Springer-Verlag,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Dimitrov</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Heflin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Qasem</surname>
          </string-name>
          , and
          <string-name>
            <given-names>N.</given-names>
            <surname>Wang</surname>
          </string-name>
          .
          <article-title>Information integration via an end-toend distributed semantic web system</article-title>
          .
          <source>In 5th International Semantic Web Conference, Lecture Notes in Computer Science</source>
          , pages
          <fpage>764</fpage>
          -
          <lpage>777</lpage>
          . Springer,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>L.</given-names>
            <surname>Ding</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Finin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. S.</given-names>
            <surname>Cost</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Reddivari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. C.</given-names>
            <surname>Doshi</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Sachs</surname>
          </string-name>
          .
          <article-title>Swoogle: A Search and Metadata Engine for the Semantic Web</article-title>
          .
          <source>In Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management</source>
          . ACM Press,
          <year>November 2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>D.</given-names>
            <surname>Fensel</surname>
          </string-name>
          and
          <string-name>
            <surname>F. van Harmelen</surname>
          </string-name>
          .
          <article-title>Unifying reasoning and search to web scale</article-title>
          .
          <source>IEEE Internet Computing</source>
          ,
          <volume>11</volume>
          (
          <issue>2</issue>
          ):
          <volume>96</volume>
          ,
          <fpage>94</fpage>
          -
          <lpage>95</lpage>
          ,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>B.</given-names>
            <surname>Grosof</surname>
          </string-name>
          ,
          <string-name>
            <surname>I. Horrocks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Volz</surname>
          </string-name>
          , and
          <string-name>
            <given-names>S.</given-names>
            <surname>Decker</surname>
          </string-name>
          .
          <article-title>Description logic programs: Combining logic programs with description logic</article-title>
          .
          <source>In Proceedings of WWW2003</source>
          , Budapest, Hungary, May
          <year>2003</year>
          . World Wide Web Consortium.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>P.</given-names>
            <surname>Haase</surname>
          </string-name>
          and
          <string-name>
            <given-names>B.</given-names>
            <surname>Motik</surname>
          </string-name>
          .
          <article-title>A mapping system for the integration of OWL-DL ontologies</article-title>
          . In A. Hahn,
          <string-name>
            <given-names>S.</given-names>
            <surname>Abels</surname>
          </string-name>
          , and L. Haak, editors,
          <source>Proceedings of the first international ACM workshop on Interoperability of Heterogeneous Information Systems (IHIS'05)</source>
          , pages
          <fpage>9</fpage>
          -
          <lpage>16</lpage>
          . ACM,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>A.</given-names>
            <surname>Halevy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ives</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Madhavan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Mork</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Suciu</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Tatarinov.</surname>
          </string-name>
          <article-title>The Piazza peer-data management system. Transactions on Knowledge and Data Engineering, Special issue on Peer-data management</article-title>
          , pages
          <fpage>764</fpage>
          -
          <lpage>777</lpage>
          ,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>A.</given-names>
            <surname>Halevy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Ives</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Suciu</surname>
          </string-name>
          ,
          <string-name>
            <surname>and I. Tatarinov.</surname>
          </string-name>
          <article-title>Schema mediation in peer data management systems</article-title>
          .
          <source>In Proc. of ICDE</source>
          ,
          <year>2003</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. I. Horrocks and
          <string-name>
            <given-names>S.</given-names>
            <surname>Tessaris</surname>
          </string-name>
          .
          <article-title>A conjunctive query language for description logic aboxes</article-title>
          .
          <source>In AAAI/IAAI</source>
          , pages
          <fpage>399</fpage>
          -
          <lpage>404</lpage>
          ,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <given-names>R.</given-names>
            <surname>Pottinger</surname>
          </string-name>
          and
          <string-name>
            <surname>A. Halevy.</surname>
          </string-name>
          <article-title>MiniCon: A scalable algorithm for answering queries using views</article-title>
          .
          <source>The VLDB Journal</source>
          ,
          <volume>10</volume>
          (
          <issue>2-3</issue>
          ):
          <fpage>182</fpage>
          -
          <lpage>198</lpage>
          ,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>A.</given-names>
            <surname>Qasem</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. A.</given-names>
            <surname>Dimitrov</surname>
          </string-name>
          , and
          <string-name>
            <given-names>J.</given-names>
            <surname>Heflin</surname>
          </string-name>
          .
          <article-title>An efficient and complete distributed query answering system for semantic web data</article-title>
          .
          <source>Technical report</source>
          , Lehigh University,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <given-names>L.</given-names>
            <surname>Serafini</surname>
          </string-name>
          and
          <string-name>
            <given-names>A.</given-names>
            <surname>Tamilin</surname>
          </string-name>
          .
          <article-title>Distributed instance retrieval in heterogeneous ontologies</article-title>
          .
          <source>In In Proc. of the Second Italian Semantic Web Workshop Semantic Web Applications and Perspectives (SWAP'05)</source>
          ,
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14. H.
          <string-name>
            <surname>Stuckenschmidt</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Vdovjak</surname>
            ,
            <given-names>G.-J.</given-names>
          </string-name>
          <string-name>
            <surname>Houben</surname>
            , and
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Broekstra</surname>
          </string-name>
          .
          <article-title>Index structures and algorithms for querying distributed rdf repositories</article-title>
          .
          <source>In Proceedings of WWW '04</source>
          , pages
          <fpage>631</fpage>
          -
          <lpage>639</lpage>
          , New York, NY, USA,
          <year>2004</year>
          . ACM Press.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>