<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Semantic Information storage and retrieval in a Peer-to-Peer corporate memory</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Ana B. Rios-Alvarado</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ricardo Marcelin-Jim´enez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>R. Carolina Medina-Ram´ırez</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad Aut ́onoma Metropolitana - Iztapalapa M ́exico</institution>
          ,
          <addr-line>DF</addr-line>
        </aff>
      </contrib-group>
      <abstract>
        <p>The paper presents a semantic approach for storing and retrieving documents from a Corporate Semantic Web (CSW). We illustrate the approach through the embedding of two graphs G1 into G2. G1 represents the CSW and whose nodes represent a collection of documents having a common range of semantic indices. G2 represents a P2P storage network. We use ”Ant Colony Optimization” metaheuristic, to solve the corresponding instances of graph embedding.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The semantic Web approach [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] relies on ontologies, annotations and formal
knowledge representation languages. A ”Corporate Semantic Web (CSW) is
built up from ontologies, resources (documents or humans) and annotations on
these resources, where these annotations rely on the ontologies[
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. There is a
meetpoint between Web and corporate memories: both gather heterogeneous
and distributed information and share the same concern about the relevance of
information retrieval. Nevertheless, corporate memory has a context, an
infrastructure and a scope limited to the organisation where they are applied.
      </p>
      <p>IP routing task, at the Internet, is supported by two complementary
procedures: table maintenance and table querying. In this work, we propose the
organization of document storage and retrieval in a Corporate Semantic Web
(CSW), based on two procedures: First, we solve content location and built a
table, whose entries shows the places in charge of a given set of documents.
Second, we perform look-up on this table in order to consult the corresponding
contents. An ontology can be regarded as a hierarchy of concepts. Each of them
corresponds to a semantic index. Besides, each semantic index has associated
a collection of documents belonging to the CSW. Therefore, we can model a
CSW as a graph G1 (Fig. 1), where each node is featured by two parameters: a
range of semantic indices and a weight. The first one represents the concepts it
gathers according to its place in the hierarchy. The second one, represents the
amount of information given by the collection of documents in the given range.
We model the storage network using a second graph G2. Each of its nodes (from
now on stores) has an associated capacity cj that features the maximal amount
of information it is able to contain.</p>
    </sec>
    <sec id="sec-2">
      <title>Methodology and assumptions</title>
      <p>
        Content placement implies the embedding of G1 into G2. We decided to tackle
our instances of graph embedding using the ant colony optimization algorithm
(ACO)[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Our method consists of creating z scout ants. Every ant is charged to
perform a random depth first search on G1. As each ant travels across the graph,
it associates the nodes that visits to a given store j of G2. When the aggregated
nodes weight exceeds the capacity of the current store, it reassigns the last node
to successor store j+1 and starts this filling process over again, as long as there
are still nodes to visit.
      </p>
      <p>
        When our particular instance of graph embedding is successfully solved, each
store receives a copy of the look-up table. Each row in this table has two parts,
the left entry indicates a range of semantic indices, while the right entry indicates
the store in charge of the documents in this range. Figure 1 shows how G1 has
been embedded into G2 and the Look-up Table. We have used a discrete event
simulator [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], for implementing our algotithm.
      </p>
      <p>1..4 5..7
A
1..4</p>
      <p>C1
5..7</p>
      <p>B
8..11</p>
      <p>We have run our simulation using a variable number z of ants, nodes in G1
have weights following an uniform random distribution, and stores in G2 have a
constant capacity.
3</p>
    </sec>
    <sec id="sec-3">
      <title>Conclusion</title>
      <p>We have presented a semantic approach for storing and retrieving documents
from a Corporate Semantic Web (CSW). We illustrate the approach through the
embedding of two graphs G1 into G2. We have used ”Ant Colony Optimization”,
to solve the corresponding graph embedding.</p>
      <p>From preliminar results, we can say that there is an optimal number of initial
ants producing the highest variance. This optimal depends on the size of G1,
and is roughly O(v(n)), where n is the total number of nodes in G1.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Shadbolt</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Berners-Lee</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hall</surname>
            <given-names>W.:</given-names>
          </string-name>
          <article-title>The semantic Web Revisited</article-title>
          ,
          <source>IEEE intelligent Systems, V21, N3</source>
          , pp
          <fpage>96</fpage>
          -
          <lpage>101</lpage>
          .
          <year>2006</year>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Dieng-Kuntz</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          :Corporate Semantic Webs, ERCIM News No.
          <volume>51</volume>
          ,
          <fpage>pp19</fpage>
          -
          <lpage>21</lpage>
          .
          <year>2002</year>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Dorigo</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <string-name>
            <surname>Optimization</surname>
            , Learning and
            <given-names>Natural</given-names>
          </string-name>
          <string-name>
            <surname>Algorithms</surname>
          </string-name>
          .
          <source>Ph.D. Thesis</source>
          , Dept. of Electronics, Politecnico di Milano.
          <source>Italy</source>
          (
          <year>1992</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. Marcel´ın-Jim´enez, R.:
          <article-title>A Flexible Simulator for Distributed Algorithms</article-title>
          .
          <article-title>Proceedings of the ENC03 by the</article-title>
          IEEE Computer Society Press, (
          <year>2003</year>
          ) pp.
          <fpage>176</fpage>
          -
          <lpage>181</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>