<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Contribution to the Classi cation of Web of Data based on Formal Concept Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Justine Reynaud</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Yannick Toussaint</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Amedeo Napoli</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>INRIA Nancy-Grand Est, 54600 Villers-les-Nancy</institution>
          ,
          <addr-line>F-54600</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Universite de Lorraine, LORIA, UMR 7503, Vandoeuvre-les-Nancy</institution>
          ,
          <addr-line>F-54506</addr-line>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>During the last decade, the web has taken a huge importance in everyday life, and has become what is commonly called a web of data. The available resources can be used by human agents but also by software agents, as it is the case for very large ontologies such as YAGO or resources such as DBpedia. These particular datasets can be linked together for constituting the Linked Open Data (LOD) cloud, where basic data are expressed as (subject, predicate, object) triples. One issue of main interest is knowledge discovery within LOD, which can help information retrieval and knowledge engineering. Formal concept analysis (FCA), which is a mathematical theory allowing classi cation and data analysis, was already used to classify LOD elements. In this research work, we are interested in analyzing the di erent approaches (extensions) based on FCA for knowledge discovery in the web of data. One objective is to study the e ciency and the applicability of the existing approaches and to propose some improvements.</p>
      </abstract>
      <kwd-group>
        <kwd>Formal Concept Analysis</kwd>
        <kwd>Pattern Structures</kwd>
        <kwd>Triadic Concept Analysis</kwd>
        <kwd>Linked Open Data</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        In this paper, we would like to extend and complete a previous work on the
classi cation of Linked Open Data (LOD) [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. The basic unit of LOD is the
RDF triple which is composed of three elements, namely a subject, a predicate
and an object, i.e. (subject, predicate, object). It can be noticed that subjects,
predicates and objects can be organized into a partial ordering depending on a
speci c schema (i.e. RDFS) or an ontology (e.g. YAGO or DBpedia Ontology).
      </p>
      <p>
        The classi cation of LOD should take into account the RDF triple as a basic
unit and this can be done in several ways. We distinguish ways that relies on a
\triples" view of LOD and ways that relies on a \graph" view. Following the lines
of [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], we consider an approach which considers triples and which is based on
pattern strucures [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. We complete this approach by integrating the organization
of predicates in the classi cation of RDF triples.
      </p>
      <p>
        In [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], it was shown how the classi cation of RDF triples amounts to
classify pairs of objects and attributes {as in a binary context{ where attributes
are partially ordered. Actually, given a subject which corresponds to an object
in a binary context, predicates are considered \one by one" and attributes are
viewed as \ranges" of the predicates. Attributes are organized within a partial
ordering and there are two main ways of dealing with this order. The rst one
is to consider a \scaling" as in [
        <xref ref-type="bibr" rid="ref3 ref4">3,4</xref>
        ] where the description of an attribute
correspond to the set of its ancestors in the attribute hierarchy, and the similarity
between two attributes is given by the minimal elements in the intersection of
their descriptions In the second way, it can be shown that pattern structures
for structured sets of attributes are very well adapted to solve the problem of
classifying RDF triples for analyzing the content of LOD. As it was precised
in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], it can be considered that the description of an attribute is an antichain
and the similarity is given by the intersection of antichains (which is actually an
alternative way of handling the scaling introduced just above).
      </p>
      <p>
        Both approaches are discussed in [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], where in addition a speci c procedure
based on RMQ is used for e ciently computing the intersection of antichains. In
the present work we add the \predicate dimension" within RDF triples and we
propose rst elements for extending our preceding approach by considering the
predicate classi cation. The basics of the approach are detailed but for the
moment no experiments are proposed, which is planned in a future work. However,
we show that this new proposal is sound and formally consistent.
      </p>
      <p>The paper is organized as follows. First, we present some preliminaries about
the web of data and LOD. Then, we recall and discuss the previous approaches for
classifying RDF triples or RDF graphs. Finally, we detail and illustrate our new
proposal, and prove a main proposition on the similarity of object descriptions.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Web of data</title>
      <p>Basically, the web of data consists of resources and relations between those
resources. It can be represented as a graph where nodes are resources and edges
are relations.</p>
      <p>
        The core of the web of data is the RDF (Resource Description Framework)
language, based on graph model where basic units are (subject, predicate, object)
triples. These triples, also called RDF statements, describe facts [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
De nition 1 (RDF Triple). Given a set of URIs U, blank nodes B and literals
L, an RDF triple is represented as t = (s,p,o) 2 (U [ B) U (U [ B [ L),
where:
{ s is called subject, p predicate and o object;
{ U is the set of all resources identi ed by a URI (Universal Resource
Identier);
{ B is the set of resources that are unidenti ed (called blank node);
{ L is the set of literals, which are values like strings, dates or integers.
As its name suggests, RDF allows one to describe resources. Resources can refer
to any object or thing. For convenience, the terms borrowed from description
logics can be used to distinguish di erent types of resources:
Properties: express binary relationships between any two entities;
Classes: represent sets of entities;
Instances: entities that belong to a class;
Variables: unidenti ed resources (i.e. blank nodes).
      </p>
      <p>RDF has some speci c vocabulary. For example, the property rdf:type enables
to declare an instance as belonging to a class. This expression has two parts,
separated by a colon. The second part identi es the speci c resource of this
vocabulary, here \type". The rst part, also called pre x, is an abbreviation
for \http://www.w3.org/1999/02/22-rdf-syntax-ns#", the namespace which
refers to the RDF vocabulary. Each vocabulary has its namespace.</p>
      <p>In order to give RDF some structure, schemas are used. Schemas describe
constraints on facts. They correspond to the TBox in description logics terms.
Each vocabulary may have its own schema, that is a structure between its
entities, called its reference schema. The structure of RDF triples is based on RDFS
(RDF Schema), bringing additionnal properties such as rdfs:subPropertyOf and
rdfs:subClassOf. These two additionnal properties enable to build a hierarchy
of relations one the one hand and classes on the other hand. Afterwards, these
properties will be denoted subP and subC, respectively. Instances can be linked
to the hierarchy of classes, but they are not part of it, since they are related to
a class with rdf:type. Thus, in the following, a hierarchy will refer to an
orderred set of classes, and instances will be attached to their class (for simplicity,
a unique class per instance is considered here).</p>
      <p>The language SPARQL o ers to run queries on the web of data. A query is
composed of RDF triples containing variables. For example, the query SELECT ?x
WHERE f?x rdf:type Cg returns all the instances of C.</p>
      <p>
        A toy knowledge base, freely inspired by the example of S. Ferre in [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ], is
presented in Figure 1. The Figures 1b and 1d correspond to a set of RDF
statements and the associated graph respectively. The Figures 1c and 1a represent
the background knowledge. Instances of the knowledge base are considered and
linked to the class hierarchy.
      </p>
      <p>Reference schemas. Resulting from the linked open data, each data set
is connected to others. Thus, hierarchies are not guarantee to be a tree {
instances belonging to classes of their schema and to classes of another schemata
for example.</p>
      <p>In this work, we consider that all the classes on one hand and all the
properties on the other hand belong to the same reference schema; that is, have the
same namespace. We will also assume that, for any reference schema, there are
neither subP nor subC cycle.</p>
      <p>Another concern is that, the tree structure is not guaranteed, even if there is
no cycle. If C1 and C2 are two incomparable classes, both subclasses of C3 and
C4 that are also incomparable, then, we lost the tree structure and we have a
con ict. Here, we suppose that we know how to linearize the hierarchy of each
class as the linearization of a product of two partial orderings.</p>
      <p>Place</p>
      <p>PopulatedPlace
City</p>
      <p>State</p>
      <p>Country
Capital
Honolulu Hope Hawaii Arkansas United States Barack Obama Bill Clinton Michelle Obama
&gt;
(a) subC relation</p>
      <p>&gt;
Politician
President</p>
      <p>Person</p>
      <p>Award
PublicFigure
FirstLady</p>
      <p>Barack Obama president United States .</p>
      <p>Bill Clinton president United States .</p>
      <p>NobelPrize BBairlalcCkliOnbtaomna bbiirrtthhPPllaaccee HHoopneol.ulu .</p>
      <p>Honolulu capital Hawaii .</p>
      <p>Hope city Honolulu .</p>
      <p>PeaceNobelPrize Arkansas country United States .</p>
      <p>Hawaii country United States .</p>
      <p>Barack Obama spouse Michelle Obama .</p>
      <p>Michelle Obama office First Lady of the United States .</p>
      <p>Michelle Obama spouse Barack Obama .</p>
      <p>Barack Obama award x .
x type PeaceNobelPrize .
x year 2009 .
(b) Knowledge base as RDF triples
hasLocation o ce
associatedWith</p>
      <p>year
birthPlace country
sameSettingAs coParticipatesWith
leader spouse</p>
      <p>award
president
(c) subP relation</p>
      <p>United States
Bil Clinton
birthPlace
president country country</p>
      <p>president
Arkansas</p>
      <p>Hawai
cityOf
Hope
capital
Honolulu</p>
      <p>First Lady of the United States</p>
      <p>office</p>
      <p>Michele Obama
spouse spouse</p>
      <p>Barack Obama
birthPlace award type
2009 year x</p>
      <p>PeaceNobelPrize
(d) Knowledge base as graph</p>
    </sec>
    <sec id="sec-3">
      <title>3 Formal Concept Analysis</title>
      <p>
        Formal Concept Analysis (FCA) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] is a mathematical framework used for
classi cation and knowledge discovery. As a learning process, FCA allows one to
build an ordered set of concepts where objects are classi ed w.r.t. the attributes
that they share.
      </p>
      <p>A lot of extensions have been proposed, and some of them can be usefull to
deal with WOD. Two approaches are possible: the rst consists in considering the
RDF graph itself whereas the second consists in considering the RDF statements.</p>
      <p>Classifying the web of data enables to nd inconsistencies resulting from
the merging of di erent data sets. It is also a way to discover relationships or
implications that are not explicit in any single data set. Moreover, the visual
support given by the lattice allows users to easily navigate through hierarchy for
exploratory research.
3.1</p>
      <sec id="sec-3-1">
        <title>WOD as a set of statements</title>
        <p>
          The rst approach consider WOD as a set of RDF triples. This approach is related
to the works of [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ], [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ], [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] and [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
        </p>
        <p>
          Classi cation w.r.t. background knowledge The WOD can be classi ed
through RDF triples. However, in order to be interesting enough, this approach
has to take into account background knowledge such as the classes and properties
hierarchies. This problem has been developed in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. The authors consider a set
of documents as objects and a set of terms as attributes. Terms belong to a
thesaurus and they are organized in a tree structure. In order to build the lattice,
they consider M the set of terms in the thesaurus, and de ne an order 6M ,
meaning that \any attribute implies any of its more general attributes." For
example, if the term indexing is more speci c than information-analysis in the
thesaurus and if it is an attribute of a document d, then information-analysis is
also an attribute of d. Then, the authors de ne the intersection \ of two intents
m1 and m2 as \the most speci c attributes in M that are more general than m1
and m2." With this new operator, they can build concept lattices taking into
account background knowledge. In the following, we will present an extension of
this approach.
        </p>
        <p>
          Triadic Concept Analysis The triadic concept analysis (TCA) [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ] considers
an additionnal dimension (the modus). This implies a ternary relation Y
G M B which can be interpreted as \the object g takes the value b under
the condition m." The corresponding Galois connections are more complex: they
are three and each one corresponds to a dimension expressed in terms of the two
others.
        </p>
        <p>
          Mining triconcepts. TCA is used to describe folksonomies, i.e.
communities of users U who can annotate resources R with keywords (tags) T . An example
of taxonomy is Bibsonomy, a tool allowing users to manage publications and to
tag them. Here a concept would be a group of users tagging a set of resources
with identical keywords The algorithm Trias [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] has been proposed to mine
tri-concepts. It is based on a projection of the triadic context (U; T; R; Y ) onto a
dyadic context (U; (T R); I). For each concept (A; J ) found in (U; (T R); I)
context, a new dyadic context (T; R; J ) is built. For each concept (B; C) found,
(A; B; C) is a concept of (U; T; R; Y ).
        </p>
        <p>
          Finding biclusters. In [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], the authors aim to nd biclusters in a numerical
dataset. The context is similar to a standard formal context, but instead of a
binary context, a context with numerical values is considered. In order to have a
triadic context, objects and attributes remain the same and the numerical values
are transformed into a third dimension by means of an interordinal scaling. A
threshold is provided and the new dimension is made up of intervals whose
range sizes are less or equal to .
        </p>
        <p>
          Limitations. Given that TCA can handle three dimensions, it could be
interesting for WOD. However, a simple approach does not take into account
the background knowledge. Thus, taking into account the hierarchies implies a
scaling. This implies to add all classes in the dimension of subjects and objects,
and all properties in the dimension of predicates. This approach works well in the
case of biclustering, when a threshold is given, limiting the number of intervals
to consider. By contrast, with WOD, such a threshold can hardly be considered.
Moreover, having this constraint does not guarantee the scalability.
Pattern structures Pattern structures (PS) [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] are proposed in order to
consider data which are not binary data. Attributes then become descriptions which
are partially ordered thanks to a similarity operation. Formally, a pattern
structure is de ned as follows:
        </p>
        <sec id="sec-3-1-1">
          <title>De nition 2 (Pattern structure). Let G be a set of objects, (D; u) a semi</title>
          <p>lattice and : G ! D a mapping. Then (G; (D; u); ) is called a pattern
structure. The new Galois connections are the following:
{ A
{ d
= dg2A (g) for A</p>
          <p>G
= fg 2 G j d v (g)g for d 2 D</p>
          <p>
            Pattern structures are used on triples in [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ]. In this work, the authors aim
to provide a navigation space over RDF resources. The underlying idea is that,
when a resource is a subject in a triple, we can consider that the pair (p,o) is
describing this resource. Thus, two resources can be compared regarding how
they are described by the triples in which they are subjects. Moreover, there is
background knowledge related to objects provided by the properties rdf:type
and subC. Considering a set of triples B = (si; pj ; ok) and a hierarchy of classes,
this idea can be materialized in term of formal concepts. To that purpose, a
pattern structure (G; (D; u); ) is constructed as described below.
          </p>
          <p>Entities3 and their descriptions.</p>
          <p>The set B of RDF statements is built with a SPARQL query on a speci c
namespace. The associated reference schema is used to construct the hierarchy
of classes from this namespace.</p>
          <p>First, resources that are subjects of at least one triple in B are considered
as entities of the pattern structure: G = fs1; : : : sng. They will be compared
regarding the objects they share for each predicates. Objects can be instances or
classes, but here only classes are considered. Indeed, the hierarchy between the
objects is based on the property rdfs:subClassOf, but instances are linked to
classes with the relation rdf:type. In order to maintain the consistency, objects
that are instances are replaced by the class they belong to.</p>
          <p>Each pair (p; o) such that (s; p; o) 2 B is mapped to a description d 2 D.
A description d 2 D is a pair (pi; Oi) where pi is a predicate and Oi is a set
of objects in the range of pi. Given an entity s 2 G, its description is de ned
as follow: (s) = fds1; : : : ; dsng where djs = (pj ; Ojs) with j = f1; : : : ; ng. Each
3 The term object is ambiguous: it denotes both the rst part of a pattern structure
and the last element of an RDF triple.The term entity will be used to denote objects
of the pattern structure. The term object remains for the triples.
elementary description (pi; oj ) is replaced with (pi; C(oj )) where C(oj ) is the
class of oj in the reference schema.</p>
          <p>Given two descriptions (x) = fd1x; : : : ; dxng and (y) = fd1y; : : : ; dymg, we have
(x) v (y) if 8dix 2 (x); 9djy 2 (y) s.t. i = j and 8oix 2 Oix; 9ojy 2 Ojy s.t.
C(oix) subC C(ojy).</p>
          <p>Similarity between descriptions. The hierarchy of classes from the
reference schema is considered. Thus, the similarity between two objects is de ned
as follows:</p>
          <p>=min
(x) u (y) =fd1x; : : : ; dxng u fd1y; : : : ; dymg
[</p>
          <p>fdixg u fdjyg
i2f1;:::;ng
j2f1;:::;mg
fdixg u fdjyg =
where min returns the minimal elements
(lcs(C(oix); C(ojy)) if i = j</p>
          <p>
            else
;
where lcs returns the most speci c superclass
This de nition is close to the \ used in [
            <xref ref-type="bibr" rid="ref4">4</xref>
            ]. The u operation between two
descriptions corresponds to nding the most speci c attribute in M . Considering
only the minimal of the union corresponds to \retaining only the most speci c
elements of the set generated this way ".
          </p>
          <p>
            Limitations. This work introduces a method to mine triples with pattern
structures as in [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ]. The main limitation is that, the hierarchy of predicates is
not considered.
3.2
          </p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>WOD as a graph</title>
        <p>
          Instead of considering RDF triples, it is possible to consider the associated graph.
This is what is done in some approaches like [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ].
        </p>
        <p>
          Pattern structures for graphs In [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], pattern structures are used to classify
graphs. Each graph is considered as an object and the set of all its subgraphs is
considered as the description. Thus, the similarity between two graphs is the set
af all the subgraphs they have in common.
        </p>
        <p>As this approach is expensive, graphs can be simpli ed by the mean of
projections. That is, instead of considering all the subgraphs of a graph, the
description is something simpler like the set of chains of a certain size that compose the
graph.</p>
        <p>
          Concept lattices of conceptual graphs In [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], an extension to FCA for
conceptual graphs, called G-FCA, is proposed. Compared to RDF graphs,
conceptual graphs (CG) are oriented bipartite graphs. The two kinds of nodes are
classes in one hand and relations in the other hand. Contrary to RDF graphs
which consider only binary relations, CGs handle n-ary relations. Projected graph
patterns are introduced as concepts. It is similar to a SPARQL query where the
graph query is the intent and the candidate solutions are the extent.
Example 1. Given the Figure 1, we have the concept (f(Hope,Arkansas),
(Honolulu,Hawaii)g, f(?p, birthPlace, ?x), (?x, city, ?y)g). Note that, the syntax is
di erent from [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], since the example is adapted to RDF. This concept is associated
to the query SELECT ?x ?y WHERE f ?p birthPlace ?x . ?x city ?y.g
Concept lattices of RDF graphs In [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ], in addition to the RDF graph, a
formal context corresponding to the background knowledge is considered. This
context has a set of resources that are objects and attributes at the same time.
Considering one resource as object, its attributes are the set of resources that
are \more general" regarding subC and subP properties.
        </p>
        <p>Example 2. Given the knowledge base Figure 1, a part of the formal context
could be the following:</p>
        <p>City Capital president sameSettingAs Honolulu Hawaii</p>
        <p>City
Capital
president
sameSettingAs</p>
        <p>Honolulu
Hawaii</p>
        <p>The extent of the pattern is a set of resources whereas the intent is a triple
graph. A triple graph is basically a subgraph and some background knowledge.
A morphism between two triples graphs is de ned and corresponds to an order
on intents. Moreover, a product between triple graphs is de ned such that the
join of two concepts (i.e. triple graphs) corresponds to their product.
Example 3. Given the knowledge context and the graph pattern corresponding
to the triple (Honolulu,capital,Hawaii), the graph corresponding to the triple
(Honolulu,city,Hawaii) is more general.
4</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Taking into account the three parts of the triple</title>
      <p>
        In this section, we propose a generalization of [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] : instead of considering
subjects described by (predicate,object) pairs, we consider the entire triple as a
description.
      </p>
      <p>Entities and descriptions We suppose that each triple has a unique identi er,
like a transaction id. These identi ers are the entities of the pattern structure :
G = ft1; : : : ; tng. The description of an entity is a mapping to the triple itself,
where instances are replaced by the class they belong. As to not complefy the
notation, C(s) and C(o) will be written s and o.</p>
      <p>Order on descriptions Given the descriptions (ti) = (si; pi; oi) and (tj ) =
(sj ; pj ; oj ), the partial order on descriptions is de ned as follow:
(si; pi; oi) v (sj ; pj ; oj ) , sj subC si; pj subP pi; oj subC oi
Similarity between descriptions The similarity between two triples ti and
tj is de ned as follow:</p>
      <p>(ti) u (tj ) = (lcsc(si; sj ); lcsp(pi; pj ); lcsc(oi; oj ))</p>
      <sec id="sec-4-1">
        <title>Proposition 1. (ti) v (tj ) ,</title>
        <p>(ti) u (tj ) = (ti).</p>
        <p>Proof.</p>
        <p>(ti) v (tj ) , sj subC si; pj subP pi; oj subC oi
, lcsc(si; sj ) = si; lcsp(pi; pj ) = pi; lcsc(oi; oj ) = oi
, (lcsc(si; sj ); lcsp(pi; pj ); lcsc(oi; oj )) = (si; pi; oi)
, (si; pi; oi) u (sj ; pj ; oj ) = (si; pi; oi)
, (ti) u (tj ) = (ti)
Similarity between set of triples The similarity of two triples can be
generalized to set of triples. The description of a set of triple T is the set of descriptions
of each of its triples : (T ) = f (t) j t 2 T g.</p>
        <p>Given two sets of triples T1 and T2, the similarity T1 u T2 is the set of
minimal triples (ti) u (tj ) for all ti in T1 and for all tj in T2 given the order
on descriptions.
5</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusion</title>
      <p>In this paper we discussed some approaches for dealing with the classi cation of
web of data, and more precisely of sets of RDF triples, i.e. (subject, predicate,
object). Actually, this kind of classi cation process is based on the classi cation
of pairs (subject, attribute) which simulate the RDF triples and where attributes
correspond to object triples and are structured within a hierarchy. Here, we
proposed an extension to a previous work which takes into account the classi cation
of predicates which was not the case before. We gave a formal presentation of
this proposal and for future work we are planning to make a series of experiments
which should (hopefully) validate the current approach.</p>
    </sec>
    <sec id="sec-6">
      <title>Acknowledgments</title>
      <p>Justine Reynaud is preparing her PhD Thesis with the support of \Region
Lorraine" and \Delegation Generale de l'Armement".</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Serge</given-names>
            <surname>Abiteboul</surname>
          </string-name>
          , Ioana Gabriela Manolescu Goujot, and
          <string-name>
            <given-names>Philippe</given-names>
            <surname>Rigaux</surname>
          </string-name>
          .
          <article-title>Web data management</article-title>
          . Cambridge University Press, New York,
          <year>2012</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <given-names>Mehwish</given-names>
            <surname>Alam</surname>
          </string-name>
          , Aleksey Buzmakov, Amedeo Napoli, and
          <string-name>
            <given-names>Alibek</given-names>
            <surname>Sailanbayev</surname>
          </string-name>
          .
          <article-title>Revisiting Pattern Structures for Structured Attribute Sets</article-title>
          . In Sadok Ben Yahia and Jan Konecny, editors,
          <source>The Twelth International Conference on Concept Lattices and their Applications { CLA</source>
          <year>2015</year>
          ,
          <article-title>Clermont-</article-title>
          <string-name>
            <surname>Ferrand</surname>
          </string-name>
          , France, pages
          <volume>241</volume>
          {
          <fpage>252</fpage>
          .
          <string-name>
            <given-names>LIMOS</given-names>
            <surname>Clermont-Ferrand</surname>
          </string-name>
          ,
          <year>2015</year>
          . CEUR Workshop Proceedings 1466 http: //ceur-ws.
          <source>org/</source>
          Vol-
          <volume>1466</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Carpineto</surname>
          </string-name>
          and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Romano</surname>
          </string-name>
          .
          <article-title>A lattice conceptual clustering system and its application to browsing retrieval</article-title>
          .
          <source>Machine Learning</source>
          ,
          <volume>24</volume>
          (
          <issue>2</issue>
          ):
          <volume>95</volume>
          {
          <fpage>122</fpage>
          ,
          <year>1996</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Claudio</given-names>
            <surname>Carpineto</surname>
          </string-name>
          and
          <string-name>
            <given-names>Giovanni</given-names>
            <surname>Romano</surname>
          </string-name>
          .
          <source>Concept Data Analysis: Theory and Applications</source>
          . John Wiley &amp; Sons, Chichester, UK,
          <year>2004</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Sebastien</given-names>
            <surname>Ferre</surname>
          </string-name>
          .
          <article-title>A Proposal for Extending Formal Concept Analysis to Knowledge Graphs</article-title>
          .
          <source>In Formal Concept Analysis, volume LNCS 9113</source>
          , pages
          <fpage>271</fpage>
          {
          <fpage>286</fpage>
          ,
          <string-name>
            <surname>Nerja</surname>
          </string-name>
          , Spain,
          <year>June 2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Bernhard</given-names>
            <surname>Ganter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Sergei O.</given-names>
            <surname>Kuznetsov</surname>
          </string-name>
          .
          <article-title>Pattern structures and their projections</article-title>
          .
          <source>In ICCS, LNCS 2120</source>
          , pages
          <fpage>129</fpage>
          {
          <fpage>142</fpage>
          . Springer,
          <year>2001</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Bernhard</given-names>
            <surname>Ganter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rudolf</given-names>
            <surname>Wille</surname>
          </string-name>
          .
          <source>Formal concept analysis - mathematical foundations</source>
          . Springer,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8. Robert Jaschke, Andreas Hotho, Christoph Schmitz, Bernhard Ganter, and
          <string-name>
            <given-names>Gerd</given-names>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>TRIAS - an algorithm for mining iceberg tri-lattices</article-title>
          .
          <source>In Proceedings of the 6th IEEE International Conference on Data Mining (ICDM</source>
          <year>2006</year>
          ),
          <fpage>18</fpage>
          -
          <lpage>22</lpage>
          December 2006,
          <string-name>
            <given-names>Hong</given-names>
            <surname>Kong</surname>
          </string-name>
          , China, pages
          <volume>907</volume>
          {
          <fpage>911</fpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <given-names>Mehdi</given-names>
            <surname>Kaytoue</surname>
          </string-name>
          ,
          <string-name>
            <surname>Sergei O. Kuznetsov</surname>
            , Juraj Macko, and
            <given-names>Amedeo</given-names>
          </string-name>
          <string-name>
            <surname>Napoli</surname>
          </string-name>
          .
          <article-title>Biclustering meets triadic concept analysis</article-title>
          .
          <source>Annals of Mathematics and Arti cial Intelligence</source>
          ,
          <volume>70</volume>
          (
          <issue>1-2</issue>
          ):
          <volume>55</volume>
          {79, feb
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Jens</surname>
          </string-name>
          <article-title>Kotters. Concept lattices of rdf graphs</article-title>
          .
          <source>Formal Concept Analysis and Applications FCA&amp;A</source>
          <year>2015</year>
          , page
          <volume>81</volume>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Sergei</surname>
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Kuznetsov</surname>
          </string-name>
          .
          <source>Computing Graph-Based Lattices from Smallest Projections</source>
          , pages
          <volume>35</volume>
          {
          <fpage>47</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>2011</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Fritz</given-names>
            <surname>Lehmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>Rudolf</given-names>
            <surname>Wille</surname>
          </string-name>
          .
          <article-title>A triadic approach to formal concept analysis</article-title>
          , pages
          <volume>32</volume>
          {
          <fpage>43</fpage>
          . Springer Berlin Heidelberg, Berlin, Heidelberg,
          <year>1995</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>