=Paper= {{Paper |id=None |storemode=property |title=Human-machine Collaboration for Enriching Semantic Wikis using Formal Concept Analysis |pdfUrl=https://ceur-ws.org/Vol-632/paper12.pdf |volume=Vol-632 |dblpUrl=https://dblp.org/rec/conf/semwiki/BlanscheSMN10 }} ==Human-machine Collaboration for Enriching Semantic Wikis using Formal Concept Analysis == https://ceur-ws.org/Vol-632/paper12.pdf
      Human-ma hine Collaboration for Enri hing
    Semanti             Wikis using Formal Con ept Analysis


      Alexandre Blans hé, Hala Skaf-Molli, Pas al Molli, and Amedeo Napoli


                                        LORIA
                                     Nan y, Fran e
                             {firstname.lastname}loria.fr



         Abstra t. Semanti wikis are new generation of ollaborative tools.
         They allow to embed semanti annotations in the wiki ontent. These
         annotations allow to better organize and stru ture the wiki ontents. It
         is then possible for users to build knowledge understandable by humans
         and omputers. By this way, ma hines are allowed to produ e or update
         semanti wiki pages as humans an do. In this paper, we propose a new
         smart agent based on Formal Con ept Analysis. This smart agent an
          ompute automati ally ategory trees based on dened semanti proper-
         ties. In order to redu e human-ma hine ollaboration problems, humans
         just validate hanges proposed by the smart agent. A distributed version
         of wiki is used to ensure onsisten y of the ontent during the validation
         pro ess.
         Keywords. Formal Con ept Analysis, Semanti Wiki, Human-Ma hine
         Collaboration


1      Introdu tion
Semanti      wikis are new generation of     ollaborative tools [1,2,3,4℄. They allow
to embed semanti         annotations in the wiki   ontent. These annotations allow to
better organize and stru ture the wiki       ontents. Semanti     wikis allow mass    ol-
laboration for    reating and emerging ontologi al resour es. They guide the users
from informal knowledge        ontained in do uments to more formal stru tures.
      Semanti wikis allow users to build knowledge understandable by humans and
 omputers. By this way, they also allow ma hines to produ e or update semanti
wiki pages as humans       an do. This opens the opportunity to    onsider ma hines as
new member of       ommunities to produ e and maintain knowledge. Consequently,
su h smart agents       an redu e signi antly the overhead of    ommunities in the
pro ess of    ontinuously knowledge building and       orre t humans errors.
      In [5℄, authors   oupled a   ase-based reasoner with a semanti   wiki. The     ase-
based reasoner      an enri h the wikis with new semanti       pages and thus       an be
 onsidered as a smart agent. As pointed out in [5℄, human-ma hine         ollaboration
 an lead to unstable system if not managed. For example, if humans                  hange
the    ategory tree used by the      ase-based reasoner, the   ase-based reasoner     an
produ e in orre t results from the point of view of humans users.
    In this paper, we propose a new smart agent based on Formal Con ept Analy-
sis (FCA) [6℄. This smart agent     an   ompute automati ally       ategory trees based
on dened semanti      properties. By this way, the FCA smart agent leverages hu-
mans from these tasks. In order to redu e human-ma hine              ollaboration prob-
lems, humans just validate      hanges proposed by the FCA smart agent. This is
a hieved using the DSMW [7℄ semanti           mediawiki extension.
    The paper is organized as follows. Se tion 2 introdu es the FCA framework.
Se tion 3 shows how the FCA smart agent is used to enri h the wiki. Se tion
4 details the validation pro ess. The last se tion          on ludes and points future
works.



2    Formal Con ept Analysis
In this paper, we present a smart agent that enri h a wiki based on a lassi ation
method. A tually, any     lassi ation methods might be used. We          hoose Formal
Con ept Analysis (FCA) be ause it extra ts             on epts organized into a latti e,
whi h is interesting for the navigation into the wiki. In this se tion, we briey
introdu e FCA.
    Formal Con ept Analysis [6℄ is a        lassi ation method allowing to build a
 on ept latti e where     on epts are      omposed of an intent, a maximal set of
attributes, and an extent, a maximal set of obje ts sharing the attributes.
    A    ontext K relies on a set of obje ts G, a set of attributes M and a relation
between obje ts of attributes I ⊆ G × M . Considering an obje t g ∈ G and an
attribute m ∈ M , (g, m) ∈ I means that g has the attribute m.
    A    ontext   an be visualized as a binary table. Table 1 shows a (simple) ex-
ample of    ontext about animals. There are ve attributes that des ribe animals.
Animals may have hair, feather, wings. They might breath in air or water. Ob-
je ts are animals: bat, bird,   at and sh. In the table, a     ross in one   ell indi ate
the animal has the     orresponding attribute.
                                         Breathe in water
                                         Breathe in air
                                         Has feather
                                         Has wings
                                         Has hair




                                  Bat Ö Ö Ö
                                  Bird Ö Ö Ö
                                  Cat Ö      Ö
                                  Fish         Ö
                        Table 1. Example of ontext (animals)
    FCA allows to build on epts organized into a latti e. A on ept C1 =
(A1 , B1 ) is dened by an extent A1 (a set of obje ts) and an intent B2 (a set
of attributes that dene the on ept). If C2 = (A2 , B2 ) is a sub on ept of C1
(denoted by C2 ⊑ C1 ), then A2 ⊆ A1 and B1 ⊆ B2 . The top on ept ⊤ ontains
all the obje ts and usually its intent is empty (unless an attribute is present
in ea h obje t). The bottom         on ept ⊥ is dened by all attributes but usually
 ontains no obje ts (unless an obje t has all attributes).
     On gure 1 is shown the        on ept latti e of the    ontext of table 1. On the
graph, every node is a on ept. A link between two nodes indi ates a subsumption
relation (a    on ept is a sub on ept of another       on ept). The intent of a    on ept
is written on a gray ba kground, the extent on white ba kground.




                Fig. 1. Galois latti e based on the    ontext from table 1




3     Wiki Enri hment
3.1 Prin iples
We developed a method that reorganizes the            ategories of the wiki a   ording to
the result of FCA. A new wiki will be      reated with the same pages and properties,
but dierent     ategories, based on the latti e of     on epts.
     The new    ategories will be    reated based on the previous ones, and on seman-
ti   links between pages. Useful      ategories human users did not      reate might be
dis overed. It is even possible to start a wiki without      reating any     ategories but
only semanti     links between pages, and then let the smart agent build the          ate-
gories, based on the semanti        links. The new    ategories fa ilitate the navigation
in the wiki and provide an expli it and       omplete organization of the pages.
       A mapping between original           ategories and latti e    on epts is performed.
Ea h      ategory maps one (and only one)         on ept: the most general     on ept    on-
taining the     ategory in its intent (the attribute        on ept). Ea h     on ept maps
zero, one or several      ategories. If a     on ept maps a single    ategory the   ategory
will be preserved. If a     on ept maps two        ategories or more, it means these      at-
egories are identi al and should be merged (however this              ase is very unlikely).
If a    on ept does not map any         ategory, a new    ategory will be   reated.
   Currently, the enri hment is performed by a Java appli ation that a                ess the
 ontent of the wiki and          reate an enri hed version of it.



3.2 Case study
The method presented in this paper will be illustrated by a wiki                 on erning
a ademi s. Here we present the initial          ontent of the wiki. We have the following
(user-dened)       ategories:

  Category:Professor;
  Category:Topi ;
  Category:Course;
  Category:Level whi h               ontains two sub ategories:      Category:Master 1
       Level and Category:Master 2 Level.
       We also dened two properties:

  Property:isTaughtBy, the domain is a ourse, the range a professor;
  Property:isAbout, the domain is a ourse, the range a topi .
       Finally, we added pages in the wiki:

  Prof. Smith and Prof. Jones in the Professor ategory;
  Artifi ial Intelligen e, Software Engineering and Networks in the
       Topi    ategory;
  Knowledge Dis overy, in the Course and Master 1 Level                         ategories,
       this page has two semanti        links isAbout:Artifi        ial Intelligen e and
       isTaughtBy:Prof. Smith;
  Semanti Wiki, in the Course and Master 2 Level                           ategories,   this
       page   has   two   semanti     links   isAbout:Artifi ial Intelligen e and
       isTaughtBy:Prof. Smith;
  Semanti Web, in the Course, Master 1 Level and Master 2 Level
    ategories, this page has two semanti       links isAbout:Artifi ial
   Intelligen e and isTaughtBy:Prof. Smith;
  Design Patterns, in the Course and Master 1 Level ategories, this
   page has two semanti         links isAbout:Software Engineering and
   isTaughtBy:Prof. Jones;
  Network Administration, in the Course and Master 1 Level ategories,
   this page has two semanti links isAbout:Networks and isTaughtBy:Prof.
   Jones;
  IPv6 Proto ol, in the Course and Master 2 Level ategories, this page
   has two semanti links isAbout:Networks and isTaughtBy:Prof. Jones;
3.3 Formal on ept analysis applied on the wiki
FCA      an be applied on the      ontent of the wiki. Obje ts to be    lassied by the
FCA algorithm are the standard pages of the wiki.
     The des ription of a page is     omposed of two parts: the    ategories it belongs
to and the semanti      properties it has (in our rst prototype, we only        onsidered
wiki properties of type Page). Ea h of these two parts allow to build a          ontext.
We    an   ombine these two       ontext by apposition.
     Based on the     ontent of the wiki, as des ribed above, we            an   reate the
 ontext shown on table 2. When applied to this        ontext, FCA returns the latti e
shown on gure 2.




                          Table 2. Context based on the wiki




                                           isAbout:Software Engineering
                                           isAbout:Arti ial Intelligen e
                                           isTaughtBy:Prof. Smith
                                           isTaughtBy:Prof. Jones


                                           isAbout:Networks
                                           Master 1 Level
                                           Master 2 Level
                                           Professor

                                           Course
                                           Level
                                           Topi




                        Prof. Smith       Ö
                        Prof. Jones       Ö
                   Arti ial Intelligen e   Ö
                         Networks           Ö
                   Software Engineering     Ö
                   Knowledge Dis overy        ÖÖÖ Ö Ö
                      Semanti Web             ÖÖ Ö Ö Ö
                      Semanti Wiki            ÖÖÖÖ Ö Ö
                     Design Patterns          ÖÖÖ   Ö Ö
                       IPv6 Proto ol          ÖÖ Ö Ö    Ö
                  Network Administration      ÖÖÖ   Ö   Ö


     In the   ase study, as one    an see on gure 2, four    on epts mat h one       ate-
gory: Professor, Topi , Master  1 Level, and Master 2 Level. One on ept
mat hes two ategories: Course and Level. All the other on epts do not mat h
any   ategory at all.
     How to     reate the new      ategories depends on the number of            ategories
mat hed by ea h         on ept. Depending on that number dierent methods are
used. However, no       ategories are   reated for the two    on epts ⊤ and ⊥, as ⊤
always     ontains all pages and ⊥ does not     ontain any page.
                 Fig. 2. Galois latti e based on the     ontext from table 2


3.4 Preserving of an original ategory
If a    on ept mat hes one and only one       ategory, this    ategory will simply be pre-
served in the enri hed wiki. This is the       ase of the     ategory Topi , for instan e.
       A tually, in most   ases, all the original   ategories are preserved.



3.5 Category merging
If a     on ept mat hes two        ategories or more, a new      ategory is    reated. This
new      ategory will merge the      ontent of the original mat hing     ategories: text of
ea h pages are       on atenated together. A default title is given to the       ategory.
       Category merging should be rare. It only happens if two or more            ategories
always appear in the exa t same pages. This would happen if several users use
dierent terms for the same         on ept. Bit by bit, after a number of wiki edition,
these dierent       ategories will appear in all the same pages and then will be
merged by the FCA.
       This is the   ase of the two    ategories Course and Level. Having these two
 ategories is due to a naming problem. The enri hed wiki has now only one
 ategory for this      on ept.



3.6 New ategories
If a    on ept mat hes no        ategory, a new one is   reated, with a default title.
      This might happens in two (non-ex lusive)          ases:


  a page belongs to two ategories or more;
  several pages having some identi al properties.
      A   ategory about     ourses on software engineering has been        reated, based
on the semanti       relation in the page Design       Patterns. Also, a   ategory about
 ourses available for both Master 1 and Master 2 students has been               reated,
Semanti Web is a page of this        ategory.



3.7 Category enri hment
Whatever the       reation method of a     ategory, all the new   ategories are enri hed
with new text       ontent, based on properties. Senten es like The pages belonging
to this   ategory seems to have relation T with the page P . would be appended in
the page. This will help human users to understand the meaning of the           ategory.
      For instan e, the    ategory of    ourses about software engineering will      on-
tain the senten e The pages belonging to this           ategory seems to have relation
Property:isAbout with the page Software Engineering., as a des ription of
the     ategory.



4       Validation
4.1 Validation by human users
After the enri hment, new       ategories need to be validated by human users. Some
merged      ategories might be spit, some new           ategories removed. Also, human
users should edit all the      ategories: default titles should be    hanged into more
relevant ones, text should be rened. We will present three examples of valida-
tion.
      The rst one    on erns the two     ategories Course and Level that have been
merged. Having this two ategories was a mistake. Human users will a knowledge
that and rename the merged         ategory Course. They will also rename two of
the sub ategories Master       1 Course and Master 2 Course to make them more
intelligible.
      Another example       on erns a new       ategory that has been      reated based
on the semanti        relation in the page Design       Patterns with a default name
(Category:New Category          42, for instan e). As explained in previously, the new
 ategory will      ontain a text des ribing some properties of the    on ept. A human
user will understand that this       ategory     ontains    ourses about software engi-
neering and will rename it       onsequently. The same thing will be done for the
 ategory about       ourses taught by Prof. Jones.
      The last example     on erns a sub ategory of        Master 1 Course and Prof.
Jones' Course. One might          onsider this    ategory to be irrelevant, or at least
not useful. A human user would de ide to remove this              ategory from the wiki
and update the hierar hal links         onsequently.
4.2 Distributed wiki organization




                      Fig. 3. Man-ma hine    ollaboration pro ess


     In order to ensure     onsisten y of the data, we used a distributed wiki. Two
semanti     mediawiki sites are syn hronized with the DSMW extension
                                                                            1 [7℄ (see
gure 3).


  The rst one is the Semanti Wiki1 wiki. Humans a ess this wiki as usual.
  From this Semanti Wiki1, the FCA smart agent reates the latti e in the
     Semanti Wiki2 site.
  Human users will then he k the ontent of this se ond wiki site, orre t and
     rene the    ontent.
  Next, they an push the ontent of Semanti Wiki2 on a push feed.
  Finally, administrator of Semanti Wiki1 an pull validated modi ations
     from Semanti Wiki2 into Semanti Wiki1.


     This s enario demonstrates how the DSMW extension              an be used to im-
plement pro esses. In this      ase, a simple pro ess allows validation of    hanges
produ ed by the FCA smart agent and avoids the problem of instability of
human-ma hine       ollaboration.



4.3 Enri hed wiki ontent
After validation, here is the       ontent of the enri hed wiki (Semanti Wiki1 in
gure 3) in the    ase study:


  Category:Professor, ontains pages about Prof. Smith and Prof. Jones;
1
    http://dsmw.org
  Category:Topi , ontains pages about Networks, Arti ial Intelligen e and
    Software Engineering;
  Category:Course;
  Category:Master 1 Course, a sub ategory of Category:Course;
  Category:Master 2 Course, a sub ategory of Category:Course;
  Category:Artifi ial Intelligen e Course,      a    sub ategory                  of
    Category:Course, the page indi ates that Prof. Smith is tea hing all
    the   ourses in this    ategory;
  Category:Prof. Jones' Course, a sub ategory of Category:Course;
  Category:Master 1 Artifi ial Intelligen e Course, a sub ategory
   of Category:Master 1 Course and Category:Artifi ial Intelligen e
   Course, ontains the page about Knowledge Dis overy;
  Category:Master 2 Artifi ial Intelligen e Course, a sub ategory
   of Category:Master 2 Course and Category:Artifi ial Intelligen e
   Course, ontains the page about Semanti Wiki;
  Category:Master 1 and 2 Artifi ial Intelligen e Course, a sub-
    ategory of Category:Master 1 Artifi ial Intelligen e Course and
   Category:Master 2 Artifi ial Intelligen e Course,    ontains the
    page about Semanti        Web;
  Category:Networks Course, a sub ategory of Category:Prof. Jones'
    Course;
  Category:Software Engineering Course,                 a        sub ategory      of
    Category:Prof. Jones' Course              and     Category:Master 1 Course,
     ontains the page about Design Patterns;
  Category:Master 1 Networks Course, a sub ategory of Category:Master
    1 Course and Category:Networks Course,            ontains the page about Net-
    work Administration;
  Category:Master 2 Networks Course, a sub ategory of Category:Master
    2 Course and Category:Networks Course,            ontains the page about IPv6
    Proto ol.



5    Con lusion and future work
Semanti      wikis allow users to build knowledge understandable by humans and
 omputers. By this way, they also allow ma hines to produ e or update semanti
wiki pages as humans       an do. This opens the opportunity to   onsider ma hines as
new member of      ommunities to produ e and maintain knowledge. Consequently,
su h smart agents    an redu e signi antly the overhead of      ommunities in the
pro ess of    ontinuously knowledge building and     orre t humans errors.
    In this paper, we proposed a new smart agent based on Formal Con ept
Analysis. This smart agent allows to reorganize the wiki: new           ategories are
 omputed and pages are pla ed into these new         ategories. This allows a better
organization of the    ontent and fa ilitate the navigation in the wiki.
    The refa toring pro ess needs to be validated by human users. Consisten y
of the wiki is ensured by the use of DSMW: a se ond wiki site is used to store
the result of the smart agent and is pulled ba k to the main wiki after human
validation.
    This paper presented an early work, and more resear h have to be done in the
future. Clearly, if applied on a real wiki, a method su h as FCA would produ e a
large amount of    on epts, and it would by impossible for human users to validate
any one of them. Some ltering methods should be used to prevent irrelevant
 ategories to be added, based on the number of instan es in a       ategory or other
 riteria.
    Using Relational Con ept Analysis instead of FCA should provide interesting
results. Other    lustering methods will also be   onsidered.
    In the    urrent version of our method, human users have a feedba k from the
smart agent, they will take into   onsideration the new    ategories that have been
 reated. However, the smart agent does not have a feedba k from the human
users: if a   ategory has been reje ted during the validation pro ess, the smart
agent will    reate it again when the pro ess will be reiterated. To avoid this
problem, the smart agent has to be history-aware and use the information of
the modi ation by human users during the validation pro ess.



6    A knowledgments
This resear h was part of the CyWiki proje t, funded by the Université Henri
Poin aré of Nan y.



Referen es
1. Völkel, M., Krötzs h, M., Vrande i , D., Haller, H., Studer, R.: Semanti wikipedia.
   In: WWW '06: Pro eedings of the 15th international onferen e on World Wide
   Web. (2006) 585594
2. S haert, S.: IkeWiki: A semanti wiki for ollaborative knowledge management. In:
   1st International Workshop on Semanti Te hnologies in Collaborative Appli ations
   (STICA06), Man hester, UK. (2006)
3. Bua, M., Ereteo, G., Faron-Zu ker, C., Gandon, F., Sander, P.: SweetWiki: A
   semanti wiki. Journal of Web Semanti s, spe ial issue on Web 2.0 and the Semanti
   Web 6(1) (2008)
4. Krötzs h, M., Vrande i , D., Völkel, M., Haller, H., Studer, R.: Semanti wikipedia.
   Journal of Web Semanti 5(4) (2007) 251261
5. Cordier, A., Lieber, J., Molli, P., Nauer, E., Skaf-Molli, H., Toussaint, Y.: Wiki-
   taaable: A semanti wiki as a bla kboard for a textual ase-based reasoning system.
   In: 4th Workshop on Semanti Wikis (SemWiki2009), held in the 6th European
   Semanti Web Conferen e. (2009) 1832
6. Ganter, B., Wille, R.: Formal Con ept Analysis, Mathemati al Foundation. Springer
   (1999)
7. Rahhal, C., Skaf-Molli, H., Molli, P., Weiss, S.: Multi-syn hronous ollaborative
   semanti wikis. In: 10th International Conferen e on Web Information Systems-
   Wise 2009. Volume 5802 of Le ture Notes in Computer S ien e. (2009)