<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Adaptation of Navigation by the Modified Results of Full Scan Algorithm in Adaptive Hypermedia Systems</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Marek Bober</string-name>
          <email>marek.bober@vsb.cz</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Petr Šaloun</string-name>
          <email>petr.saloun@osu.cz</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Informatics and Computers, Faculty of Science, University of Ostrava</institution>
          ,
          <addr-line>30. dubna 22, 703 01 Ostrava</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Dept. of Computer Science, Faculty of Electrical Engineering and Computer Science, VŠB-Technical University of Ostrava</institution>
          ,
          <addr-line>17. listopadu 15/2172, 708 33 Ostrava</addr-line>
          ,
          <country country="CZ">Czech Republic</country>
        </aff>
      </contrib-group>
      <fpage>111</fpage>
      <lpage>119</lpage>
      <abstract>
        <p>Models of behavior mining from logs in adaptive systems are one from many methods how to obtain useful information for personalized navigation. Target of this paper is to describe modified results of full scan algorithm used to creation recommended links to concepts. For creation recommended links is used technique of traversal patterns. Established weights up to concepts and links along with the results of full scan algorithm created relevant recommended links within the scope of all subject matter and within the scope of topic in actual concept. The fundamental part is facilitating orientation for user in hyperspace.</p>
      </abstract>
      <kwd-group>
        <kwd>knowledge discovery</kwd>
        <kwd>adaptive navigation</kwd>
        <kwd>traversal patterns</kwd>
        <kwd>concept recommendation</kwd>
        <kwd>adaptive web-based educational hypermedia</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>content from author,
list of attributes, that describe concepts,
list of rules, (conditions) that determinate format and style of displayed parts
of content concepts.</p>
      <p>Chapters can be referenced to each other as well as concepts. References between
concepts in adaptive hypermedia systems based on WWW technologies are created by
hypertext links. In adaptive systems is mostly adapted content of concept, style and
navigation technique. By the references to other concepts we want to advise to user
which toward a given topic is relevant other topic. Mostly, there has been related
concepts with specific „relevancy“ weight to understanding subject matter to study
concept.</p>
      <p>This paper describes usage of full scan (FS) algorithm for finding recommended
links and new possibilities of modification of FS algorithm results for creating
recommended links, named in adaptive hypermedia systems. The main of target is
notify to user, which has the highest weight (relevancy) of link context to specific
concept and made easier decision for user about relevant concept reference to visit it
or not. Modified results can improve orientation in hyperspace than non-modified
results of FS algorithm. For modification of results we introduce concepts and links
relevancy weight, as you can see in next chapter. These values are necessary for
modification of FS algorithm results.</p>
    </sec>
    <sec id="sec-2">
      <title>2 Problem description</title>
      <p>To notify user about specific link relevancy for him and his study process, is
necessary to determinate not only weight of link but weight of content offered by
other concept. Weight determination is not simple and all weight values should have
been given to every user individually, or if we would like target group of users with
the similar characteristics, for each groups individually. Every user has a different
style of learning and personalization is basically difficult. The next problem is
occasion of author to advise which content of text are more and which less important.
One of the possibilities is non automatic weights setting of concepts by author. Non
automatic weights setting are necessary for creation new concepts. The better
occasion is for example automatic (semi-automatic) setting by the ontology and
modifying by the neural networks. It is subject of next research.</p>
      <p>We can determinate several relevancy weights (WR) for all concepts. For example
we can use percentage valuation by displaying weights in ever concepts. Now is
possible say to user, which is the important study content for him in light of relevancy
in concrete subject, see Figure 1. The traditional case of referencing to several
concepts is self WR concept poor. From the other one point of view can be one
concept more relevant (it has greater valuation) than other concept for some case
based on subject summary matter. There are necessary to determinate WR between
the links of several concepts. If a concept has references at each other then weights
can have same or different valuation in both ways of relevancy view within actual
topic. Relevancy from the one side can be different than from the other side. It is
necessary set WR into links every way.</p>
      <p>Let C denotes concept. Every concept has to be denoted by unique identifier (C1,
C2, C2.1 …). Concept C1 has WR 0.8 (80 %) and referencing to next two concepts
(C2 and C21). Concept denoted as C21 has value 0.4 and concept C2 has value 0.8. In
the case of summary topic we can determinate relevancy of referenced concepts by
the value of weights into hypertext links of referenced concepts. From all of view at
content relevancy in actual concept we merge relevant and local evaluation of several
links, which links starts from concept or pointed to this concept end in concept. For
user, which study selected topic, is value of WR predicative about relevancy. In the
case of summary subject matter is unused. For example concept C21 can be only 20
% relevant in case of topic of concept C1 but in case of summary subject matter can
be 80 % relevant.</p>
      <p>Fig.1. Weights of concepts which determining relevancy of content within the scope
of summary subject matter.</p>
      <p>By the WR links valuation we can determinate, how is close the relation between
referenced concept and actual concept. Determination of WR in first phase (to create
new concepts) of personalization is difficult and requires cooperation of subject
material authors. In more advanced phase value of WR can be modified for example
by the Radial Basis Function from neural network etc. Determination of WR into
links is showed at Figure 2.</p>
      <p>Values wc1, wc2, wc21 are percentage of content relevancy to other concepts in case
of summary subject matter. Values wl2 and wl21 are percentage of related topic
relevancy to other concepts with concept from which is linked to. In this case
concepts cross references cannot use weight of link same for both ways. In opposite
direction relevancy of links can be different, see Figure 3.</p>
      <p>In line with value of WR wl21 topic of concept C2.1 is concerned per 75% of topic
concept C1. But value of WR in the opposite direction wl1 determine related content
C1 only per 15%. Concepts represent vertexes and hypertext links represent angles.
Now they creating oriented graph with valuated angles.</p>
      <p>
        WR of hypertext links and concepts can have relevancy restriction. WR can be
determinate in case of summary subject matter, one chapter, paragraph etc. In
adaptive environment system it can offer related links in ordered list by the value of
weights (link recommendation), for example by the traversal pattern algorithm [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
Recommended (related) links and the most frequently links from all users can be
string together by interconnection with properties of concrete user or user group and
the result is offered to concrete user or group[
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3 Related works</title>
      <p>
        Research in adaptive hypermedia systems is based on collection data, analyzing (data
mining) and their consequent usage for personalization. In systems based on WWW
pages the collection of data proceeds by hide form during user activity in system.
Collected data represents small form patterns complex. Relevant data obtaining
process and their usage for personalization can be divided into basic steps for
knowledge discovery and their later usage [
        <xref ref-type="bibr" rid="ref11 ref7">7, 11</xref>
        ]. The first step is the collection of
patterns that made preprocessing and cleaning. Follow choosing relevant attributes
and transforming data – data mining. In last step data are analyzed, interpreted and
used in work experience – personalization.
      </p>
      <p>
        Target of summary process research is the most effective content and style
adaptation for navigation to concrete user or user group in order to increase of
effectiveness learning and hyperspace orientation. Typical personalization properties
are links visualization, show or hide some text parts and show recommended links too
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. Follow chapters are targeted to show recommended links by modified results of
Full Scan (FS) algorithm of traversal patterns.
      </p>
      <sec id="sec-3-1">
        <title>3.1 Environment for personalization</title>
        <p>
          Every adaptive hypermedia system uses some technique for adaptation. There are two
basic adaptive techniques – content and navigation adaptation. Term adaptation is
substitute by term personalization [
          <xref ref-type="bibr" rid="ref1 ref6">1, 6</xref>
          ].
        </p>
        <p>
          Information from web pages still increases the value of orientation in these pages
with more exacting values. Navigation in WWW pages can be shown better and
allows user better orientation. Navigation and personalization target is to help user in
order to better orientation in hyperspace. There are five different methods of
adaptation: direct navigation, sorted links, hided links, links commentary, and map
adaptation [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ].
        </p>
        <p>
          The most systems that work with adaptive hypermedia are based on model. For
example adaptive hypermedia system AHA! [
          <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
          ] is based on referenced model
AHAM [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]. AHAM model define four basic models [
          <xref ref-type="bibr" rid="ref3 ref4">3, 4</xref>
          ]:
        </p>
        <p>Domain model (DM) describes structure of system content (fragments, pages
and concepts).</p>
        <p>User model (UM) store information about users and their behaviors in
system.</p>
        <p>Adaptation model (AM) is defined as set of rules. These rules define how
adaptation has proceeded.</p>
        <p>Adaptive mechanism provides an adaptation (outgoing from adaptive rules)
and generate pages by using different adaptive techniques.
•
•
•
•
•
•
•
•</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2 Data for knowledge discovery</title>
        <p>
          For adapting content is necessary obtaining information on which they are based
adaptation proceed – is necessary observe user behaviors in system, it is means to
observe his walk across system [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. In our case of content adaptation we display
recommended links within the scope of all subjects matter and within the scope of
actual topic. It is necessary data stored in user log.
        </p>
        <sec id="sec-3-2-1">
          <title>Student Id – student identification</title>
          <p>Date and time – time stamp: date and time by the login/logout of user
to/from system that represent start/end looking time over the concept.
Type of access – attribute can have tree values: access to concept with study
subject matter, access to test, and general access to system.</p>
        </sec>
        <sec id="sec-3-2-2">
          <title>Concept – Concept identification. System Domain model contains basic information, that identify every concepts. These data and user log are used to basic adaptation mechanism.</title>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Algorithms for data mining</title>
      <p>The main source of data for obtaining knowledge is log of user activities. Log
contains all information about user activities in system during current session
(realized by logout a login). Our target is exploring characteristic patterns of user's
navigation.</p>
      <p>
        Because user activities in system looks like transactions in electronic shops, it is
possible to use tree basic techniques for data mining: association rules mining,
sequential patterns mining and traversal patterns mining [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Association rules mining. In the case of electronic shops this technique is used to
search group of similar products based on user knowledge (preferences). In adaptive
systems it is used to search relationships between concepts. This technique is suited to
authors, which produce or modify the course. For example Apriori algorithm
represents association rules mining [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Sequential patterns mining. Technique is resembled as association rules mining.
This technique calculated only with visited concepts. Sequential patterns apply
oneself to recommended concepts. From the user log we can deduce, which concept is the
most visiting but it is not possible to specify, which concept will be visited in a future.
Technique of sequential patterns make possibility to find concepts, which user would
visit in future – recommended concepts. There is analogy with Apriori algorithm [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        A traversal pattern mining is sometime called as a continuous sequential patterns.
This technique is mostly used for web server log analysis. In adaptive systems is
recommended to user relevant concepts for visiting. Algorithms that represent
traversal patterns are full scan (FS) - selective scan [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>Next parts of document apply oneself to use full scan algorithm and consecutive
modification of results.</p>
      <sec id="sec-4-1">
        <title>4.1 Modified algorithm results for traversal patterns</title>
        <p>
          Specification of FS algorithm outgoes from DHP (direct hashing and pruning)
algorithm [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ]. FS algorithm was used in adaptive hypermedia systems ALEA, see [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ].
Authors compared tree techniques for data mining. The result of this comparing
shows recommended links for users. They start from visited to non-visited concepts
because in tutorial process users very often come back to previously visited concepts.
        </p>
        <p>Our main idea is integrate results of FS algorithm with WR within the scope of all
subject matter, and with WR of links within the scope of actual topic of concept. As a
result are recommended links to concepts in two categories: recommended links
within the scope of all subject matter and recommended links within the scope of
actual topic of concept. Usage of FS algorithm and modified results is shown on
Figure 5.</p>
        <p>Fig. 5. Results of FS algorithm</p>
        <p>Left table in Figure 5 contains session identifier TID, and set of visited concepts.
Results of FS algorithm are tree qualifications for concept recommendation C2, C3
and C5. System can display user links to concepts. But in case of concepts have WR
within the scope of all subject matter; see right table in Figure 5, the result of
algorithm can be modified. From the table we can see, that concepts C2 and C3 have
relevancy within the scope of all subject matter “relatively big”. Concept C5 has
relevancy “relatively small”. We determinate a minimum limit of relevancy within the
scope of all subject matter, denote L and determinate value to 0,35. By applying limit
LA to results of recommended concepts is possible to eliminate concepts, which are
not relevant for link in subject matter.</p>
        <p>In case of results, as shown Figure 5, are relevant recommended concepts only C2
and C3. Concept denoted C5 have not to be displayed. It is possible to use another
technique than displaying and hiding recommended links, for example highlighting of
background color. In the case of concept C5 can be highlighted of other color.</p>
        <p>If there are established WR for links, that represents relevancy within the scope of
topic concept, it is possible to offer user relevant links with relation to given topic.
Let {C2, C3, C5} is result of FS algorithm and structure of connection concepts is in
agreement with Figure 6. Algorithm proceeded from root, where is user occurred in
agreement with Figure 6, on concept C1.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions and future work</title>
      <p>Adaptation of content to several users or user groups by the different determinate
rules for all users individually is very complex problem, because several users (user
groups) have different style of learning. Techniques for data mining based on user
system behaviors helps to characterize specific groups with the same behaviors, and
based on obtained knowledge is possible to apply adaptive mechanisms for concrete
groups.</p>
      <p>
        One of the eventual adaptation techniques is presented ease in hyperspace by the
recommended links to relevant concepts. By using traversal patterns technique along
with value of weights in concepts and links is possible offer to user relevant content
for their next study of content. Modified results of FS algorithm can by tried in
adaptive hypermedia system AHA! [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] and LMS systems called Barborka, that is
develop on VŠB-Technical University of Ostrava, see [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. Modified algorithm for
traversal patterns can be extended by entries of ontology and information about
knowledge of user obtained during their study in adaptive hypermedia system.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Agrawal</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srikant</surname>
          </string-name>
          , R.:
          <article-title>Fast algorithms for mining association rules</article-title>
          . In J. B.
          <string-name>
            <surname>Bocca</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Jarke</surname>
          </string-name>
          , and C. Zaniolo, editors,
          <source>Proc. of 20th Int. Conf. Very Large Data Bases</source>
          ,
          <string-name>
            <surname>VLDB</surname>
          </string-name>
          , Morgan Kaufmann (
          <year>1994</year>
          ),
          <fpage>487</fpage>
          -
          <lpage>499</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Ayres</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Flannick</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gehrke</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yiu</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          :
          <article-title>Sequential pattern mining using a bitmap representation</article-title>
          .
          <source>In Proc. of the 8th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining</source>
          (
          <year>2002</year>
          ),
          <fpage>429</fpage>
          -
          <lpage>435</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Bieliková</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Creation of content in adaptive hypermedia for e-learning. Tvorba obsahu v adaptívnych hypermédiách pre e-vzdelávanie. [in Slovak] In Technologie pro e-vzdělávání,</article-title>
          <string-name>
            <surname>Prague</surname>
          </string-name>
          (
          <year>2004</year>
          ),
          <fpage>13</fpage>
          -
          <lpage>23</lpage>
          . ISBN 81-01-03167-5.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4. [19]
          <string-name>
            <surname>Bober</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Šaloun</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Velart</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          :
          <article-title>Interactive PDF forms for multichoice testing in AHA!</article-title>
          .
          <source>ICETA</source>
          <year>2005</year>
          ,
          <volume>121</volume>
          -
          <fpage>124</fpage>
          , ISBN 80-8086-016-6
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>De</surname>
            <given-names>Bra</given-names>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            ,
            <surname>Aerts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Smits</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ,
            <surname>Stash</surname>
          </string-name>
          , N.:
          <source>AHA! Version 2</source>
          .0,
          <string-name>
            <surname>More</surname>
            <given-names>Adaptation</given-names>
          </string-name>
          <article-title>Flexibility for Authors</article-title>
          .
          <source>In Proceedings of the AACE ELearn'2002 conference (2002)</source>
          ,
          <fpage>240</fpage>
          -
          <lpage>246</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Fasuga</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Holub</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Šarmanová</surname>
          </string-name>
          , J.:
          <article-title>Support of learning of technical subjects and their usage system Barborka. Podpora výuky odborných předmětů a jejich aplikace s použitím systému Barborka</article-title>
          . [in Czech]
          <source>In Geometry and Computer Graphics</source>
          <year>2004</year>
          ,
          <volume>25</volume>
          .
          <article-title>konference o geometrii a počítačové grafice</article-title>
          . Ostrava, Czech
          <string-name>
            <surname>Republic</surname>
          </string-name>
          (
          <year>2004</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Fayyad</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Piatetsky-Shapiro</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smyth</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Uthurusamy</surname>
          </string-name>
          , R. E.:
          <article-title>Advances in Knowledge Discovery in Data Mining</article-title>
          . AAAI Press/MIT Press,
          <year>1996</year>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>M. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Park</surname>
            ,
            <given-names>J. S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>P. S.</given-names>
          </string-name>
          :
          <article-title>Data mining for path traversal patterns in a web environment</article-title>
          .
          <source>In 16th Int. Conf. on Distributed Computing Systems</source>
          , pages
          <fpage>385</fpage>
          -
          <lpage>392</lpage>
          ,
          <year>1996</year>
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Krištofič</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bieliková</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Improving adaptation in web-based educational hypermedia by means of knowledge discovery</article-title>
          ,
          <source>In 6th ACM conference on Hypertext and hypermedia</source>
          , Salzburg, Austria (
          <year>2005</year>
          )
          <fpage>184</fpage>
          -
          <lpage>192</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Mobasher</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cooley</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Srivastava</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          :
          <article-title>Automatic personalization based on web usage mining</article-title>
          .
          <source>In Communications of the ACM</source>
          ,
          <volume>142</volume>
          -
          <fpage>151</fpage>
          ,
          <year>2000</year>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Pierrakos</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Paliouras</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papatheodorou</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spyropoulos</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Web usage mining as a tool for personalization: A survey. User Modeling</article-title>
          and
          <string-name>
            <surname>User-Adapted</surname>
            <given-names>Interaction</given-names>
          </string-name>
          ,
          <fpage>311</fpage>
          -
          <lpage>372</lpage>
          ,
          <year>2003</year>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>