<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Cognitive Perspective on Emergent Semantics in Collaborative Tagging: The Basic Level Effect</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Tobias Ley</string-name>
          <email>tley@know-center.at</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paul Seitlinger</string-name>
          <email>paulchristian.seitlinger@edu.uni-graz.at</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          ,
          <addr-line>Inffeldgasse 21a, 8010 Graz</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Cognitive Science Section, University of Graz</institution>
          ,
          <addr-line>Universitätsplatz 2, 8010 Graz</addr-line>
          ,
          <country country="AT">Austria</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Researching the emergence of semantics in social systems needs to take into account how users process information in their cognitive system. We report results of an experimental study in which we examined the interaction between individual expertise and the basic level advantage in collaborative tagging. The basic level advantage describes availability in memory of certain preferred levels of taxonomic abstraction when categorizing objects and has been shown to vary with level of expertise. In the study, groups of students tagged internet resources for a 10-week period. We measured the availability of tags in memory with an association test and a relevance rating and found a basic level advantage for tags from more general as opposed to specific levels of the taxonomy. An interaction with expertise also emerged. Contrary to our expectations, groups that spent less time to develop a shared understanding shifted to more specific levels as compared to groups that spent more time on a topic. We attribute this to impaired collaboration in the groups. We discuss implications for personalized tag and resource recommendations.</p>
      </abstract>
      <kwd-group>
        <kwd>Tagging</kwd>
        <kwd>Categorization</kwd>
        <kwd>Personalized Recommendation</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>that was used in our study. Here, users collaboratively tag bookmarks and then use the
tags to build a shared vocabulary and a taxonomic structure.</p>
      <p>
        Our conjecture is that besides an understanding of the social (e.g. [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]) and
pragmatic processes (e.g. 6], it is equally important to understand the underlying cognitive
processes in collaborative tagging for offering effective recommendations. For
example, it has been shown that human categorization processes are highly variable and
adaptive. Categorization does, for instance, vary on the level of specificity depending
on a number of factors. Therefore, our intention with the study reported here is to look
at the temporal dynamics in the collaborative tagging environment both in terms of
the tagging activities and the associated cognitive processes over time. By doing so,
we would like to gain a better understanding of the variability in human
categorization as it can be observed in such an environment, and thereby enhance current
personalized tag recommendation mechanisms provided both in the process of tagging
and in the process of browsing tag clouds and resource collections. This should
enhance the emergence of stable patterns in these environments.
      </p>
      <p>
        The term basic level advantage [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] has been introduced to describe a preferred
level of taxonomic abstraction when classifying objects of the real world (e.g. a
preference for the term “dog” as opposed to “mammal” or “poodle”). In human
communication, the basic level has an important role as it contains categories that are most
easily retrieved from memory and have a high degree of information value in
describing objects. Among many others, an advantage for the basic level has been shown
when people verify the categories of pictures of objects [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], or in a free naming
paradigm [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. While the role of the basic level advantage in collaborative tagging is
often acknowledged [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], surprisingly little empirical research exists to inform
design decisions. In their study of delicious, Golder &amp; Huberman [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] suggest that
popular tags which are introduced very early for a certain bookmark correspond to
categories of the basic level. The authors also find that the tag distribution for a
certain bookmark quickly stabilizes over time suggesting an emerging consensus.
      </p>
      <p>The authors also point to a potential problem with the basic level advantage that
arises with differing levels of expertise. They hypothesize that there should be
systematic variations across individuals of “what constitutes a basic level”. In
collaborative tagging, this basic level variation is a potential drawback. When resources are
described on varying levels of specificity, it makes retrieval of information more
difficult both for experts and for novices. While for the former, the information value
of a basic level category is too low, for the latter the specific categories are not
sufficiently well represented in memory, and, hence, their labels difficult to comprehend.</p>
      <p>
        The hypothesized basic level variation is in line with cognitive research which has
found a basic level shift in various categorization paradigms, such as generating
attributes of category objects, free naming of category labels or verifying category
membership [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Basic level shift for more experienced persons leads to
better availability in memory of category members and their attributes on more
specific levels of the taxonomy. Following sensemaking research, we expect that in a
collaborative tagging environment a growing expertise in the domain can be observed
over time. Therefore, we hypothesize that users will use more specific categories, will
show better availability of these in memory and will ascribe more importance to more
specific categories, when they collaboratively tag for a longer as compared to a
shorter duration of time.
To test this hypothesis, we asked four groups of students to collaboratively collect
bookmarks related to their course subject and describe them with tags. Two of the
groups had to work on a topic for the whole duration of the semester (10 weeks), the
other two groups switched their topic at half time. Our hypothesis was that the long
duration (ld) groups would form a stronger representation in memory of the more
specific tags and that they would rate their relevance higher than the short duration
(sd) groups. Collaborative tagging among the students was realized through the social
bookmarking system SOBOLEO. In SOBOLEO, the tags and the tag taxonomy that is
collaboratively created are shared by all users of the system.
2.1
      </p>
    </sec>
    <sec id="sec-2">
      <title>Participants and Procedure</title>
      <p>The study took place in the context of a university course on cognitive models in
technology enhanced learning at the University of Graz. Subjects (N=25, mean age
M=23.3, SD=1.2) were psychology students participating for course credit. After an
introduction to SOBOLEO, a computer literacy questionnaire and a word association
test eliciting participants’ knowledge about central concepts of the given topics were
administered to the participants. Subjects were then assigned to four groups of 6 or 7
participants which were equivalent according to their scores on the word association
test and computer literacy questionnaire. Each group was provided with their own
SOBOLEO instantiation only accessible by personal usernames and passwords.</p>
      <p>E-mails were then sent out to inform the participants of the topic they had to work
on together with access details for their SOBOLEO environment. Two groups were
asked to research the topic “the use of Wikis in enterprises”, the other two groups “the
use of Weblogs in universities”. They were asked to prepare these topics as if they
were collaboratively working on a report of presentation. Both topics were chosen
because they were related to the course subject and because we expected the
participants to have only little prior knowledge about them.</p>
      <p>During the whole duration of the study (ten weeks) each student was expected to
post two relevant bookmarks per week to the SOBOLEO environment and describe
them with meaningful tags. The students were also required to collaboratively
organize their tag collection with the help of the SOBOLEO taxonomy editor. To facilitate
the emergence of consensus, the students were also encouraged to utilize the
SOBOLEO chat and an external discussion forum.</p>
      <p>After five weeks (at halftime), the SOBOLEO environments of two of the four
groups were cleared. They had to start from scratch and to work on the other topic for
another five weeks, making them the short duration (sd) groups. The other two groups
continued with their prior topic, making them the long duration (ld) groups. Right
before this topic switch, we also controlled for the fact that there still were no
differences between the two conditions in the word association test. At the end of the
semester, the association test and the relevance rating were administered to the 25
students in a group setting using a sample of tags they had created so far.
2.3</p>
    </sec>
    <sec id="sec-3">
      <title>Tag Samples, Tag Specificity and Dependent Measures</title>
      <p>By the end of the 10-week period, the four groups had created N=213 tags from
which n=76 tags were drawn as a sample. To yield the independent variable tag
specificity, tags were drawn from three different levels of the SOBOLEO taxonomies the
students had created: General tags were drawn from the taxonomy levels 1, medium
tags from level 2, and all tags below level 2 were allocated to the specific tags. From
each of the four SOBOLEO environments, 19 tags were randomly drawn: three
general (e.g. “weblogs”, “e-learning by collaborating”), eight medium (e.g. “kinds of
weblogs”, “psychology of weblogs”) and eight specific tags (e.g. “videoblogs”,
“microblogging”). Hence, the entire sample consisted of 76 tags: 12 general, 32 medium
and 32 specific tags.</p>
      <p>As a dependent measure, a relevance rating was collected at the end of the
semester asking subjects to rate each tag sampled from their own SOBOLEO environment
on a five-point Likert scale ranging from strongly relevant to strongly irrelevant for
describing and organizing resources. By averaging the ratings of all group members a
mean relevance rating for each tag was obtained. An association test was also
conducted at the end of the semester. This test elicits implicit knowledge about concepts
underlying verbal representations. Subjects were confronted with tags as stimulus
words and asked to write down all associations coming to their mind. Response time
was confined to 30 seconds. By counting the number of associations, the test informs
about the strength of representation of concepts in memory. Stimulus words were the
same tags used for the relevance rating. Again we averaged the number of
associations of all group members to obtain a mean number of associations for each tag.
3</p>
      <sec id="sec-3-1">
        <title>Results</title>
        <p>groups (M=2.65, SD=0.81; F1,59=13.01, p&lt;.01). The same applies to the relevance
rating (F1,59=9.12, p&lt;.001): the judged relevance of medium and specific tags is higher
in sd groups (M=2.56, SD=0.60) than in ld groups (M=2.22, SD=0.69).</p>
        <p>Results of a post-hoc questionnaire that had been administered to the students at
the end of the semester give insight into these counterintuitive findings. First, all
groups indicated they had been dissatisfied with the communication mechanisms (the
SOBOLEO Chat and discussion forum). Albeit having worked on their topic for a
longer time, groups of the ld condition gave significantly lower ratings when asked
for the understanding of the topic (M=1.67 on a 5-point Likert scale, SD=1.23) than
sd groups (M=2.69, SD=0.75; F1,23=6.44, p&lt;.05). Additionally, ld groups (M=1.92,
SD=1.00) perceived a lower quality of their taxonomy than sd groups (M=2.92,
SD=0.86; F1,23=7.33, p&lt;.05). Free text answers indicate that especially students in ld
groups found it more difficult to collaboratively work on the shared taxonomy in
SOBOLEO and they felt that the exercise had resulted in a chaotic collection of
bookmarks and tags where it was rather difficult to keep an overview.
4</p>
      </sec>
      <sec id="sec-3-2">
        <title>Discussion and Outlook</title>
        <p>We conclude from the study that a strong basic level effect could be observed for an
implicit memory measure (number of associations) as well as an explicit measure
(relevance rating), where for the latter this only showed for one of the groups.
However, our manipulation (duration of engagement with a topic) was obviously not
effective in producing a stronger representation in memory. Quite to the contrary, the fact
that environments of students in sd groups were cleared after half time actually helped
them to build a more effective and shared external knowledge representation. The
negative effect for ld groups was exacerbated by missing effective communication
mechanisms in the SOBOLEO system. Similarly, we assume that it was students from
sd groups that developed a more shared and stronger internal representation. If this
was the case, then there is clear evidence for a shift in the basic level. This already
showed after a comparatively little time (5 weeks), and produced a strong and also
practically significant effect (an increase of 0.64 associations on average).</p>
        <p>
          Results of this study have practical significance for tag and resource
recommendation in collaborative environments (e.g. [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]) as they suggest that effective tag
recommendations need to take tag specificity into account. Experts in a domain would
benefit from more specific tag recommendations or from recommendations of resources
with more specific tag assignments. The study also suggests that temporal dynamics
need to be taken into account where shifts in basic level already take place after a few
weeks of collaboration. Finally, in case tag specificity could be captured, this would
also have implications for user modelling as the level of expertise pertaining to a
certain topic could be derived for any user from his or her tag assignments.
        </p>
        <p>
          A limitation of our results relates to the manual creation of the taxonomy by
students which extends the (normally flat) folksonomy by a taxonomic relation. For our
future work, we plan to draw on statistical approaches, such as [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] who found
different tag similarity measures (tag co-occurrence vs. distributional measures) to
correspond to different taxonomic relationships between tags. Moreover, these results seem
to be moderated by particular behavioural tendencies of users using the tagging
system [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ].
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Braun</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Schmidt</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Walter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nagypal</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zacharias</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Ontology Maturing: a Collaborative Web 2.0 Approach to Ontology Engineering</article-title>
          .
          <source>In: 16th int'l WWW conference</source>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Carmagnola</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vernero</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Grillo</surname>
            ,
            <given-names>P.:</given-names>
          </string-name>
          <article-title>SoNARS: A Social Networks-Based Algorithm for Social Recommender Systems</article-title>
          .
          <source>In: 17th int'l UMAP conference</source>
          , pp.
          <fpage>223</fpage>
          -
          <lpage>234</lpage>
          (
          <year>2009</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Cattuto</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hotho</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          , G.:
          <article-title>Semantic Grounding of Tag Relatedness in Social Bookmarking Systems</article-title>
          .
          <source>In: 7th ISWC Conference</source>
          , pp.
          <fpage>615</fpage>
          -
          <lpage>631</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Fu</surname>
          </string-name>
          , Wai-Tat.:
          <article-title>The microstructures of social tagging: a rational model</article-title>
          .
          <source>In: Proceedings of the ACM 2008 conference on Computer supported cooperative work</source>
          , pp.
          <fpage>229</fpage>
          -
          <lpage>238</lpage>
          (
          <year>2008</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Golder</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huberman</surname>
            ,
            <given-names>B.A.</given-names>
          </string-name>
          <article-title>The Structure of Collaborative Tagging Systems</article-title>
          .
          <source>J. Information Sciences</source>
          <volume>32</volume>
          ,
          <fpage>198</fpage>
          -
          <lpage>208</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Körner</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Benz</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hotho</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Strohmaier</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stumme</surname>
          </string-name>
          , G.:
          <article-title>Stop Thinking, Start Tagging: Tag Semantics Emerge From Collaborative Verbosity</article-title>
          .
          <source>In: 19th Int'l WWW Conference</source>
          , ACM, New York,
          <year>2010</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Marlow</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Naaman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyd</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Davis</surname>
          </string-name>
          , M: HT06, tagging paper, taxonomy, Flickr, academic article, to read.
          <source>In Proc. HYPERTEXT '06</source>
          , ACM Press,
          <fpage>31</fpage>
          -
          <lpage>40</lpage>
          (
          <year>2006</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Mika</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Ontologies Are Us: A Unified Model of Social Networks and Semantics</article-title>
          .
          <source>J. Web Semantics: Science, Services and Agents on the World Wide Web 5</source>
          ,
          <fpage>5</fpage>
          -
          <lpage>15</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Rogers</surname>
          </string-name>
          , T.T.,
          <string-name>
            <surname>Patterson</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          :
          <article-title>Object categorization: reversals and explanations of the basiclevel advantage J</article-title>
          .
          <source>Experimental Psychology: General</source>
          <volume>136</volume>
          ,
          <fpage>451</fpage>
          -
          <lpage>469</lpage>
          (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Rosch</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mervis</surname>
            ,
            <given-names>C.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gray</surname>
            ,
            <given-names>W.D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Johnson</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Boyes-Braem</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>Basic objects in natural categories</article-title>
          .
          <source>J. Cognitive Psychology</source>
          <volume>8</volume>
          ,
          <fpage>382</fpage>
          -
          <lpage>439</lpage>
          (
          <year>1976</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Specia</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Motta</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          :
          <article-title>Integrating Folksonomies with the Semantic Web</article-title>
          .
          <source>In: 4th ESWC2007 European Semantic Web Conference</source>
          , pp.
          <fpage>624</fpage>
          --
          <lpage>639</lpage>
          . Springer, Heidelberg (
          <year>2007</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Tanaka</surname>
            ,
            <given-names>J.W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taylor</surname>
          </string-name>
          , M.:
          <article-title>Object Categories and Expertise: Is the Basic Level in the Eye of the Beholder?</article-title>
          J.
          <source>Cognitive Psychology</source>
          <volume>23</volume>
          ,
          <fpage>457</fpage>
          --
          <lpage>482</lpage>
          (
          <year>1991</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>