<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Organizing Publications and Bookmarks in BibSonomy</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Robert J a¨schke</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andreas Hotho</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Christoph Schmitz</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Beate Krause</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Gerd Stumme</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Knowledge &amp; Data Engineering Group, Dept. of Mathematics and Computer Science Univ. of Kassel</institution>
          ,
          <addr-line>Wilhelmsh o ̈her Allee 73, D-34121, Kassel</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Miranda Grahl</institution>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Research Center L3S, Univ. of Hannover</institution>
          ,
          <addr-line>Appelstr. 9a, D-30167 Hannover</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2007</year>
      </pub-date>
      <fpage>8</fpage>
      <lpage>12</lpage>
      <abstract>
        <p>BibSonomy is a web-based social resource sharing system which allows users to organise and share bookmarks and publications in a collaborative manner. Apart from standard folksonomy features such as an intuitive user interface, navigation along all dimensions, or browser integration via RSS feeds, BibSonomy provides tag hierarchies, group management and privacy features, and numerous import and export functions.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>SHARING BOOKMARKS AND PUBLI</title>
    </sec>
    <sec id="sec-2">
      <title>CATIONS WITH BibSonomy</title>
      <p>Social resource sharing systems are web-based systems used to
manage resources on the web in a collaborative way. Users can
describe the resources with arbitrary words, so-called tags. The
systems can be distinguished according to what kind of resources
are supported. Flickr, for instance, allows the sharing of photos,
del.icio.us the sharing of bookmarks, CiteULike1 and Connotea2
the sharing of bibliographic references, and 43Things3 even the
sharing of personal goals and resolutions. Our own system,
BibSonomy,4 can be used for sharing bookmarks and BIBTEX entries
simultaneously. In their core, these systems are all very similar.
Once a user is logged in, he can add a resource to the system and
assign arbitrary tags to it.</p>
      <p>
        The collection of all users’ tag assignments is called a
folksonomy. The word ‘folksonomy’ (coined by Vander Wal in [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]) is
      </p>
      <sec id="sec-2-1">
        <title>1http://www.citeulike.org</title>
        <p>
          2http://www.connotea.org
3http://www.43things.com
4http://www.bibsonomy.org
a blend of the words ‘taxonomy’ and ‘folk’, and stands for
conceptual structures created by the people. Folksonomies are thus
a bottom-up complement to more formalized Semantic Web
technologies, as they rely on emergent semantics [
          <xref ref-type="bibr" rid="ref8 ref9">8, 9</xref>
          ] which result
from the converging use of the same vocabulary.
        </p>
        <p>A typical user interface allows for exploration of the folksonomy
in all dimensions: for a given user one can see all resources he has
uploaded, together with the tags he has assigned to them; when
clicking on a resource one sees which other users have uploaded
this resource and how they tagged it; and when clicking on a tag
one sees who assigned it to which resources (see Figure 1).</p>
        <p>
          After an introduction to the user interface and architecture of
BibSonomy, we give an overview about some of its advanced
features. For the underlying data structure of the system and more
details, we refer to [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ].
1.1
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>User Interface</title>
      <p>
        BibSonomy with its more than 5,000 registered users allows to
share bookmarks (i.e., URLs) as well as publication references.
The data model of the publication part is based on BIBTEX [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], a
popular literature management system for LATEX [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. A typical list
of posts is depicted in Figure 1 which shows bookmark and
publication posts in a column layout containing the tag web. The page
is divided into four parts: the header (showing information such as
the current page and path, navigation links and search boxes), two
lists of posts – one for bookmarks and one for publications – each
sorted by date in descending order, and a list of tags related to the
posts. This scheme holds for all pages showing posts and allows
for navigation in all dimensions of the folksonomy.
      </p>
      <p>A detailed view of one bookmark post from the list in Figure 1
can be seen in Figure 2. The first line shows in bold the title of
the bookmark which has the URL of the bookmark as
underlying hyperlink. The second line shows an optional description the
user can assign to every post. The last two lines belong together
and show detailed information: first, all the tags the user has
assigned to this post (web, service, tutorial, guidelines and api),
second, the user name of that user (hotho) followed by a note, how
many users tagged that specific resource. These parts have
underlying hyperlinks, leading to the corresponding tag pages of the
user (/user/hotho/web 5, /user/hotho/service, . . . ),
5All paths given in parentheses are relative to http://www.
the users page (/user/hotho) and a page showing all four posts
(i. e., the one of user hotho and those of the 3 other people) of this
resource (/url/r, where r is a hashed representation of the
resource). The last part shows the posting date and time followed
by links for actions the user can do with this post – depending on
whether this is his own post (edit, delete) or another user’s post
(copy).</p>
      <p>The structure of a publication post displayed in BibSonomy is
very similar, as shown in Figure 3. The first line shows again the
title of the post, which equals the title of the publication in BIBTEX.
It has an underlying link leading to a page which shows detailed
information on that post. This line is followed by the authors or
bibsonomy.org.
editors of the publication, the journal or book title and the year.
The following lines show the tags assigned to this post by the user,
whose user name comes next, followed by a note on how many
people tagged this publication. As described for bookmark posts,
these parts link to the respective pages. After date and time of
the posting follow the actions the user can do, which in this case
include picking the entry for later download, copying it, accessing
the URL of the entry or viewing the BIBTEX source code.
1.2</p>
    </sec>
    <sec id="sec-4">
      <title>Additional Features</title>
      <p>This section briefly describes some extensions of BibSonomy
which go beyond the basic folksonomy model and have evolved
during the practical use of the system.
1.2.1</p>
      <sec id="sec-4-1">
        <title>Tag Hierarchy</title>
        <p>
          Tagging gained so much popularity in the past two years because
it is simple and no specific skills are needed for it. Nevertheless the
longer people use systems like BibSonomy, the more often they ask
for options to structure their tags. A user specific binary relation ≺
between tags as described in our model of a folksonomy (cf. [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ] for
details) is an easy way to arrange tags.
        </p>
        <p>Therefore we included this possibility in BibSonomy and
extended it further to use it for conceptual navigation. For instance, it
is possible, given a tag, to show all posts with one of the subtags of
the given tag.</p>
        <p>Figure 4 shows details of the relation management in
BibSonomy. Relations can be edited manually, but usually they will be
created in the normal tagging process by tagging a resource, e. g.
as eclipse-&gt;java, expressing that the tags eclipse and java should
be assigned to a particular resource, and that the pair (eclipse, java)
should be inserted into the user’s ≺ relation. When browsing the
folksonomy, a user can decide if she wants to make use of the tag
hierarchy, e. g. when querying for the tag java, resources that are
tagged with eclipse can be included in the result set, even if they do
not have the tag java themselves. In order to distinguish between
simple tag queries and those involving subtags, we call the latter
one a query for java as a concept.</p>
        <p>While it is not enforced upon the user how the ≺ relation is to
be used, we expect that most of the time, it will be used in order to
express subsumptions in an is-a hierarchy. The actual use we are
observing confirms this assumption.
1.2.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Duplicate detection.</title>
        <p>In particular for literature references there is the problem of
detecting duplicate entries, because there are large variations in how
users enter fields such as journal name or author. On the one hand
it is desirable to allow a user to have several entries which differ
only slightly. On the other hand one might want to find other users’
entries which refer to the same paper or book even if they are not
completely identical. Hence it is necessary to map these entries
together to allow such browsing functionality.</p>
        <p>To fulfill both goals we implemented two hashes to compare
publication entries at different levels of granularity. One is used to warn
the user if she posts very similar BIBTEX entries twice, possibly
creating unwanted redundancy. The other one is used to aggregate
BIBTEX entries that were posted by more than one user in a
common view, providing an opportunity to pick the most complete one
or copy over missing data to one’s own version of the publication
entry. The implemented solution does not allow for a common
creation of a single (correct) entry in a wiki style as we want to allow
every user to store BIBTEX in the way she likes without changes
from other users.
1.2.3</p>
      </sec>
      <sec id="sec-4-3">
        <title>Import of resources.</title>
        <p>To encourage users to transition from other systems, we
implemented an import functionality. For del.icio.us, this functionality
also takes into account the del.icio.us bundles which are named sets
of tags. We map bundles to relations. Furthermore it is possible to
import bookmark files of the Firefox6 web browser, where the
typical folder hierarchy of the bookmarks can be added to the user’s ≺
relation.</p>
        <p>Existing BIBTEX entries can be imported by uploading files,
past</p>
        <sec id="sec-4-3-1">
          <title>6http://www.mozilla.com/firefox/</title>
          <p>ing BIBTEX snippets into BibSonomy, or just marking a BIBTEX
entry on a web page and hitting the post button. Furthermore, for
numerous digital library services including the ACM Digital Library,
SpringerLink, arXiv, and CiteSeer, automatic screen scrapers for
publication metadata are provided.</p>
          <p>For unstructured publication metadata, such as the publication
lists often found on researchers’ web pages, semi-automatic
extraction using the Mallet7 information extraction tool is supported.
1.2.4</p>
        </sec>
      </sec>
      <sec id="sec-4-4">
        <title>Export facilities.</title>
        <p>The content of any page in the BibSonomy user interface can
be presented in different formats for export. Exporting BIBTEX
is accomplished by preceding the path of an URL with the string
/bib – this returns all publications shown on the respective page in
BIBTEX format. For example the page http://www.bibsonomy.org/
bib/search/text+clustering returns a BIBTEX file containing all
literature references which contain the words “text” and “clustering” in
their fulltext.</p>
        <p>HTML-formatted8 publication lists can be exported which can
be easily included into personal homepages. For example, a user
schmitz might want to tag his own publications with myown and
the year of publication. This enables him to include the page http://
www.bibsonomy.org/publ/user/schmitz/myown+2006 into his
personal web page to get an automatically updated list of his
publications on the web.</p>
        <p>Other exports such as XML, RSS and BURST9 feeds, RDF
according to the SWRC ontology, BIBTEX and EndNote work
similarly.</p>
        <p>Furthermore, links can be provided to an OpenURL10 resolver.
Such a resolver allows every user to find any publication presented
in BibSonomy in her own local library.
1.2.5</p>
        <p>Group Management and Privacy.</p>
        <sec id="sec-4-4-1">
          <title>7http://mallet.cs.umass.edu/</title>
          <p>8A small heuristic is applied to handle special letters e.g.
accents but LATEX commands are not used to format the output.
9http://www.cs.vu.nl/∼pmika/research/burst/BuRST.html
10http://www.exlibrisgroup.com/sfx openurl.htm
with</p>
          <p>In many situations it is desirable to share resources only among
certain people. If the resources can be public, then one could agree
to tag them with a special tag and use that tag to find the shared
resources. The disadvantage is, that this could be undermined by
other users (or spammers) by using the same tag. To solve this
problem and also to allow resources to be visible only for certain
users, we introduced groups in BibSonomy which gives users more
options to decide with whom they share their resources.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>CONCLUSION AND OUTLOOK</title>
    </sec>
    <sec id="sec-6">
      <title>Summary</title>
      <p>BibSonomy is, to the best of our knowledge, the only
folksonomy system currently online which combines bookmark and
publication management in a common user interface.</p>
      <p>In addition to standard folksonomy features, BibSonomy
provides capabilities for structuring a user’s tag cloud, group and
privacy management, and various import and export options,
including screen scrapers for popular publication services as well as for
unstructured publications lists.
2.2</p>
    </sec>
    <sec id="sec-7">
      <title>Ongoing and Future Work</title>
      <p>
        There are several important topics which we will be address in
the near future. As stated in the introduction folksonomies can be
seen as a lightweight knowledge representation. One major goal
is therefore the convergence with the semantic web effort which
is also called Web 3.0. To reach this goal a more machine
understandable tagging is needed which can be reached be using so
called “machine tags”11 but also by developing new methods to
extract semantics from folksonomies. Our next steps in this direction
but also to enhance the usability are as follows:
Ranking The FolkRank [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] ranking algorithm has been developed
which allows for a topic-specific ranking of folksonomy
resources. Incorporating a ranking scheme to enhance the
simple reverse-chronological presentation of posts is ongoing
work.
      </p>
      <p>
        Conceptual Clustering and Community Detection We are
currently investigating different approaches [
        <xref ref-type="bibr" rid="ref3 ref4 ref7">4, 3, 7</xref>
        ] for finding
coherent clusters within the folksonomy; these clusters could
be viewed as communities of users being interested in
common topics.
      </p>
      <p>
        Tag Recommender and Ontology Learning As users are
providing tag-tag relations in BibSonomy, we are currently
investigating techniques which enable the semi-automatic learning
of the tag-tag relation. A first step in that direction is the
learning of subsumption relations using association rules on
the folksonomy data set [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. The same techniques can also
be used to generate tag recommendations.
      </p>
      <p>
        API Experience has shown, that an Application Programming
Interface (API) is crucial for a folksonomy system to gain
success. It is something which has been requested by many
people and which allows for easy interaction of BibSonomy with
other systems. Hence we are currently investigating several
approaches to add an API to BibSonomy. Most systems use
lightweight APIs similar to the idea of REST [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] which can
be used and accessed in a very straightforward and
easy-toimplement fashion. Nevertheless, with SOAP12 there exists a
standard for web services which should also be taken into
account. Since the process of defining an API for BibSonomy
has just started, this is still an open task.
3.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>Roy</surname>
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Fielding</surname>
          </string-name>
          .
          <article-title>Architectural Styles and the Design of Network-based Software Architectures</article-title>
          .
          <source>PhD thesis</source>
          , University of California, Irvine,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Hotho</surname>
          </string-name>
          , Robert Ja¨schke, Christoph Schmitz, and Gerd Stumme.
          <article-title>BibSonomy: A social bookmark and publication sharing system</article-title>
          .
          <source>In Proceedings of the Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures</source>
          , pages
          <fpage>87</fpage>
          -
          <lpage>102</lpage>
          ,
          <year>2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>Andreas</given-names>
            <surname>Hotho</surname>
          </string-name>
          , Robert Ja¨schke, Christoph Schmitz, and
          <string-name>
            <given-names>Gerd</given-names>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Information retrieval in folksonomies: Search and ranking</article-title>
          .
          <source>In Proceedings of the 3rd European Semantic Web Conference, LNCS</source>
          , pages
          <fpage>411</fpage>
          -
          <lpage>426</lpage>
          , Budva, Montenegro, June 2006. Springer.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Robert</given-names>
            <surname>Ja</surname>
          </string-name>
          ¨schke, Andreas Hotho, Christoph Schmitz, Bernhard Ganter, and
          <string-name>
            <given-names>Gerd</given-names>
            <surname>Stumme</surname>
          </string-name>
          .
          <article-title>Trias - an algorithm for mining iceberg tri-lattices</article-title>
          .
          <source>Hong Kong</source>
          ,
          <year>December 2006</year>
          . (to appear).
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>Leslie</given-names>
            <surname>Lamport. LaTeX: A Document Preparation System. Addison-Wesley</surname>
          </string-name>
          ,
          <year>1986</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>Oren</given-names>
            <surname>Patashnik. BibTeXing</surname>
          </string-name>
          ,
          <year>1988</year>
          .
          <article-title>(Included in the BIBTEX distribution)</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>Christoph</given-names>
            <surname>Schmitz</surname>
          </string-name>
          , Andreas Hotho, Robert Ja¨schke, and Gerd Stumme.
          <article-title>Mining association rules in folksonomies</article-title>
          .
          <source>In Proc. IFCS 2006 Conference</source>
          , pages
          <fpage>261</fpage>
          -
          <lpage>270</lpage>
          , Ljubljana,
          <year>July 2006</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>S.</given-names>
            <surname>Staab</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Santini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nack</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Steels</surname>
          </string-name>
          ,
          <article-title>and</article-title>
          <string-name>
            <given-names>A.</given-names>
            <surname>Maedche</surname>
          </string-name>
          .
          <article-title>Emergent semantics</article-title>
          .
          <source>Intelligent Systems, IEEE [see also IEEE Expert]</source>
          ,
          <volume>17</volume>
          (
          <issue>1</issue>
          ):
          <fpage>78</fpage>
          -
          <lpage>86</lpage>
          ,
          <year>2002</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Steels</surname>
          </string-name>
          .
          <article-title>The origins of ontologies and communication conventions in multi-agent systems</article-title>
          .
          <source>Autonomous Agents and Multi-Agent Systems</source>
          ,
          <volume>1</volume>
          (
          <issue>2</issue>
          ):
          <fpage>169</fpage>
          -
          <lpage>194</lpage>
          ,
          <year>October 1998</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <surname>Thomas</surname>
            <given-names>Vander</given-names>
          </string-name>
          <string-name>
            <surname>Wal</surname>
          </string-name>
          .
          <article-title>Folksonomy definition and wikipedia</article-title>
          .
          <source>November</source>
          <year>2005</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>