<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Tag Recommendations Based on Tracking Social Bookmarking Systems</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Szymon</forename><surname>Chojnacki</surname></persName>
							<affiliation key="aff0">
								<orgName type="department" key="dep1">Department of Artificial Intelligence</orgName>
								<orgName type="department" key="dep2">Institute of Computer Science</orgName>
								<orgName type="institution">Polish Academy of Sciences</orgName>
							</affiliation>
						</author>
						<title level="a" type="main">Tag Recommendations Based on Tracking Social Bookmarking Systems</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">B8CA5E7CBD3480C8A3218EDE74BE23B6</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-24T06:52+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Information Retrieval</term>
					<term>Searching Engines</term>
					<term>Tag Recommendations</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The purpose of this paper is to describe our approach to Tag Recommendation Task during ECML PKDD 2009 Challenge. The organizers supplied a training set of tagged webpages and publications from BibSonomy portal. Our goal was to build a model which can predict tags for new users bookmarking digital resources. Our strategy was based on an assumption that users tend to tag the same resources in various systems. Therefore, have we developed a tracking engine, which was adjusted to the profile of BibSonomy users in selection of RSS feeds and utilized the training data to optimize the list of tracked URLs. We had over 90 days to collect the data from the feeds, but this period did not overlap with the dates of posts from the training set. As a result we had to set manually parameters responsible for a trade-off between recall and accuracy of the model. We stored all downloaded feed entries in a searching engine. The recommendation was based on tags attached to the documents retrieved from the engine by means of typical information retrieval query.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The development of collaborative society that we experience in recent years can be characterized by four principles: being open, peering, sharing and acting globally <ref type="bibr" target="#b5">[6]</ref>. These principles determine the way we exchange information and organize the knowledge. Very important part of this phenomenon is the popularity of social classification, indexing and tagging. Attaching labels to common resources (webpages, blogs, music, videos, photos) can on one hand shed a new light on information retrieval problems, on the other hand poses new challenges concerning uncontrolled explosion of folksonomy size and its usability. The goal of our research is to build a tag recommendation system that would influence user's selection of tags and as a result enable us to reuse folksonomy entries in more efficient way than we observe currently This paper describes our attempt to predict tags already chosen by BibSonomy users. This was the Task 1 in ECML PKDD 2009 Challenge. However, we believe</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Examined datasets</head><p>We used cleaned dump dataset which consisted of three tables: bibtex (158 924 records), bookmark (263 004 records) and tas (1 401 104 records). The dump contained all public bookmarks and publication posts of BibSonomy until (but not including) 2009-01-01. Posts from the user dblp (a mirror of the DBLP Computer Science Bibliography) as well as all posts from users which have been flagged as spammers have been excluded. Furthermore, the tags were cleaned. Java method was used to remove all characters which were neither numbers nor letters and removed those tags, which were empty after cleansing or matched one of the tags imported, public, systemimported, nn, systemunfiled. The tas table (Tag Assignments) was a fact table with information about who attached which tag to which resource/content. The bookmark table consisted of following columns (content_id, url_hash, url, description, extended description and date). The bibtex table was described by following dimensions (content_id, journal, volume, chapter, edition, month, day, booktitle, howPublished, institution, organization, publisher, address, school, series, bibteXKey, url, type, description, annote, note, pages, key, number, crossref, misc, bibtexAbstract, simhash0, simhash1, simhash2, entrytype, title, author, edition, year).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Our approach</head><p>In this section we describe three main parts of our system. Firstly we focus on a selection of RSS feeds and the problems we encountered while downloading the posts. In the second part we define the vector space in which the posts were stored as well as main characteristics of deployed database. Finally we present the details of the tag recommendation algorithm. The algorithm is divided into four steps: searching of matching resources based on URL address, retrieval of the most similar cluster, selection of the post with highest overlap score and ranking of suggested tags.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>4.1</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RSS Feeds selection</head><p>Our strategy was to optimize a set of keywords that we were going to track in popular bookmarking systems as well as in a variety of domain portals. We analyzed distribution of most common tags in BibSonomy and Delicious and decided that tracking only the most recent posts would be biased (Table <ref type="table" target="#tab_0">1</ref>). We decided to enrich the most recent posts with a set of 100 most popular tags (out of 93 757 unique tags) in BibSonomy training data. We had to face different problems in case of bookmarking systems and domain portals. We used Google Reader to search for top 10 domain portals and their RSS URLs for each chosen keyword. Because some feeds appeared in different searching results we end up with 734 feeds.</p><p>An example of feeds recommended by Google Reader for a keyword "linux" is presented in Table <ref type="table" target="#tab_1">2</ref>. Even though numerous feeds use the most recent RSS or Atom standard and we could easily parse the content of XML files, it is uncommon to fill in the category field by feed editors. We can see in the Table <ref type="table" target="#tab_1">2</ref>, that out of 10 sources: one did not contain proper URL, four did not deliver information about category, one marked each feed entry with the same category. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>RSS Feed</head><p>Categories of updated entries On the other hand the problem with typical bookmarking systems is the fact that when we subscribe most recent posts for a given keyword we get only tags of a particular user who bookmarked the resource. As a consequence we need to crawl a service in order to find out about most typical tags for a given resource. The problem of connection limits arises when we want to crawl every out of 100 entries downloaded for a given keyword. Because of this, we decided to verify if we can cluster tags based on their cooccurence score. Table <ref type="table" target="#tab_2">3</ref> contains 20 pairs of tags with highest symmetric Jaccard cooccurance coefficient calculated as a division of number of posts with both tags by a number of all posts with any of the tags. We can see that "ccp" and "jrr" always appear together. Also "genetic", "algorithms" and "programming" create a cloud of tags. Four tags "emulationgames", "emulationvideogames", "aaaemulationgames", "classicemulatedremakeretrogames" create another cloud. However, the Jaccard coefficient drops very fast below 20% level and therefore we decided not to abandon the idea of tag clustering.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Linux</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>4.2</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Data storage</head><p>In order to recommend tags online we needed a fast engine that does not need to be taught every time we get need posts from a scratch. The Beatca system (developed in our Institute <ref type="bibr" target="#b0">[1,</ref><ref type="bibr" target="#b3">4]</ref>) is an example of such engine. It performs online incremental hierarchical clustering of documents and proved very effective in the field of intelligent Information Retrieval. Soft classification of documents and construction of conceptual closeness graph is based on large-scale Bayesian networks. Optimal document map search and document clustering is based on SOM (self-organizing maps), AIS (artificial immune systems), and GNG (growing neural gas). Each post is defined as a point in a multidimensional space in which coordinates represent frequency of a token appearing in a post's title or description. Because some tokens are very common and others are present in only few posts we selected only the most informative tokens as coordinates in our vector space. The dictionary optimization was based on a entropy-like quality measure Q(t i ) of a token t i :</p><formula xml:id="formula_0">˝{ˮ { ˚ ˚Ñ . ˚ ˚ Ñ ˚ ˚ (# ˚<label>(1)</label></formula><p>where N ij is the number of occurrences of term t i in document d j , N j is the number of documents that contains term t i and N is the total number of documents. We removed tokens with Q(t i ) measure below 0.01 or above 0.95. We implemented term frequency inverse document frequency weighting scheme. According to the scheme we divided term frequency in a single document by the number of documents in which the term appears.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.3">Tag recommendations</head><p>Our tag recommendation consisted of four steps. If we had a positive result in the first step then we went directly to the final fourth step.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Step One</head><p>In the first step we checked if a post is present in the BibSonomy training set or an URL of the post is among downloaded RSS entries. If the answer was true then we selected all tags attached to these resources and moved to the Step Four.</p><p>Step Two</p><p>In the second step we retrieved a group (cluster) of documents that was the most similar to the post's description or title field. The similarity was measured as a cosine of an angel between vectors x={x 1 ,…,x n } and y={y 1 ,…,y n } representing the resources in our database and the post (Eq. 2). For example, one of the posts had following title: "Attribute Grammar Based Programming and its Environment". The query consisting of the first five informative tokens from the above title returned a cluster of four documents:</p><p>1 </p><formula xml:id="formula_1">{ { ˲ ˩ ˳ ˩ J ˩ ŵ ˲ ˩ Ŷ J ˩ ŵ ˳ ˩ Ŷ J ˩ ŵ<label>(2)</label></formula><p>A cluster of all the retrieved posts was transferred to the next step.</p><p>Step Three For all the posts retrieved in the second step we calculated normalized overlap score and chosen the post with the highest score. The overlap was defined as a maximum length of n-gram appearing in both posts. In order to compute the score we used all the words from title/description fields (not only the most informative tokens). The overlap score was divided by the length of title/description field of the candidate posts. For example, normalized overlap score between "Attribute Grammar Based Programming and its Environment" and "Attribute grammar based language extension for Java" equals to 3/7=0.42. The post with highest score was transferred to the final fourth step if the value of a score was greater than 0.6 threshold.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Step four</head><p>In the last step we ordered the tags of selected post according to their count in BibSonomy training set. Top five tags were selected as predictions in the Challenge.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5">Evaluation</head><p>The F1-Measure common in Information Retrieval was used to evaluate the recommendations. The precision and recall were first computed for each post in the test data by comparing the recommended tags against the tags the user has originally assigned to this post <ref type="bibr" target="#b1">[2]</ref>. Then the average precision and recall over all posts in the test data was used to calculate the F1-Measure as f1 = (2 * precision * recall) / (precision + recall). The number of tags one can recommend was not restricted. However, the organizers regarded the first five tags only. We computed both precision and recall measures for various levels of a threshold parameter from step three in our recommendation algorithm (Fig. <ref type="figure" target="#fig_1">1</ref>). According to these simulations optimum level of the threshold is approximately 0.6 and yields F1-measure between 3% and 4%. During the challenge we obtained overall F1-measure of 4,6%, which was slightly better than in our simulations, but incomparable to the results of the best teams.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="6">Conclusions</head><p>We must admit that the way we approached the problem needs substantial computing power and disc space. Unfortunately the quality of our tag recommendations was below an average and probably this direction of research in the field of tag recommending systems is not a promising one. </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Insider 3 | 7 |</head><label>37</label><figDesc>Community | Community | Distros | Licensing | Financial News | Mobile | Community | Community programming | xwindows | google | gui | software | microsoft | portables | os | storage | linuxbusiness | security | gnu | education | caldera | portables Linux.com :: Features 8 URL broken HowtoForge -Linux Howtos and Tutorials 9 | Ubuntu | Debian | Ubuntu | Desktop | Debian | Lighttpd | Ubuntu | Desktop | Virtualization | Ubuntu | Desktop | Security | Ubuntu | CentOS | Samba | Ubuntu | Desktop | Linux | Ubuntu | Security | Ubuntu | Desktop | Fedora | Security Linux and Open Source -RSS Feeds 10 No Categories LinuxQuestions.org 11 | Linux -Newbie | Linux -Newbie | Linux -Newbie | Linux -Software | Programming | Red Hat | Linux -General | Linux -General | Linux -Laptop and Netbook | Puppy | Ubuntu | Linux -Desktop | Ubuntu | Ubuntu | Linux -Security LXer Linux News 12 | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux | linux</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head>Fig. 1 .</head><label>1</label><figDesc>Fig. 1. Values of precision, recall and F1 measures for different levels of overlap threshold.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>The most common tags in Delicious and Bibsonomy</figDesc><table><row><cell></cell><cell>Tag</cell><cell>Delicious 1</cell><cell>BibSonomy 2</cell></row><row><cell>1</cell><cell>design</cell><cell>1.69%</cell><cell>27</cell></row><row><cell>2</cell><cell>blog</cell><cell>1.29%</cell><cell>13</cell></row><row><cell>3</cell><cell>tools</cell><cell>1.05%</cell><cell>10</cell></row><row><cell>4</cell><cell>software</cell><cell>0.96%</cell><cell>4</cell></row><row><cell>5</cell><cell>webdesign</cell><cell>0.92%</cell><cell>54</cell></row><row><cell>6</cell><cell>programming</cell><cell>0.89%</cell><cell>5</cell></row><row><cell>7</cell><cell>tutorial</cell><cell>0.85%</cell><cell>44</cell></row><row><cell>8</cell><cell>art</cell><cell>0.75%</cell><cell>83</cell></row><row><cell>9</cell><cell>reference</cell><cell>0.72%</cell><cell>33</cell></row><row><cell>10</cell><cell>video</cell><cell>0.72%</cell><cell>3</cell></row><row><cell>11</cell><cell>inspiration</cell><cell>0.71%</cell><cell>587</cell></row><row><cell>12</cell><cell>music</cell><cell>0.66%</cell><cell>25</cell></row><row><cell>13</cell><cell>web2.0</cell><cell>0.65%</cell><cell>7</cell></row><row><cell>14</cell><cell>education</cell><cell>0.63%</cell><cell>17</cell></row><row><cell>15</cell><cell>photography</cell><cell>0.52%</cell><cell>166</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Results of searching first ten feeds for "Linux" keyword in Google Reader.</figDesc><table /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Co-occurances of pairs of tags, occurances and normalized Jaccard coefficient.</figDesc><table><row><cell>Tag 1</cell><cell>Tag 2</cell><cell>Éˮ # ˮ $ É</cell><cell>|t 1 |</cell><cell>|t 2 |</cell><cell>Éˮ # ˮ $ É Éˮ # ˮ $ É</cell></row><row><cell>ccp</cell><cell>jrr</cell><cell>4294</cell><cell>4294</cell><cell>4294</cell><cell>1.00</cell></row><row><cell>algorithms</cell><cell>genetic</cell><cell>5775</cell><cell>6220</cell><cell>5888</cell><cell>0.91</cell></row><row><cell>aaaemulation-topgames</cell><cell>emulation-videogames</cell><cell>3653</cell><cell>3653</cell><cell>4576</cell><cell>0.80</cell></row><row><cell>emulationgames</cell><cell>emulation-videogames</cell><cell>4576</cell><cell>6055</cell><cell>4576</cell><cell>0.76</cell></row><row><cell>aaaemulation-topgames</cell><cell>classicemulated-remakeretrogames</cell><cell>2472</cell><cell>3653</cell><cell>2472</cell><cell>0.68</cell></row><row><cell>aaaemulation-topgames</cell><cell>emulationgames</cell><cell>3653</cell><cell>3653</cell><cell>6055</cell><cell>0.60</cell></row><row><cell>classicemulated-remakeretrogames</cell><cell>emulation-videogames</cell><cell>2472</cell><cell>2472</cell><cell>4576</cell><cell>0.54</cell></row><row><cell>genetic</cell><cell>programming</cell><cell>5262</cell><cell>5888</cell><cell>9491</cell><cell>0.52</cell></row><row><cell>journal</cell><cell>medical</cell><cell>1693</cell><cell>2566</cell><cell>2448</cell><cell>0.51</cell></row><row><cell>algorithms</cell><cell>programming</cell><cell>5303</cell><cell>6220</cell><cell>9491</cell><cell>0.51</cell></row><row><cell>classicemulated-remakeretrogames</cell><cell>emulationgames</cell><cell>2472</cell><cell>2472</cell><cell>6055</cell><cell>0.41</cell></row><row><cell>book</cell><cell>nlp</cell><cell>1230</cell><cell>2614</cell><cell>2027</cell><cell>0.36</cell></row><row><cell>education</cell><cell>learning</cell><cell>2143</cell><cell>5021</cell><cell>4751</cell><cell>0.28</cell></row><row><cell>media</cell><cell>texts</cell><cell>1998</cell><cell>7149</cell><cell>2012</cell><cell>0.28</cell></row><row><cell>analysis</cell><cell>data</cell><cell>1187</cell><cell>3352</cell><cell>2589</cell><cell>0.25</cell></row><row><cell>folksonomy</cell><cell>tagging</cell><cell>1027</cell><cell>2561</cell><cell>3083</cell><cell>0.22</cell></row><row><cell>emulationgames</cell><cell>zzztosort</cell><cell>2844</cell><cell>6055</cell><cell>11839</cell><cell>0.19</cell></row><row><cell>audio</cell><cell>music</cell><cell>919</cell><cell>1857</cell><cell>4142</cell><cell>0.18</cell></row><row><cell>howto</cell><cell>tutorial</cell><cell>850</cell><cell>2876</cell><cell>2798</cell><cell>0.18</cell></row><row><cell>bookmarks</cell><cell>indexforum</cell><cell>9164</cell><cell>52795</cell><cell>9183</cell><cell>0.17</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head></head><label></label><figDesc>However we believe that there are certain situations in which best tags are not a function of words contained in title of a post and in our future research we would like to focus on such examples. Despite of unsatisfactory result in the first Task of ECML PKDD 2009 Challenge we are going to verify our recommendations within the third online recommendation task.</figDesc><table><row><cell></cell><cell>10%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>9%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>8%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>7%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>6%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell>Value</cell><cell>5%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>4%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>3%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>2%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>1%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>0%</cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell>0</cell><cell>0.1</cell><cell>0.2</cell><cell>0.3</cell><cell>0.4</cell><cell>0.5</cell><cell>0.6</cell><cell>0.7</cell><cell>0.8</cell><cell>0.9</cell><cell>1</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell></cell><cell cols="3">Overlap threshold</cell><cell></cell><cell></cell><cell></cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>precision</cell><cell></cell><cell>recall</cell><cell></cell><cell>F1</cell><cell></cell><cell></cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">Relative frequency of a tag in a random collection of 603 750 downloaded from the Delicious.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="2" xml:id="foot_1">Rank of a corresponding tag in the BibSonomy.</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="3" xml:id="foot_2">http://www.linuxinsider.com/perl/syndication/rssfull.pl</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="4" xml:id="foot_3">http://www.linux-mag.com/cache/rss20.xml</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="5" xml:id="foot_4">http://distrowatch.com/news/dw.xml</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="6" xml:id="foot_5">http://linuxtoday.com/backend/biglt.rss</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="7" xml:id="foot_6">http://rss.slashdot.org/Slashdot/slashdotLinux</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="8" xml:id="foot_7">http://www.linux.com/index.rss</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="9" xml:id="foot_8">http://www.howtoforge.com/node/feed</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="10" xml:id="foot_9">http://rssnewsapps.ziffdavis.com/eweeklinux.xml</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="11" xml:id="foot_10">http://www.linuxquestions.org/syndicate/lqlatest.xml</note>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="12" xml:id="foot_11">http://lxer.com/module/newswire/headlines.rss</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">On some clustering algorithms for document maps creation</title>
		<author>
			<persName><forename type="first">Draminski</forename><surname>Ciesielski</surname></persName>
		</author>
		<author>
			<persName><surname>Klopotek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Wierzchon</forename><surname>Kujawiak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the Intelligent Information Processing and Web Mining Conference (IIS:IIPWM-2005)</title>
				<meeting>the Intelligent Information Processing and Web Mining Conference (IIS:IIPWM-2005)<address><addrLine>Gdansk,</addrLine></address></meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2005">2005</date>
		</imprint>
	</monogr>
	<note>Advances in Soft Computing</note>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Tag Recommendations in Social Bookmarking Systems</title>
		<author>
			<persName><forename type="first">Marinho</forename><surname>Jäschke</surname></persName>
		</author>
		<author>
			<persName><surname>Hotho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Stumme</forename><surname>Schmidt-Thieme</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">AI Communications</title>
				<meeting><address><addrLine>Amsterdam</addrLine></address></meeting>
		<imprint>
			<publisher>IOS Press</publisher>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Multilabel Text Classification for Automated Tag Suggestion</title>
		<author>
			<persName><forename type="first">Tsoumakas</forename><surname>Katakis</surname></persName>
		</author>
		<author>
			<persName><surname>Vlahavas</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ECML PKDD Discovery Challenge (RSDC08)</title>
				<meeting>ECML PKDD Discovery Challenge (RSDC08)</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Coexistence of Fuzzy and Crisp Concepts in Document Maps</title>
		<author>
			<persName><forename type="first">Wierzchon</forename><surname>Klopotek</surname></persName>
		</author>
		<author>
			<persName><surname>Ciesielski</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Kujawiak</forename><surname>Draminski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the International Conference on Artificial Neural Networks (ICANN 2005)</title>
		<title level="s">Lecture Notes in Artificial Intelligence</title>
		<meeting>the International Conference on Artificial Neural Networks (ICANN 2005)</meeting>
		<imprint>
			<publisher>Springer-Verlag</publisher>
			<date type="published" when="2005">2005</date>
			<biblScope unit="volume">3697</biblScope>
		</imprint>
	</monogr>
	<note>LNAI</note>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Tag Recommendation for Folksonomies Oriented towards Individual Users</title>
		<author>
			<persName><surname>Lipczak</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ECML PKDD Discovery Challenge (RSDC08)</title>
				<meeting>ECML PKDD Discovery Challenge (RSDC08)</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<monogr>
		<title level="m" type="main">Wikinomics</title>
		<author>
			<persName><forename type="first">Williams</forename><surname>Tapscott</surname></persName>
		</author>
		<imprint>
			<date type="published" when="2008">2008</date>
			<publisher>Atlantic Books</publisher>
			<pubPlace>London</pubPlace>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Tag Recommendations using Bookmark Content</title>
		<author>
			<persName><forename type="first">Srikanth</forename><surname>Tatu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D'</forename><surname>Silva</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of ECML PKDD Discovery Challenge (RSDC08)</title>
				<meeting>ECML PKDD Discovery Challenge (RSDC08)</meeting>
		<imprint>
			<date type="published" when="2008">2008</date>
		</imprint>
	</monogr>
	<note>RSDC&apos;08</note>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
