<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>ONEMercury: Towards Automatic Annotation of Environmental Science Metadata</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Suppawong Tuarob</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Line C. Pouchard</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Natasha Noy</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jeffery S. Horsburgh</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Giri Palanisamy</string-name>
        </contrib>
      </contrib-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>1Pennsylvania State University, University Park, PA, USA
2Oak Ridge National Laboratory, Oak Ridge, TN, USA
3Stanford University, Stanford, CA, USA
4Utah State University, Logan, UT, USA
suppawong@psu.edu, pouchardlc@ornl.gov, noy@stanford.edu, jeff.horsburgh@usu.edu,
palanisamyg@ornl.gov</p>
    </sec>
    <sec id="sec-2">
      <title>Abstract. The rapid growth of diverse data types and greater vol</title>
      <p>umes available to environmental sciences prompts the scientists to seek
knowledge in data from multiple places, times, and scales. To facilitate
such need, ONEMercury has recently been implemented as part of the</p>
    </sec>
    <sec id="sec-3">
      <title>DataONE project to serve as a portal for accessing environmental and</title>
      <p>observational data across the globe. ONEMercury harvests metadata
from the data hosted by multiple repositories and makes it searchable.
However, harvested metadata records sometimes are poorly annotated
or lacking meaningful keywords, and hence would unlikely be retrieved
during the search process. In this paper, we develop an algorithm for
automatic metadata annotation. We transform the problem into a tag
recommendation problem, and propose a score propagation style
algorithm for tag recommendation. Our experiments on four data sets of
environmental science metadata records not only show great promises
on the performance of our method, but also shed light on the different
natures of the data sets.</p>
      <sec id="sec-3-1">
        <title>Introduction</title>
        <p>Environmental sciences have become both complex and data-intensive, needing
accesses to heterogenous data collected from multiple places, times, and
thematic scales. For example, research on climate changes would involve exploring
and analyzing observational data such as the migration of animals and
temperature shifts across the world, from time to time. While the needs to access such
heterogenous data are apparent, the rapid expansion of observational data, in
both quantity and heterogeneity, poses huge challenges for data seekers to obtain
the right information for their research. Such problems behoove tools that
automatically manage, discover, and link big data from diverse sources, and present
the data in the forms that are easily accessible and comprehensible.</p>
        <sec id="sec-3-1-1">
          <title>1.1 ONEMercury Search Service</title>
          <p>
            Recently, DataONE, a federated data network built to facilitate accesses and
preservation about environmental and ecological science data across the world,
has come to exist and gain increasingly popularity[
            <xref ref-type="bibr" rid="ref7">7</xref>
            ]. DataONE harvests
metadata from different environmental data providers and make it searchable via the
search interface ONEMercury1, built on Mercury2, a distributed metadata
man1 https://cn.dataone.org/onemercury/
2 http://mercury.ornl.gov/
agement system. ONEMercury offers two modes of searching: basic and advance.
The basic mode only requires the user to input a set of keywords and the system
would return matching results; while the advance mode adds the capability to
further filter search results by authors, projects, and keywords.
          </p>
        </sec>
        <sec id="sec-3-1-2">
          <title>1.2 Challenge and Proposed Solution</title>
          <p>Linking data from heterogenous sources always has a cost. One of the biggest
problems that ONEMercury is facing is the different levels of annotation in
the harvested metadata records. Poorly annotated metadata records tend to be
missed during the search process as they lack meaningful keywords.
Furthermore, such records would not be compatible with the advance mode offered by
ONEMercury as it requires the metadata records be semantically annotated with
keywords from the keyword library. The explosion of the amount of metadata
records harvested from an increasingly number of data repositories makes it even
impossible to annotate the harvested records manually by hand, urging the need
for a tool capable of automatically annotating poorly curated metadata records.</p>
          <p>In this paper, we address the problem of automatic annotation of metadata
records. Our goal is to build a fast and robust system that annotates a given
metadata record with meaningful and related keywords from a given ontology.
The idea is to annotate a poorly annotated record with keywords associated to
the well annotated records that it is most similar with. We propose a solution to
this problem by first transforming the problem into a tag recommendation
problem where the set of recommended tags is used to annotate the given metadata
record, and then propose an algorithm that deals with the problem.</p>
        </sec>
        <sec id="sec-3-1-3">
          <title>1.3 Problem De nition</title>
          <p>We define a document as a tuple of textual content and a set of tags. That is
d = &lt;c, e&gt;, where c is the textual content, represented by a sequence of terms,
of the document d and e is a set of tags associated with the document. Given
a tag library T , a set of annotated documents S, and a non-annotated query
document q, our task is to recommend a ranked set of K tags taken from T
to the query q. A document is said to be annotated if it has at least one tag;
otherwise, it is non-annotated. The formal description of each variable is given
below:</p>
        </sec>
        <sec id="sec-3-1-4">
          <title>1.4 Contributions</title>
          <p>This paper has four key contributions as follows:
1. We address a real word problem of linking data from multiple archives faced
by ONEMercury. We transform the problem into the tag recommendation
problem, and generalize the problem so that the proposed solution can apply
to other domains.
2. We propose a novel score propagation technique for tag recommendation.</p>
          <p>Given a document query q, we first calculate the similarity score between
the query and each document in the source S. The score then is propagated
to the tags of each document in the source. Tags then are ranked by the
scores, and the top K tags are returned for recommendation. We propose
two different measures for computing the similarity between two documents:
term frequency-inverse document frequency (TFIDF) and topic model (TM).
3. We crawl environmental science metadata records from 4 different archives
for our data sets: the Oak Ridge National Laboratory Distributed Active
Archive Center (DAAC)3, Dryad Digital Repository4, the Knowledge
Network for Biocomplexity (KNB)5, and TreeBASE: a repository of
phylogenetic information6. We select roughly 1,000 records from each archive for
the experiments.
4. We validate our proposed method using aggressive empirical evaluations.</p>
          <p>We use document wise 10 fold cross validation to evaluate our schemes with
5 evaluation metrics: Precision, Recall, F1, MRR (Mean Reciprocal Rank),
and BPref (Binary Preference). These evaluation metrics are extensively used
together to evaluate recommendation systems.
2</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>Related Works</title>
        <p>
          Since we choose to transform our setting to a tag recommendation problem.
We briefly state the related literature here. Tag recommendation has gained
substantial amount of interest in recent years. Most work, however, focuses on
personalized tag recommendation, suggesting tags to a user’s object based on
the user’s preference and social connection. Mishne et al. [
          <xref ref-type="bibr" rid="ref8">8</xref>
          ] employ the social
connection of the users to recommend tags for weblogs, based on similar weblogs
tagged by the same users. Wu et al.[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] utilize the social network and the
similarity between the contents of objects to learn a model for recommending tags.
Their system aims towards recommending tags for Flickr photo objects. While
such personalized schemes have been proven to be useful, some domains of data
have limited information about authors (users) and their social connections. Liu
et al. [
          <xref ref-type="bibr" rid="ref6">6</xref>
          ] propose a tag recommendation model using Machine Translation. Their
algorithm basically trains the translation model to translate the textual
description of a document in the training set into its tags. Krestel et al.[
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] employ topic
modeling for recommending tags. They use the Latent Dirichlet Allocation
algorithm to mine topics in the training corpus, using tags to represent the textual
content. They evaluate their method against the association rule based method
proposed in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ].
3
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>Data Sets</title>
        <p>We obtain the data sets of environmental metadata records for the experiments
from 4 different archives: the Oak Ridge National Laboratory Distributed Active
Archive Center (DAAC), Dryad Digital Repository (DRYAD), the Knowledge
Network for Biocomplexity (KNB), and TreeBASE: a repository of phylogenetic
information (TREEBASE). The statistics of the data sets including the number
3 http://daac.ornl.gov/
4 http://datadryad.org/
5 http://knb.ecoinformatics.org/index.jsp
6 http://treebase.org/treebase-web/home.html
of documents, total number of tags, average number of tags per document,
number of unique tags (tag library size), tag utilization, number of all words (data
set size), and average number of word per document, are summarized in Table
1. Tag utilization is the average number of documents where a tag appears in,
# all tags
and is defined as # unique tags .</p>
        <p>In our setting, we assume that the documents are independently annotated,
so that the tags in our training sets represent the gold-standard. However, some
metadata records may not be independent since they may be originated from
the same projects or authors, hence annotated with similar styles and sets of
keywords. To mitigate such problem, we randomly select a subset of 1,000
annotated documents (except DAAC data set, which only has 978 documents, hence
we select them all.) from each archive for our experiments. We combine all the
textual attributes (i.e. Title, Abtract, Description) together as the textual
content for the document. We preprocess the textual content in each document
by removing 664 common stop words and punctuation, and stemming the words
using the Porter2 stemming algorithm.
4</p>
      </sec>
      <sec id="sec-3-4">
        <title>Preliminaries</title>
        <p>Our proposed solution is built upon the concepts of Cosine Similarity, Term
Frequency-Inverse Document Frequency (TFIDF), and Latent Dirichlet
Allocation (LDA). We briefly introduce them here to fortify readers’ background before
going further.</p>
        <sec id="sec-3-4-1">
          <title>4.1 Cosine Similarity</title>
          <p>In general, cosine similarity is a measure of similarity between two vectors by
measuring the cosine of the angle between them. Given two vectors A and B,
the cosine similarity is defined using a dot product and magnitude as:
CosineSim(A; B) = A · B ∑iN=1 Ai × Bi (4)
∥A∥ ∥B∥ = √∑iN=1(Ai)2 × √∑iN=1(Bi)2</p>
          <p>
            CosineSim(A,B) outputs [
            <xref ref-type="bibr" rid="ref1">0,1</xref>
            ], with 0 indicating independence, and the
value in between indicates the level of similarity. The cosine similarity is heavily
used to calculated the similarity between two vectorized documents.
          </p>
        </sec>
        <sec id="sec-3-4-2">
          <title>4.2 Term Frequency-Inverse Document Frequency</title>
          <p>TF-IDF is used extensively in the information retrieval area. It reflects how
important a term is to a document in a corpus. TF-IDF has two components: the
term frequency (TF) and the inverse document frequency (IDF). The TF is the
frequency of a term appearing in a document. The IDF of a term measures how
important the term is to the corpus, and is computed based on the document
frequency, the number of documents in which the term appears. Formally, given
a term t, a document d, and a corpus (document collection) D:
(5)
We can then construct a TF-IDF vector for a document d given a corpus D as
follows:</p>
          <p>T F IDF (d; D) =&lt; tfidf(t1; d; D); tfidf(t2; d; D); · · · ; tfidf(tn; d; D) &gt; (6)
Consequently, if one wishes to compute the similarity score between two
documents d1 and d2, the cosine similarity can be computed between the TF-IDF
vectors representing the two documents:</p>
          <p>DocSimT F IDF (d1; d2; D) = CosineSim (T F IDF (d1; D); T F IDF (d2; D)) (7)</p>
        </sec>
        <sec id="sec-3-4-3">
          <title>4.3 Latent Dirichlet Allocation</title>
          <p>
            In text mining, the Latent Dirichlet Allocation (LDA) [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] is a generative model
that allows a document to be represented by a mixture of topics. The basic
intuition of LDA for topic modeling is that an author has a set of topics in mind
when writing a document. A topic is defined as a distribution of terms. The
author then chooses a set of terms from the topics to compose the document.
With such assumption, the whole document can be represented using a mixture
of different topics. Mathematically, the LDA model is described as follows:
jZj
P (ti|d) = ∑ P (ti|zi = j) · P (zi = j|d) (8)
          </p>
          <p>j=1</p>
          <p>P (tijd) is the probability of term ti being in document d. zi is the latent
(hidden) topic. jZj is the number of all topics. This number needs to be
predefined. P (tijzi = j) is the probability of term ti being in topic j. P (zi = jjd) is
the probability of picking a term from topic j in the document d.</p>
          <p>After the topics are modeled, we can assign a distribution of topics to a
given document using a technique called inference. A document then can be
represented with a vector of numbers, each of which represents the probability
of the document belonging to a topic.</p>
          <p>Inf er(d; Z) =&lt; z1; z2; :::; zQ &gt;; jZj = Q
Where Z is a set of topics, d is a document, and zi is a probability of the
document d falling into topic i. Since a document can be represented using a vector
of numbers, one can then compute the topic similarity between two documents
d1 and d2 using cosine similarity as follows:</p>
          <p>DocSimT M (d1; d2; Z) = CosineSim (Infer(d1; Z); Infer(d2; Z)) (9)
5</p>
        </sec>
      </sec>
      <sec id="sec-3-5">
        <title>Method</title>
        <p>In this section, we describe our score propagation method for tag
recommendation. We show how our algorithm works using a simple example in Section 5.1,
and later discuss the variation of document similarity measures that we use.</p>
        <sec id="sec-3-5-1">
          <title>5.1 System Overview</title>
          <p>Figure 1 illustrates a flow of our score propagation algorithm on a simple
example. Three documents in the source are annotated with tags fwater, seagullg,
fseagull, soil, birdg, and fbird, airg respectively. Our algorithm proceeds
as follows:
STEP1 The document similarity score is computed between the document query
6
and each document in the source.</p>
          <p>STEP2 The scores then are propagated to the tags in each source document.The
scores are combined if a tag receives multiple scores. In the example, tags
seagull and bird obtain multiple scores (0.7+0.5) and (0.5+0.3) respectively.
STEP3 The tags are ranked by the scores. Then the top K tags are returned as
suggested tags.</p>
        </sec>
        <sec id="sec-3-5-2">
          <title>5.2 Document Similarity Measures</title>
          <p>We explore two different document similarity measures when computing the
similarity between the document query and the documents in the source.</p>
          <p>TFIDF Based. The first measure relies on the term frequency-inverse
document frequency discussed in Section 4.2. In our setting, D is the document
source. In order to compute the IDF part of the scheme, all the documents
in the source need to first be indexed. Hence the training phase (preprocess)
involves indexing all the documents. We then compute the similarity between
the query q and a source document d using DocSimT F IDF (q; d; D) as defined
in Equation 7. We use LingPipe7 to perform the indexing and calculating the
TFIDF based similarity.</p>
          <p>
            TM Based. The second document similarity measure utilizes topic
distributions of the documents. Hence the training process involves modeling topics from
the source using LDA algorithm as discussed in Section 4.3. We use Stanford
Topic Modeling Toolbox8 with the collapsed variational Bayes approximation[
            <xref ref-type="bibr" rid="ref1">1</xref>
            ]
to identify topics in the source documents. For each document we generate
unigrams, bi-grams, and tri-grams, and combine them to represent the textual
content of the document. The algorithm takes two input parameters: the number
of topics to be identified and the maximum number of the training iterations.
After some experiments on varying the two parameters we fix them at 300 and
7 http://alias-i.com/lingpipe/
8 http://nlp.stanford.edu/software/tmt/tmt-0.4/
1,000 respectively. For assigning a topic distribution to a document, we use the
inference method proposed by [
            <xref ref-type="bibr" rid="ref1">1</xref>
            ].
6
          </p>
        </sec>
      </sec>
      <sec id="sec-3-6">
        <title>Evaluation and Discussion</title>
        <p>We evaluate our methods using the tag prediction protocol. We artificially create
a test query document by removing the tags from an annotated document. Our
task is to predict the removed tags. There are two reasons behind the choosing
of this evaluation scheme:
1. The evaluation can be done fully automatically. Since our data sets are large,
manual evaluation (i.e. having human identify whether a recommended tag
is relevant or not) would be infeasible.
2. The evaluation can be done against the existing gold standard established
(manually tagged) by expert annotators who have good understanding about
the data, while manual evaluation could lead to evaluation biases.
We test our algorithm using with document similarity measures on each data
set, using two different source (training set) modes: self-source and cross-source.
For the self-source mode, the documents in the training set are selected from
the same archive as the query; while the cross-source mode combines the
training documents from all the archives together. We evaluate our algorithms with
different source modes using document-wise 10 fold cross validation, where each
data set is split into 10 equal subsets, and for each fold i 2 f1; 2; 3; :::; 10g the
subset i is used for the testing set, and the other 9 subsets are combined and
used as the source (training set). The results of each fold are summed up and
the averages are reported. The evaluation is done on a Windows 7 PC with Intel
Core i7 2600 CPU 3.4 GHz and 16GB of ram.</p>
        <sec id="sec-3-6-1">
          <title>6.1 Evaluation Metrics</title>
          <p>Precision, Recall, F1.</p>
          <p>For each document query in the test set, we use the original set of tags as the
ground truth Tg. Assume that the set of recommended tags are Tr , so that the
correctly recommended tags are Tg ∩ Tr. Precision, recall and F1 measures are
defined as follows:
precision = |Tg ∩ Tr| ; recall = |Tg ∩ Tr| ; F 1 = 2 · precision · recall</p>
          <p>|Tr| |Tg| precision + recall
In our experiments, the number of recommended tags ranges from 1 to 30. It is
wise to note that better tag recommendation systems tend to rank correct tags
higher than the incorrect ones. However, the precision, recall, and F1 measures
do not take ranking into account. To evaluate the performance of the ranked
results, we employ the following evaluation metrics.</p>
          <p>Mean Reciprocal Rank (MRR).</p>
          <p>MRR measure takes ordering into account. It measures how well the first
correctly recommended tag is ranked. Formally, given a testing set Q, let rankq be
the rank of the first corrected answer of query q 2 Q, then MRR of the query
set Q is defined as follows:
Binary Preference (Bpref ).</p>
          <p>
            Bpref measure considers the order of each correctly recommended tag [
            <xref ref-type="bibr" rid="ref3">3</xref>
            ]. Let S
be the set of recommended tags by the system, R be the set of corrected tags ,
r 2 R be a correct recommendation, and i 2 S R be an incorrect
recommendation. The Bpref is defined as follows:
          </p>
          <p>Bpref = 1 ∑ 1 − |i ranked higher than r|</p>
          <p>|R| r2R |S|
6.2</p>
        </sec>
        <sec id="sec-3-6-2">
          <title>Evaluation on TFIDF-Based Method</title>
          <p>(a) Precision
(b) Recall</p>
          <p>We run our algorithm with TFIDF based document similarity on each of
the 4 data sets, using both self and cross source modes. Figure 2 summarizes
the precision, recall, F1, and the prec. vs rec. graph. Table 3 summarizes the
MMR, Bpref, and other time-wise (training and recommending) performances.
TTT, TRT, ATT, and ART stand for Total Train Time, Total Recommend
Time, Average Train Time (per fold), and Average Recommend Time (per fold)
respectively.
Metric daac Seldfr-ryeacdommenkdnabtion treebase daac Crodsrsy-raedcommeknndbationtreebase
MRR 0.564 0.202 0.494 0.089 0.532 0.032 0.514 0.058
Bpref 0.818 0.440 0.665 0.069 0.630 0.245 0.648 0.044
TTT (sec) 52.937 63.23 60.634 61.5 69.322 69.251 69.176 69.221
TRT (sec) 9.946 11.940 12.295 10.752 43.817 44.963 44.478 43.581
ATT (sec) 5.293 6.323 6.063 6.15 6.932 6.925 6.917 6.922
ART (sec) 0.994 1.194 1.229 1.075 4.381 4.496 4.447 4.358
Table 3: MRR, Bpref, and other time-wise performance statistics of the TFIDF
based method performed on different data sets and source selection modes.</p>
          <p>From the results, we get the following observations. First, the performance
differs significantly across data sets. Overall, the TFIDF based method performs
better on DAAC and KNB data sets. DAAC data set has smaller tag library
(only 611 unique tags), hence the chance of recommending correct tags (which
is reflected by the recall growth rates) is higher than those of other data sets.
The KNB data set, though has the largest tag library, has a high tag utilization
rate, hence the chance of correctly guessing the tags is expectedly higher.</p>
          <p>Second, self-source recommendations always perform better than cross-source
recommendations with respect to our evaluation scheme. This is because, given
a document query, the cross-recommendation system may introduce alien tags
from other data sets, which would most certainly be identified as incorrect tags.
Note that, even though the cross-recommendation systems may perform worse
than the self-recommendation ones with respect to our evaluation setting, in real
world these alien tags may actually have semantic relevance to the query.</p>
        </sec>
        <sec id="sec-3-6-3">
          <title>6.3 Evaluation on TM-Based Method</title>
          <p>We run our score propagation algorithm with the TM based document similarity
on each data set, using both self and cross recommendation modes. Figure 3
summarizes the precision, recall, F1, along with the prec. vs rec. comparison.
Table 3 summarizes the MMR, Bpref, and other time-wise performances.</p>
          <p>The comparison among different data sets is similar to the result from the
TFIDF based method, except that the performance on the KNB data set seems
to be surprisingly outstanding.</p>
        </sec>
        <sec id="sec-3-6-4">
          <title>6.4 Performance Comparison Between the Methods</title>
          <p>We compare the performances of the two document similarity measures (TFIDF
based and TM based) on the 4 data sets with self-recommendation. Figure 4
(a) Precision
(b) Recall
summarizes the precision, recall, F1, and prec. vs rec. graphs. The TM based
approach obviously outperforms the TFIDF based approach in DAAC, DRYAD,
and KNB data sets. The performance of the TM based approach is dominant
when applied to the KNB data set, as seen in the precision vs recall graph in
Figure 4(d), where the curve forms the shape close to the ideal
precision-vsrecall curve. The comparison, however, is not dominant on the TreeBASE data
set. Actually, both algorithms perform very poorly on the TreeBASE data set.
We hypothesize that this is because the TreeBASE documents are very sparse
and have very few tags. From our statistics, each document in the TreeBASE
data set has only 11 words and only 0.7 tags on average. Such sparse texts lead
to weak relationship when finding textually similar documents in the TFIDF
based approach, and the poor quality of the topic model used by the TM based
approach. The small number of tags per document makes it even harder to
predict the right tags.</p>
          <p>We note that, though overall the TM based approach recommends better
quality of tags, the training times take significantly longer than those of the
TFIDF based approach. For example, it takes roughly 33 minutes to train the
(a) Precision
(b) Recall</p>
          <p>TM based method (modeling topics) using 3,600 documents, while it takes only
7 seconds to train (index) the same amount of the documents via the TFIDF
based approach. However, also note that the evaluation is done on a local PC.
The issue of training times could be much diminished if the system is employed
on a powerful computing server.</p>
        </sec>
      </sec>
      <sec id="sec-3-7">
        <title>7 Conclusions and Future Research</title>
        <p>In this paper we propose an algorithm for automatic metadata annotation. We
are inspired by the real world problem faced by ONEMerucy, a search system
for environmental science metadata harvested from multiple data archives, in
which the metadata from different archives has different levels of curation and
hence behooves the system that automatically annotates poorly annotated
metadata records. We treat each metadata record as a tagged document, and then
transform the problem into a tag recommendation problem.</p>
        <p>We propose the score propagation model for tag recommendation, with two
variations of document similarity measures: TFIDF based and Topic Model (TM)
based. The TM based approach yields impressive results, though with a cost of
longer training times.</p>
        <p>
          Our future works include evaluating our approaches against a well known
state-of-the-art such as the method that mines association rules for tag
recommendation[
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. We also plan to adopt a classification technique such as [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] to
rank tags in the tag library. Finally, we aim to implement our automatic
metadata annotation system into ONEMercury search service. This would give rise
to further implementation and system integration issues.
        </p>
      </sec>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Asuncion</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Welling</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Smyth</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teh</surname>
            ,
            <given-names>Y.W.</given-names>
          </string-name>
          :
          <article-title>On smoothing and inference for topic models</article-title>
          .
          <source>In: Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence</source>
          . pp.
          <fpage>27</fpage>
          -
          <lpage>34</lpage>
          . UAI '09, AUAI Press, Arlington, Virginia, United
          <string-name>
            <surname>States</surname>
          </string-name>
          (
          <year>2009</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>1795114</volume>
          .
          <fpage>1795118</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Blei</surname>
            ,
            <given-names>D.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ng</surname>
            ,
            <given-names>A.Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jordan</surname>
            ,
            <given-names>M.I.</given-names>
          </string-name>
          :
          <article-title>Latent dirichlet allocation</article-title>
          .
          <source>J. Mach. Learn. Res</source>
          .
          <volume>3</volume>
          ,
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          (
          <year>Mar 2003</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>944919</volume>
          .
          <fpage>944937</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Buckley</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Voorhees</surname>
            ,
            <given-names>E.M.:</given-names>
          </string-name>
          <article-title>Retrieval evaluation with incomplete information</article-title>
          .
          <source>In: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval</source>
          . pp.
          <fpage>25</fpage>
          -
          <lpage>32</lpage>
          . SIGIR '04,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2004</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1008992.1009000
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Heymann</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ramage</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Garcia-Molina</surname>
          </string-name>
          , H.:
          <article-title>Social tag prediction</article-title>
          .
          <source>In: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval</source>
          . pp.
          <fpage>531</fpage>
          -
          <lpage>538</lpage>
          . SIGIR '08,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2008</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1390334.1390425
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Krestel</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fankhauser</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Nejdl</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          :
          <article-title>Latent dirichlet allocation for tag recommendation</article-title>
          .
          <source>In: Proceedings of the third ACM conference on Recommender systems</source>
          . pp.
          <fpage>61</fpage>
          -
          <lpage>68</lpage>
          . RecSys '09,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2009</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1639714.1639726
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Chen</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>M.:</given-names>
          </string-name>
          <article-title>A simple word trigger method for social tag suggestion</article-title>
          .
          <source>In: Proceedings of the Conference on Empirical Methods in Natural Language Processing</source>
          . pp.
          <fpage>1577</fpage>
          -
          <lpage>1588</lpage>
          . EMNLP '
          <volume>11</volume>
          ,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2011</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>2145432</volume>
          .
          <fpage>2145601</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Michener</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vieglais</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vision</surname>
          </string-name>
          , T.,
          <string-name>
            <surname>Kunze</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cruse</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Jan´ee, G.:
          <article-title>DataONE: Data Observation Network for Earth - Preserving Data and Enabling Innovation in the Biological and Environmental Sciences</article-title>
          .
          <source>DLib Magazine</source>
          <volume>17</volume>
          (
          <issue>1</issue>
          /2),
          <fpage>1</fpage>
          -
          <lpage>12</lpage>
          (
          <year>2011</year>
          ), http://www.dlib.org/dlib/january11/michener/01michener.html
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Mishne</surname>
          </string-name>
          , G.:
          <article-title>Autotag: a collaborative approach to automated tag assignment for weblog posts</article-title>
          .
          <source>In: Proceedings of the 15th international conference on World Wide Web</source>
          . pp.
          <fpage>953</fpage>
          -
          <lpage>954</lpage>
          . WWW '06,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2006</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1135777.1135961
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Treeratpituk</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Teregowda</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Huang</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Giles</surname>
            ,
            <given-names>C.L.</given-names>
          </string-name>
          :
          <article-title>Seerlab: A system for extracting key phrases from scholarly documents</article-title>
          .
          <source>In: Proceedings of the 5th International Workshop on Semantic Evaluation</source>
          . pp.
          <fpage>182</fpage>
          -
          <lpage>185</lpage>
          . SemEval '10,
          <string-name>
            <surname>Association</surname>
          </string-name>
          for Computational Linguistics, Stroudsburg, PA, USA (
          <year>2010</year>
          ), http://dl.acm.org/citation.cfm?id=
          <volume>1859664</volume>
          .
          <fpage>1859703</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yang</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hua</surname>
            ,
            <given-names>X.S.</given-names>
          </string-name>
          :
          <article-title>Learning to tag</article-title>
          .
          <source>In: Proceedings of the 18th international conference on World wide web</source>
          . pp.
          <fpage>361</fpage>
          -
          <lpage>370</lpage>
          . WWW '09,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          , New York, NY, USA (
          <year>2009</year>
          ), http://doi.acm.
          <source>org/10</source>
          .1145/1526709.1526758
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>