<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Active Stream Learning with an Oracle of Unknown Availability for Sentiment Prediction</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Elson Serrao</string-name>
          <email>elson.serrao@gmail.com</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Myra Spiliopoulou</string-name>
          <email>myra@ovgu.de</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Otto-von-Guericke-University Magdeburg</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
      </contrib-group>
      <fpage>36</fpage>
      <lpage>47</lpage>
      <abstract>
        <p>Active learning holds the promise of learning models from the data with minimal expert input. However, it assumes that the expert is always available or only at the beginning. We waive this assumption and investigate to what extent active learning is effective in practice. We focus on sentiment classification over real streams of opinions. We show that at least for the two real streams we have analyzed, the random strategy is very competitive, and querying the expert in an intelligent way does not bring many advantages, at least when the expert is irregularly available.</p>
      </abstract>
      <kwd-group>
        <kwd>active learning</kwd>
        <kwd>oracle availablity</kwd>
        <kwd>polarity model learning</kwd>
        <kwd>opinion stream mining</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        The objective of active learning is to obtain better or comparable performance
to a fully supervised learner with fewer labels if the learner is given the
opportunity to select the instances for which it requires labels [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Active learning is
thus very suitable in those scenarios where there is an abundance of unlabeled
data and obtaining new labels is rather expensive. Labels are obtained using an
oracle who can, for example, be a domain expert or a human annotator from
a crowd-sourcing platform. However, it is often assumed that there is a single
oracle that is always correct, always available and inexpensive to query. While
there are surveys [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], and [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and studies [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] that provide insights to the
above mentioned challenges in active learning, only a few studies focus on the
availability of the oracle for streaming data [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        In this paper, we consider a stream of opinionated documents and try to
predict the sentiment of the document as being either positive, negative or
neutral. Over time drift may be observed in the stream due to evolving topics, data
and vocabulary, requiring the classifier to adapt to the opinionated stream. For
this we use active learning to obtain new labels from the data stream. Instances
to be labeled by the oracle are sampled using an appropriate query strategy.
However, we assume that the oracle is available irregularly i.e. according to a
pattern unknown to the learner. This implies that the oracle may be queried
at each moment, but will respond by delivering the label only if it is available.
Most of the algorithms for active learning on streams either assume infinite
verification latency, whereupon they invoke semi-supervised learning [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] or they
assume that the Oracle is always available to provide labels [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ], [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ], [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ],
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ].
      </p>
      <p>
        Shickel and Rashidi in [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] propose a framework that is aware of the oracle’s
availability for data streams. Their framework considers multiple oracles and
focuses on querying first those oracles that have a higher availability. They try to
achieve a cost-benefit tradeoff by using a dynamic labeling budget proportional
to the oracle’s availability with the cost of labeling an instance inversely
proportional to the oracle’s availability. However, such a cost-benefit tradeoff seems
unrealistic in real-world scenarios where the cost is likely to be regulated by the
difficulty in obtaining the label and other factors [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. No experiments were
conducted with oracles of varying expertise, which is often seen in active learning
literature considering multiple oracles.
3
      </p>
    </sec>
    <sec id="sec-2">
      <title>Active Learning on an Opinionated Stream</title>
      <p>We consider a data stream D of opinionated documents observed at distinct
timepoints t0, t1, . . . , ti, . . . where at each timepoint ti we receive a batch of
documents. We define the timepoint on a temporal level where, for example, the
timepoint could be a week. Consequently, all the documents arriving during the
time period from ti−1 to ti would comprise the batch of documents for ti.</p>
      <p>Our framework encompasses an algorithmic core described in Section 3.1 and
the query strategies in Section 3.2. We link this framework with a simulator of
the oracle’s availability, described in Section 4.1, and with an algorithm for the
preparation of the opinionated data stream, described in Section 4.2.
3.1</p>
      <sec id="sec-2-1">
        <title>Modeling Oracle Unavailability during Querying</title>
        <p>We consider the beginning of the stream at t0 to be characterized by the
availability of an initial set of labeled documents L0. The initially labeled documents
L0 are used to initialize the classifier Δ. At subsequent timepoints i.e. t1
onwards, we receive unlabeled data Ut. If the budget B is not exceeded, for every
unlabeled document x, we use the trained classifier Δ to predict the probability
P (yˆc|x) ∀c ∈ C, where c represents the sentiment of the document, namely,
positive, negative or neutral. Our method calculates the confidence of the learner’s
prediction I using metric φ, and launches a request for the true label y when
necessary. The oracle provides the true label y only if it is available and the
document x is added to the labeled data for the next iteration. In the event that
the oracle is unavailable, x is not used to adapt the learner. An overview of our
framework is shown in Algorithm 1.</p>
        <p>We assume the cost of labeling is the same for every document at any time
t. If nt is the number of queries sent to the oracle at t, then the utilized budget
at t is given by nt</p>
        <p>|Ut|</p>
        <p>
          To adapt to the evolving data stream, we utilize a sliding window W [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]. At
every timepoint t, we add the labeled documents Lt to the window. Once the
window is full, the documents from the oldest timepoint within the window are
forgotten. Depending on the chosen classifier Δ, for every iteration, there may
be a need to retrain Δ with the documents in the window.
        </p>
        <p>Although the framework is capable of using any confidence metric such as
maximum posterior probability or the maximum margin between the first and
second most probable class, we propose calculating the confidence of a prediction
using entropy as,
&lt; B
φH = 1 −
"
− X P (yˆc|x) log|C| P (yˆc|x)
c∈C
#
(1)
(2)
3.2</p>
      </sec>
      <sec id="sec-2-2">
        <title>Query Strategies</title>
        <p>
          We use uncertainty based query strategies for the active learner. We also use the
variable uncertainty and variable randomized uncertainty strategies introduced
by Zˇliobaite˙ et al. in [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ] and [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ]. The difference in our implementation is
that we have generalized the strategies to allow the use of any confidence metric
while determining if an instance needs to be sampled.
        </p>
        <p>Random Strategy: This strategy randomly selects an instance to be labeled
by the oracle with a probability given by the budget B. In this sense, it is very
naive and is used as the baseline strategy.
7
8
9
10
11
12
13
14
15
16</p>
        <p>Algorithm 1: Active Learning with an Irregularly Available Oracle
Input: Δ - classifier with relevant parameters; size - window size; B - budget;
o - oracle; φ - confidence metric; queryStrategy - query strategy with
parameters;</p>
        <p>Initialize: t ← 0; W ← SlidingWindow(size)
1 Receive labeled data Lt
2 W ← addToWindow(Lt)
3 for t = 1, 2, . . . do
4 Train classifier Δ with instances in W
5 Receive unlabeled data Ut
6 nt ← 0</p>
        <p>Lt ← ∅
for each instance x ∈ Ut do</p>
        <p>Pc ← PΔ(yˆc|x) ∀c ∈ C // predict the probability
yˆ ← arg maxc(Pc) // predicted label
if (nt/ |Ut|) &lt; B then</p>
        <p>I ← φ(Pc) // compute the confidence
if queryStrategy(B, I, . . . ) = T rue then
nt ← nt + 1
if isAvailable(o) then
y ← get true label of x from the oracle o</p>
        <p>Lt ← Lt ∪ (x, y)</p>
        <p>W ← addToWindow(Lt)</p>
        <p>Fixed Uncertainty Strategy: This strategy samples those instances which
the learner are least confident of by comparing the confidence of prediction of
an instance to a fixed threshold θ.</p>
        <p>Variable Uncertainty Strategy: For a learner to adapt to an evolving
data stream, obtaining labels for the least confident instances within each
timepoint would be more beneficial. The variable uncertainty strategy described in
Algorithm 2 provides a variable threshold that adjusts itself to the incoming
data stream. When the data stream speeds up, the confidence of the learner
decreases. In such cases, it decreases its threshold so that least confident instances
are queried first. On the other hand, at times when the learner is confident of its
prediction, the threshold is increased to capture the most uncertain instances.</p>
        <p>Algorithm 2: VariableUncertainty(I, s)</p>
        <p>Input: I - confidence score; s ∈ (0, 1] - threshold adjustment step
Output: T rue if true label is required else F alse</p>
        <p>Initialize: labeling threshold θ ← 1
1 if I &lt; θ then
2 θ ← θ (1 − s)// uncertain instance: decrease the threshold</p>
      </sec>
      <sec id="sec-2-3">
        <title>3 return T rue</title>
      </sec>
      <sec id="sec-2-4">
        <title>4 else</title>
        <p>5 θ ← θ (1 + s)// confident instance: increase the threshold</p>
      </sec>
      <sec id="sec-2-5">
        <title>6 return F alse</title>
        <p>Variable Randomized Uncertainty Strategy: Uncertainty based active
learning query strategies generally focus on sampling those instances that are
close to the decision boundary of the learner. In evolving data streams changes
may occur anywhere in the instance space. So as to not miss the change that
may occur elsewhere, the variable randomized uncertainty strategy occasionally
samples those instances that the learner is confident of. As shown in Algorithm 3,
the threshold is multiplied by a normally distributed random variable to sample
the confident instances.</p>
        <p>Algorithm 3: VariableRandomizedUncertainty(I, s, δ)</p>
        <p>Input: I - confidence score; s ∈ (0, 1] - threshold adjustment step
δ - variance of threshold randomization
Output: T rue if true label is required else F alse</p>
        <p>Initialize: labeling threshold θ ← 1
1 θrandomized ← θ × η, where η ∈ N (1, δ) is a random multiplier
2 if I &lt; θrandomized then
3 θ ← θ (1 − s)// uncertain instance: decrease the threshold</p>
      </sec>
      <sec id="sec-2-6">
        <title>4 return T rue</title>
      </sec>
      <sec id="sec-2-7">
        <title>5 else</title>
        <p>6 θ ← θ (1 + s) // confident instance: increase the threshold</p>
      </sec>
      <sec id="sec-2-8">
        <title>7 return F alse</title>
        <p>4</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Experiment Setup</title>
      <p>The goal of our experiments is to study how the performance of the learner is
affected when the availability of the oracle changes. The following sub-sections
describe the oracle availability simulator, the datasets used, and the evaluation
criteria and strategy.
4.1</p>
      <sec id="sec-3-1">
        <title>Simulator of Oracle Availability</title>
        <p>At timepoint t, we consider the oracle to be available with a probability of αt. In
our experiments, at any timepoint t, we set the oracle’s availability αt = α and
is considered to be independent of the availability at t − 1. Algorithm 4 provides
a more formal definition of our simulator.</p>
        <p>Algorithm 4: Oracle Availability Simulator</p>
        <p>Input: α ∈ (0, 1] - availability of the oracle;</p>
        <p>Output: T rue if the oracle is available else F alse
1 return uniform(0, 1 ) ≤ α
4.2</p>
      </sec>
      <sec id="sec-3-2">
        <title>Datasets and Feature Engineering</title>
        <p>Yelp: The Yelp Dataset 1 contains about 5.2 million reviews of various businesses
over a period of 13 years from 11 metropolitan areas across 4 countries. We
1 https://www.yelp.com/dataset/challenge
filtered the dataset for English reviews and additionally removed those reviews
whose length was less than 15 words. For our data stream, we considered the
reviews from 2009 onwards.</p>
        <p>
          Amazon: The Amazon dataset introduced in [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ], contains reviews of
several product categories. Using the 5-core datasets of the product categories, we
build a dataset with an intention of introducing concept drift. For this
purpose, we randomly selected three product categories every three months from
among the following nine categories: Home and Kitchen, Kindle Store, Health
and Personal Care, Cell Phones and Accessories, Apps for Android,
Electronics, Clothing, Shoes and Jewelry, CDs and Vinyl, and Beauty. We removed the
duplicate reviews arising from the product having multiple categories.
        </p>
        <p>Both the Yelp and Amazon employ a 5-star rating scheme, where 5-stars is
the highest rating while 1-star is the lowest. We considered the 1 and 2-star
rating to be negative, 3-star rating neutral and 4 and 5-star rating positive.</p>
        <p>Feature Engineering: We preprocessed the reviews by replacing URLs,
negations and currencies with the tokens URL, NEGATION and CURRENCY
respectively and some emoticons with tokens like SMILE and HEART. We
further suppressed repeated letters, expanded contractions (e.g. ”I’m” into ”I am”),
removed stopwords and replaced words by their lemmas.</p>
        <p>After preprocessing, we extracted features from the reviews using word
ngrams with n = 3 along with its corresponding frequency of occurrence. We
selected the most relevant 15000 features using the chi-square test. Our feature
vectors were then constructed using the TF-IDF weighting scheme.2
4.3</p>
      </sec>
      <sec id="sec-3-3">
        <title>Evaluation Strategy and Evaluation Criteria</title>
        <p>
          We define the duration of a timepoint to be a week and maintain a sliding
window of five weeks. We perform prequential evaluation [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ]: we first test on
all documents for the incoming week and then adapt by using only the sampled
documents whose labels have been provided by the oracle.
        </p>
        <p>
          As we use entropy to calculate our confidence measure, we evaluate on log
loss decrease [
          <xref ref-type="bibr" rid="ref2">2</xref>
          ]. The log loss lt at timepoint t is given by,
1
        </p>
        <p>X X bij log pij
lt = − |Ut| i∈Ut j∈C
where bij is a binary indicator of whether or not label j is the correct classification
for instance i, and pij is the model probability of assigning label j to instance i.
5</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>Experimental Evaluation</title>
      <p>
        As the Stochastic Gradient Descent classifier was found to be effective for
sentiment analysis in [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], we used the same with hinge loss, l2 penalty and alpha
value of 0.0001 to optimize the objective function of a linear support vector
machine as our base learner. The base learner was calibrated using Platt Scaling to
obtain probabilistic outputs.
2 Code and supplementary material available at https://github.com/elrasp/osm
(3)
      </p>
      <p>
        For each timepoint t, we set a fixed budget B = 0.1. For the query strategies,
we set the fixed threshold θ = 0.9, and the suggested values of 0.01 and 1 for
the threshold adjustment step s and the variance of the normally distributed
random number generator δ respectively [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ].
      </p>
      <p>In Section 5.1, we describe the underlying class distribution of the data
stream for these datasets and in Section 5.2, we analyze the influence of the
oracle’s availability on the performance of the learner.
5.1</p>
      <sec id="sec-4-1">
        <title>Distribution of Data</title>
        <p>The weekly underlying class distribution of data in the stream of opinionated
documents for the Yelp and Amazon datasets are shown in Figure 3(a) and
Figure 3(b) respectively.</p>
        <p>Yelp: For the Yelp dataset, we observe a gradual increase in the number of
reviews obtained over time. The proportion of neutral reviews received remain
almost constant for the entire data stream. In comparison, the positive and
negative reviews are always increasing. Also, we find that the positive reviews
dominate the class distribution accounting for more than 50% of the reviews in
any week.</p>
        <p>Amazon: Unlike the Yelp dataset, the amazon dataset exhibits sudden
bursts in the volume of reviews received. This occurs as some chosen
product categories are more popular than others and receive more reviews. We also
observe that mostly there is a burst in the positive reviews received as compared
to the negative and neutral reviews. Similar to the Yelp dataset, the positive
reviews dominate the class distribution.
5.2</p>
      </sec>
      <sec id="sec-4-2">
        <title>Impact of the Oracle’s Availability on Learning</title>
        <p>2500
d
re2000
sen
w
irsA1500
e
e
uQ1000
500</p>
        <p>In Figure 3 we show the results of evaluation for the Yelp (on the left) and
Amazon (on the right) datasets. We aggregated the log loss results for all the
timepoints every two months and plot the mean (lines) and standard deviation
(colored area around the line). For the Yelp dataset, in general, we observe that
the error in performance gradually reduces as the stream progresses irrespective
of the oracles availability and the query strategy used. On the other hand, the
learner finds it much more difficult to adapt to the evolving data stream of the
Amazon dataset.</p>
        <p>In the early stages of the stream, where the data volume in the stream is low,
we observe the learner performing better for lower oracle availabilities. As more
and more data is accumulated, the need to have the oracle for the Yelp dataset
drops but remains for the Amazon dataset as it exhibits more drift.</p>
        <p>
          We further conducted experiments with oracle availabilities varying between
0.01 and 0.1 in steps of 0.01 and compared the different query strategies at
varying availabilities with the non-parametric Friedman’s test followed by Nemenyi
post-hoc [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ]. Friedman’s test proceeds by ranking the models under
consideration. The best performing model is given the rank 1, the second best 2 and so
on. If two or more models have identical performance they are given an average
rank. The null hypothesis of Friedman’s test states that all the models perform
the same and thus, will have the same average rank. If the test rejects the null
hypothesis, we proceed with the Nemenyi post-hoc test that makes pair-wise
comparisons of the different models. It identifies statistically significant models
if the difference between their average rank is more than the critical distance.
        </p>
        <p>Figure 4(a) and Figure 4(b) shows the critical distance diagram of the Yelp
and Amazon dataset respectively. In these diagrams the better performing
models are ranked higher and are placed to the right. The model name corresponds
to a combination of the query strategy and the value of the oracle availability,
whereupon ”rand” in the name refers to the random strategy. The models whose
difference in average rank is less than the calculated critical distance (CD) are
connected to each other by a horizontal line, indicating that these models are
statistically indifferent to each other.</p>
        <p>As we can see in these figures, there are strategies that which perform
significantly better when the oracle availability changes. At very low oracle
availabilities we find that there is no strategy that performs significantly better than the
others. For the Amazon dataset, even at higher oracle availabilities there is no
difference in the performance of the strategies. This could mainly be attributed
to the nature of the drift exhibited in both the datasets.
6</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Conclusions</title>
      <p>The need for an oracle depends on the overall variability of the dataset. If there
is convergence over time as in the case of the Yelp dataset, the need for an
oracle is limited, because the learner can predict the labels by itself. Hence, low
availability of the oracle is only relevant if there is drift.
availability
0.1
0.4</p>
      <p>Jul 2012 Jul 2013 Jul 2014
(b) Amazon Dataset: Weekly Distribution of Data</p>
      <p>Jul</p>
      <p>Jul</p>
      <p>2012 Jul 2013 Jul
(d) Amazon Dataset: Random Strategy
2014</p>
      <p>Jul</p>
      <p>Jul</p>
      <p>Jul
25000
20000
15000
10000
5000
positive
neutral
negative
1.1
1.0
0.9</p>
      <p>Jul
availability
0.1
0.4
0.7
1.0</p>
      <p>If an oracle is available, the random strategy is a good choice, as it shows the
same tendency as the other strategies, is easy to implement and is fast. However,
more experiments are needed to check whether stronger active learning strategies
can beat the random sampler in this setting, by capitalizing more effectively on
the few available labels. Experiments are also required for different domains.</p>
      <p>
        To improve model quality, we intend to consider more elaborate querying
strategies [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ], and to investigate whether instance-based active learning might
deliver better results than the block-based active learning paradigm we currently
use.
      </p>
      <p>
        To compensate for oracle inavailability, we also intend to combine active
learning with semi-supervised learning. Semi-supervised methods are used to
propagate labels to the arriving instances, cf. [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ], [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ]. In that case, we want to
investigate how disagreement between oracle and self-learner can be alleviated
in a seamless way.
      </p>
      <p>Acknowledgements. This work was inspired and partially conducted (last
author) within the project OSCAR Opinion Stream Classification with Ensembles
and Active Learners, funded by the German Research Foundation.
variable_rand_0.3
fixed_0.3
variable_0.3
random_0.3
variable_1.0
variable_0.7
variable_rand_0.7
variable_rand_0.1
fixed_0.1
variable_rand_1.0
fixed_0.7
random_0.1</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Settles</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Active learning literature survey</article-title>
          .
          <source>Computer Sciences Technical Report 1648</source>
          , University of Wisconsin-Madison (
          <year>2009</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Fu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>A survey on instance selection for active learning</article-title>
          .
          <source>Knowledge and Information Systems</source>
          <volume>35</volume>
          (
          <issue>2</issue>
          ) (May
          <year>2013</year>
          )
          <fpage>249</fpage>
          -
          <lpage>283</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Lughofer</surname>
          </string-name>
          , E.:
          <article-title>On-line active learning: A new paradigm to improve practical useability of data stream modeling methods</article-title>
          .
          <source>Information Sciences 415-416</source>
          (nov
          <year>2017</year>
          )
          <fpage>356</fpage>
          -
          <lpage>376</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>Wu</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , Liu,
          <string-name>
            <given-names>Y.</given-names>
            ,
            <surname>Guo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            ,
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <surname>X.</surname>
          </string-name>
          :
          <article-title>A probabilistic model of active learning with multiple noisy oracles</article-title>
          .
          <source>Neurocomputing</source>
          <volume>118</volume>
          (
          <year>2013</year>
          )
          <fpage>253</fpage>
          -
          <lpage>262</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Donmez</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          , Carbonell, J.G.:
          <article-title>Proactive learning: Cost-sensitive active learning with multiple imperfect oracles</article-title>
          .
          <source>In: Proceedings of the 17th ACM Conference on Information and Knowledge Management. CIKM '08</source>
          , New York, NY, USA, ACM (
          <year>2008</year>
          )
          <fpage>619</fpage>
          -
          <lpage>628</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Shickel</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rashidi</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          :
          <article-title>ART: an availability-aware active learning framework for data streams</article-title>
          . In
          <string-name>
            <surname>Markov</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Russell</surname>
          </string-name>
          , I., eds.
          <source>: Proceedings of the Twenty-Ninth International Florida Artificial Intelligence Research Society Conference, FLAIRS</source>
          <year>2016</year>
          ,
          <string-name>
            <given-names>Key</given-names>
            <surname>Largo</surname>
          </string-name>
          , Florida, May
          <volume>16</volume>
          -18,
          <year>2016</year>
          ., AAAI Press (
          <year>2016</year>
          )
          <fpage>92</fpage>
          -
          <lpage>97</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Zimmermann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ntoutsi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spiliopoulou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Adaptive semi supervised opinion classifier with forgetting mechanism</article-title>
          .
          <source>In: Proceedings of the 29th Annual ACM Symposium on Applied Computing. SAC '14</source>
          , New York, NY, USA, ACM (
          <year>2014</year>
          )
          <fpage>805</fpage>
          -
          <lpage>812</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Zimmermann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ntoutsi</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spiliopoulou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Incremental active opinion learning over a stream of opinionated documents</article-title>
          .
          <source>WISDOM'15 (Workshop on Issues of Sentiment Discovery and Opinion Mining) 2015 at Knowledge Discovery and Data Mining, KDD'15 Workshops</source>
          <year>2015</year>
          , Sydney, Australia,
          <year>August 10</year>
          ,
          <year>2015</year>
          (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9. Zˇliobaite˙,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bifet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Pfahringer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Holmes</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Active learning with evolving streaming data</article-title>
          . In Gunopulos, D.,
          <string-name>
            <surname>Hofmann</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Malerba</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Vazirgiannis</surname>
          </string-name>
          , M., eds.:
          <source>Machine Learning and Knowledge Discovery in Databases</source>
          , Berlin, Heidelberg, Springer Berlin Heidelberg (
          <year>2011</year>
          )
          <fpage>597</fpage>
          -
          <lpage>612</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10. Zˇliobaite˙,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bifet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Holmes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            ,
            <surname>Pfahringer</surname>
          </string-name>
          ,
          <string-name>
            <surname>B.</surname>
          </string-name>
          :
          <article-title>Moa concept drift active learning strategies for streaming data</article-title>
          . In Diethe, T.,
          <string-name>
            <surname>Balcazar</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shawe-Taylor</surname>
          </string-name>
          , J.,
          <string-name>
            <surname>Tirnauca</surname>
          </string-name>
          , C., eds.
          <source>: Proceedings of the Second Workshop on Applications of Pattern Analysis. Volume 17 of Proceedings of Machine Learning Research., CIEM</source>
          ,
          <string-name>
            <surname>Castro</surname>
            <given-names>Urdiales</given-names>
          </string-name>
          , Spain,
          <source>PMLR (19-21 Oct</source>
          <year>2011</year>
          )
          <fpage>48</fpage>
          -
          <lpage>55</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11. Zˇliobaite˙,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bifet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Pfahringer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            ,
            <surname>Holmes</surname>
          </string-name>
          ,
          <string-name>
            <surname>G.</surname>
          </string-name>
          :
          <article-title>Active learning with drifting streaming data</article-title>
          .
          <source>IEEE Transactions on Neural Networks and Learning Systems</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ) (
          <year>Jan 2014</year>
          )
          <fpage>27</fpage>
          -
          <lpage>39</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12. Smailovi´c, J., Grˇcar, M.,
          <string-name>
            <surname>Lavraˇc</surname>
          </string-name>
          , N.,
          <string-name>
            <surname>Zˇnidarˇsiˇc</surname>
          </string-name>
          , M.:
          <article-title>Stream-based active learning for sentiment analysis in the financial domain</article-title>
          .
          <source>Inf. Sci. 285</source>
          (C) (
          <year>November 2014</year>
          )
          <fpage>181</fpage>
          -
          <lpage>203</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Gama</surname>
            ,
            <given-names>J.a.</given-names>
          </string-name>
          , Zˇliobaite˙,
          <string-name>
            <given-names>I.</given-names>
            ,
            <surname>Bifet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ,
            <surname>Pechenizkiy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            ,
            <surname>Bouchachia</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          :
          <article-title>A survey on concept drift adaptation</article-title>
          .
          <source>ACM Comput. Surv</source>
          .
          <volume>46</volume>
          (
          <issue>4</issue>
          ) (
          <year>March 2014</year>
          )
          <volume>44</volume>
          :
          <fpage>1</fpage>
          -
          <lpage>44</lpage>
          :
          <fpage>37</fpage>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>McAuley</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Targett</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>Q.</given-names>
          </string-name>
          , van den Hengel, A.:
          <article-title>Image-based recommendations on styles and substitutes</article-title>
          .
          <source>In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR</source>
          <year>2015</year>
          , ACM Press (
          <year>2015</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Bifet</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Frank</surname>
          </string-name>
          , E.:
          <article-title>Sentiment knowledge discovery in twitter streaming data</article-title>
          .
          <source>In: Proceedings of the 13th International Conference on Discovery Science. DS'10</source>
          , Berlin, Heidelberg, Springer-Verlag (
          <year>2010</year>
          )
          <fpage>1</fpage>
          -
          <lpage>15</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          16.
          <string-name>
            <surname>Demˇsar</surname>
          </string-name>
          , J.:
          <article-title>Statistical comparisons of classifiers over multiple data sets</article-title>
          .
          <source>J. Mach. Learn. Res. 7 (December</source>
          <year>2006</year>
          )
          <fpage>1</fpage>
          -
          <lpage>30</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          17.
          <string-name>
            <surname>Kottke</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Krempl</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Spiliopoulou</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          :
          <article-title>Probabilistic active learning in datastreams</article-title>
          . In Fromont, E.,
          <string-name>
            <surname>De Bie</surname>
          </string-name>
          , T., van Leeuwen, M., eds.
          <source>: Advances in Intelligent Data Analysis XIV, Cham</source>
          , Springer International Publishing (
          <year>2015</year>
          )
          <fpage>145</fpage>
          -
          <lpage>157</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          18.
          <string-name>
            <surname>Dyer</surname>
            ,
            <given-names>K.B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Capo</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Polikar</surname>
          </string-name>
          , R.:
          <article-title>COMPOSE: A semisupervised learning framework for initially labeled nonstationary streaming data</article-title>
          .
          <source>IEEE Transactions on Neural Networks and Learning Systems</source>
          <volume>25</volume>
          (
          <issue>1</issue>
          ) (jan
          <year>2014</year>
          )
          <fpage>12</fpage>
          -
          <lpage>26</lpage>
          Journal Article Research Support, U.S. Gov't,
          <string-name>
            <surname>Non-P.H.S.</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          19.
          <string-name>
            <surname>Souza</surname>
            ,
            <given-names>V.M.A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Silva</surname>
            ,
            <given-names>D.F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gama</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Batista</surname>
            ,
            <given-names>G.E.A.P.A.</given-names>
          </string-name>
          :
          <article-title>Data stream classification guided by clustering on nonstationary environments and extreme verification latency</article-title>
          . In Venkatasubramanian, S.,
          <string-name>
            <surname>Ye</surname>
          </string-name>
          , J., eds.
          <source>: Proceedings of the 2015 SIAM International Conference on Data Mining. [Society for Industrial and Applied Mathematics] (jun</source>
          <year>2015</year>
          )
          <fpage>873</fpage>
          -
          <lpage>881</lpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>