<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Identifying Complaints from Product Reviews: A Case Study on Hindi</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Raghvendra Pratap Singh</string-name>
          <email>raghvendra.singh6@mail.dcu.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rejwanul Haque</string-name>
          <email>rejwanul.haque@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mohammed Hasanuzzaman</string-name>
          <email>mohammed.hasanuzzaman@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Andy Way</string-name>
          <email>andy.way@adaptcentre.ie</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>ADAPT Centre</institution>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Cork Institute of Technology</institution>
          ,
          <addr-line>Cork</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Computing, Dublin City University</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>When an expectation does not meet reality in a real-world situation, the diference is usually expressed and communicated via an act which is complaint. Customers often post reviews on the products or services they purchase on the retailer websites and diferent social media platforms, and the reviews may reflect complaints about the products or services. Automatic recognition of customers' complaints on products or services that they purchase can be crucial for the organisations, multinationals and online retailers since they can exploit this information to fulfil the customers' expectations including managing and resolving the complaints. In this work, we present the supervised and semi-supervised learning strategies to identify users' complaints from the language they use to post their reviews. In other words, we automatically identify complaints from the opinionated texts (reviews) about products posted in Hindi. For this, first we automatically crawled the Hindi reviews on different products from the the websites of the retail giant Amazon and the popular social media platform YouTube, and prepared a gold-standard data set via a systematic manual annotation process. We use state-ofthe-art classification algorithms for the complaints identification task and our classification models achieve reasonable classification accuracy (F1 = 68.38%) on a gold-standard evaluation test set.</p>
      </abstract>
      <kwd-group>
        <kwd>Random walk</kwd>
        <kwd>LSTM</kwd>
        <kwd>fastText</kwd>
        <kwd>Dice coefficient</kwd>
        <kwd>SMOTE</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1 Introduction</title>
      <p>Text classification is an active field of natural language processing (NLP) and
data mining. Almost all online retailers allow users to freely express their
opinions and thoughts on products via their websites and the relevant social media
platforms. The customers who intend to purchase a product may take
purchasing decisions based on the reviews of the product. Accordingly, the commercial
and retail companies considers product reviews as an important source of
information, and could exploit this information to build their marketing tool and
strategy, and to resolve any issues in relation to the product. This could also
benefit the users with the suggestions on the quality of the products or services
that they want to purchase. As for the number of reviews of a product posted
by the users, they could range from several hundreds to tens of thousands. The
e-commerce companies and online retailers want to identify complaints given the
reviews of a product for their own benefit. Likewise, the customers who want to
buy a product or service may need such information while avoiding having to go
through thousands of reviews about the product.</p>
      <p>
        In this context, Gupta et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] identified the relationship between users’
purchase intent from their social media forums such as Quora1 and Yahoo!
Answers.2 They primarily carried out text analysis to detect purchase intent from
user-generated content (UGC). Wang et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] investigated the problem of
identifying purchase intent. In particular, the authors proposed a graph-based
learning approach to identify intent tweets and classify them into six categories.
For this, they retrieved tweets with a bootstrap method, with using a list of seed
intent-indicators (e.g. ‘want to’), and manually created training examples from
the collected tweets. Haque et al. [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] extends the work of Wang et al. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] while
increasing the coverage of the purchase intent indicators with the distributed
vector representation of words using the continuous skip-gram model [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        Recently, Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] automatically identify complaints from
the tweets posted by the social media users and the potential customers. To
the best of our knowledge, the most relevant works to ours come from
PreotiucPietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. In fact, to a certain extent, our proposed methods can be
viewed as the extension of Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] with mainly the following
additions as far as this task is concerned: (i) we considered product reviews
instead of tweets as in Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] and (ii) we explore a
resourcepoor and less-explored language, Hindi, for our investigation, (iii) we applied
a state-of-the-art sampling strategy [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] in order to encounter class imbalance
problem in the training data, and (iv) we explore applying a semi-supervised
classification algorithm in the complaint identification task. Moreover, one of
the key contributions to this work is creation of the gold-standard dataset in
Hindi.
      </p>
      <p>The remainder of the paper is organised as follows. In Section 2, we detail
how we semi-automatically created training data for our experiments. In
Section 3, we present our experimental methodology and setups. Section 4 presents
our evaluation results, with some discussions. Section 5 concludes and provides
avenues for further work.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Dataset Creation</title>
      <sec id="sec-2-1">
        <title>1 www.quora.com 2 www.answers.yahoo.com</title>
        <p>This section details the creation of training data that has been used in this task.</p>
        <sec id="sec-2-1-1">
          <title>Collecting Hindi Reviews</title>
          <p>To the best of our knowledge, there is no existing (freely available) annotated
review data (complaint and non-complaint) for Hindi. For this task we needed
an annotated review dataset for Hindi. In order to create an annotated review
dataset for Hindi, we first collected Hindi reviews posted online. The reviews were
taken from two diferent sources: (i) the websites of the retail giant Amazon, and
(ii) YouTube. The users usually post their Hindi reviews on these two platforms.</p>
          <p>In order to collect the reviews from Amazon, we used amazon-reviews-scraper
Python library3 which takes a product name as input and provides the reviews
about the product across the diferent languages. Similarly, in order to collect the
reviews from YouTube, we used youtube-comment-downloader Python library.4
This script provided us reviews on the products across the diferent languages.</p>
          <p>In order to remove noise (e.g. HTML tags, special characters) from reviews,
we applied a number cleaning scripts including a language identifier. 5 We also
removed emojis from the review texts. Each of the collected clean reviews is
manually tagged with a particular category, namely complaint or non-complaint.
The annotation scheme and results are presented in the next section.
सोनी का साउंड बहुत बेहतरीन होता इसलए मुझे ये बहुत पसंद है....
्परयागराज मे जयो बहुत ही घटया चल रहा है
फ़ोन का कै मरा बहुत अ है डले भी और बैटरी तो तबाही हैं
पृ सयंख्ा के बाद के पृ उटल्े लगाए गए हैं
सब इधर उधर का चोरी कया हुआ लखा है...मकसद फमल्ों में स्करपट लखने का काम करना...
माउस काम नहीं कर रहा सवस बेकार है
य्का खूब लखा है मजा आ गया सयत् भैया ऐसे ही लखते रहे
काफ रोमांचक है और कु छ हट के भी
कहानी का शानदार आग़ाज़ बेहतरीन अंदाज़ आप आगे पढ़ने के लए मजबूर होते हैं
चाजग करते समय बहुत गरम हो जाता है, फटने का दर है
A1 A2 A3
3 https://github.com/philipperemy/amazon-reviews-scraper. Accessed on
August 2020
4 https://github.com/egbertbouman/youtube-comment-downloader. Accessed on</p>
          <p>August 2020.
5 https://pypi.org/project/pycld2/
answer two questions for a given review. While the first question is related to the
decision as to whether a review is complaints or non-complaints, the motivation
behind the second question is to collect a more fine-grained (book, phone, tv
etc.) gold standard dataset.6</p>
          <p>To have a concrete idea about the agreement between annotators, we
calculated the majority class for each review in our dataset. A review belongs to a
majority class k if the most frequent annotation for the review was selected by at
least k annotators. As a consequence, a large percentage of review belonging to
high majority classes are symptomatic of good inter-annotator agreement.
Similar to earlier studies, we consider all annotations with a majority class greater
than 2 as reliable. In this case, for the complaints or non-complaints annotation
scheme, over 88% of the review were annotated identically by the majority of
annotators, while for fine-grained annotation scheme, 7 over 85% of the
annotations fell into this case. As such, we can be confident that the annotation process
was successful and the dataset is reliable. A sample of our annotated dataset is
presented in Table 1, and statistics about the dataset are presented in the next
section.
2.3</p>
        </sec>
        <sec id="sec-2-1-2">
          <title>Data Statistics</title>
          <p>We report the statistics of the our gold standard data set in Table 2. The table
shows the numbers of the complaint and non-complaint reviews which are listed
according to diferent product types.</p>
          <p>
            We divided the annotated set of reviews (i.e. 3,711 reviews) into train,
development and test sets. The test and development set reviews were randomly
sampled from the all reviews. The statistics about the train, development and
test set reviews are shown in Table 3. The training, test and development data
sets have been released publicly and can be downloaded from https://github.
com/MrRaghav/Complaints-mining-from-Hindi-product-reviews.
6 The details of the annotation guidelines are out of the scope of this paper.
7 Annotation of reviews tagged with ‘complaints’ into product types such as phone,
book, tv etc.
Nowadays, recurrent neural network (RNN), in particular with long-short term
memory (LSTM) [
            <xref ref-type="bibr" rid="ref7">7</xref>
            ] hidden units, has been proved to be an efective model for
many classification tasks in NLP, e.g. sentiment analysis [
            <xref ref-type="bibr" rid="ref14">14</xref>
            ], text classification
[
            <xref ref-type="bibr" rid="ref15 ref8">8, 15</xref>
            ]. RNN is an extension of the feed-forward neural network (NN), which has
the gradient vanishing or exploding problems. LSTM deals with the exploding
and vanishing gradient problems of RNN. An RNN composed of LSTM hidden
units is often called an LSTM network. A common LSTM unit is composed of
a cell, an input gate, an output gate and a forget gate. More formally, each cell
in LSTM can be computed as follows:
          </p>
          <p>X =
[
ht−1
xt</p>
          <p>]
ft = σ(Wf · X + bf )
it = σ(Wi · X + bi)
ot = σ(Wo · X + bo)
ct = ft ⊙ ct−1 + it ⊙ tanh (Wc · X + bc)
ht = ot ⊙ tanh (ct)
(1)
(2)
(3)
(4)
(5)
(6)
where Wi, Wf , Wo ∈ Rd×2d are the weighted matrices and bi, bf , bo ∈ Rd are
biases of LSTM, which need to be learned during training, parameterising the
transformations of the input, forget and output gates, respectively. σ is the
sigmoid function, and ⊙ stands for element-wise multiplication. xt includes the
inputs of LSTM cell unit. The vector of hidden layer is ht. The final hidden
vector hN represents the whole input review, which is passed to softmax layer
after linearising it into a vector whose length is equal to the number of class
labels. In our work, the set of class labels includes complaint and non-complaint
categories.</p>
        </sec>
        <sec id="sec-2-1-3">
          <title>3.2 fastText</title>
          <p>
            We used fastText [
            <xref ref-type="bibr" rid="ref8">8</xref>
            ], a fast and efficient classifier, in our task. fastText is often
on par with the state-of-the-art deep learning classifiers in terms of accuracy.
It is also found to be faster for training and evaluation. fastText has many
advance features, e.g. hierarchical softmax, hidden states shared among features
and classes, and bag of n-grams features.
3.3
          </p>
        </sec>
        <sec id="sec-2-1-4">
          <title>Graph-based Classifier</title>
          <p>We also used a semi-supervised classification algorithm, i.e. random walk on
graph. A graph, G is a pair of vertices V and edges E. We use a weighted graph
for the classification. A weighted graph is a graph where each edge, Ei, is linked
to a numeric value Wij where i, j ∈ V . For our approach, it is necessary that
W ij = W ji
(7)</p>
          <p>
            Random walk is based on the concept of randomly determined processes. It
considers a transition probability of the random walker that it will reach from
a vertex Vi to the other vertex Vj with each step being either +1 or -1 with
equal probability. As for implementing random walk for this task, we follow the
approach and python implementation of the algorithm described in [
            <xref ref-type="bibr" rid="ref12">12</xref>
            ].
3.4
          </p>
        </sec>
        <sec id="sec-2-1-5">
          <title>Classical Supervised Classification Models</title>
          <p>Furthermore, we compare the deep learning model (the LSTM network), fastText
and semi-supervised classification model (random walk) presented above with
the classical supervised classification models. We employ the following classical
supervised classification techniques in our task:
– Logistic Regression (LR)
– Decision Tree (DT)
– Random Forest (RF)
– Naïve Bayes (NB)
– Support Vector Machine (SVM)</p>
          <p>These classical learning models (LR, DT, RF, NB and SVM) can be viewed
as the baselines in this task. Thus, we obtain a comparative overview on the
performances of diferent supervised and semi-supervised classification models
including the LSTM network.
3.5</p>
        </sec>
        <sec id="sec-2-1-6">
          <title>Training Setup</title>
          <p>In order to build LR, DT, RF and NB classification models, we use the
wellknown scikit-learn machine learning library,8 and performed all the experiments</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>8 https://scikit-learn.org/stable/</title>
        <p>with default parameters set by scikit-learn. As for the representation space, each
review was represented as a vector of word unigrams weighted by their frequency
in the reviews.</p>
        <p>
          For the classifiers based on the neural networks, we use a 300-Dimensional
word embeddings from fastText. We use sigmoid activation function with Adam
optimizer [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ] and binary cross entropy loss function. The size of input layer of
the NN is 300. We employ layer normalisation [
          <xref ref-type="bibr" rid="ref1">1</xref>
          ] in the model. Dropout [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]
between layers is set to 0.10. The size of embedding and hidden layers are 300.
The learning-rate is set to 0.0003, and the training examples were reshufled for
each epoch.
        </p>
        <p>
          As for the random walk algorithm, we use the similarity between the
sentences as the transition probability. Again, we create a network with the training
examples. We take a set of labelled vertices L and unlabelled vertices U , where
(L, U ) ⊆ G (G : set of all the labels). The random walk algorithm considers
the vertices of the network as the states of a Markov chain. We calculate the
similarity measure between the data points with the Dice coefficient (DC) [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ]
and reach to the final decision according to the probability. The DC measure
has been widely used for evaluating the degree of association between words.
3.6
        </p>
        <sec id="sec-2-2-1">
          <title>Handling Class Imbalance</title>
          <p>
            We recall Tables 2 and 3 where we can see the presence of class imbalance in
the training data. In order to encounter the class imbalance problem in our
training data, we followed Chawla et al. [
            <xref ref-type="bibr" rid="ref2">2</xref>
            ] who presented a combination of
oversampling of minority class with under-sampling of majority class. This is also
called SMOTE which is a kind of synthetic sampling technique and found to be
efective in our problem.
4
          </p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Results and Discussion</title>
      <p>We evaluate the performance our classifiers on the gold-standard test set (cf.
Table 3) and report the evaluation results in this section. In order to measure
classifier’s accuracy on the test set, we use three widely-used evaluation metrics:
precision, recall and F1 measures. The results obtained are reported in Table 4.
The first five rows of Table 4 represent our baseline classifiers (i.e. the classical
supervised classification models). The next row represents the random walk
algorithm. We see from the table that these classifiers performs below par and SVM
is the best-performing method among them (SVM: a 47.94 F1 score)) according
to the F1 scores.</p>
      <p>As for NN-based classifiers, the LSTM network trained on fastText
embeddings performed reasonable as we see from Table 4 that it produces a moderate
F1 score (68.38 F1) on the test set. The fastText classifier also performs
reasonably well; however, it could not surpass the LSTM network.</p>
      <p>
        In Table 5, we show the F1 scores of the best-performing classifiers of the
complaint identification task presented in [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ]. As can be seen from Table 5,
their best complaint identification models produce F1 scores in the range of
78–79 which are more than 10 F1 points higher than the F1) score of our
bestperforming complaint identification model (cf. second last row of Table 4). Note
that, unlike us, Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] carried out experiments on English
and the data sets including the evaluation test set are diferent too. Moreover,
the nature of data is also diferent to us. Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] focused
on identifying complaints on tweets and we focused on identifying the same
on the product reviews posted on Amazon and YouTube. Naturally, the scores
presented in Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] cannot be directly compared to those
presented in this paper. Given the fact that English is a high resource language
and there are a plenty of linguistic tools and resources freely available in English,
Preotiuc-Pietro et al. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] exploited many linguistic resources, models and tools in
their task for feature engineering, e.g. sentiment and emotion analysis, temporal
model, part-of-speech information. In Hindi, it is difficult to obtain such resources
and standard tools, and many such tools are not available to use. Moreover,
unlike English, Hindi is a morphological complex and highly inflected language.
Therefore, we believe that identifying complaints on Hindi texts is to be more
challenging in comparison to that in English or other high resource languages.
In this paper, we presented supervised and semi-supervised learning models to
identify customers’ complaints from the review data in Hindi, a low-resource,
lessexplored, and morphological rich highly inflected Indic language. There is no
publicly available gold-standard training data as far as identifying complaints from
user generated Hindi texts about products is concerned. Accordingly, we
semiautomatically created a gold-standard dataset for complaints identification on
Hindi. We conducted our experiments with state-of-the-art LSTM classifier and
classical supervised classification models. With our LSTM classifier, we achieved
a competent accuracy (a F1 score of 68.38) on the gold-standard evaluation test
set. There are limited linguistic resources and tools freely available for research
in Hindi in comparison to many high-resource languages including English. We
believe that this work would add an additional value to the social media analytics
research in low-resource scenarios.
      </p>
      <p>In future, we intend to test our method on diferent low-resource and
nonEnglish languages. We also plan to investigate applying more sophisticated and
linguistic features in our model, e.g. part-of-speech information. We removed
emojis from reviews at the time data prereprocessing. Since emojis may capture
polarity and encode information regarding users’ experiences and complaints, in
future, we aim to carry out experiments while keeping emojis in the reviews.
Our classifiers were trained on the data which is a mixture of reviews from
diferent product types. We intend to train classifiers on reviews from specific
product type, i.e. book or phone reviews. By this, we can compare the classifiers
trained on the data from individual product type to the one trained on the data
consisting reviews from all product types.</p>
      <p>Singh et al.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <given-names>Jimmy</given-names>
            <surname>Lei</surname>
          </string-name>
          <string-name>
            <surname>Ba</surname>
          </string-name>
          , Jamie Ryan Kiros and
          <string-name>
            <given-names>Geofrey E</given-names>
            <surname>Hinton</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Layer normalization</article-title>
          .
          <source>arXiv preprint arXiv:1607</source>
          .
          <fpage>0645</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Kevin</surname>
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Bowyer</surname>
          </string-name>
          ,
          <string-name>
            <surname>Nitesh</surname>
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Chawla</surname>
            , Lawrence O. Hall and
            <given-names>W. Philip</given-names>
          </string-name>
          <string-name>
            <surname>Kegelmeyer</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>SMOTE: Synthetic Minority Over-sampling Technique</article-title>
          .
          <source>Journal of Artificial Intelligence Research</source>
          ,
          <volume>16</volume>
          :
          <fpage>321</fpage>
          -
          <lpage>357</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Lee</surname>
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Dice</surname>
          </string-name>
          .
          <year>1945</year>
          .
          <article-title>Measures of the Amount of Ecologic Association Between Species</article-title>
          .
          <source>Journal of Ecology</source>
          ,
          <volume>26</volume>
          (
          <issue>3</issue>
          ):
          <fpage>297</fpage>
          -
          <lpage>302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <given-names>Yarin</given-names>
            <surname>Gal</surname>
          </string-name>
          and
          <string-name>
            <given-names>Zoubin</given-names>
            <surname>Ghahramani</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A Theoretically Grounded Application of Dropout in Recurrent Neural Networks</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <fpage>1019</fpage>
          -
          <lpage>1027</lpage>
          , Barcelona, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <given-names>Vineet</given-names>
            <surname>Gupta</surname>
          </string-name>
          , Devesh Varshney, Harsh Jhamtani, Deepam Kedia and
          <string-name>
            <given-names>Shweta</given-names>
            <surname>Karwa</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Identifying Purchase Intent from Social Posts</article-title>
          .
          <source>In Proceedings of the Eighth International AAAI Conference on Weblogs and Social Media</source>
          , pages
          <fpage>180</fpage>
          -
          <lpage>186</lpage>
          , Ann Arbor, MI.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <given-names>Rejwanul</given-names>
            <surname>Haque</surname>
          </string-name>
          , Arvind Ramadurai, Mohammed Hasanuzzaman and
          <string-name>
            <given-names>Andy</given-names>
            <surname>Way</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Mining Purchase Intent in Twitter</article-title>
          . Computación y Sistemas,
          <volume>23</volume>
          (
          <issue>3</issue>
          ):
          <fpage>871</fpage>
          -
          <lpage>881</lpage>
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <given-names>Sepp</given-names>
            <surname>Hochreiter</surname>
          </string-name>
          and
          <string-name>
            <given-names>Jürgen</given-names>
            <surname>Schmidhuber</surname>
          </string-name>
          .
          <year>1997</year>
          .
          <article-title>Long short-term memory</article-title>
          .
          <source>Neural Computation</source>
          , pages
          <volume>9</volume>
          (
          <issue>8</issue>
          ):
          <fpage>1735</fpage>
          -
          <lpage>1780</lpage>
          . MIT Press.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <given-names>Armand</given-names>
            <surname>Joulin</surname>
          </string-name>
          , Edouard Grave, Piotr Bojanowski and
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Bag of Tricks for Efficient Text Classification</article-title>
          .
          <source>In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume</source>
          <volume>2</volume>
          ,
          <string-name>
            <surname>Short</surname>
            <given-names>Papers</given-names>
          </string-name>
          , pages
          <fpage>427</fpage>
          -
          <lpage>431</lpage>
          , Valencia, Spain.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Diederik</surname>
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Kingma</surname>
            and
            <given-names>Jimmy</given-names>
          </string-name>
          <string-name>
            <surname>Ba</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Adam: A Method for Stochastic Optimization</article-title>
          .
          <source>arXiv preprint arXiv:1412</source>
          .
          <fpage>6980</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Tomas</surname>
            <given-names>Mikolov</given-names>
          </string-name>
          , Ilya Sutskever, Kai Chen, Greg S Corrado and
          <string-name>
            <given-names>Jef</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Distributed representations of words and phrases and their compositionality</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <fpage>3111</fpage>
          -
          <lpage>3119</lpage>
          ,
          <string-name>
            <surname>Lake</surname>
            <given-names>Tahoe</given-names>
          </string-name>
          , NV.
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Daniel</surname>
          </string-name>
          Preotiuc-Pietro,
          <article-title>Mihaela Gaman</article-title>
          and
          <string-name>
            <given-names>Nikolaos</given-names>
            <surname>Aletras</surname>
          </string-name>
          .
          <year>2019</year>
          .
          <article-title>Automatically Identifying Complaints in Social Media</article-title>
          .
          <source>In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistic</source>
          , pages
          <fpage>219</fpage>
          -
          <lpage>224</lpage>
          . Los Angeles, CA.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <given-names>Martin</given-names>
            <surname>Szummer</surname>
          </string-name>
          and
          <string-name>
            <given-names>Tommi</given-names>
            <surname>Jaakkola</surname>
          </string-name>
          .
          <year>2002</year>
          .
          <article-title>Partially labeled classification with Markov random walks: Advances in neural information processing systems</article-title>
          .
          <source>In Advances in neural information processing systems</source>
          , pages
          <fpage>945</fpage>
          -
          <lpage>952</lpage>
          , Vancouver, BC.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Jinpeng</surname>
            <given-names>Wang</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Gao Cong</surname>
          </string-name>
          ,
          <article-title>Wayne Xin Zhao</article-title>
          and
          <string-name>
            <given-names>Xiaoming</given-names>
            <surname>Li</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Mining user intents in twitter: a semi-supervised approach to inferring intent categories for tweets</article-title>
          .
          <source>In Twenty-Ninth AAAI Conference on Artificial Intelligence</source>
          , pages
          <fpage>318</fpage>
          -
          <lpage>324</lpage>
          . Austin, TX.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          14.
          <string-name>
            <surname>Yequan</surname>
            <given-names>Wang</given-names>
          </string-name>
          , Minlie Huang,
          <source>Xiaoyan Zhu and Li Zhao</source>
          .
          <year>2016</year>
          .
          <article-title>Attention-based LSTM for aspect-level sentiment classification</article-title>
          .
          <source>In Proceedings of the 2016 conference on empirical methods in natural language processing</source>
          , pages
          <fpage>606</fpage>
          -
          <lpage>615</lpage>
          , Austin, TX.
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          15.
          <string-name>
            <surname>Peng</surname>
            <given-names>Zhou</given-names>
          </string-name>
          , Zhenyu Qi, Suncong Zheng, Jiaming Xu,
          <string-name>
            <given-names>Hongyun</given-names>
            <surname>Bao</surname>
          </string-name>
          and
          <string-name>
            <given-names>Bo</given-names>
            <surname>Xu</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Text classification improved by integrating bidirectional LSTM with twodimensional max pooling</article-title>
          .
          <source>arXiv preprint arXiv:1611</source>
          .
          <fpage>06639</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>