<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Aspect Detection in Book Reviews: Experimentations</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Jeanne Villaneau</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefania Pecore</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Farida Sad</string-name>
          <email>farida.saidg@univ-ubs.fr</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IRISA, Universite de Bretagne Sud</institution>
          ,
          <country country="FR">France</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>LMBA, Universite de Bretagne Sud</institution>
          ,
          <country country="FR">France</country>
        </aff>
      </contrib-group>
      <fpage>16</fpage>
      <lpage>27</lpage>
      <abstract>
        <p>Aspect Based Sentiment Analysis (ABSA) aims at identifying the aspects of entities and the sentiment expressed towards each aspect. Substantial work already exists in English language and in domains where aspects are easy to de ne such as restaurants, hotels, laptops, etc. This paper investigates detection of aspects in French language and in the books reviews domain where expression is more complex and aspects are less easy to characterize. On the basis of a corpus that we annotated, 21 aspects were de ned and categorized into eight main classes including a catch-all class, General, which was found to be absorbent. Several methods were carried out to address this di culty, with varying e ciency: Random Forest and SVM provided better results than kNN and Neural Net. Combining these methods with voting rules helped to improve noticeably the results. On another side, the di culty of the task and the limits of a lexical approach were further explored with a qualitative analysis of errors and a topological mapping of the data using Self Organising Maps.</p>
      </abstract>
      <kwd-group>
        <kwd>Aspect Based Sentiment Analysis aspect detection opinion mining</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>
        Aspect Based Sentiment Analysis (ABSA) systems aim at detecting the main
aspects (features) of an entity which are discussed in texts and at estimating
the orientation of the sentiment expressed per aspect (how positive or negative
the opinions are on each aspect) [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. ABSA was rst introduced as a shared
task in SemEval-2014 [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], with datasets in English in two domains: laptops and
restaurants. The task was repeated in SemEval-2015 and SemEval-2016, and
extended to new entities (hotel, restaurant, telecom, consumer electronics) and
to other languages (French, Dutch, Russian, Spanish and Turkish) [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ].
      </p>
      <p>
        ABSA is classically split into three subtasks: (i) extracting opinion
expressions, (ii) determining the aspect of these expressions and (iii) determining their
opinion value [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. In SemEval 2016, determining the aspects was the subtask of
ABSA (task 5) which called the largest number of contributions (216 over 245
submissions in total). As an example, French data sets were proposed in
restaurant domain with 6 types of entities and 6 types of attributes [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. On these data,
the best system obtained a F1 score of 0.612.
      </p>
      <p>Despite challenges as SemEval, few studies were conducted in languages other
than English and freely available data are scarse. We were interested in this work
in investigating this task in French language and in a domain where aspects are
more di cult to detect and where opinion is expressed in complex and varied
forms. This paper presents a book reviews corpus which we collected and the
work carried out to de ne aspects (Section 2) and to implement their automatic
detection by using lexical statistical methods (Sections 3). It was found that
these methods perform varyingly well and their performances can be improved
when they are combined. Moreover, an analysis of the errors gives an idea of the
di culty of the task and the limits we have to go beyond to improve the results
(Section 4).
2
2.1</p>
      <p>Training and Test corpora - Task and Approach</p>
      <p>Training Corpus and Annotation
We built a corpus of 900 reviews by concatenation of 450 book reviews from the
French Sentiment Corpus (FSC), which was produced between 2009 and 2013
by Vincent and Winterstein (2013), and 450 more recent book reviews which we
collected from the Amazon.fr website between 2016 and 2017 (NC).</p>
      <p>The total number of words in the corpus is about 72,000 words.</p>
      <p>We proposed an annotation schema suitable for all types of books, regardless
of genre, which is based on 5 aspects and 20 attributes (see Table 1). The 21
resulting classes can be gathered into metaclasses to meet di erent needs.</p>
    </sec>
    <sec id="sec-2">
      <title>Aspects</title>
      <sec id="sec-2-1">
        <title>General Feeling</title>
      </sec>
      <sec id="sec-2-2">
        <title>Text</title>
      </sec>
      <sec id="sec-2-3">
        <title>Illustration</title>
      </sec>
      <sec id="sec-2-4">
        <title>Author</title>
      </sec>
      <sec id="sec-2-5">
        <title>Form</title>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Attributes</title>
      <p>General, Subject, Style, Characters, Pace/Narration, Readability,</p>
      <p>Translation/Adaptation,Interest/Accuracy</p>
      <p>General, Interest/Accuracy, Graphic quality</p>
      <p>General, Text Author, Translator, Illustration Author,
General, Bookbinding, Typography, Inner structure, Distribution</p>
      <p>The complexity of the wording in book reviews makes di cult the task of
allocating a unique aspect to an entity as usually done, for example in SemEval
2016 annotation task. The following examples, yet very simple, illustrate how
entities, opinion phrases and context have to be taken into account to determine
proper aspects.
{ In the phrase "le livre est bien mal ecrit" [the book is very badly written],
the part which expresses sentiment is "bien mal ecrit" (very badly written)
(value: -2) and the entity is le livre [the book]. The appropriate aspect is Text
with Style for attribute, because of the verb ecrire [to write].
{ In the review, "la bobo au style frelate" [the boho with degenerated style],
the word degenerated expresses a very negative opinion (-2). It relates to the
entity Style and it is classi ed in Text#Style. Because of the reference to the
style, one can say that bobo refers to the author; "la bobo" represents both
the entity and the opinion of the reviewer.</p>
      <p>
        Since it often happens that entity and aspect do not coincide, it is essential to
include an aspect detection phase in the annotation process. For that, we proceed
in three steps:
{ selection of a group of contiguous words which indicate an opinion (evaluated
by an ordinal value),
{ detection of the entity to which the opinion refers (when it is expressed),
{ selection of an aspect and an attribute in the annotation schema.
The annotation task concerned about 4700 phrases related to 3300 opinion
expressions. More information on the corpus (statistics, annotators, inter-annotators
agreement) is given in [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ].
2.2
      </p>
      <p>Task, Test Corpus and Approach
Aspects were grouped into eight main classes because of the di culty met by
the annotators to separate certain aspects. More precisely, the following pairs
of aspects were aggregated: General with Text#General, Text#Readability with
Text#Style and Text#Interest with Text#Subject. The other considered aspects
are Text#Pace-Narration, Text#Characters, Illustrations, Form and Authors
regardless of attributes for the latter. Table 2 displays the relative importance of
these classes in the training corpus. The large prevalence of the class General
and the very limited size of the class Illustrations are to be mentioned.</p>
      <p>The test corpus consists of 340 sentences or parts of text selected from the
non-annotated part of the FSC corpus. The sentences were selected so as to
present a unique aspect each and to cover all aspect classes, thereby reducing
the prevalence of the class "General ". The resulting distribution of the aspect
classes is given in the last column of Table 2.</p>
      <p>As mentioned above, sentences presenting more than one aspect were
removed during the selection process, as in:</p>
      <p>"Tant dans le contenu que dans l'ecriture je n'ai pu trouver aucun inter^et a
cet ouvrage" [Both in the contents and in the writing I was not able to nd any
interest in this work.]</p>
      <p>Furthermore, it is whole sentences or their largest possible parts which were
selected, as in:</p>
    </sec>
    <sec id="sec-4">
      <title>Class Aspect/Attribute</title>
      <sec id="sec-4-1">
        <title>General (Ge) fGeneral Feel. - Text#Generalg</title>
        <sec id="sec-4-1-1">
          <title>Pace (Pa) Text#Pace-Narration</title>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>Interest (In) Text#fInterest-Accur., Subjectg</title>
        <sec id="sec-4-2-1">
          <title>Characters (Ch) Text#Characters</title>
        </sec>
      </sec>
      <sec id="sec-4-3">
        <title>Style (St) Text#fStyle, Readibilityg</title>
      </sec>
      <sec id="sec-4-4">
        <title>Authors (Au) Author#fall attributesg</title>
      </sec>
      <sec id="sec-4-5">
        <title>Illustrations (Il) Illustration#fall attributesg Form (Fo) Form #fall attributesg</title>
        <p>% Training
44.9%
11.5%
21.0%
8.5%
3.2%
4.5%
0.7%
5.7%</p>
        <p>"Tout sonne faux, les relations entre les protagonistes, les dialogues qui
semblent sortis de la bouche de mauvais acteurs, la psychologie des personnages."
[Everything rings false, the relations between the protagonists, the dialogues
which seem come out of the mouth of bad actors, the psychology of the
characters.]</p>
        <p>It should be noticed that some words which could seem to be key words in
the determination of the target (Aspect#Attribute), can turn out to be false
friends as in the previous sentence where the word personnage [character] can
lead to misclassify the sentence in Characters while a human annotator would
classify it in Interest.</p>
        <p>Detecting opinion polarity meets several di culties among which negation,
use of humoristic or indirect expression, etc. On the other hand, the success of
statistical methods based on simple bag of words (BoW) supports the
hypothesis that determining aspects is essentially a lexical task. We investigated the
e ciency of this approach on the corpus of book reviews.</p>
        <p>Following lemmatization (with Treetagger), a list of lemmas (names,
adjectives, verbs and adverbs excepting stop words) was selected according to their
frequency in the corpus (i). Each annotaded expression in the training corpus
is handled as a vector whose binary entries (0 or 1) code the co-occurrences of
the expression with the lemmas (ii). A co-occurence matrix is built and then
augmented with a column which speci es the aspect#attribute assigned to every
annotated expression (iii).</p>
        <p>Our attempts to enrich the model with linguistic parameters were not
conclusive and the performances achieved were low below the results presented in
the next section. Anyhow, the best results were obtained using lemmas rather
than forms, possibly because of the modest size of our corpus.
3</p>
        <p>Experiments and Results
Various experimentations were conducted using unsupervised and supervised
classi cation approaches, namely SOM (Self-Organising Maps), kNN (k-Nearest
Neighbours), NN (Neural Net), RF (Random Forest), SVM (Support Vector
Machine). Linguistic contexts of words were taken into account through the use of
Word2vec. The well known language and environment for statistical computing,
R, was used all along this work.</p>
        <p>The results of our experimentations are presented below and they re ect
well the di culty of the task. In all tables, aspect classes are identi ed by the
abbreviations given in Table 2.
3.1</p>
        <p>SOM
Self Organising Maps is a competitive learning network based on unsupervised
learning. It provides a low dimension representation of the input data and it
serves for representation as well as for clustering. We used in our
experimentations the kohonen R-package.</p>
        <p>The topological map in Figure 1 was obtained by combining the
observationlemma matrix (weight of 5) with the vector of related aspect classes (weight of
1).</p>
        <p>Legend
Black
Yellow 
Green
Red
Chocolate 
Grey
Orange
Blue
 : General
: Authors
 : Characters
 : Interest
: Style
 : Form
 : Illustrations
 : Pace
3.2
k-Nearest Neighbors (kNN) - Neural Networks - Fuzzy
classi cation
The best results with kNN are displayed in Table 3; they were obtained for k = 2.
These performances are disappointing and re ect the di culties encountered,
especially the absorbtion capacity of the class General, the only class showing
a precision score lower than the recall. As predicted by SOM, Characters is the
class that obtains the best results.</p>
        <p>Ge Pa Ch St Au In Il Fo
Ge 72 0 1 7 1 7 0 3
Pa 28 15 1 1 1 6 0 1
Ch 7 0 10 1 0 2 0 0
St 30 1 0 27 0 3 0 3
Au 14 1 0 3 2 0 0 0
In 32 3 1 3 0 14 0 0
Il 13 0 0 0 0 4 1 0
Fo 13 0 0 2 0 2 0 4</p>
        <sec id="sec-4-5-1">
          <title>Class Precision Recall F-measure</title>
          <p>General 0.344 0.791 0.48
Pace 0.41 0.75 0.411
Characters 0.769 0.5 0.606
Style 0.614 0.422 0.5
Authors 0.5 0.1 0.167
Interest 0.368 0.264 0.308
Illustrations 1 0.056 0.105</p>
          <p>Form 0.363 0.190 0.25</p>
          <p>
            Fuzzy logic and Neural Networks already proved to be e cient in Sentiment
analysis [
            <xref ref-type="bibr" rid="ref1 ref5">1,5</xref>
            ]. However, they provided very poor results when implemented on
our data (R-package frbs and neuralnet), with almost all expressions classi ed
in the class General.
3.3
          </p>
          <p>Random Forest
The statistical approach using Random Forest (ntree = 500) gives encouraging
results. The class General is still absorbent but all classes have their precision
and recall scores greatly improved. In accordance with SOM (Figure 1), class
Characters performs well. The results of class Author remain mediocre and those
of class Form are poor, while the recall of class Illustrations is very low.</p>
          <p>While names can be su cient for the determination of aspects in certain
domains, four parts of Speech are highlighted in our experimentation. Indeed, the
top twenty words in Random Forest consist in 9 names, 5 adjectives, 5 verbs and 1
adverb and among them, the adjectives interesting, clear and likeable which are
respectively associated with classes Interest, Style/Readibility, Characters and
the adverb facilement [easily] which is associated with the class Style/Readibility.
3.4</p>
          <p>
            SVM
In the eld of ABSA, SVM classi ers made their proof for both aspect and
polarity detection [
            <xref ref-type="bibr" rid="ref13 ref6">6,13</xref>
            ]. The classic approach by SVM with linear kernel outclasses
          </p>
          <p>Random Forests, however the improvement is not general: classes Pace,
Characters, Interest obtain poorer results. By contrast, the improvement of the results
of the class Form is particularly remarkable.</p>
          <p>Besides, we still observe the trend to an overuse of the class General.</p>
          <p>Ge Pa Ch St Au In Il Fo
Ge 75 3 2 6 1 2 0 2
Pa 20 27 1 2 2 1 0 0
Ch 0 0 19 0 1 0 0 0
St 13 1 0 45 3 2 0 0
Au 9 0 0 2 7 2 0 0
In 25 1 0 1 2 24 0 1
Il 8 0 0 1 0 1 8 0
Fo 5 2 0 0 0 0 0 14</p>
        </sec>
        <sec id="sec-4-5-2">
          <title>Class Precision Recall F-measure</title>
          <p>General 0.484 0.824 0.610
Pace 0.794 0.509 0.621
Characters 0.864 0.95 0.905
Style 0.789 0.703 0.744
Authors 0.467 0.35 0.4
Interest 0.75 0.453 0.565
Illustrations 1 0.444 0.615</p>
          <p>Form 0.824 0.667 0.737
Many of the words used in the test corpus do not appear in the training corpus
because of its small size. To deal with this di culty, the last approach makes
use of Word2Vec to enrich the space of words in the test corpus. Word2Vec was
trained with the corpora FSC, NC and Wikipedia.</p>
          <p>The training corpus remains unchanged and only the co-occurrence matrix is
modi ed: the entry in the co-occurence matrix of every name, adjective, verb or
adverb that does not appear in the training corpus, is replaced by its similarity
score with its closest lemma in the training corpus.</p>
          <p>The results were globally below expectations (except for class General) (cf.
Table 6); a reason for that could be that the noise brought by W2V limited the
global gain.
The last three approaches (Random Forest, SVM, SVMW2V) obtain
encouraging results and their performances are globally close. We combined them by
adopting a majority vote with a special handling of the class General. The voting
rules are presented in Table 7</p>
          <p>The second rule states that if at least one system out of three chooses a class
other than General, this class is favoured. The underlying purpose of this rule
is to reduce the absorbing bias of class General which was observed in all single
systems.</p>
          <p>The third rule speci es that in case of total disagreement between the three
systems, class General is chosen. This rule aims to avoid a random draw when
there is no well de ned class.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Results</title>
      <p>(1) Three equal results: r1 = r2 = r3 = C
(2) Two equal results (example: r1 = r2 = C, r3 = C0, C0 6= C)
if C 6=General
if C =General
(3) Three distinct results</p>
    </sec>
    <sec id="sec-6">
      <title>Choice</title>
      <p>C
C</p>
      <p>C0</p>
      <p>General</p>
      <p>The outcomes of the combined system are given in Table 8. In global or
on average across all classes, we notice that combining the three approaches
leads to a slight reduction in the precision compared with SVM, which is widely
compensated with an increase in the recall.</p>
      <p>Table 9 gives the results of the combined system by class. Before combination,
Random Forest outperformed the 2 other systems in 4 of the 8 classes, SVM
in 3 classes and SVMW2V in the class General. Random Forest outperforms
the combined system in 3 classes and SVM in the class Form. Seen from this
perspective, no system outclasses totally the others.
In 78 out of 340 tests, none of the statistical systems selected the same aspect
as the human annotators. A qualitative analysis of the disagreements allows to
go deeper in the understanding of the limits of the lexical approach.</p>
      <p>Disagreements can be classi ed into three classes:
1. There are 19 "false errors" for which the human annotation may be
questioned. For example :
(a) "C'est dr^ole et enleve, puissant et sensuel?: un chef-d'oeuvre de vie, dedie
a la vie d'une ville incomparable." [It is funny and spirited, powerful and
sensual?: a masterpiece of life, dedicated to the life of an incomparable
city.]
This test is classi ed as General by human annotators and as Style by
the three statistical systems. Both choices are justi ed: the rst choice is
understandable if we consider the whole sentence and the second choice
is essentially motivated by the rst part of the sentence. This example
shows the limits of a strict classi cation since classes are not necessarily
mutually exclusive.
(b) "Attention: livre impossible a l^acher avant la derniere page?!" [Attention:
book impossible to put down before the last page?!]
BoW systems classi ed the sentence as Pace (SVMW2V) or as
Interest (SVM and RF), while human annotators chose the class General,
possibly because the choice is unclear.</p>
      <p>The signi cant number of false errors points out the di culty of the task in
the eld of book reviews and the fuzzy outlines between the de ned classes.
2. Another group of errors (about 12) can be related to training bias due to
new words appearing in the tests. For example:
(a) "Pouchkine est un ecrivain au style su^r, simple et envou^tant" [Pushkin
is a writer with a sure, simple and mesmerizing style]
This sentence should be classi ed as Author since it expresses a general
opinion on Puchkin's style. However, all systems classi ed it as Style
because "Puchkin" did not appear in the training corpus. It is likely that
a list of authors' names would improve the results of the class Author.
(b) "Un tres joli livre, avec de tres belles peintures chinoises a l'interieur."
[A very attractive book, with very beautiful Chinese paintings inside.]
This sentence related to Illustrations is miclassi ed as General by the
systems. This error can be explained by the low occurrence of the
keyword "peinture" [painting] in the training corpus.</p>
      <p>One would hope that using Word2Vec would allow to go beyond the
limits of training corpus' vocabulary by extending it. However, in our
experiments, the noise introduced by the similarity scores negated the
expected improvement.
3. Lastly, the vast majority of errors is related to the limits of BoW approach.</p>
      <p>
        Firstly, representing a sentence as a bag of lemmas is very simplistic; on the
other hand, the understanding of the reader uses contexts of various types:
temporal, cultural, pragmatic, textual, of common sense, etc. [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
(a) "il manque l'essentiel, les bonnes adresses, les acces, les plages, bref,
aucun detail, c'est un TOP 10 sans le moindre inter^et." [The main part, the
good addresses, the accesses, the beaches, in brief, no detail is missing,
it is a PIP 10 without the slightest interest. ]
Here, the aspect is expressed in the word "Interest" and yet, the test is
classi ed by the systems as General, probably because the word is buried
in many others, as "essential". It can be assumed that linguistic context,
especially the adverb bref [in short] which introduces a conclusion, could
make it possible to give more importance to this keyword.
(b) "Je n'ai pas accroche a l'histoire, il convient su^rement a toutes les petites
et jeunes lles ans des poupees, mais la trame est cousue de ls blancs3
[I did not stick to the story, it is certainly advisable to all the girls and
the girls the years of dolls, but the framework is a blatant lie...]
The test is classi ed as General (instead of Pace) by all the systems
despite the keyword "histoire" ["story"]. The word trame [framework],
3 In French language, there is a play on words between trame, which means "weft"
or "framework" depending on the context, and phrase cousue de ls blancs litteraly
"sewn of white threads".
almost synonymic but much less common, was probably not taken into
account by the systems, including W2V.
(c) "le livre reste un catalogue d'interpretations deja connues." [ the book
remains a catalog of already known interpretations ]
BoW systems classify this test in General instead of Interest. The adverb
"deja" plays a key role to show the lack of interest of the book.
(d) "L'auteur abuse de mots aussi savants qu'inutiles qui detournent du sujet
traite?; un defaut di cilement pardonnable." [The author makes
excessive use of words as fancy as they are useless which divert from the
handled subject?; a fault hardly overlooked ]
The word "auteur" makes the test classi ed in class Authors, while the
sentence relates to the style of the book and not to its author in general.
(e) "Sauter de la page 288 a la 337 n'aide pas du tout a apprecier un roman,
notamment si celui-ci doit tre le dernier d'une serie." [Jumping from
page 288 to 337 does not help at all to appreciate a novel, especially if it
is the last of a series... ]
"l'auteur oublie ici et la des mots qui AIDE a comprendre les phrase."
[the author forgets here and there words which help to understand the
sentences]
Both tests express negative sentiments with a certain sense of humour
(irony or sarcasm), which is a real challenge for automatic systems. For
instance, a speci c session of SemEval was devoted to sarcastic tweets
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] and numerous works addressed this topic (see for example [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]) .
      </p>
      <p>Actual mistakes point clearly toward the need to take into account multiple
contexts and knowledges to improve systems, as emphasized by Benamara and
Co [2017]. Within our study, the most relevant aspects relate to the choice of
wording and its structure, and to take into account the linguistic context of the
words : expressions varyingly litteral ("cousu de l blanc" [blindingly obvious]),
linkage of adverbs and quali catives... not to mention the detection of irony, a
full study program in itself.
5</p>
      <p>Conclusion
In a complex eld where aspects are sometimes hard to sort out, even for a human
annotator, a simple SVM approach with a linear kernel on words (lemmas in this
instance) is, despite its lackings, relatively e cient. Regardless, the combination
with other statistical approaches, especially with Random Forest, noticeably
improves the attained results. Furthermore, an intake in lexical resources, like
the list of authors, could help to better circumvent some classes.</p>
      <p>Despite this, the analysis of errors brings to light the limits of the BoW
approach. An improvement of the results inevitably requires a better analysis
of contexts with the problems that come with the use of a language all-in-all
lacking in normalization on one hand and, on the other hand in French language
which proves much poorer in resources than English language.</p>
      <p>At present, our research concerns polarity determination. Besides BoW
approaches, we also take into account the linguistic context by implementing a
surface analysis. First results seem to evidence that the use of linguistic
parameters can allow to outclass widely a simple BoW approach in this task.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          1.
          <string-name>
            <surname>Afzaal</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Usman</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fong</surname>
            ,
            <given-names>A.C.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Fong</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhuang</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Fuzzy aspect based opinion classi cation system for mining tourist reviews</article-title>
          .
          <source>Advances in Fuzzy Systems</source>
          <year>2016</year>
          (
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          2.
          <string-name>
            <surname>Apidianaki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tannier</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Richart</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          :
          <article-title>Datasets for aspect-based sentiment analysis in french</article-title>
          .
          <source>In: Proceedings of LREC 2016. European Language Resources Association (ELRA)</source>
          , Paris, France (may
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          3.
          <string-name>
            <surname>Benamara</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Taboada</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mathieu</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          :
          <article-title>Evaluative language beyond bags of words: Linguistic insights and computational applications</article-title>
          .
          <source>Comput. Linguist</source>
          .
          <volume>43</volume>
          (
          <issue>1</issue>
          ),
          <volume>201</volume>
          {264 (Apr
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          4.
          <string-name>
            <surname>De Clercq</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Lefever</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Jacobs</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Carpels</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Hoste</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          :
          <article-title>Towards an integrated pipeline for aspect-based sentiment analysis in various domains</article-title>
          .
          <source>In: Proceedings of the 8th Workshop on Computational Approaches</source>
          to Subjectivity, Sentiment and
          <string-name>
            <surname>Social Media Analysis - WASSA</surname>
          </string-name>
          <year>2017</year>
          . pp.
          <volume>136</volume>
          {
          <fpage>142</fpage>
          .
          <source>EMNLP</source>
          <year>2017</year>
          (
          <year>2017</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          5.
          <string-name>
            <surname>Khalil</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>El-Beltagy</surname>
            ,
            <given-names>S.R.</given-names>
          </string-name>
          : Niletmrg at semeval
          <article-title>-2016 task 5: Deep convolutional neural networks for aspect category and sentiment extraction</article-title>
          .
          <source>In: Proceedings of SemEval-2016</source>
          . ACL, San Diego, California (
          <year>June 2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          6.
          <string-name>
            <surname>Kiritchenko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cherry</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          : Nrc-canada-2014:
          <article-title>Detecting aspects and sentiment in customer reviews</article-title>
          .
          <source>In: Proceedings of SemEval</source>
          <year>2014</year>
          . ACL and Dublin City University, Dublin, Ireland (
          <year>August 2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          7.
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          :
          <article-title>Sentiment Analysis and Opinion Mining</article-title>
          . Morgan and Claypool Publishers (
          <year>2012</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          8.
          <string-name>
            <surname>Nakov</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Rosenthal</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kiritchenko</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Mohammad</surname>
            ,
            <given-names>S.M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Kozareva</surname>
            ,
            <given-names>Z.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Ritter</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Stoyanov</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhu</surname>
            ,
            <given-names>X.</given-names>
          </string-name>
          :
          <article-title>Developing a successful semeval task in sentiment analysis of twitter and other social media texts</article-title>
          .
          <source>Lang. Resour. Eval</source>
          .
          <volume>50</volume>
          (
          <issue>1</issue>
          ),
          <volume>35</volume>
          {65 (Mar
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          9.
          <string-name>
            <surname>Pecore</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Villaneau</surname>
          </string-name>
          , J.:
          <article-title>Complex and precise movie and book annotations in French language for aspect based sentiment analysis</article-title>
          .
          <source>In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation</source>
          ,
          <string-name>
            <surname>LREC</surname>
          </string-name>
          <year>2018</year>
          , Miyazaki, Japan, May 7-
          <issue>12</issue>
          ,
          <year>2018</year>
          . (
          <year>2018</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          10.
          <string-name>
            <surname>Pontiki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galanis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papageorgiou</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Androutsopoulos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manandhar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>AL-Smadi</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Al-Ayyoub</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Zhao</surname>
            ,
            <given-names>Y.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Qin</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>De Clercq</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          , et al.:
          <article-title>Semeval2016 task 5: Aspect based sentiment analysis</article-title>
          .
          <source>In: ProWorkshop on Semantic Evaluation (SemEval-2016)</source>
          . pp.
          <volume>19</volume>
          {
          <fpage>30</fpage>
          .
          <article-title>Association for Computational Linguistics (</article-title>
          <year>2016</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          11.
          <string-name>
            <surname>Pontiki</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Galanis</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Pavlopoulos</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Papageorgiou</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Androutsopoulos</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Manandhar</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          :
          <article-title>Semeval-2014 task 4: Aspect based sentiment analysis</article-title>
          .
          <source>In: Proceedings of SemEval 2014</source>
          . Dublin, Ireland (
          <year>August 2014</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          12.
          <string-name>
            <surname>Vincent</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Winterstein</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          :
          <article-title>Construction et exploitation d'un corpus francais pour l'analyse de sentiment</article-title>
          .
          <source>In: TALN-RECITAL 2013. Les Sables d'Olonne</source>
          , France (
          <year>2013</year>
          )
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          13.
          <string-name>
            <surname>Wagner</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Arora</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Cortes</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Barman</surname>
            ,
            <given-names>U.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Bogdanova</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Foster</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Tounsi</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          :
          <article-title>Dcu: Aspect-based polarity classi cation for semeval task 4</article-title>
          .
          <source>In: Proceedings of SemEval 2014</source>
          . pp.
          <volume>223</volume>
          {
          <fpage>229</fpage>
          .
          <article-title>Association for Computational Linguistics</article-title>
          and Dublin City University, Dublin, Ireland (
          <year>August 2014</year>
          )
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>