<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Integrating Terminology Extraction and Word Embedding for Unsu- pervised Aspect Based Sentiment Analysis</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>France Grenoble</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>France Grenoble</string-name>
        </contrib>
      </contrib-group>
      <fpage>13</fpage>
      <lpage>18</lpage>
      <abstract>
        <p>English. In this paper we explore the advantages that unsupervised terminology extraction can bring to unsupervised Aspect Based Sentiment Analysis methods based on word embedding expansion techniques. We prove that the gain in terms of F-measure is in the order of 3%. Italiano. Nel presente articolo analizziamo l'interazione tra syistemi di estrazione “classica” terminologica e systemi basati su techniche di “word embedding” nel contesto dell'analisi delle opinioni. Domostreremo che l'integrazione di terminogie porta un guadagno in F-measure pari al 3% sul dataset francese di Semeval 2016.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>The goal of this paper is to bring a contribution
on the advantage of exploiting terminology
extraction systems coupled with word
embedding techniques. The experimentation is
based on the corpus of Semeval 2016. In a
previous work, summarized in section 4, we
reported the results of a system for Aspect Based
Sentiment Analysis (ABSA) based on the
assumption that in real applications a domain
dependent gold standard is systematically absent.
We showed that by adopting domain dependent
word embedding techniques a reasonable level of
quality (i.e. acceptable for a proof of concept) in
terms of entity detection could be achieved by
providing two seed words for each targeted
entity. In this paper we explore the hypothesis
that unsupervised terminology extraction
approaches could further improve the quality of
the results in entity extraction.</p>
      <p>The paper is organized as follows: In section 2
we enumerate the goal of the research and the
industrial background justifying it. In section 3
we provide a state of the art of ABSA
particularly focused towards unsupervised ABSA
and its relationship to terminology extraction. In
section 4 we summarize our previous approach
in order to provide a context for our
experimentation. In section 5 we prove the
benefit of the integration of unsupervised
terminology extraction with ABSA, whereas in 6
we provide hints for further investigation.
2</p>
    </sec>
    <sec id="sec-2">
      <title>Background</title>
      <p>
        ABSA is a task which is central to a number of
industrial applications, ranging from
ereputation, crisis management, customer
satisfaction assessment etc. Here we focus on a
specific and novel application, i.e. capturing the
voice of the customer in new product
development (NPD). It is a well-known fact that
the high rate of failure (76%, according to
Nielsen France, 2014) in launching new products
on the market is due to a low consideration of
perspective users’ needs and desires. In order to
account for this deficiency a number of methods
have been proposed ranging from traditional
methods such as KANO (Wittel et al., 2013) to
recent lean based NPD strategies
        <xref ref-type="bibr" rid="ref20">(Olsen, 2015)</xref>
        .
All are invariantly based on the idea of collecting
user needs with tools such as questionnaire,
interviews and focus groups. However with the
development of social networks, reviews sites,
forums, blogs etc. there is another important
source for capturing user insights for NPD: users
of products (in a wide sense) are indeed talking
about them, about the way they use them, about
the emotions they raise. Here it is where ABSA
becomes central: whereas for applications such
as e-reputation or brand monitoring capturing
just the sentiment is largely enough for the
specific purpose, for NPD it is crucial to capture
the entity an opinion is referring to and the
specific feature under judgment.
      </p>
      <p>ABSA for NPD is a novel technique and as
such it might trigger doubts on its adoption:
given the investments on NPD (198 000 M€ only
in the cosmetics sector) it is normal to find a
certain reluctance in abandoning traditional
methodologies for voice of the customer
collection in favor of social network based
ABABSA. In order to contrast this reluctance, two
conditions need to be satisfied. On the one hand,
one must prove that ABSA is feasible and
effective in a specific domain (Proof of Concept,
POC); on the other hand the costs of a high
quality in-production system must be affordable
and comparable with traditional methodologies
(according to Eurostat the spending of European
PME in the manufacturing sector for NPD will
be about 350,005.00 M€ in 2020, and PME
usually have limited budget in terms of “voice of
the customer” spending).</p>
      <p>If we consider the fact that the range of
product/services which are possible objects of
ABSA studies is immense1, it is clear that we
must rely on almost completely unsupervised
technologies for ABSA, which translates in the
capability of performing the task without a
learning corpus.
3
3.1</p>
    </sec>
    <sec id="sec-3">
      <title>State of the Art</title>
    </sec>
    <sec id="sec-4">
      <title>Semeval2016’s overview</title>
      <p>
        SemEval is “ an ongoing series of evaluations of
computational semantic analysis systems” 2 ,
organized since 1998. Its purpose is to evaluate
semantic analysis systems. ABSA (Aspect Based
Sentiment Analysis) was one of the tasks of this
event introduced in 2014. This type of analysis
provides information about consumer opinions on
products and services which can help companies
to evaluate the satisfaction and improve their
business strategies. A generic ABSA task consists
to analyze a corpus of unstructured texts and to
extract fine-grained information from the user
reviews. The goal of the ABSA task within
SemEval is to directly compare different datasets,
approaches and methods to extract such
information
        <xref ref-type="bibr" rid="ref22">(Pontiki et al., 2016)</xref>
        .
      </p>
      <p>In 2016, ABSA provided 39 training and
testing datasets for 8 languages and 7 domains.
Most datasets come from customer reviews
(especially for the domains of restaurants,
laptops, mobile phones, digital camera, hotels and
museums), only one dataset (telecommunication
domain) comes from tweets. The subtasks of the
sentence-level ABSA, were intended to identify
all the opinion tuples encoding three types of
information: Aspect category, Opinion Target
Expression (OTE) and Sentiment polarity. Aspect
is in turn a pair (E#A) composed of an Entity and
1 The site of UNSPC reports more than 40,000
categories of products (https://www.unspsc.org).
2 https://aclweb.org/aclwiki/SemEval_Portal, seen on
05/24/2018
an Attribute. Entity and attributes, chosen from a
special inventory of entity types (e.g.
“restaurant”, “food”, etc.) and attribute labels
(e.g. “general”, “prices”, etc.) are the pairs
towards which an opinion is expressed in a given
sentence. Each E#A can be referred to a linguistic
expression (OTE) and be assigned one polarity
label.</p>
      <p>
        The evaluation assesses whether a system
correctly identifies the aspect categories towards
which an opinion is expressed. The categories
returned by a system are compared to the
corresponding gold annotations and evaluated
according to different measures (precision (P),
recall (R) and F-1 scores). System performance
for all slots is compared to baseline score.
Baseline System selects categories and polarity
values using Support Vector Machine (SVM)
based on bag-of-words features
        <xref ref-type="bibr" rid="ref2">(Apidianaki et al.,
2016)</xref>
        .
3.2
      </p>
    </sec>
    <sec id="sec-5">
      <title>Related works on unsupervised</title>
    </sec>
    <sec id="sec-6">
      <title>ABSA</title>
      <p>Unsupervised ABSA. Traditionally, in ABSA
context, one problematic aspect is represented by
the fact that, given the non-negligible effort of
annotation, learning corpora are not as large as
needed, especially for languages other than
English. This fact, as well as extension to
“unseen” domains, pushed some researchers to
explore unsupervised methods. Giannakopoulos
et al. (2017) explore new architectures that can
be used as feature extractors and classifiers for
Aspect terms unsupervised detection.</p>
      <p>
        Such unsupervised systems can be based on
syntactic rules for automatic aspect terms
detection (Hercig et al., 2106), or graph
representations
        <xref ref-type="bibr" rid="ref12">(García-Pablos et al., 2017)</xref>
        of
interactions between aspect terms and opinions,
but the vast majority exploits resources derived
from distributional semantic principles
(concretely, word embedding).
      </p>
      <p>
        The benefits of word embedding used for
ABSA were successfully shown in
        <xref ref-type="bibr" rid="ref26">(Xenos et al.,
2016)</xref>
        . This approach, which is nevertheless
supervised, characterizes an unconstrained
system (in the Semeval jargon a system
accessing information not included in the
training set) for detecting Aspect Category,
Opinion Target expression and Polarity. The
used vectors were produced using the skip-gram
model with 200 dimensions and were based on
multiple ensembles, one for each E#A
combination. Each ensemble returns the
combinations of the scores of constrained and
unconstrained systems. For Opinion Target
expression, word embedding based features
extend the constrained system. The resulting scores
reveal, in general, rather high rating position of
the unconstrained system based on word
embedding. Concerning the advantages derived from
the use of pre-trained in domain vectors, they are
also described in
        <xref ref-type="bibr" rid="ref27">(Kim, 2014)</xref>
        , who makes use of
convolutional neural networks trained on top of
pre-trained word vectors and shows good
performances for sentence-level tasks, and
especially for sentiment analysis
      </p>
      <p>
        Some other systems represent a compromise
between supervised and unsupervised ABSA, i.e.
semi-supervised ABSA systems, such an almost
unsupervised system based on topic modelling
and W2V
        <xref ref-type="bibr" rid="ref14">(Hercig et al., 2016)</xref>
        , and W2VLDA
        <xref ref-type="bibr" rid="ref12">(García-Pablos et al., 2017)</xref>
        . The former uses
human annotated datasets for training, but enrich
the feature space by exploiting large unlabeled
corpora. The latter combines different
unsupervised approaches, like word embedding and
Latent Dirichlet Allocation
        <xref ref-type="bibr" rid="ref4">(LDA, Blei et al., 2003)</xref>
        to classify the aspect terms into three Semeval
categories. The only supervision required by the
user is a single seed word per desired aspect and
polarity. Because of that, the system can be
applied to datasets of different languages and
domains with almost no adaptation.
      </p>
    </sec>
    <sec id="sec-7">
      <title>Relationship with Term Extraction. Auto</title>
      <p>matic Terminology Extraction (ATE) is an
important task in NLP, because it provides a clear
footprint of domain-related information. All ATE
methods can be classified into linguistic,
statistical and hybrid (Cabré-Castellvi et al., 2001).</p>
      <p>
        The relationship between word embedding and
ATE method is successfully explored for tasks of
term disambiguation in technical specification
documents
        <xref ref-type="bibr" rid="ref17">(Merdy et al., 2016)</xref>
        . The
distributional neighbors of the 16 seed words were
evaluated on the basis of the three corpora of
different size: small (200,000 words), medium (2 M
words) and large (more than 200 M words). The
results of this study show that the identification
of generic terms is more relevant in the large
sized corpora, since the phenomenon is very
widespread over the contexts. For specified
terms, medium and large sized corpora are
complementary. The specialized medium corpora
brings a gain value by guaranteeing the most
relevant terms. As for the small corpora, it does not
seem to give usable results, whatever the term.
Thus, the authors conclude that word2vec is an
ideal technique to constitute semi-automatically
term lexicon from very large corpora, without
being limited to a domain.
      </p>
      <p>
        Word2vec's methods (such as skip-gram and
CBOW) are also used to improve the extraction
of terms and their identification. This is done by
the composed filtering of Local-global vectors
        <xref ref-type="bibr" rid="ref1">(Amjadian et al., 2016)</xref>
        . The global vectors were
trained on the general corpus with GloVe
        <xref ref-type="bibr" rid="ref21">(Pennington et al., 2014)</xref>
        , and the local vectors on the
specific corpus with CBOW and Skip-gram. This
filter has been made to preserve both
specificdomain and general-domain information that the
words may contain. This filter greatly improves
the output of ATE tools for a unigram term
extraction.
      </p>
      <p>The W2V method seems useful for the task of
categorizing terms using the concepts of an
ontology (Ferré, 2017). The terms (from medical
texts) were first annotated. For each term an
initial vector was generated. These term vectors,
embedded into the ontology vector space, were
compared with the ontology concept vectors. The
calculated closest distance determines the
ontological labeling of the terms.</p>
      <p>
        Word2vec method is used also to emulate a
simple ontology learning system to execute term
and taxonomy extraction from text
        <xref ref-type="bibr" rid="ref10 ref17 ref2 ref25 ref6 ref7">(Wohlgenannt and Minic, 2016)</xref>
        . The researchers apply the
built-in word2vec similarity function to get terms
related to the seed terms. But the minus-side of
the results shows that the candidates suggested
by word2vec are too similar terms, as plural
forms or near synonyms. On the other hand, the
evaluation of word2vec for taxonomy building
gave the accuracy of around 50% on taxonomic
relation suggestion. Being not very impressive,
the system will be improved by parameter
settings and bigger corpora.
      </p>
      <p>In the experiments described in this paper we
exploit only the Skip-gram approach based on
the word2vec implementation. It is important to
notice that this choice is not due to a principled
decision but to not functional constraints related
the fact that that algorithm has a java
implementation, is reasonably fast and it is already
integrated with Innoradiant NLP pipeline.
4</p>
    </sec>
    <sec id="sec-8">
      <title>Previous Investigations</title>
      <p>The experiments described in Dini et al. (under
review), have been performed by using
Innoradiant’s Architecture for Language Analytics
(henceforth IALA). The platform implements a
standard pipelined architecture composed of
classical NLP modules: Sentence Splitting →
Tokenization → POS tagging → lexicon access
→ Dependency Parsing → Feature identification
→ Attitude analysis. Inspired by Dini et al.
(2017) and Dini and Bittar (2016),
sentiment/attitude analysis in IALA is mainly
symbolic. The basic idea is that dependency
representations are an optimal input for rules
computing sentiments. The rule language formalism is
inspired by Valenzuela-Escárcega et al. (2015)
and thanks to its template filling capability, in
several cases, the grammar is able to identify the
perceiver of a sentiment and, most importantly
the cause, of the sentiment, represented by a
word in an appropriate syntactic dependency
with the sentiment-bearing lexical item. For
instance the representation of the opinion in I
hate black coffee. would be something such as:
&lt;Opinion
cause=”3”&gt;.</p>
      <p>trigger=”1”
perceiver=”0”
(where integers represent position of words in a
CONLL like structure).</p>
      <p>By default entities (which are normally
products and services under analysis) are identified
since early processing phases by means of
regular expressions. This choice is rooted in the fact
that by acting at this level multiword entities
(such as hydrating cream) are captured as single
words since early stages.</p>
      <p>The goal of the Dini et al. (2018) work was to
minimize the domain configuration overhead by
i) expanding automatically the polarity lexicon to
increase polarity recall and ii) to perform entity
recognition by providing only two words (seeds)
for each target entity.</p>
      <p>Both goals were achieved by exploiting a
much larger corpus than Semeval, obtained by
automatically scraping restaurant review from
TripAdvisor. The final corpus was composed of
3,834,240 sentence and 65,088,072 lemmas.
From this corpus we obtain a word2vec resource
by using the DL4j library (skip-gram). The
resource (W2VR, henceforth) was obtained by
using lemma rather than surface forms. Relevant
training parameters for reproducing the model
are described in that paper.</p>
      <p>
        We skip here the description of i) (polarity
expansion) as in the context of the present work we
kept polarity exactly as it was in Dini &amp; al.
(2018)3. We just mention the achieved results on
polarity only detection which were a precision of
0.78185594 and a recall of 0.54541063
(F3 Some previous works on unsupervised polarity lexicon
acquisition for sentiment analysis were done in
        <xref ref-type="bibr" rid="ref3 ref6">(Castellucci
et al., 2016; Basili et al., 2017)</xref>
        measure: 0.6425726). These numbers are
important because in our approach a positive match
is always given by a positive match of polarity
and a correct entity identification (in other words
a perfect entity detection system could achieve a
maximum of 0.64 precision).
4.1
      </p>
    </sec>
    <sec id="sec-9">
      <title>Entity Matching</title>
      <p>Entity matching was achieved by manually
associating two seed words to each Semeval entity
(RESTAURANT, FOOD, DRINK, etc.) and then
applying the following algorithm:
• Associate each entity to the average vector
of the seed words (e-vect. E.g.
evect(FOOD)=avg(vect(cuisine),vect(pizza)
).
• If a syntactic cause is found by the grammar
(as in “I liked the meal”)assign it the entity
associated to the closest e-vect.
• Otherwise compute the average vector of n
words surrounding the opinion trigger and
assign the entity associated to the closest
evect.</p>
      <p>With n=35 we obtain precision= 0.47914252,
recall= 0.4888 and F-measure=0.3998.
5</p>
    </sec>
    <sec id="sec-10">
      <title>Integrating terminology</title>
      <p>A possible path to improve results in entity
assignment can be found in the usage of
“synonyms” in the computation of the set of e-vect.
These can again be obtained from W2VR by
selecting the n closest world to the average of the
seeds and using them in the computation of the
e-vect. Expectedly, the value of n can influence
the result as shown in Figure 1.</p>
      <p>We notice that best results are achieved by using
a set of closest world around 10: after that
threshold the noise caused by “false synonyms”
or associated common words causes a decay in
the results. We also notice that overall the results
are better than the original seed-only method, as
now we obtain precision: 0.51 recall: 0.35
Fmeasure: 0.42. Here the positive fact is not only
a global raise of the f-measure, but the fact that
this is mainly caused by an increased precision,
which according to Dini et al. (2018) is the
crucial point in POC level applications.</p>
      <p>
        As a way to remedy to the noise caused by an
unselective use of the n closest words coming
from W2VR we decide to explore an approach
that filters them according to the words
appearing as terms in a terminology obtained from
unsupervised terminology extraction system. To
this purpose we adopted the software TermSuite
        <xref ref-type="bibr" rid="ref7">(Cram &amp; Daille, 2016)</xref>
        which implements a
classic two steps model of identification of term
candidates and their ranking. In particular TermSuite
is based on two main components, a UIMA
Tokens Regex for defining terms and variant
patterns over word annotations, and a grouping
component for clustering terms and variants that
works both at morphological and syntactic levels
        <xref ref-type="bibr" rid="ref7">(for more details cf. Cram &amp; Daille, 2016)</xref>
        . The
interest of using this resource for filtering results
from W2VR is that “quality word” lists are
obtained with the adoption of methods
fundamentally different from W2V approach and heavily
based on language dependent syntactic patterns.
      </p>
      <p>We performed the same experiments as
W2VR expansion for the computation of e-vect,
with the only difference that now the top n must
appear as closest terms in W2VR and as terms in
the terminology (The W2VR parameters,
including corpus are described in section 4; the
terminology was obtained from the same corpus about
restaurants). The results are detailed in Figure 2.</p>
      <p>We notice that all scores increase significantly.
In particular at top n=10 we obtain
P=0.550233483, R=0.381750288 and
F=0.450762752, which represents a 5% increase
(in F-measure) w.r.t. the results presented in Dini
et al. (2018).</p>
    </sec>
    <sec id="sec-11">
      <title>Conclusions</title>
      <p>Many improvements can be conceived to the
method presented here, especially concerning the
computation of the vector associated to the
opinionated windows, both in terms of size,
directionality and consideration of finer grained
features (e.g. indicators of a switch of topic).
However our future investigation will rather be
oriented towards full-fledged ABSA, i.e. taking into
account not only Entities, but also Attributes.
Indeed, if we consider that the 45% F measure is
obtained on a corpus where only 66% sentences
were correctly classified according to the
sentiment and if we put ourselves in a Semeval
perspective where entity evaluation is provided with
respect to a “gold sentiment standard” we
achieve a F-score of 68%, which is fully
acceptable for an almost unsupervised system.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <given-names>Ehsan</given-names>
            <surname>Amjadian</surname>
          </string-name>
          , Diana Inkpen,
          <string-name>
            <given-names>T.Sima</given-names>
            <surname>Paribakht</surname>
          </string-name>
          and
          <string-name>
            <given-names>Farahnaz</given-names>
            <surname>Faez</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Local-Global Vectors to Improve Unigram Terminology Extraction</article-title>
          .
          <source>Proceedings of the 5th International Workshop on Computational Terminology</source>
          , Osaka, Japan, Dec
          <volume>12</volume>
          ,
          <year>2016</year>
          ,
          <fpage>2</fpage>
          -
          <lpage>11</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <given-names>Marianna</given-names>
            <surname>Apidianaki</surname>
          </string-name>
          , Xavier Tannier and
          <string-name>
            <given-names>Cécile</given-names>
            <surname>Richart</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Datasets for Aspect-Based Sentiment Analysis in French</article-title>
          .
          <source>Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC</source>
          <year>2016</year>
          ). Portorož, Slovenia.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Basili</surname>
          </string-name>
          , Danilo Croce and
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Castellucci</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Dynamic polarity lexicon acquisition for advanced Social Media analytics</article-title>
          .
          <source>International Journal of Engineering Business Management</source>
          , Volume
          <volume>9</volume>
          ,
          <fpage>1</fpage>
          -
          <lpage>18</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <given-names>David M.</given-names>
            <surname>Blei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Andrew Y.</given-names>
            <surname>Ng</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michael I.</given-names>
            <surname>Jordan</surname>
          </string-name>
          .
          <year>2003</year>
          .
          <article-title>Latent Dirichlet Allocation</article-title>
          .
          <source>The Journal of machine Learning research</source>
          , Volume
          <volume>3</volume>
          ,
          <fpage>993</fpage>
          -
          <lpage>1022</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>M. Teresa Cabré</surname>
            <given-names>Castellví</given-names>
          </string-name>
          , Rosa Estopà Bagot, Jordi Vivaldi Palatresi.
          <year>2001</year>
          .
          <article-title>Automatic term detection: a review of current systems</article-title>
          . In Bourigault, D. Jacquemin,
          <string-name>
            <surname>C. L'Homme</surname>
            ,
            <given-names>M-C.</given-names>
          </string-name>
          <year>2001</year>
          .
          <source>Recent Advances in Computational Terminology</source>
          ,
          <fpage>53</fpage>
          -
          <lpage>88</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <given-names>Giuseppe</given-names>
            <surname>Castellucci</surname>
          </string-name>
          , Danilo Croce and
          <string-name>
            <given-names>Roberto</given-names>
            <surname>Basili</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>A Language Independent Method for Generating Large Scale Polarity Lexicons</article-title>
          .
          <source>Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC'16)</source>
          , Portoroz, Slovenia,
          <fpage>38</fpage>
          -
          <lpage>45</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <given-names>Damien</given-names>
            <surname>Cram</surname>
          </string-name>
          and
          <string-name>
            <given-names>Béatrice</given-names>
            <surname>Daille</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>TermSuite: Terminology Extraction with Term Variant Detection</article-title>
          .
          <source>Proceedings of the 54th Annual Meeting of</source>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <given-names>Luca</given-names>
            <surname>Dini</surname>
          </string-name>
          , Paolo Curtoni and
          <string-name>
            <given-names>Elena</given-names>
            <surname>Melnikova</surname>
          </string-name>
          .
          <year>2018</year>
          .
          <article-title>Portability of Aspect Based Sentiment Analysis: Thirty Minutes for a Proof of Concept</article-title>
          . Submitted to:
          <source>The 5th IEEE International Conference on Data Science and Advanced Analytics. DSAA</source>
          <year>2018</year>
          , Turin.
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <string-name>
            <given-names>Luca</given-names>
            <surname>Dini</surname>
          </string-name>
          , André Bittar, Cécile Robin,
          <string-name>
            <given-names>Frédérique</given-names>
            <surname>Segond</surname>
          </string-name>
          and
          <string-name>
            <given-names>M.</given-names>
            <surname>Montaner</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>SOMA: The Smart Social Customer Relationship Management</article-title>
          .
          <article-title>Sentiment Analysis in Social networks</article-title>
          .
          <source>Chapter</source>
          <volume>13</volume>
          .
          <fpage>197</fpage>
          -
          <lpage>209</lpage>
          . DOI:
          <volume>10</volume>
          .1016/B978-0
          <source>-12-804412- 4</source>
          .
          <fpage>00013</fpage>
          -
          <lpage>9</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <given-names>Luca</given-names>
            <surname>Dini</surname>
          </string-name>
          and
          <string-name>
            <given-names>André</given-names>
            <surname>Bittar</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Emotion Analysis on Twitter: The Hidden Challenge</article-title>
          .
          <source>Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC</source>
          <year>2016</year>
          , Portorož, Slovenia,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <string-name>
            <given-names>Arnaud</given-names>
            <surname>Ferré</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Représentation de termes complexes dans un espace vectoriel relié à une ontologie pour une tâche de catégorisation</article-title>
          .
          <source>Rencontres des Jeunes Chercheurs en Intelligence Artifcielle (RJCIA</source>
          <year>2017</year>
          ),
          <year>Jul 2017</year>
          , Caen, France.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <string-name>
            <given-names>Aitor</given-names>
            <surname>García-Pablos</surname>
          </string-name>
          ,
          <article-title>Montse Cuadros</article-title>
          and
          <string-name>
            <given-names>German</given-names>
            <surname>Rigau</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>W2VLDA: Almost unsupervised system for Aspect Based Sentiment Analysis</article-title>
          .
          <source>Expert Systems with Applications</source>
          , (
          <volume>91</volume>
          ):
          <fpage>127</fpage>
          -
          <lpage>137</lpage>
          . arXiv:
          <volume>1705</volume>
          .07687v2 [cs.CL],
          <volume>18</volume>
          jul
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          <string-name>
            <given-names>Athanasios</given-names>
            <surname>Giannakopoulos</surname>
          </string-name>
          , Diego Antognini, Claudiu Musat,
          <string-name>
            <given-names>Andreea</given-names>
            <surname>Hossmann</surname>
          </string-name>
          and
          <string-name>
            <given-names>Michael</given-names>
            <surname>Baeriswyl</surname>
          </string-name>
          .
          <year>2017</year>
          .
          <article-title>Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision</article-title>
          .
          <source>2017 IEEE International Conference on Data Mining Workshops (ICDMW)</source>
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <string-name>
            <given-names>Tomáš</given-names>
            <surname>Hercig</surname>
          </string-name>
          , Tomáš Brychcín, Lukáš Svoboda, Michal Konkol and
          <string-name>
            <given-names>Josef</given-names>
            <surname>Steinberger</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Unsupervised Methods to Improve Aspect-Based Sentiment Analysis in Czech</article-title>
          .
          <source>Computación y Sistemas</source>
          , vol.
          <volume>20</volume>
          , No.
          <volume>3</volume>
          ,
          <fpage>365</fpage>
          -
          <lpage>375</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <given-names>David</given-names>
            <surname>Jurgens</surname>
          </string-name>
          and
          <string-name>
            <given-names>Keith</given-names>
            <surname>Stevens</surname>
          </string-name>
          .
          <year>2010</year>
          .
          <article-title>The S-Space package: An open source package for word space models</article-title>
          .
          <source>Proceedings of the ACL 2010 Systel Demonstrations</source>
          ,
          <fpage>30</fpage>
          -
          <lpage>35</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <string-name>
            <given-names>Noriaki</given-names>
            <surname>Kano</surname>
          </string-name>
          , Nobuhiku Seraku, Fumio Takahashi, Shinichi Tsuji,
          <year>1984</year>
          . Attractive Quality and
          <string-name>
            <given-names>MustBe</given-names>
            <surname>Quality</surname>
          </string-name>
          .
          <source>Hinshitsu: The Journal of the Japanese Society for Quality Control</source>
          ,
          <volume>14</volume>
          (
          <issue>2</issue>
          ) :
          <fpage>39</fpage>
          -
          <lpage>48</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <given-names>Emilie</given-names>
            <surname>Merdy</surname>
          </string-name>
          , Juyeon Kang and
          <string-name>
            <given-names>Ludovic</given-names>
            <surname>Tanguy</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Identification de termes flous et génériques dans la documentation technique : expérimentation avec l'analyse distributionnelle automatique</article-title>
          . Actes de l'atelier
          <article-title>"Risque et TAL" dans le cadre de la conférence TALN</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          , Kai Chen, Greg Corrado,
          <string-name>
            <given-names>Jeffrey</given-names>
            <surname>Dean</surname>
          </string-name>
          .
          <year>2013a</year>
          .
          <article-title>Efficient estimation of word representations in vector space</article-title>
          .
          <source>arXiv preprint arXiv:1301.3781</source>
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          <string-name>
            <given-names>Tomas</given-names>
            <surname>Mikolov</surname>
          </string-name>
          ,
          <article-title>Wen-tau Yih and Geoffrey Zweig</article-title>
          . 2013b.
          <article-title>Linguistic regularities in continuous space word representations</article-title>
          .
          <source>Proceedings of NAACL-HLT</source>
          <year>2013</year>
          ,
          <volume>746</volume>
          -
          <fpage>751</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <given-names>Dan R.</given-names>
            <surname>Olsen</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>The Lean Product Playbook: How to Innovate with Minimum Viable Products and Rapid Customer Feedback</article-title>
          . John Wiley &amp; Sons Inc: New York, United States.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Jeffrey</surname>
            <given-names>Pennington</given-names>
          </string-name>
          , Richard Socher and
          <string-name>
            <given-names>Christopher D.</given-names>
            <surname>Manning</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>GloVe: Global Vectors for Word Representation</article-title>
          .
          <source>Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          ,
          <fpage>1532</fpage>
          -
          <lpage>1543</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <string-name>
            <given-names>Maria</given-names>
            <surname>Pontiki</surname>
          </string-name>
          , Dimitrios Galanis, Haris Papageorgiou, Ion Androutsopoulos, Suresh Manandhar, Mohammad Al-Smadi, Mahmoud Al-Ayyoub,
          <string-name>
            <given-names>Yanyan</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Bing</given-names>
            <surname>Qin</surname>
          </string-name>
          , Orphée De Clecq, Véronique Hoste, Marianna Apidianaki, Xavier Tannier, Natalia Loukachevitch, Evgeny Kotelnikov, Nuria Bel,
          <string-name>
            <surname>Salud M. Jiménez-Zafra</surname>
            ,
            <given-names>Gülşen</given-names>
          </string-name>
          <string-name>
            <surname>Eryiğit</surname>
          </string-name>
          .
          <year>2016</year>
          . SemEval
          <article-title>-2016 Task 5: Aspect Based Sentiment Analysis</article-title>
          .
          <source>Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval</source>
          <year>2016</year>
          ). San Diego, USA.
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <given-names>Marco A.</given-names>
            <surname>Valenzuela-Escárcega</surname>
          </string-name>
          , Gustave V.
          <article-title>HahnPowell</article-title>
          and
          <string-name>
            <given-names>Mihai</given-names>
            <surname>Surdeanu</surname>
          </string-name>
          .
          <year>2015</year>
          .
          <article-title>Description of the Odin Event Extraction Framework and Rule Language</article-title>
          .
          <source>arXiv:1509.07513v1 [cs.CL], 24 Sep</source>
          <year>2015</year>
          , version
          <issue>1</issue>
          .0,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <given-names>Lars</given-names>
            <surname>Witell</surname>
          </string-name>
          , Martin Löfgren and
          <string-name>
            <given-names>Jens J.</given-names>
            <surname>Dahlgaard</surname>
          </string-name>
          .
          <year>2013</year>
          .
          <article-title>Theory of attractive quality and the Kano methodology - the past, the present, and the future</article-title>
          .
          <source>Total Quality Management &amp; Business Excellence</source>
          , (
          <volume>24</volume>
          ),
          <fpage>11</fpage>
          -
          <lpage>12</lpage>
          :
          <fpage>1241</fpage>
          -
          <lpage>1252</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <given-names>Gerhard</given-names>
            <surname>Wohlgenannt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Filip</given-names>
            <surname>Minic</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>Using word2vec to Build a Simple Ontology Learning System</article-title>
          .
          <source>Proceedings of the ISWC</source>
          <year>2016</year>
          co
          <article-title>-located with 15th International Semantic Web Conference (ISWC 2016)</article-title>
          . Vol-
          <volume>1690</volume>
          . Kobe, Japan, October
          <volume>19</volume>
          ,
          <year>2016</year>
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <given-names>Dionysios</given-names>
            <surname>Xenos</surname>
          </string-name>
          , Panagiotis Theodorakakos, John Pavlopoulos, Prodromos Malakasiotis,
          <string-name>
            <given-names>Ion</given-names>
            <surname>Androutsopoulos</surname>
          </string-name>
          .
          <year>2016</year>
          .
          <article-title>AUEB-ABSA at SemEval2016 Task 5: Ensembles of Classifiers and Embeddings for Aspect Based Sentiment Analysis</article-title>
          .
          <source>Proceedings of SemEval-2016</source>
          , San Diego, California,
          <fpage>312</fpage>
          -
          <lpage>317</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <given-names>Yoon</given-names>
            <surname>Kim</surname>
          </string-name>
          .
          <year>2014</year>
          .
          <article-title>Convolutional Neural Networks for Sentence Classification</article-title>
          .
          <source>Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)</source>
          ,
          <fpage>1746</fpage>
          -
          <lpage>1751</lpage>
          . arXiv:
          <volume>1408</volume>
          .
          <fpage>5882</fpage>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>