=Paper=
{{Paper
|id=Vol-1702/tass2016_proceedings_v216
|storemode=property
|title=GTI at TASS 2016: Supervised Approach for Aspect Based Sentiment Analysis in Twitter
|pdfUrl=https://ceur-ws.org/Vol-1702/tass2016_proceedings_v216.pdf
|volume=Vol-1702
|authors=Tamara Álvarez-López,Milagros Fernández-Gavilanes,Silvia García-Méndez,Jonathan Juncal-Martínez,Francisco Javier González-Castaño
|dblpUrl=https://dblp.org/rec/conf/sepln/Alvarez-LopezGG16
}}
==GTI at TASS 2016: Supervised Approach for Aspect Based Sentiment Analysis in Twitter==
TASS 2016: Workshop on Sentiment Analysis at SEPLN, septiembre 2016, pág. 53-57
GTI at TASS 2016: Supervised Approach for Aspect Based
Sentiment Analysis in Twitter∗
GTI en TASS 2016: Una aproximación supervisada para el análisis de
sentimiento basado en aspectos en Twitter
Tamara Álvarez-López, Milagros Fernández-Gavilanes, Silvia Garcı́a-Méndez,
Jonathan Juncal-Martı́nez, Francisco Javier González-Castaño
GTI Research Group, AtlantTIC
University of Vigo, 36310 Vigo, Spain
{talvarez,mfgavilanes,sgarcia,jonijm}@gti.uvigo.es, javier@det.uvigo.es
Resumen: Este artı́culo describe la participación del grupo de investigación GTI,
del centro AtlantTIC, perteneciente a la Universidad de Vigo, en el tass 2016. Este
taller es un evento enmarcado dentro de la XXXII edición del Congreso Anual de
la Sociedad Española para el Procesamiento del Lenguaje Natural. En este trabajo
se propone una aproximación supervisada, basada en clasificadores, para la tarea de
análisis de sentimiento basado en aspectos. Mediante esta técnica hemos conseguido
mejorar las prestaciones de ediciones anteriores, obteniendo una solución acorde con
el estado del arte actual.
Palabras clave: Análisis de sentimiento, aspectos, SVM, aprendizaje automático,
Twitter
Abstract: This paper describes the participation of the GTI research group of
AtlantTIC, University of Vigo, in tass 2016. This workshop is framed within the
XXXII edition of the Annual Congress of the Spanish Society for Natural Language
Processing event. In this work we propose a supervised approach based on classifiers,
for the aspect based sentiment analysis task. Using this technique we managed to
improve the performance of previous years, obtaining a solution reflecting the actual
state-of-the-art.
Keywords: Sentiment analysis, aspects, SVM, machine learning, Twitter
1 Introduction mum length of the post. However, tweets
have other elements we have to consider,
The social media activity is being profused
like hashtags, mentions and retweets. More
in the recent years, users post opinions and
concretely, aspect-based sentiment analysis
comments in Twitter and in other social plat-
(absa) consists of extracting opinions, i.e.
forms. Due to this, there is a huge amount
determining the sentiment polarity, from spe-
of information available that could be use-
cific entities in the text (Liu, 2012). There-
ful for business, in order to design marketing
fore, this task becomes a challenge on the
campaigns or to apply any kind of business
field of nlp.
analysis.
As a consequence, the research on text The tass Workshop (Garcı́a-Cumbreras
mining and also on the field of Sentiment et al., 2016) and the sepln conference of-
Analysis (sa) has grown considerably these fer an opportunity for participants to know
days. sa is the part of Natural Language Pro- about the latest advances on the field of nlp
cessing (nlp) responsible for determining the for Spanish language.
polarity of a text or a whole sentence. The Many approaches applied to sa can be
sa applied to Twitter has to be conducted found in the literature, where it is possi-
in a restricted scenario due to the maxi- ble to distinguish between knowledge based
∗
approaches (Brooke, Tofiloski, and Taboada,
This work was partially supported by the Minis-
terio de Economı́a y Competitividad under project
2009; Fernández-Gavilanes et al., 2016), us-
COINS (TEC2013-47016-C2-1-R) and by Xunta de ing grammars and thesaurus and others
Galicia (GRC2014/046). based on machine learning approaches (Mo-
ISSN 1613-0073
T. Álvarez-López, M. Fernández-Gavilanes, S. García-Méndez, J. Juncal-Martínez, F. J. González-Castaño
hammad, Kiritchenko, and Zhu, 2013). In plying sa to Twitter has been fully ad-
the last years we can also find deep learning dressed (Pak and Paroubek, 2010; Han and
approaches (Bengio, 2009), applied to this Baldwin, 2011). Within the chosen solu-
task. tions, we highlight the text normalization
We present our supervised machine learn- approach (Fabo, Cuadros, and Etchegoyhen,
ing (ml) system which consists of a Support 2013) and the use of key elements in classifi-
Vector Machine (svm) classifier. Our objec- cation approach (Wang et al., 2011). Others
tive is to conduct the sa process at an aspect hold the advantages of using deep learning
level, task 2, determining the polarity of a techniques in this task (dos Santos and Gatti,
specific given part of a sentence. 2014).
The article is structured as follows. Sec- According to the purpose of the developed
tion 2 is a review of the research involving sa systems, it is possible to find applications
in the Twitter domain. Then, the Section 3 like classification of product reviews and po-
describes the applied approach and the im- litical sentiment and election results pre-
plemented system. In Section 4, we show the diction (Bermingham and Smeaton, 2011),
experimental results of our system. Finally, among others.
in Section 5 we present the conclusions and
future works. 3 System Overview
In this section we make a brief description
2 Related work of the system submitted for Task 2: Aspect-
A large amount of literature related to Opin- based sentiment analysis. We developed a
ion Mining (om) and sa can be found (Pang supervised system, based on a svm classifier
and Lee, 2008; Martı́nez-Cámara et al., using different features. In the next subsec-
2016). Most of the systems are applied to tions we explain the different steps required.
Twitter. However others are applied to social
media platforms within the micro-blog con- 3.1 Preprocessing
text. Due to this, the approaches are varied Before applying any supervised approach to
technically and in connection with the pur- our corpus, some preprocessing is needed.
pose. First of all, we have to normalize the text,
Two main approaches exist in sa: super- since in Twitter language we can find abbre-
vised and unsupervised learning ones. Super- viations, mentions, hashtags, URLs or mis-
vised systems implement classification meth- spellings. In order to do that, we replace the
ods like svm, Logistic Regression (lr), Con- URLs with the “URL” tag and we replace the
ditional Random Fields (crf), K-Nearest abbreviations or misspellings with the correct
Neighbors (knn), etc. Cui, Mittal, and Datar entire word. For mentions and hashtags, we
(2006) affirmed that svm are more appro- keep them unchanged but deleting the “@”
priate for sentiment classification than gen- or “#” symbols. Moreover, when a hashtag
erative models, due to their capability for is composed of several words, we split and
working with ambiguity, that is, dealing with treat them as different tokens.
mixed feelings. Supervised algorithms are After this, a lexical analysis is carried out.
used when the number of classes, as well as It consists of lemmatization and POS tag-
the representative members of each class, are ging, which are performed by means of Freel-
known. ing tool (Atserias et al., 2006).
Unsupervised systems are based on lin- Once we have analysed lexically the texts,
guistic knowledge like lexicons, and syntactic we decided to separate the sentences by the
features in order to infer the polarity (Pal- different aspects. For doing that, the scope
toglou and Thelwall, 2012). These last tech- of each aspect is determined, applying the
niques represent a more effective approach in following rules, which are adapted from our
the cross-domain context and for multilingual English aspect based sentiment anaylisis sys-
applications. The unsupervised classification tem (Alvarez-López et al., 2016)
algorithms do not work with a training set,
in contrast, some of them use clustering algo- • If there is only one aspect in the sen-
rithms in order to distinguish groups (Li and tence, we keep the sentence unchanged,
Liu, 2010). and introduce it entirely as input for the
As noted earlier, the special case of ap- next step.
54
GTI en TASS 2016: Una aproximación supervisada para el análisis de sentimiento basado en aspectos en Twitter
• If there are multiple aspects, we separate a number of political issues, such as health
the sentences by punctuation marks, or economy, among others. These issues are
conjunctions or other aspects found. framed in the political campaign of Andalu-
sian elections in 2015, where each aspect re-
• If there are several aspects with no words
lates to one or several entities that corre-
between them, we consider that they be-
spond to one of the main political parties
long to the same context, and assign the
in Spain (PP, PSOE, IU, UPyD, Cs and
same polarity to all of them.
Podemos). The corpus is composed by 1,284
tweets, and has been divided into a training
3.2 SVM classifier set (784 tweets) and a set of evaluation (500
In this section we describe the strategy fol- tweets).
lowed to determine the sentiment (positive, In order to evaluate the performance of
negative or neutral) for each aspect prede- the various features for polarity classification
fined in corpus. at an aspect-based level, we perform a se-
We develop a svm classifier, using the lib- ries of ablation experiments as shown in Ta-
svm library (Chang and Lin, 2011). The in- ble 1. We start with the word token base-
puts for the svm will be the sentences sep- line classifier, and then add all four sets of
arated by contexts, as explained in the pre- features that help to increase performance as
vious subsection. The features extracted are measured by accuracy. As we might expect,
the following: including the aspect feature has the most
marked effect on the performance of polarity
• Word tokens of nouns, adjectives and classification, although all the features con-
verbs in the sentence. tributed to improving overall performance on
• Lemmas of verbs, nouns and adjectives stompol corpus.
that appear in each sentence.
Type Accuracy Improvement
• POS tags of nouns, adjectives and verbs.
Word token 56.12
• N-grams of different length, grouping the +Lemmas 57.64 +1.52%
words in each sentence. +pos tags 58.26 +0.62%
• Aspects appearing in the sentence. We +Aspects 59.94 +1.68%
join “aspect”-“entity”, defined in each +Negations 60.60 +0.66%
target as a feature.
• Negations. We create a negation dic- Table 1: Results for polarity feature ablation
tionary, which contains several parti- experiments on stompol corpus
cles indicating negation, such as “no”,
“nunca”, etc. Due to the low participation of research
teams in task 2 this year, we decided to com-
The previous features are all binary ones, pare our proposal to the systems presented
assigning the value 1 if the current feature is this year and also to that ones of last year,
present in the tweet and the value 0, if not. because of the use of the same dataset.
For this reason, Table 2 compares results
4 Experimental Results for our approach with different official ones
The Task 2: Sentiment Analysis at the as- submitted in 2015 and 2016 tass editions.
pect level consists of assigning a polarity label In this way, we compared our results for a
to each aspect, which were initially marked ml approach based on well-known squared-
in the stompol corpus (Martı́nez-Cámara et regularised logistic regression with a snippet
al., 2016) raised by the tass organization. In of length 4 (Lys-2) described in Vilares et
this way, this corpus provides both polarity al. (2015), a clustering method focused on
labels and the identification of the aspects grouping authors with similar sociolinguis-
that appear in each tweet. The aim is to be tic insights (TID-spark) described in Park
able to correctly assign to each aspect a pos- (2015), a recurrent neural network composed
itive, negative or neutral polarity. of a single long short term memory and a
In this regard, the stompol corpus con- logistic function (Lys-1) described in Vilares
sists of a set of Spanish tweets related to et al. (2015), a ml approach based on a
55
T. Álvarez-López, M. Fernández-Gavilanes, S. García-Méndez, J. Juncal-Martínez, F. J. González-Castaño
svm with a snipped of length 5,7 and 10 In Proceedings of LREC, volume 6, pages
(ELiRF) described in Hurtado, Plà, and Bus- 48–55.
caldi (2015), and the best performing run of
Bengio, Y. 2009. Learning deep architec-
the actual task 2 tass edition (ELiRF-UPV).
tures for AI. Found. Trends Mach. Learn.,
2(1):1–127, January.
Experiment Task edition Accuracy
Bermingham, A. and A. F. Smeaton. 2011.
ELiRF-UPV 2016 63.3
On using Twitter to monitor political sen-
ELiRF 2015 63.3
timent and predict election results.
GTI 2016 60.6
LyS-1 2015 59.9 Brooke, J., M. Tofiloski, and M. Taboada.
TID-spark 2015 55.7 2009. Cross-linguistic sentiment analysis:
Lys-2 2015 54.0 From english to spanish. In G. Angelova,
K. Bontcheva, R. Mitkov, N. Nicolov, and
N. Nikolov, editors, RANLP, pages 50–
Table 2: Results of different approaches in 54. RANLP 2009 Organising Committee
2015/2016 tass editions on stompol corpus / ACL.
Comparing the results, the performance of Chang, C.-C. and C.-J. Lin. 2011. Libsvm: a
our current model is close from the top rank- library for support vector machines. ACM
ing systems of this and last year. Transactions on Intelligent Systems and
Technology (TIST), 2(3):27.
5 Conclusions and future works
Cui, H., V. Mittal, and M. Datar. 2006.
This paper describes the participation of the
Comparative experiments on sentiment
GTI group in the tass 2016, Task 2: Aspect-
classification for online product reviews.
Based Sentiment Analysis. We developed a
In Proceedings of the 21st National Con-
supervised system based on a svm classifier
ference on Artificial Intelligence - Vol-
for the aspect-based sentiment analysis. The
ume 2, AAAI’06, pages 1265–1270. AAAI
performance of our approach has been com-
Press.
pared to that ones submitted this year but
also to that ones submitted last year. Exper- dos Santos, C. N. and M. Gatti. 2014. Deep
imental results suggest that we need to in- convolutional neural networks for senti-
clude explore new features, such as word em- ment analysis of short texts. In COLING,
bedding representations or paraphrase (Zhao pages 69–78.
and Lan, 2015), in order to improve the per-
Fabo, P. R., M. Cuadros, and T. Etchegoy-
formance.
hen. 2013. Lexical normalization of
As future work we plan to include new fea-
spanish tweets with preprocessing rules,
tures explained before and to develop a new
domain-specific edit distances, and lan-
system which combines different ml classifi-
guage models. In Proceedings of the Tweet
cation methods. We are also interested in
Normalization Workshop co-located with
considering different paradigms of heteroge-
29th Conference of the Spanish Society
neous classification, such as deep learning to
for Natural Language Processing (SEPLN
increase the performance.
2013), Madrid, Spain, September 20th,
References 2013., pages 59–63.
Alvarez-López, T., J. Juncal-Martınez, Fernández-Gavilanes, M., T. Álvarez-López,
M. Fernández-Gavilanes, E. Costa- J. Juncal-Martı́nez, E. Costa-Montenegro,
Montenegro, and F. J. González-Castano. and F. J. González-Castaño. 2016. Unsu-
2016. Gti at semeval-2016 task 5: Svm pervised method for sentiment analysis in
and crf for aspect detection and unsu- online texts. Expert Systems with Appli-
pervised aspect-based sentiment analysis. cations, 58:57–75.
Proceedings of SemEval, pages 306–311. Garcı́a-Cumbreras, M. A., J. Villena-Román,
Atserias, J., B. Casas, E. Comelles, E. Martı́nez-Cámara, M. C. Dı́az-Galiano,
M. González, L. Padró, and M. Padró. M. T. Martı́n-Valdivia, and L. A. Ureña-
2006. Freeling 1.3: Syntactic and seman- López. 2016. Overview of tass 2016. In
tic services in an open-source NLP library. Proceedings of TASS 2016: Workshop on
56
GTI en TASS 2016: Una aproximación supervisada para el análisis de sentimiento basado en aspectos en Twitter
Sentiment Analysis at SEPLN co-located (LREC’10), Valletta, Malta, may. Eu-
with the 32nd SEPLN Conference (SE- ropean Language Resources Association
PLN 2016), Salamanca, Spain, Septem- (ELRA).
ber.
Paltoglou, G. and M. Thelwall. 2012. Twit-
Han, B. and T. Baldwin. 2011. Lexi- ter, myspace, digg: Unsupervised sen-
cal normalisation of short text messages: timent analysis in social media. ACM
Makn sens a #twitter. In Proceedings of Transactions on Intelligent Systems and
the 49th Annual Meeting of the Associa- Technology (TIST), 3(4):66.
tion for Computational Linguistics: Hu- Pang, B. and L. Lee. 2008. Opinion min-
man Language Technologies - Volume 1, ing and sentiment analysis. Found. Trends
HLT ’11, pages 368–378, Stroudsburg, PA, Inf. Retr., 2(1-2):1–135, January.
USA. Association for Computational Lin-
guistics. Park, S. 2015. Sentiment classification us-
ing sociolinguistic clusters. In Proceedings
Hurtado, L. F., F. Plà, and D. Bus- of TASS 2015: Workshop on Sentiment
caldi. 2015. ELiRF-UPV en TASS Analysis at SEPLN co-located with 31st
2015: Análisis de sentimientos en Twit- SEPLN Conference (SEPLN 2015), Ali-
ter. In Proceedings of TASS 2015: Work- cante, Spain, September 15, 2015., pages
shop on Sentiment Analysis at SEPLN co- 99–104.
located with 31st SEPLN Conference (SE-
PLN 2015), Alicante, Spain, September Vilares, D., Y. Doval, M. A. Alonso, and
15, 2015., pages 75–79. C. Gómez-Rodrı́guez. 2015. Lys at
TASS 2015: Deep learning experiments
Li, G. and F. Liu. 2010. A clustering-based for sentiment analysis on spanish tweets.
approach on sentiment analysis. In Intel- In Proceedings of TASS 2015: Work-
ligent Systems and Knowledge Engineer- shop on Sentiment Analysis at SEPLN co-
ing (ISKE), 2010 International Confer- located with 31st SEPLN Conference (SE-
ence on, pages 331–337. IEEE. PLN 2015), Alicante, Spain, September
15, 2015., pages 47–52.
Liu, B. 2012. Sentiment Analysis and Opin-
ion Mining. Synthesis Lectures on Human Wang, X., F. Wei, X. Liu, M. Zhou, and
Language Technologies. Morgan & Clay- M. Zhang. 2011. Topic sentiment anal-
pool Publishers. ysis in Twitter: A graph-based hashtag
sentiment classification approach. In Pro-
Martı́nez-Cámara, E., M. A. Garcı́a- ceedings of the 20th ACM International
Cumbreras, J. Villena-Román, and Conference on Information and Knowl-
J. Garcı́a-Morera. 2016. Tass 2015 - the edge Management, CIKM ’11, pages 1031–
evolution of the spanish opinion mining 1040, New York, NY, USA. ACM.
systems. Procesamiento del Lenguaje
Natural, 56:33–40. Zhao, J. and M. Lan. 2015. Ecnu: Lever-
aging word embeddings to boost perfor-
Mohammad, S. M., S. Kiritchenko, and mance for paraphrase in Twitter. In Pro-
X. Zhu. 2013. Nrc-canada: Building the ceedings of the 9th International Work-
state-of-the-art in sentiment analysis of shop on Semantic Evaluation (SemEval
tweets. In Proceedings of the seventh in- 2015), pages 34–39, Denver, Colorado,
ternational workshop on Semantic Evalu- June. Association for Computational Lin-
ation Exercises (SemEval-2013), Atlanta, guistics.
Georgia, USA, June.
Pak, A. and P. Paroubek. 2010. Twit-
ter as a corpus for sentiment analy-
sis and opinion mining. In N. C. C.
Chair), K. Choukri, B. Maegaard, J. Mar-
iani, J. Odijk, S. Piperidis, M. Ros-
ner, and D. Tapias, editors, Proceedings
of the Seventh International Conference
on Language Resources and Evaluation
57