=Paper= {{Paper |id=Vol-2627/short11 |storemode=property |title=Text Mining for Aspect Based Sentiment Analysis on Customer Review : A Case Study in the Hotel Industry |pdfUrl=https://ceur-ws.org/Vol-2627/paper2.pdf |volume=Vol-2627 |authors=Fitra A. Bachtiar,Wirdhayanti Paulina,Alfi Nur Rusydi |dblpUrl=https://dblp.org/rec/conf/iicst/BachtiarPR20 }} ==Text Mining for Aspect Based Sentiment Analysis on Customer Review : A Case Study in the Hotel Industry== https://ceur-ws.org/Vol-2627/paper2.pdf

TEXT MINING FOR ASPECT BASED SENTIMENT ANALYSIS ON CUSTOMER
REVIEW: A CASE STUDY IN THE HOTEL INDUSTRY

Fitra A. Bachtiar, Wirdhayanti Paulina, Alfi Nur Rusydi
Faculty of Computer Science, Brawijaya University, fitra.bachtiar@ub.ac.id

ABSTRACT

The development of the role of the OTA (Online Travel Agent) site has become one of the E-WOM (Electronic
Word of Mouth) media in addition to its main function as a platform for ticket reservations to encourage
stakeholders in the hotel industry to utilize E-WOM for business continuity. One of the guest houses in Malang
realized the importance of E-WOM because 90 percent of the booking process originated from the OTA website.
However, the process of processing customer reviews only focuses on physical reviews, namely Guest Reviews.
Meanwhile, information from online sources can have a more significant impact on E-WOM. One of the
techniques of text mining is sentiment analysis which can be used to process and group text reviews. Sentiment
analysis can be done to determine the sentiment of opinions on customer reviews to determine customer
satisfaction with guest house services that aim to produce a positive E-WOM. Sentiment analysis is carried out at
the aspect level using aspects of location, room, food, price, and service. The text of the review used in Indonesian
originates from the sites Agoda.com, Expedia, Pegi-Pegi, Booking.Com, TripAdvisor and has a timeline from
2012 to 2019. This research yields findings in the form of customer satisfaction analysis of the five aspects where
food aspects have urgency to be addressed and corrected immediately. Evaluation of the classification results also
proves the effectiveness of the SVM method from Naïve Bayes
Key words: Guest House, E-WOM, Sentiment Analysis.

1. INTRODUCTION

The rapid growth of technology has encouraged the development of the role of the OTA (Online Travel Agent)
site as one of the E-WOM (Electronic Word Of Mouth) media in addition to its main function as a platform for
ticket reservations. Westbrook (1987) states that all informal communication aimed at consumers through internet-
based technology related to the use or characteristics of a product, service, or provider is called E-WOM. Through
these sites, customers are expected to provide reviews / reviews about what they feel and experience after a visit
at the place they are going to either a hotel, restaurant, amusement vehicle and so on. Positive e-WOM can be
generated through good customer reviews, while good reviews can be generated from satisfying customer
experience for the services and accommodations produced.
One of the guest houses in Malang realized the importance of E-WOM in its business continuity because 90
percent of the booking process originated from the OTA website. Currently, the guest house has been listed on
several OTA sites, namely TripAdvisor, Booking.com, Expedia, Agoda and Pegi-Pegi. However, the process of
processing customer reviews only focuses on physical reviews, namely Guest Reviews. Meanwhile, information
from online sources can have a more significant impact on E-WOM. The process of processing customer reviews
becomes ineffective because it only focuses on one source, while the evaluation of guest house management
services needs to be on target. In addition to these problems, the wide range of hotel attributes makes it difficult
for stakeholders to determine aspects that have urgency to be addressed immediately.
A method that can be used to process and group text reviews is a sentiment analysis. Sentiment analysis or
opinion mining is a computational study of people's opinions, sentiments, and emotions through entities and
attributes that are expressed in text form (Liu, 2012). This sentiment analysis can classify the polarity of the text
in sentences or documents to find out whether the opinions on the sentence or document are positive or negative.
Sentiment analysis can be done to determine the sentiment of opinions on customer reviews to determine customer
satisfaction with guest house services that aim to produce a positive E-WOM.
Ekawati and Khodra (2017) uses sentiment analysis on restaurant reviews to help restaurant owners improve
the quality of their products and services. Sentiment analysis is carried out at the aspect level using aspects of food,
service, price, and place. El-Jawad et al. (2018) describes several stages in sentiment analysis namely the data
collection phase, the preprocessing phase, the term weighting phase, the classification phase, and the evaluation
phase. Sentiment analysis will be carried out in 5 stages using the SVM method based on research Bhavitha et al.
(2017) who analyzes comparisons of the techniques used in sentiment analysis. Researchers compared the lexicon
based approach with the machine learning approach. Lexicon based has an average accuracy of 70% where

IICST2020: 5th International Workshop on Innovations in Information and Communication Science and Technology, Malang, Indonesia
Bachtiar F.A, Wirdhayanti P., Rusydi, A.N.

Machine Learning has an average accuracy that is much better above 80% and among the machine learning
classifier results Support Vector Machine has the best accuracy compared to other classifiers. Miao et al. (2018)
compared 3 machine learning methods, namely SVM, KNN, and Naïve Bayes for making a Chinese text news
classification system. SVM has advantages in the value of Recall, Precision, and Recall although it takes longer
than both methods because of the iteration process. Meanwhile, KNN and Naïve Bayes produce values that are
not much different. Finally, Shi and Li (2011) uses the SVM method to compare TF-IDF with Frequency in the
Term Weighting process in Sentiment Analysis. TF-IDF proved to be more effective with a Recall value of 89.2%,
Precision 85.2%, and F1-Score 87.2%.
This study will discuss the use of sentiment analysis will be carried out at the selected aspect level to group
reviews into 5 aspects and determine the sentiment of customer reviews by applying stages in text mining using
machine learning classification methods. These five aspects were chosen based on research by Dolnicar and Otter
(2003). The aspects used are the location, room, price, food and services chosen according to the needs of the
organization. The results of this study are in the form of findings that can help stakeholders understand what are
the customer complaints so that the decision making process to determine the services that need to be repaired and
addressed becomes more effective and targeted. This paper is divided into several sections, namely Introduction,
Literature Review, Methodology, Experiment, Analysis, and Conclusion.

2. LITERATUR REVIEW

2.1 Sentiment Analysis

Sentiment analysis or opinion mining is an umbrella of branches of study such as opinion extraction, sentiment
mining, subjectivity analysis, affect analysis, emotion analysis, mining review, etc. Sentiment analysis is a field
of study that analyzes opinions, praise sentiments, one's emotions towards entities such as products, services,
organizations, events, problems, and attributes of entities (Liu, 2012). Sentiment analysis is divided into 3 levels,
namely: Document Level, Sentence Level, and Entity Level (aspect).

2.2 Support Vector Machine

The SVM algorithm aims to find Maximum Marginal Hyperplane (MMH) using support vectors and margins.
MMH is the best hyperplane with the largest margin distance used to separate data maximally and accurately for
each class. Margin can be defined as the shortest distance of a hyperplane to one side of the margin is the same as
the hyperplane distance to the other side of the margin, provided that both margins are in a parallel position with
the hyperplane (Han et al., 2012).

Fig. 1. Small Hyperplane Vs Optimal Hyperplane
Source : Han et al. (2012)

If there is a dataset in the form (X1, y1), (X2, y2), (X3, y3), ... , (Xi, yi) where Xi is tuple training and yi is the
class label with 𝑖 = 1 .... 𝑁 𝑋𝑖 ∈ 𝑅𝑑 and 𝑦𝑖 ∈ {−1,1}. Every yi can choose one of two values either +1 or -1 SVM
will form a classifier as shown in the Equation (1) as follows:
(#,% )'
𝑓(𝑥! ) = {"#,%!"
!" &'
(1)
In SVM a hyperplane will be described in the following equation:
𝑾. 𝑿 + 𝑏 = 0 (2)
Based on equations (2), W is scalar Weight, n is attribute, b is scalar value or called bias, and X is training data
set or training tuples.
106

Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Text Mining for Aspect Based Sentiment Analysis on Customer Review

2.3 Naïve Bayes

Naive Bayes is an algorithm put forward by a British scientist Thomas Bayes which is a classification algorithm
using probability and statistical methods. This algorithm predicts opportunities based on past experience to be used
in the future so it is known as the Bayes Theorem (Berrar, 2018). This study uses a multinomial model that takes
into account the frequency of each word that appears in the document (Manning et al., 2008). For example there
are documents d and class c. To calculate the class of document d, it can be calculated using the formula:
𝑷(𝒄|𝒅𝒐𝒄𝒖𝒎𝒆𝒏𝒕 𝒕𝒆𝒓𝒎 𝒅) = 𝑷(𝒄) 𝒙 𝑷(𝒕𝟏 |𝒄) 𝒙 𝑷(𝒕𝟐 |𝒄) 𝒙 𝑷(𝒕𝟑 |𝒄) 𝒙 … 𝒙 𝑷(𝒕𝒏 |𝒄) (3)
Based on Equation (3), P (c) is prior probability of class c, tn is word document d nth, P (c | term document d)
is the probability of a document including class c and P (tn | c) = N-word probability with class c. The probability
of prior class c is determined by the formula:
𝑵
𝑷(𝒄) = 𝑵𝒄 (4)
Based on Equation (4), Nc is number of class c in all documents, N is number of all documents. The n-word
probability is determined using the laplacian smoothing technique:
𝒄𝒐𝒖𝒏𝒕(𝒕𝒏,𝒄))𝟏
𝑷(𝒕𝒏 | 𝒄) = 𝒄𝒐𝒖𝒏𝒕(𝒄))|𝑽| (5)
In Equation (5), count (tn, c) is number of terms tn found in all training data with category c, count (c) is
number of terms in all training data with category c, V is number of all templates in the training data

2.4 TF-IDF

TF-IDF Term Weighting is a weighting that is often used and is a combination of Term Frequency and Inverse
Document Frequency. TF-IDF consists of frequency terms and inverse documents obtained from dividing the total
number of documents to the number of documents that have these terms (Feldman and Sanger, 2007).
𝑛! (𝑑) = 𝑑𝑓! ∗ log 𝐷/𝑑𝑓! (6)
Based on Equation (6), 𝑛! (𝑑) is the weight of the term t in the document d, 𝑑𝑓! is frequency of occurrence of
8
the term t in document d, and log 9: is inverse document frequency value of the term t.
!

2.5 Confusion Matrix

Confusion Matrix contains information about the performance of a classification system that is evaluated using
the data or metrics contained in the Confusion matrix. Confusion Matrix analyzes how well the classification has
been done on the actual class and the predicted class.

Table 1. Confusion Matrix.
Prediction
Positive Negative
Actual Positive TP FN
Negative FP TN
Source: Han et al. (2012).

Confusion matrix represents the level of accuracy of the classification process that has been done. Accuracy
shows the proportion of the number of true predictions.
;<);=
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = ;<)><);=)>= (7)

Precision is the proportion of correctly identified labeling (Completeness), the formula for finding Precision:
;<
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = (8)
;<);<

Recall is the proportion of information that can be found from a label, the search formula for Recall:
;<
𝑅𝑒𝑐𝑎𝑙𝑙 = ;<)>= (9)

Precision and Recall can be used to get the proportion of other measurements namely F1-Score. F1-Score is
the harmonic mean of the calculation of Precision and Recall, the formula to find F1-Score: