<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>IICST</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>ASPECT BASED SENTIMENT ANALYSIS FOR CUSTOMER REVIEWS ON BAKSO PRESIDENT MALANG</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Fitra A. Bachtiar</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Whita Parasati</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Nanang Yudi Setiawan</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Universitas Brawijaya</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Indonesia</string-name>
        </contrib>
        <contrib contrib-type="author">
          <string-name>fitra.bachtiar@ub.ac.id</string-name>
        </contrib>
      </contrib-group>
      <pub-date>
        <year>2020</year>
      </pub-date>
      <volume>5</volume>
      <fpage>12</fpage>
      <lpage>19</lpage>
      <abstract>
        <p>Bakso President Malang highly appreciate the opinions of customers regarding the products and services they provide to increase customer satisfaction. However, Bakso President Malang does not have customer opinion data, nor does the technology in processing and analyzing data that can produce information about customer perspectives on customer satisfaction aspects. To get the customer perspective of Bakso President Malang on customer satisfaction aspects can be done through Sentiment Analysis at the aspect level. In this study, sentiment analysis is performed on customer review using algorithms namely, Multinomial Naïve Bayes Classifier and Linear Support Vector Machine. This study uses 2,252 customer review data from 2012 to 2019. Customer review data is obtained through Web Scraping techniques on the TripAdvisor and Google Review sites. Experimental results shows that Linear Support Vector Machine has better accuracy performance compared to Multinomial Naïve Bayes Classifier.</p>
      </abstract>
      <kwd-group>
        <kwd>sentiment analysis</kwd>
        <kwd>naïve bayes</kwd>
        <kwd>support vector machine</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. INTRODUCTION</title>
      <p>Restaurants owner runs their business not only consider about strategies that prioritize products, but they should
implement strategies that could appraise what is needed by customer. This should be done in order to create a good
relationship between customers and restaurants. Therefore, restaurant owners need to pay attention to customer
review regarding the products and services that they have been provided. Bakso President Malang already has a
lot of customer reviews on sites like TripAdvisor and Google Reviews. The problem faced by Malang President's
Meatballs is that it has not been able to extract information from the many customer reviews that are expected to
improve service and product development. The existence of customer reviews will also have an impact on
competitive advantages and the brand image that will be experienced by Bakso President Malang.</p>
      <p>
        Based on the problems faced by Bakso President, one alternative that can be offered is through an analysis of
data related to customer opinions obtained from websites that provide customer reviews. Customer review data
are then to be processed using sentiment analysis to get customer perspective related to several aspects of customer
satisfaction. Customer satisfaction is obtained from the food quality, price, ambiance, and service quality that
provided by the restaurant
        <xref ref-type="bibr" rid="ref3">(Panthi and Karki, 2018)</xref>
        . Sentiment analysis focuses on opinions that express or imply
positive or negative sentiments. The results of this sentiment analysis will be used by Bakso President Malang to
help the decision making process related to strategies that are appropriate to customer needs.
      </p>
      <p>
        Previous research conducted by
        <xref ref-type="bibr" rid="ref8">Wulandini and Nugroho (2009)</xref>
        for the Indonesian language text categorization
found that Support Vector Machine (SVM) showed slightly better performance with an accuracy of 92.5%
compared to the method Naïve Bayes Classifier (NBC) with 90% accuracy. Compared to SVM, NBC method is
much more conventional methods and simpler.
        <xref ref-type="bibr" rid="ref4">Kang et al. (2012)</xref>
        proposed a new senti-lexicon for restaurant
review sentiment analysis. The use of the Improved Naïve Bayes algorithm and senti-lexicon showed increased
accuracy by 10.2% in recall and 26.2% in precision compared with when SVM was used. The proposed method
also has better performance when compared to Naïve Bayes with difference of 5.6% in 1.9% in precision. In other
study,
        <xref ref-type="bibr" rid="ref6">Shi and Li (2011)</xref>
        applied sentiment analysis in hotel review data on online media using Support Vector
Machine. The proposed method is to classify the document polarity with unigram features. Two features used in
their study is feature frequency and TF-IDF. The results shows that TF-IDF is more effective than feature
frequency.
      </p>
      <p>Based on the background that has been described and some research that has been done before, the authors are
interested in conducting sentiment analysis study. The sentiment is gathered from TripAdvisor and Google Review
sites related to customer satisfaction aspects of the Bakso President Malang. Sentiment analysis is carried out at
the aspect level using two classifier, the NBC and SVM. In this study NBC and SVM is chosen as the classifiers
because SVM works on high dimension data and NB classifier is a group of basic probabilistic classifiers
dependent on a typical assumption that all features are free of one another, given the class variable, and it is
regularly utilized as the pattern in text classification. Each of the classifier is combined with TF-IDF as Term
Weighting. This research will find out which methods have better performance to be implemented in sentiment
analysis. The paper is distributed in 6 different sections. Section 2 discuss about the related studies. Section 3
describe about the methodology of the proposed work, dataset and techniques used. Section 4 discuss about the
experiment results that performed in this paper. Section 5 describe the conclusion and future scope of the paper.</p>
    </sec>
    <sec id="sec-2">
      <title>2. LITERATURE REVIEW 2.1</title>
    </sec>
    <sec id="sec-3">
      <title>Web Scraping</title>
      <p>
        According to Urru and
        <xref ref-type="bibr" rid="ref7">Vargiu (2013)</xref>
        web scraping is a technique used to retrieve information from a website
automatically by extracting data that was not previously structured from the website into structured data. The
purpose of web scraping is to dig up information from different and unstructured websites and then transform it
into a neater and structured form in the format of spreadsheets, databases, or comma separated values (CSV).
2.2
      </p>
    </sec>
    <sec id="sec-4">
      <title>Sentiment Analysis</title>
      <p>
        Sentiment analysis or commonly referred as opinion mining is a field of study that analyzes an opinion, sentiment,
or someone's judgment regarding an entity such as a product, service, organization, individual, problem, event,
and topic. Unlike factual information, opinions and sentiments have characteristics that indicate that it is
subjective. An examination of a group of opinions from many parties is needed because it is a subjective view that
comes from more than one person so a summary is needed to represent an opinion
        <xref ref-type="bibr" rid="ref2">(Hu and Liu, 2004)</xref>
        .
2.3
      </p>
    </sec>
    <sec id="sec-5">
      <title>Naïve Bayes Classifier (NBC)</title>
      <p>
        The method used for text classification in sentiment analysis is Naïve Bayes Classifier. Multinomial Naive Bayes
Classifier is a classification method of Bayes algorithm that can be used in the classification of text or documents
        <xref ref-type="bibr" rid="ref5">(Rahman et al., 2017)</xref>
        . In the Multinomial Naive Bayes Classifier, the document class is not only determined by
the words that appear but also the number of occurrences. Calculating class from document d, can be calculated
using Equation 1:
 ( |
      </p>
      <p>) =  ( )   ( 1| )   ( 2| )   ( 3| )  …   (  | )</p>
      <sec id="sec-5-1">
        <title>In Equation (1):</title>
      </sec>
      <sec id="sec-5-2">
        <title>P(c) is a prior probability from class c.</title>
        <p>tn is a word document d to-n.</p>
      </sec>
      <sec id="sec-5-3">
        <title>P(c|document term d) is document probability.</title>
      </sec>
      <sec id="sec-5-4">
        <title>P(tn|c) is a probability term-n with c.</title>
      </sec>
      <sec id="sec-5-5">
        <title>Prior probability class c determined with Equation (2):</title>
      </sec>
      <sec id="sec-5-6">
        <title>In Equation (2):</title>
      </sec>
      <sec id="sec-5-7">
        <title>Nc is a total class c of all document.</title>
      </sec>
      <sec id="sec-5-8">
        <title>N is a total documents. Probability term to-n determined with Laplacian Smoothing as shown in Equation (3):</title>
        <p>( ) =
 (  |  ) =
( , )+1
( )+| |
(1)
(2)
(3)</p>
      </sec>
      <sec id="sec-5-9">
        <title>In Equation (3):</title>
        <p>count(tn,c) is a number of terms found in all training data with c category
count(c) is a number of terms in all training data with c category
| | is a number of terms in all training data
2.4</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Support Vector Machine (SVM)</title>
      <p>Han et al. (2012) defined Support Vector Machine is a classification method for finding the best separator to create
maximum separation for all classes or Maximum Marginal Hyperplane (MMH). Margin could be defined as the
shortest distance from a hyperplane to one side of margin is same as distance of a hyperplane to the other side of
margin, note that both margins are in a parallel position with the hyperplane. As shown in Figure 1. Then it can be
concluded that the greater the margin can be said to have higher accuracy.</p>
    </sec>
    <sec id="sec-7">
      <title>Optimal Hyperplane</title>
      <p>SVM uses training dataset in the form (Xi yi), where Xi is a tuple and yi is class label with I = 1...N, Xi ∈Rd
and yi ∈{−1, 1}. The purpose of SVM is to be able to form a classifier like Equation 4 below.
to form a hyperplane explained in the following equation,</p>
      <sec id="sec-7-1">
        <title>Where:</title>
        <p>2.5</p>
      </sec>
    </sec>
    <sec id="sec-8">
      <title>Confusion Matrix</title>
      <p>W
b
X
= Vector weight{W1, W2, W3,…,Wn}, n is the number of attributes
= Scalar, or commonly referred as bias
= training tuples</p>
      <p>≥0,  = +1
 (  ) = {&lt;0,  = −1
 . 
+</p>
      <p>= 0
Confusion matrix is a method that can calculate the accuracy of data mining. The results of the Confusion matrix
are accuracy, precision, and recall values. Accuracy is the level of closeness between the predicted value and the
actual value. Meanwhile, Precision is the level of accuracy between the information requested by users with the
answers provided by the system. In addition to accuracy and precision, there is Recall which is the success rate of
the system in rediscovering information.
(5)
(6)</p>
      <sec id="sec-8-1">
        <title>Precision calculations with the confusion matrix table are shown in the Equation 7:</title>
      </sec>
      <sec id="sec-8-2">
        <title>Recall calculations with the confusion matrix table are shown in the Equation 8:</title>
      </sec>
      <sec id="sec-8-3">
        <title>F1-Score calculations with the confusion matrix table are shown in the Equation 9:</title>
      </sec>
    </sec>
    <sec id="sec-9">
      <title>3. METHODOLOGY</title>
      <p>Precision =

 +
Recall =

 +
F1-Score = 2 ∗  
 
 
 
∗
+
(7)
(8)
(9)</p>
    </sec>
    <sec id="sec-10">
      <title>Aspect Based Sentiment Classification 3.1</title>
    </sec>
    <sec id="sec-11">
      <title>Problem Definition</title>
      <p>At this stage, observations were made at the Malang Bakso President restaurant and direct interviews with the
Bakso President Malang manager to find out the business processes. This step also explores the problems that exist
in the Bakso President related to extracting information in customer reviews.
3.2</p>
    </sec>
    <sec id="sec-12">
      <title>Literature Review</title>
      <p>This step is searching for the right theory and research that has been studied previously that have similar problems
domain to search for solutions carefully. References are obtained from the internet through books, journals, and
credible sites about Text Mining, Sentiment Analysis, Naive Bayes Classifier, and Food Service Dimension, and</p>
      <sec id="sec-12-1">
        <title>Dashboard visualization.</title>
        <p>3.3</p>
      </sec>
    </sec>
    <sec id="sec-13">
      <title>Data Acquisition</title>
      <p>Data collection was conducted on reviews given by customers of Malang Bakso President Malang restaurant
objects. Data sourced from the TripAdvisor site and Google Review using Web Scraping techniques. The tool
used is Selenium Python to retrieve data in Google Review and Chrome extension, webscraper.io, for the</p>
      <p>TripAdvisor website. The amount of data successfully scraped from TripAdvisor was 943 review data and from
Google Review 1,209 review data. In total there are 2,252 comments on Bakso President by the customers gathered
from 2012 to 2019.
3.4</p>
    </sec>
    <sec id="sec-14">
      <title>Aspect Categorization and Data Labelling</title>
      <p>Aspect categorization is useful for knowing what aspects are present in each customer review. The comments
organized into related aspects. After that, each of the comment proceeded with labeling sentiment data whether
positive or negative. The sentiment labeling is done manually. To minimize the subjectivity, the labeled sentiment
is checked by the owner to validate the sentiment labeling precision.
3.5</p>
    </sec>
    <sec id="sec-15">
      <title>Text Preprocessing</title>
      <p>Text Preprocessing is a phase that is carried out before doing Term Weighting and sentiment classification.
Preprocessing stages will be carried out using standard text processing. The steps are Formalization and
Translation, Remove Punctuation, Remove Number, Tokenization, Case Folding, Stop-word, Stemming, and</p>
      <sec id="sec-15-1">
        <title>Cleansing. The phases of text preprocessing are using library Scikit-learn and Sastrawi to stem the data.</title>
        <p>Formalization and Translation: Formalization is the process of changing words that were not standard into
standard words in accordance with the Kamus Besar Bahasa Indonesia (KBBI). This process is done manually
from scraping and then comparing it to the Indonesian dictionary. Translation aims to translate words from foreign
languages and regional languages into words that are in accordance with the Kamus Besar Bahasa Indonesia
(KBBI).</p>
        <p>Remove Punctuation: Remove punctuation is the process of removing punctuation in data. This process uses
the string library module in Python.</p>
        <p>Remove Number: Remove number is the process of clearing data from numeric characters. This process uses
the string library module in Python.</p>
        <p>Tokenization and Case Folding: Tokenization is the stage to separate sentences into parts of words called
tokens. The implementation of the toke</p>
        <p>Stopwords: Stopword is the step of eliminating words that are included in stopword categories such as ‘di’,
‘pada’, ‘dan’, ‘yang’, ‘hanya’, ‘tidak’ and other words that are included in the stopwords list library.</p>
        <p>Stemming: Stemming is a process of reducing a word to its word stem that affixes to suffixes and prefixes or
to the roots of words.</p>
        <p>Cleansing: Cleansing is a process to remove whitespace or empty tokens. The data used in this process is the
result of the previous stemming process.
3.6</p>
      </sec>
    </sec>
    <sec id="sec-16">
      <title>Term Weighting</title>
      <p>Term weighting is done by applying Term Frequency-Inverse Document Frequency (TF-IDF). Weighting is done
by utilizing a module in the Python Programming Language with the Scikit-learn module to weight the TF-IDF.</p>
      <sec id="sec-16-1">
        <title>An example of term weighting process is shown in Table 2 and Table 3.</title>
        <p>3.7</p>
      </sec>
    </sec>
    <sec id="sec-17">
      <title>Sentiment Classification</title>
      <p>In the sentiment classification stage, two documents are needed, namely the preprocessing document that has been
given a sentiment class label and Term Weighting from the previous stage. The sentiment class label is divided
into positive and negative. This sentiment classification is carried out on every aspect namely Food, Service, Price,
and Atmosphere aspects. Model selection is done in order to get the highest accuracy and the best data model for
each data. The selection of the model in this study uses Stratified Cross Validation, which is to divide the dataset
randomly with a balanced amount and prevent multiples or different amounts of data. The dataset is divided into
2 folds, so the ratio of training data and test data is 50:50. After that the test data and training data will be stored
into four variables namely X_train, Y_train, X_test and Y_test. The variables X_train and X_test are variables that
contain term weighting data which are divided into training data and test data. While the Y_train and Y_test
variables are variables that store label data. The next step is the sentiment classification used by the MultinomialNB
module to implement the Multinomial Naïve Bayes and LinearSVC module to implement Support Vector Machine
algorithm as a classifier.
3.8</p>
    </sec>
    <sec id="sec-18">
      <title>Analysis and Evaluation</title>
      <p>After obtaining the results of sentiment classification, the next phase is testing with 4 parameters, namely
Accuracy, Precision, Recall, and F1-Score. This process uses the library from Scikit-learn by using the
classification_report module. In this test also used a confusion matrix to see the comparison of the predicted
amount of data.
3.9</p>
    </sec>
    <sec id="sec-19">
      <title>Conclusion</title>
    </sec>
    <sec id="sec-20">
      <title>4. RESULTS 4.1</title>
    </sec>
    <sec id="sec-21">
      <title>Aspect Categorization Result</title>
      <p>The final step in this research is to draw conclusions based on the formulation of the problems that have been
obtained and focus on the results of sentiment analysis and provide suggestions for further research.
The food aspect defines the quality of food and drinks from Bakso President, both in terms of taste, presentation,
texture, portions, and also menu variations. The positive sentiment class on food aspect shows customer reviews
which give positive responses in the form of comments stating customer satisfaction with food and beverage
quality from Bakso President Malang. While in the negative sentiment class on this aspect of food shows negative
responses such as disappointment to the quality of food and drinks from Bakso President Malang. The total number
of customer reviews for food aspects is 1,927 reviews, with each number for positive sentiment classes 1,702
reviews and negative sentiment classes is 225 reviews.</p>
      <p>Service Aspect shows customer reviews that describe customer satisfaction with service quality from Bakso
President Malang. The service in question is employee performance such as employee attitudes and behavior in
treating customers. In this aspect not only shows how customers respond to Malang President's Meatballs
employees, but also about the condition of the cleanliness of food equipment, the ordering process, to the serving
of food. The positive sentiment class on the service aspect shows customer reviews that give positive responses in
the form of comments that indicate customer satisfaction with service quality from Bakso President Malang.
Whereas in the negative sentiment class on this aspect of food shows negative responses such as disappointment
to the quality of service from Bakso President Malang. The number of customer review data on service aspects
with negative sentiment was 383 reviews. Whereas for customer review data on service aspects with positive
sentiment was 114 reviews.</p>
      <p>The price aspect shows the customer's response regarding the price set by the Bakso President Malang. In a
positive sentiment class, customer reviews contain opinions that illustrate that the price set by the Bakso President
Malang is in accordance with or what has been obtained by customers. In addition, customer reviews that have
positive sentiment on this aspect also stated that the price set by Bakso President was cheap or still affordable. As
for customer reviews that have negative sentiment is when the customer's response to the price set by Bakso
President Malang is not in accordance with what is obtained by the customer and is expensive. The number of
customer review data on the price aspect with negative sentiment of 170 reviews. As for the customer review data
on the price aspect with positive sentiment is 217 reviews.</p>
      <p>This aspect of the atmosphere explains the atmosphere or environment of the President's Meatballs that are felt
by customers. Customer reviews in this aspect talk about Malang Bakso President's location conditions. In a review</p>
    </sec>
    <sec id="sec-22">
      <title>Food</title>
    </sec>
    <sec id="sec-23">
      <title>Service</title>
    </sec>
    <sec id="sec-24">
      <title>Price</title>
    </sec>
    <sec id="sec-25">
      <title>Atmosphere</title>
    </sec>
    <sec id="sec-26">
      <title>Food</title>
    </sec>
    <sec id="sec-27">
      <title>Service</title>
    </sec>
    <sec id="sec-28">
      <title>Price</title>
    </sec>
    <sec id="sec-29">
      <title>Atmosphere</title>
    </sec>
    <sec id="sec-30">
      <title>Negative</title>
    </sec>
    <sec id="sec-31">
      <title>Positive</title>
    </sec>
    <sec id="sec-32">
      <title>Negative</title>
    </sec>
    <sec id="sec-33">
      <title>Positive</title>
    </sec>
    <sec id="sec-34">
      <title>Negative</title>
    </sec>
    <sec id="sec-35">
      <title>Positive</title>
    </sec>
    <sec id="sec-36">
      <title>Negative</title>
    </sec>
    <sec id="sec-37">
      <title>Positive</title>
    </sec>
    <sec id="sec-38">
      <title>Negative</title>
    </sec>
    <sec id="sec-39">
      <title>Positive</title>
    </sec>
    <sec id="sec-40">
      <title>Negative</title>
    </sec>
    <sec id="sec-41">
      <title>Positive</title>
    </sec>
    <sec id="sec-42">
      <title>Negative</title>
    </sec>
    <sec id="sec-43">
      <title>Positive</title>
    </sec>
    <sec id="sec-44">
      <title>Negative Positive</title>
      <p>0.33
0.89
0.78
0.40
0.68
0.88
0.30
0.85
1.00
0.89
0.80
0.90
1.00
0.89
1.00
0.89
0.07
0.98
0.97
0.07
0.88
0.68
0.04
0.98
0.05
1.00
0.11
1.00
0.05
1.00
0.08
1.00
0.12
0.93
0.86
0.12
0.77
0.77
0.06
0.91
0.10
0.94
0.19
0.94
0.10
0.94
0.15
0.94
of customers who have positive sentiments stated that the Bakso President Malang environment is unique because
it is next to the railroad tracks so that customers can feel a pleasant atmosphere. While in customer reviews who
have negative sentiments stated that the Bakso President environment which is next to the railroad tracks is not
good so it is dusty and dangerous. The total number of customer reviews for Atmosphere aspects is 1,083 reviews,
with each number for positive sentiment classes 915 reviews and negative sentiment classes 168 reviews.
4.2</p>
    </sec>
    <sec id="sec-45">
      <title>Sentiment Analysis Classification</title>
      <p>This section describes the performance report and discussion of the sentiment analysis classification algorithms
on the customer reviews of Bakso President Malang. Table 4 shows testing results using Multinomial Naïve Bayes
in each aspect. Table 5 shows testing results using Support Vector Machine in each aspect.</p>
      <p>Sentiment analysis with Linear Support Vector Machines method showed better performances than
Multinomial Naïve Bayes. The accuracy of every aspect using Linear SVM shows the same value, which is 89%.
While the use of the MNB method shows different accuracy values for each aspect. The highest accuracy value in
MNB is in the aspect of food that is 88% which has a slight difference compared to Linear SVM. One of the factors
that influence a classification to get good accuracy results, can be seen from how man y combinations of training
data and test data generated in the selection of models. The more combination of training data, the better the
classification results will be produced because if more learning models are used in classifying training data, the
better the classification of testing data in recognizing existing patterns (Han et al., 2012). Thus, if the distribution
of training data and testing data uses more than 2-fold with a comparison of training data of more than 50%, it is
possible to get better accuracy results.</p>
    </sec>
    <sec id="sec-46">
      <title>5. CONCLUSION</title>
      <p>From the discussion that has been presented based on the research that has been done, some conclusions are
obtained. Web scraping techniques can be used to obtain customer review data needed to conduct sentiment
analysis on the aspect level of the TripAdvisor and Google Review sites. The total data obtained from the
implementation of Web Scraping in the form of customer review data on Bakso President Malang is 2,152 reviews
with a time span from 2012 to 2019. The results of the application of the research conducted using the Naïve Bayes
Classifier and Support Vector Machine method and Term Frequency-Inverse Document Frequency as Term
Weighting can be used as an option in solving problems in the classification analysis of aspects of sentiment level
for assessing customer satisfaction of Bakso President Malang. Algorithms that shows better performances from
accuracy results is Support Vector Machine.</p>
      <p>Future work is expected to use automated methods that are not manual in labeling data and extracting aspects,
such as CRF (Conditional Random Field), LDA (Latent Dirichlet Allocation) combined with feature extraction
like POS-Tagging, unigram, or bigram. The purpose of using those method so the labeling of 1 opinion is not
subjective.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Han</surname>
            ,
            <given-names>J</given-names>
          </string-name>
          .,
          <string-name>
            <surname>Pei</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Kamber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>Data mining: concepts and techniques</article-title>
          . Morgan Kaufmann: Waltham MA.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Hu</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Liu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          (
          <year>2004</year>
          ).
          <article-title>Mining and summarizing customer reviews</article-title>
          ,
          <source>In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining</source>
          , Kim,
          <string-name>
            <given-names>W.</given-names>
            ,
            <surname>Kohavi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            ,
            <surname>Gehrke</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          , and DuMouchel, W. (Eds.),
          <fpage>168</fpage>
          -
          <lpage>177</lpage>
          . ACM: New York.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Karki</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Panthi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2018</year>
          ).
          <article-title>How Food Quality, Price, Ambiance and Service Quality Effects Customer Satisfaction: A study on Nepalese Restaurants in Finland</article-title>
          .
          <source>BA Thesis</source>
          , Haaga-Helia University of Applied Sciences.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Kang</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Yoo</surname>
            ,
            <given-names>S.J.</given-names>
          </string-name>
          , and Han,
          <string-name>
            <surname>D</surname>
          </string-name>
          . (
          <year>2012</year>
          ).
          <article-title>Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews</article-title>
          .
          <source>Expert Systems with Applications</source>
          ,
          <volume>39</volume>
          (
          <issue>5</issue>
          ),
          <fpage>6000</fpage>
          -
          <lpage>6010</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Rahman</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ,
          <string-name>
            <surname>Wiranto</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Doewes</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          (
          <year>2017</year>
          ).
          <article-title>Online News Classification Using Multinomial Naive Bayes</article-title>
          .
          <source>ITSMART: Jurnal Teknologi dan Informasi</source>
          ,
          <volume>6</volume>
          (
          <issue>1</issue>
          ),
          <fpage>32</fpage>
          -
          <lpage>38</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>H.-X.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Li</surname>
            ,
            <given-names>X.-J.</given-names>
          </string-name>
          (
          <year>2011</year>
          ).
          <article-title>A sentiment analysis model for hotel reviews based on supervised learning</article-title>
          ,
          <source>In: 2011 International Conference on Machine Learning and Cybernetics</source>
          ,
          <volume>3</volume>
          ,
          <fpage>950</fpage>
          -
          <lpage>954</lpage>
          . IEEE: New York.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Vargiu</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Urru</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          (
          <year>2013</year>
          ).
          <article-title>Exploiting web scraping in a collaborative filtering-based approach to web advertising</article-title>
          .
          <source>Artif. Intell. Research</source>
          ,
          <volume>2</volume>
          (
          <issue>1</issue>
          ),
          <fpage>44</fpage>
          -
          <lpage>54</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Wulandini</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          , and
          <string-name>
            <surname>Nugroho</surname>
            ,
            <given-names>A.S.</given-names>
          </string-name>
          (
          <year>2009</year>
          ).
          <article-title>Text Classification Using Support Vector Machine for Webmining Based Spatio Temporal Analysis of the Spread of Tropical Diseases</article-title>
          ,
          <source>In: International Conference on Rural Information and Communication Technology</source>
          <year>2009</year>
          ,
          <fpage>189</fpage>
          -
          <lpage>192</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>