=Paper=
{{Paper
|id=Vol-3395/T2-9
|storemode=property
|title=Sentiment Analysis and Homophobia detection of YouTube comments
|pdfUrl=https://ceur-ws.org/Vol-3395/T2-9.pdf
|volume=Vol-3395
|authors=Sushil Ugursandi,Anand Kumar M
|dblpUrl=https://dblp.org/rec/conf/fire/UgursandiM22
}}
==Sentiment Analysis and Homophobia detection of YouTube comments==
<pdf width="1500px">https://ceur-ws.org/Vol-3395/T2-9.pdf</pdf>
<pre>
Sentiment Analysis and Homophobia Detection of
YouTube Comments
Sushil Ugursandi, Anand Kumar M.
Department of Information Technology, National Institute of Technology Karnataka, Surathkal, 575025


                                         Abstract
                                         Sentiment analysis identifies a graded scale of opinions or emotional responses to a particular subject.
                                         Many industries and organisations have been actively researching this area for more than 20 years.
                                         The key to understand a user’s behaviour while responding on a social media site is to understand
                                         their feelings. In contemporary research, a sentence’s content is evaluated, the emotion predicted, that
                                         helps researchers gain an insight on the reaction of an individual towards a social media topic. Here,
                                         a sentence’s text data is analysed using several Natural Language Processing techniques before being
                                         utilised to categorise this multi-class issue. The detection of homophobia and transphobia in comments
                                         on YouTube or other social media sites is second objective of this work. Anger, discomfort, or suspicion
                                         against Lesbian, Gay, Bisexual and Transgender people is known as homophobia. It can incite individuals
                                         to feel panic, dislike, disrespect, aggression, or wrath. By identifying such occurrences on social media,
                                         we can better understand how society works and how people behave. The goal of this work is to analyze
                                         social media texts such as comments from YouTube and detect homophobic sentiments using deep
                                         learning or machine learning models. In this work 6-layer classification model is used, the F1-Score for
                                         sentiment identification using the proposed model in this study was 0.5 on multi-class classification
                                         and 0.97 on homophobic/transphobic classification and achieved 1st rank on Homophobic detection in
                                         Malayalam language and 4th rank for sentiment analysis in Kannada language.

                                         Keywords
                                         Deep Learning, Homophobic/Transphobic Detection, Neural Networks, Sentiment Analysis, YouTube
                                         comments Analysis


1. Introduction
Sentiment analysis uses views acquired from people to determine emotions. With the increase
in the reach and availability of the Internet, people increasingly express their thoughts on
social media sites like Twitter, Facebook, and YouTube. There is a need to comprehend people’s
perspectives due to an increase in social media data and Internet users and to understand the
emotion within the text, Sentiment Analysis is used. The negative attitude or dislike toward
LGBT individuals is referred to as homophobia. LGBT refers to Lesbian, Gay, Bisexual and
Transgender people.
   Nobody has the right to act aggressively or harshly toward another person. This type of
activity or misbehavior can be identified on social media sites using machine learning and deep

FIRE 2022: Forum for Information Retrieval Evaluation, December 9-13, 2022, India
∗
    Corresponding author.
†
     These authors contributed equally.
Envelope-Open sushil.212it032@nitk.edu.in (S. Ugursandi); m_anandkumar@nitk.edu.in (A. K. M.)
                                       © 2022 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
    CEUR
    Workshop
    Proceedings
                  http://ceur-ws.org
                  ISSN 1613-0073
                                       CEUR Workshop Proceedings (CEUR-WS.org)
learning algorithms. This can be used to flag or prohibit posts that can be dangerous and thus
can be used to prevent violence in the community. The practice of violent activities on social
media platforms has a negative effect on internet users. Social media is essential for online
communication in the digital era because it provides users the flexibility to create, share, and
debate anything they choose. Understanding and filtering the content people express on social
media is crucial as online communication has grown across languages used across the world.
   Main goal of this work is to classify YouTube comments into good, negative, neutral, or
mixed categories based on their message polarity on language belongs combinations, especially
Malayalam-English, Tamil-English, and Kannada-English. The second objective of this work
is to detect homophobic/transphobic nature of comments from social media. The collection
includes properly identified homophobic and transphobic language. The training and validation
(dev) datasets are in the English, Tamil and Malayalam languages. This work intends to support
further research into the detection of homophobic and transphobic content in postings on social
media that are published in Tamil, Kannada and Malayalam languages. The dataset for the
training, development and testing is obtained from DravidianLangTech 2022.
   This competition was organized by DravidianLangTech 2022 [1] and consisted of the following
shared task :

    • Task A: This is a message-level polarity classification task. Given a Youtube comment,
      systems have to classify it into positive, negative, neutral, or mixed emotions. The
      participants will be provided development, training, and test dataset code-mixed text in
      Dravidian languages (Tamil-English, Malayalam-English, and Kannada-English)
    • Task B: In this shared task, participants will be provided with comments extracted
      from social media platforms and are expected to develop and submit systems to predict
      whether it is homophobic/transphobic in nature. The seed data for this task is the
      Homophobia/Transphobia Detection dataset, a collection of comments from YouTube.
      This dataset consists of manually annotated comments indicating whether if the text is
      homophobic/transphobic or not. The participants will be provided development, training,
      and test dataset in English, Malayalam, and Tamil.


2. Related Work
The below literature survey conducted in the area of Sentiment Analysis and Homophobic
detection. People express their ideas on a range of subjects via social media such as YouTube,
Facebook, and Twitter according to Jelodar et al. In their work, Jelodar et al. [2] performed
Sentiment analysis of YouTube comments on Oscar-nominated movie trailers. Video-sharing
platforms like YouTube encourage audience interaction by enabling viewers to rate movie
trailers [3]. They organised the movie trailer comments on YouTube in this article. Using these
comments, they created three different Sentiment Indexes that gauge the sentiment of movie
reviews. Then, projections for the movie’s box office take were produced using this sentiment
index. Distributors and producers can anticipate audience reaction to a film by examining
comments on the trailers. This also leads to a forecast of the profits on the day of release by
taking into account existing attitudes. In this work, the collection of movies was predicted
using a unique sentiment analysis technique and sentiment index. Latent topic detection and
fuzzy lattice reasoning were used.
   G. Prasad et al. [4] analysed the sentiment of cryptocurrency related comments on YouTube.
The architecture employed in this study is an ensemble model with stacked KNN, Decision Tree,
XG Boost, and Random Forest Classifier components. The meta/base classifier used was logistic
regression. T. Mehta et al.[5] used Support Vector Machine, Decision Tree, Linear Regression,
Artificial Neural Networks, and Random Forest, predicts the feelings of YouTube Ad View. From
a massive collection of comments many of which offer helpful information that raises the posted
content’s rating levels [6]. 2022 Vinit kumar et al. [7] worked to evaluate significant elements
of YouTube videos for the Koha and DSpace apps. Information searchers may now substantially
benefit from this since YouTube allows content providers the chance to share their expertise and
experiences via their work. These days, individuals routinely search YouTube for answers to
their queries or tutorials that might help them better comprehend certain concepts. Analyzing
the reactions of the viewers to the videos was another aim of their study.
   Homophobia detection involves detecting whether homophobic or not [8]. N. Ashraf et al.
[9] involved the use of TF/IDF along with a bigram model. This helped in the vectorization of
comments. After this, Support Vector Machines were used. D. Nozza et al. [10] made use of
ensemble modeling and data augmentation for high class imbalance. Fine-tuning was used on
BERT, RoBERTa and HateBERT. A weighted majority vote was used on the predictions done. A
popular topic of discussion is the private American company’s launch of the Starlink satellite in
[11] by A.M. Putri et al. The satellite launch footage was disseminated via the YouTube channel.
Many people commented on the video of the satellite launch, and the comments section on
the video had a wide range of opinions. Therefore, this study employed deep learning-based
sentiment analysis to look at how internet users responded to the Starlink satellite launch in
their YouTube comments. This study comprised 22,000 YouTube comments in total. It uses the
Long Short Term Memory model (LSTM). This study produced an accuracy value for the LSTM
model utilising various activation and optimization techniques. This study [11] found that the
maximum accuracy was 86% when using the LSTM model, Softmax activation function, and
Adam’s optimization.
   H. Bhuiyan et al. [12] claims that YouTube is one of the most well-known social networking
sites where users can post, like, comment on, and watch videos. This ranking often preserves
the popularity, applicability, and quality of the video. Unrelated or subpar films frequently
appear higher in search results due to the quantity of views or likes, which is absurd. To
address this issue, they provided a sentiment analysis technique based on Natural Language
Processing (NLP) for user comments. This method assisted in identifying the most relevant
and popular YouTube video for the specified search. An analysis of the performance of the
proposed method in terms of its precision in identifying relevant, popular, and high-quality
videos. R. F. Alhujaili at al. [6] identified the sentiments by machine learning and Natural
Language Processing algorithms. Numerous academic endeavours using two classes—positive
or negative, three classes—two with neutral , or multiple classes have been made (happy, sad,
fear, surprise, and anger). However, picking the most precise model might be challenging.
As a result, efforts have been undertaken to use sentiment analysis of YouTube comments to
determine the polarity. This work examines the methods and techniques for sentiment analysis
that may be used on YouTube videos. It also lists and categorises a variety of strategies that are
beneficial for sentiment analysis and data mining investigations.


3. Methodology
3.1. Dataset
Data is obtained from DravidianLangTech 2022 [13] for the following shared tasks[1].

    • Task A: To find positive, negative, neutral or mixed emotion for the given YouTube
      comment in Dravidian language.
    • Task B: To predict homophobic/transophobic nature of comments extracted from social
      media.

   Code-Mixed Dravidian languages used are Kannada-English, Tamil-English and Malayalam-
English. The Kannada-English sample dataset is displayed in the Fig 1 along with their class
label.


Figure 1: Kannada-English dataset for Sentiment Analysis


  Fig 2 describes the training dataset in Tamil-English language for sentiment analysis.
  Similarly the another dataset sample in Malayalam-English language consists of 3 classes
’Non-anti-LGBT+ content’, ’Homophobic’ and ’Transphobic’.
  In Figures 3 and 4 are the training dataset for the second objective of this work which is
Homophobic/Transphobic detection on multilingual social media comments.
  The overall count for each class label for sentiment analysis and homophobic detection,
respectively, is described in Table 1 and Table 2 for all languages.
Figure 2: Tamil-English dataset for Sentiment Analysis


Table 1
Count of instance for each class present in training dataset for Sentiment Analysis
             Kannada-English              Tamil-English             Malayalam-English
              Class       Count           Class        Count          Class      Count
          Unknown_state    711        Unknown_state     5628      Unknown_state    5279
             Positive      2823          Positive      20070         Positive      6421
             Negative      1188         Negative        4271         Negative      2105
          Mixed_feelings   1722       Mixed_feelings    4020      Mixed_feelings    926
           Not_Kannada     916          Not_Tamil       1667      Not-Malayalam    1157

Table 2
Count of instance for each class present in training dataset for Homophobic/Transphobic detection
                        Tamil-English                     Malayalam-English
                        Class              Count             Class          Count
               Non-anti-LGBT+ content       3438    Non-anti-LGBT+ content   692
                    Homophobic               311         Homophobic          133
                    Transphobic              112         Transphobic         41


4. Proposed Model
The algorithm used to generate the categorization report is is illustrated by Fig 5. More details
about the precise functioning of the model are given in the subsequent sections of this work.
Figure 3: Tamil-English dataset for Homophobic/Transphobic detection


Figure 4: Malayalam-English dataset for Homophobic/Transphobic detection


4.1. Pre-processing
The text data is not readable by system so it needs to be converted into numeric data to make it
understandable by system and classify accordingly. The model which is used for classification of
sentiments and homophobic detection requires numerical value. Tokenizer is a Python package
that turns all text into discrete integer values after cleaning the text by removing stopwords and
Figure 5: Proposed Model


symbols. This is done as part of the dataset’s pre-processing. To fit on the specified input size,
post-tokenization padding is used to lengthen each sentence to the same number of characters.
The class is one hot encoded in the same manner, creating distinct numerical values according
to the associated class label.
   Tokenizer library from keras framework is used in this work to tokenize the words using the
dictionary size 20000. Padding is applied to make all sentence of same length. Post padding
is used with padding size of 94 which fills multiple zeros at end to until it’s length reaches to
required number.

4.2. Layers
A 6-layer deep learning model is used for this classification problem as shown in Fig 6 The first
layer is embedding layer it is used to translate categorical information into integers, we employ
one-hot encoding. In order to do this, It generates sample features for each category and fill
them with 0s and 1s which generates a vector of results, this layer generates a new layer input
over a particular geographic dimension, this process is done by convolutional layer. There is a
kernel of fixed size which computes the vector data on that region. After computing the vector
data, a fixed size kernel of one dimension iterates over the entire vector and creates a new layer.
The third layer is a Max Pooling layer which is used to calculate the maximum value in a vector
from each feature value in a matrix. Further the two dimensional matrix data are subsequently
transformed into one dimensional values using a flatten layer. The Dropout layer is just a filter
which leaves all other neurons unaltered while eliminating particular neuron contributions to
the subsequent layer.

4.2.1. Embedding layer
The Embedding layer looks for the embedding vector for each word-index using the vocabulary
that has been integer-encoded. The Embedding layer is used from keras framework with the
embedding dimension of 64 and sequence length of 270.

4.2.2. Hyperparameters used
For training of the model the batch size used is 64, using Nadam optimizer and the loss function
from keras library CategoricalCrossentropy is applied. Total 50 epochs were used to train this
model.


Figure 6: 6-Layer Classification Model


5. Results and Discussion
In this section the results of experiment performed for ”Sentiment analysis and homophobic/-
transphobic detection in YouTube comments” is shown.

5.1. Results on validation dataset
Metrics on the prediction from model (Precision, Recall and F-1 Score) is used to display the
classification report on validation dataset. Table 3, Table 4 and Table 5 shows the results of
Sentiment Analysis on validation dataset for Kannada-English, Tamil-English and Malayalam-
English respectively.
   These are the results obtained on the validation dataset shown on Table 5 on Malayalam-
English language for Sentiment analysis.
   For the task of Homophobic detection, the classification results on validation dataset for
Tamil language are shown in Table 6 and Malayalam language is shown in Table 7.
Table 3
Result on Sentiment Analysis - (Kannada-English)
                            Class          Precision   Recall   F1-Score
                        Unknown_state         0.38      0.48      0.42
                           Positive           0.69      0.71      0.70
                           Negative           0.62      0.45      0.52
                        Mixed_feelings        0.21      0.17      0.19
                         Not_Kannada          0.54      0.63      0.58
                          macro avg           0.49      0.49      0.48
                         weighted avg         0.58      0.58      0.58
                          Accuracy             –         –        0.58

Table 4
Result on Sentiment Analysis - (Tamil-English)
                            Class          Precision   Recall   F1-Score
                        Unknown_state         0.36      0.41      0.38
                           Positive           0.71      0.75      0.73
                          Negative            0.40      0.37      0.38
                        Mixed_feelings        0.25      0.16      0.20
                          Not_Tamil           0.25      0.16      0.20
                          macro avg           0.46      0.43      0.44
                         weighted avg         0.56      0.57      0.57
                          Accuracy             –         –        0.57

Table 5
Result on Sentiment Analysis - (Malayalam-English)
                            Class          Precision   Recall   F1-Score
                        Unknown_state         0.71      0.65      0.68
                           Positive           0.71      0.77      0.74
                           Negative           0.60      0.55      0.57
                        Mixed_feelings        0.40      0.41      0.41
                        Not_Malayalam         0.73      0.72      0.73
                          macro avg           0.63      0.62      0.63
                         weighted avg         0.68      0.68      0.68
                           Accuracy            –         –        0.68


5.2. Result by Organizers
Among all these experiment on different languages, three of the systems experimented on were
submitted. The result obtained from them is shown in Table 8. Using this model 4th rank is
achieved for first task on Kannada-English language. For the same problem on Malayalam-
English language, the output of this model got 5th rank. For second task, Homophobic/Trans-
phobic detection, 1st Rank is achieved using this model on Malayalam language.
Table 6
Result on Homophobic detection - (Tamil-English)
                            Class             Precision    Recall   F1-Score
                         Homophobic              0.34       0.27      0.30
                         Transphobic             0.62       0.26      0.37
                    Non-anti-LGBT+content        0.92       0.96      0.94
                          macro avg              0.63       0.50      0.54
                        weighted avg             0.87       0.89      0.87
                          Accuracy                –          –        0.89

Table 7
Result on Homophobic detection - (Malayalam-English)
                            Class             Precision    Recall   F1-Score
                         Homophobic              0.97       0.95      0.96
                         Transphobic             0.99       0.99      0.99
                    Non-anti-LGBT+content        0.98       1.00      0.99
                          macro avg              0.98       0.98      0.98
                        weighted avg             0.99       0.99      0.99
                          Accuracy                –          –        0.99

Table 8
Results published by organizers
                    Objective                      Precision    Recall   F1-Score   Rank
      Homophobic Detection (Malayalam-English)         –          –       0.9744      1
        Sentiment Analysis (Kannada-English)          0.48       0.5       0.48       4
       Sentiment Analysis (Malayalam-English)         0.5        0.5        0.5       5


5.3. Confusion Matrix
For measurement of performance of the model the confusion matrix is shown in Table 9 for
Task A in Malayalam language and in Table 10 for Task B in Malayalam language.

Table 9
Confusion Matrix for Sentiment Analysis in Malayalam language
                                                             Predicted
                             Unknown_state    Positive    Negative Mixed_feelings   Not-Malayalam
          Unknown_state           396           122          37         14               17
             Positive             105           531          29         25               16
 Actual      Negative             49            41          127         16                4
          Mixed_feelings          21            28          12          41                0
          Not-Malayalam           16            22           4           1               98
Table 10
Confusion Matrix for Homophobic detection in Malayalam language
                                                               Predicted
                                            Homophobic       Non-anti LGBT       Transphobic
                         Homophobic            126                 7                  0
              Actual    Non-anti LGBT           3                 685                 4
                         Transphobic            0                  0                 41


6. Conclusion and Future work
Sentiment Analysis and Homophobic/Transphobic detection were performed. The proposed
model with 6 layers was used in this work. The results obtained were calculated using metrics
like Precision, Recall and F-1 score. Sentiment analysis was performed on Malayalam-English,
Tamil-English and Kannada-English datasets. The Homophobic/Transphobic detection was
performed for the languages Tamil-English and Malayalam-English. The accuracy obtained was
58%, 68% and 57% for Sentiment analysis task in Kannada, Malayalam and Tamil respectively
and 89% and 99% for Homophobic detection on Tamil and Malayalam language respectively. The
proposed one dimensional model predicted good result for both of the Sentiment Analysis and
Homophobic detection. Homophobic detection had 3 classes whereas first objective of Sentiment
analysis had 5 class-labels and model predicted better result on Homophobic detection. Users
from various region express their opinion in their regional language, the future focus of this
research will be based on the intermixed language on different regions.


References
 [1] K. Shumugavadivel, M. Subramanian, P. K. Kumaresan, B. R. Chakravarthi, B. B, S. Chin-
     naudayar Navaneethakrishnan, L. S.K, T. Mandl, R. Ponnusamy, V. Palanikumar, M. Balaji J,
     Overview of the Shared Task on Sentiment Analysis and Homophobia Detection of YouTube
     Comments in Code-Mixed Dravidian Languages, in: Working Notes of FIRE 2022 - Forum
     for Information Retrieval Evaluation, CEUR, 2022.
 [2] H. Jelodar, Y. Wang, M. Rabbani, S. B. B. Ahmadi, L. Boukela, R. Zhao, R. S. A. Larik, A nlp
     framework based on meaningful latent-topic detection and sentiment analysis via fuzzy
     lattice reasoning on youtube comments, Multim. Tools Appl. 80 (2021) 4155–4181.
 [3] H. Timani, P. Shah, M. Joshi, Predicting success of a movie from youtube trailer com-
     ments using sentiment analysis, in: 2019 6th International Conference on Computing for
     Sustainable Global Development (INDIACom), 2019, pp. 584–586.
 [4] Divesh, G. Prasad, G. Sharma, D. K. Vishwakarma, Sentiment analysis on cryptocurrency
     using youtube comments, in: 2022 6th International Conference on Computing Method-
     ologies and Communication (ICCMC), 2022, pp. 730–733. doi:1 0 . 1 1 0 9 / I C C M C 5 3 4 7 0 . 2 0 2 2 .
     9753723.
 [5] T. Mehta, G. Deshmukh, YouTube ad view sentiment analysis using deep learning and
     machine learning, International Journal of Computer Applications 184 (2022) 10–14. URL:
     https://doi.org/10.5120%2Fijca2022922078. doi:1 0 . 5 1 2 0 / i j c a 2 0 2 2 9 2 2 0 7 8 .
 [6] R. F. Alhujaili, W. M. Yafooz, Sentiment analysis for youtube videos with user comments:
     Review, in: 2021 International Conference on Artificial Intelligence and Smart Systems
     (ICAIS), 2021, pp. 814–820. doi:1 0 . 1 1 0 9 / I C A I S 5 0 9 3 0 . 2 0 2 1 . 9 3 9 6 0 4 9 .
 [7] M. Deori, V. Kumar, M. K. Verma, Analysis of youtube video contents on koha and dspace,
     and sentiment analysis of viewers’ comments, Library Hi Tech (2021).
 [8] B. R. Chakravarthi, R. Priyadharshini, R. Ponnusamy, P. K. Kumaresan, K. Sampath,
     D. Thenmozhi, S. Thangasamy, R. Nallathambi, J. P. McCrae, Dataset for identifica-
     tion of homophobia and transophobia in multilingual youtube comments, 2021. URL:
     https://arxiv.org/abs/2109.00227. doi:1 0 . 4 8 5 5 0 / A R X I V . 2 1 0 9 . 0 0 2 2 7 .
 [9] N. Ashraf, M. Taha, A. Abd Elfattah, H. Nayel, NAYEL @LT-EDI-ACL2022: Homophobia/-
     transphobia detection for equality, diversity, and inclusion using SVM, in: Proceedings
     of the Second Workshop on Language Technology for Equality, Diversity and Inclu-
     sion, Association for Computational Linguistics, Dublin, Ireland, 2022, pp. 287–290. URL:
     https://aclanthology.org/2022.ltedi-1.42. doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 2 . l t e d i - 1 . 4 2 .
[10] D. Nozza, Nozza@LT-EDI-ACL2022: Ensemble modeling for homophobia and transphobia
     detection, in: Proceedings of the Second Workshop on Language Technology for Equality,
     Diversity and Inclusion, Association for Computational Linguistics, Dublin, Ireland, 2022,
     pp. 258–264. URL: https://aclanthology.org/2022.ltedi-1.37. doi:1 0 . 1 8 6 5 3 / v 1 / 2 0 2 2 . l t e d i - 1 .
     37.
[11] A. M. Putri, D. A. P. Basya, M. T. Ardiyanto, I. Sarathan, Sentiment analysis of youtube
     video comments with the topic of starlink mission using long short term memory, in: 2021
     International Conference on Artificial Intelligence and Big Data Analytics, 2021, pp. 28–32.
     doi:1 0 . 1 1 0 9 / I C A I B D A 5 3 4 8 7 . 2 0 2 1 . 9 6 8 9 7 5 4 .
[12] H. Bhuiyan, J. Ara, R. Bardhan, M. R. Islam, Retrieving youtube video by sentiment analysis
     on user comment, in: 2017 IEEE International Conference on Signal and Image Processing
     Applications (ICSIPA), 2017, pp. 474–478. doi:1 0 . 1 1 0 9 / I C S I P A . 2 0 1 7 . 8 1 2 0 6 5 8 .
[13] B. R. Chakravarthi, R. Priyadharshini, R. Ponnusamy, P. K. Kumaresan, K. Sampath, D. Then-
     mozhi, S. Thangasamy, R. Nallathambi, J. P. McCrae, Dataset for identification of homopho-
     bia and transophobia in multilingual youtube comments, arXiv preprint arXiv:2109.00227
     (2021).

</pre>