<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Filtering opinions in Spanish with Topics of Tourist Interest for the Sentiment Analysis Task</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>José de Jesús Ceballos-Mejía</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Martha Angélica Parra-Urías</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miguel Ángel Álvarez-Carmona</string-name>
          <email>miguel.alvarez@cimat.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centro de Investigación en Matemáticas (CIMAT)</institution>
          ,
          <addr-line>Sede Monterrey</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT)</institution>
          ,
          <addr-line>CDMX</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Instituto Tecnológico de Tepic (ITT)</institution>
          ,
          <addr-line>Nayarit</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>This paper presents a proposal by the ITT Team to participate in the Rest-Mex 2023 forum, focusing on ifltering opinions in Spanish with topics of tourist interest for the Sentiment Analysis task. Our objective is to conduct a sentiment analysis that is tailored to the key areas that significantly impact tourists' experiences. We have identified 11 specific topics: Activities, Friendliness, Climate, Food, Insecurity, Cleanliness, Nature, Pandemic, Prices, Transportation, and Location, which encompass a wide range of factors critical for tourists when evaluating and making decisions about their travel experiences. By leveraging topic modeling techniques, we aim to enhance the accuracy and granularity of sentiment analysis by considering these specific areas of interest. Our proposed approach combines machine learning methods and feature engineering to classify opinions as positive, negative, or neutral based on their sentiment towards these topics. The results demonstrate the efectiveness of our approach in ifltering Spanish opinions related to tourist topics and provide valuable insights for tourism-related decision-making processes.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        The advent of social media and online platforms has transformed the way people communicate
and share their opinions, experiences, and feedback [
        <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
        ]. This shift has had a profound impact
on various industries, including tourism, where customer opinions play a vital role in shaping
services and improving customer satisfaction [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ]. In the digital era, sentiment analysis has
emerged as a powerful tool for extracting insights from the vast amount of user-generated
content available [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. However, in the context of the tourism domain, a more focused and
topic-specific approach to sentiment analysis is necessary to better understand and address the
needs and preferences of tourists.
      </p>
      <p>
        In this paper, we present a proposal to participate in the Rest-Mex 2023 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] forum as the ITT
Team. We propose to filter opinions in Spanish using specific topics of interest related to tourism.
Our aim is to conduct sentiment analysis that is tailored to the key areas that significantly
impact tourists’ experiences. We focus on 11 specific topics: Activities, Friendliness, Climate,
Food, Insecurity, Cleanliness, Nature, Pandemic, Prices, Transportation, and Location. These
topics encompass a wide range of factors that are critical for tourists when evaluating and
making decisions about their travel experiences.
      </p>
      <p>
        The study of sentiment analysis within the tourism domain is not novel, as previous research
has explored general sentiment classification in this context[
        <xref ref-type="bibr" rid="ref10 ref6 ref7 ref8 ref9">6, 7, 8, 9, 10</xref>
        ]. However, limited
attention has been given to topic-based sentiment analysis that specifically targets areas of
interest relevant to tourists. By focusing on these specific topics, we aim to provide a more
nuanced understanding of sentiment distribution and customer preferences within the Spanish
tourism industry.
      </p>
      <p>To achieve these objectives, we outline a methodology that encompasses data collection,
preprocessing, topic classification, sentiment classification, and model evaluation.</p>
      <p>
        To accomplish this, the Rest-Mex dataset is utilized [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. In the 2023 edition, the organizers
have introduced an extension to the sentiment analysis task that has been in development since
2021 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. The objective is to predict three aspects based on a tourist’s opinion about a place:
1. The polarity of the opinion, represented as an integer value ranging from 1 to 5.
2. The type of tourist place, which can be categorized as an attraction, a hotel, or a restaurant.
3. The country where the tourist place is located, which can be Mexico, Cuba, or Colombia.
      </p>
      <p>
        The training data for the polarity task this year comprises over 250,000 opinions, while the
test data consists of more than 100,000 opinions. Typically, three separate models would be
constructed using this data, one for each aspect to be predicted (polarity, type, and country).
However, given the substantial size of the data collection, training and testing three distinct
models could be time-consuming [
        <xref ref-type="bibr" rid="ref11 ref6">6, 11, 12</xref>
        ].
      </p>
      <p>By filtering opinions in Spanish with a focus on topics of tourist interest, we aim to contribute
to the advancement of sentiment analysis in the tourism domain and assist industry stakeholders
in better understanding and catering to the needs and preferences of their customers.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Sentiment Analysis in Tourism</title>
      <p>
        Sentiment analysis, also known as opinion mining, is a subfield of natural language processing
(NLP) that focuses on extracting and understanding sentiments, opinions, and attitudes expressed
in textual data. In the context of the tourism industry, sentiment analysis plays a crucial role in
analyzing customer opinions and feedback to gain insights into their experiences, preferences,
and satisfaction levels[
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. By automatically classifying sentiments as positive, negative, or
neutral, sentiment analysis provides valuable information for businesses to improve their
services and make data-driven decisions.
      </p>
      <p>
        In recent years, sentiment analysis has gained significant attention in the tourism domain, as
online platforms and social media have become popular platforms for travelers to share their
opinions and experiences. The analysis of sentiment in tourism texts provides valuable insights
into various aspects, such as destination preferences, hotel experiences, restaurant reviews, and
attraction satisfaction. However, performing sentiment analysis in the tourism domain poses
specific challenges that need to be addressed for accurate and meaningful results [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ].
      </p>
      <sec id="sec-2-1">
        <title>2.1. Challenges in Sentiment Analysis in Tourism</title>
        <p>
          Subjectivity and Context: Tourism-related opinions are inherently subjective and
contextdependent. The sentiment expressed in a review can vary based on individual preferences,
cultural diferences, and personal experiences. Interpreting sentiment accurately requires
considering the context in which the opinions are expressed, including the specific tourist place,
the reviewer’s background, and the overall experience [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ].
        </p>
        <p>Domain-specific Language and Slang : Tourism texts often include domain-specific
language, jargon, and slang. For example, travelers may use specific terms to describe activities,
attractions, or local customs. Understanding and appropriately handling these linguistic nuances
is crucial for accurate sentiment classification[ 13].</p>
        <p>Aspect-based Analysis: Sentiment analysis in tourism should go beyond general sentiment
classification and delve into aspect-based analysis. Aspects or topics of interest, such as activities,
friendliness, cleanliness, and prices, have a significant impact on tourists’ opinions. Analyzing
sentiments at the aspect level provides deeper insights into specific areas of interest and helps
identify strengths and weaknesses in tourism oferings[ 14].</p>
        <p>Multilingual Analysis: The tourism industry attracts a diverse range of travelers from
various countries and cultures, resulting in the need for sentiment analysis in multiple languages.
Diferent languages may require specific language models, preprocessing techniques, and
sentiment lexicons for accurate sentiment classification [ 15].</p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Methodologies for Sentiment Analysis in Tourism</title>
        <p>Various methodologies have been employed for sentiment analysis in the tourism domain,
ranging from traditional machine learning approaches to more advanced deep learning techniques.
Common steps in sentiment analysis include data collection, preprocessing, feature extraction,
sentiment classification, and evaluation. However, considering the challenges specific to tourism
sentiment analysis, certain adaptations and techniques can be employed:</p>
        <p>Aspect-based Sentiment Analysis: To capture sentiments related to specific topics of
interest in tourism, such as activities, friendliness, climate, and prices, aspect-based sentiment
analysis techniques can be employed. This involves identifying and classifying sentiments for
each aspect separately, providing a more granular understanding of customer opinions.</p>
        <p>Domain-specific Lexicons : Building or utilizing domain-specific sentiment lexicons can
enhance sentiment analysis accuracy in tourism. These lexicons contain domain-specific terms
and their associated sentiment polarities, enabling better recognition of sentiment-bearing
words and phrases in tourism texts.</p>
        <p>Machine Learning and Deep Learning Approaches: Traditional machine learning
algorithms, such as Support Vector Machines (SVM), Naïve Bayes, and Random Forests, have
been widely used for sentiment classification. Additionally, deep learning techniques, including
recurrent neural networks (RNNs) and transformer-based models, such as BERT[16].</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Rest-Mex Corpus</title>
      <p>The Rest-Mex 2023 corpus, curated by the organizers, comprises a total of 251,702 opinions
collected from TripAdvisor. This dataset serves as the foundation for training and evaluating
sentiment analysis models in the context of tourism.</p>
      <p>For the task of Polarity classification, the opinions are categorized into five classes, ranging
from the worst polarity (class 1) to the best polarity (class 5). Figure 1a illustrates the distribution
of these classes, revealing a clear imbalance within the dataset.</p>
      <p>(a) Distribution of Polarity
(b) Distribution of Country
(c) Distribution of Country</p>
      <p>To determine the Type of place, the opinions are classified into three categories: Attractive,
Hotel, and Restaurant. Figure 1b presents the distribution of these classes. While there is not as
pronounced an imbalance as in the Polarity classification, some variations can still be observed.</p>
      <p>Lastly, the opinions are labeled with the Country of origin of the place the tourist visited,
resulting in three classes: Mexico, Cuba, and Colombia. The distribution of these classes is
shown in Figure 1c.</p>
      <p>From the distributions presented, it is evident that the Polarity classification exhibits the
greatest imbalance among the three traits. Conversely, the Type classification demonstrates
a relatively more balanced distribution, while the Country classification falls somewhere in
between. These class distribution variations must be considered when developing and evaluating
sentiment analysis models using the Rest-Mex corpus.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Proposal</title>
      <p>The hypothesis of this work is that by using only opinions where words from certain themes
appear, it is possible to classify the features of polarity, type, and country with competitive
results.</p>
      <p>To test this hypothesis we propose 3 phases. First, define the topics of interest, second, extract
representative words from each topic, and third, propose an algorithm that filters opinions from
the corpus.</p>
      <p>Each of the 3 phases is described below.</p>
      <sec id="sec-4-1">
        <title>4.1. Important topics for tourism</title>
        <p>Tourism is a thriving industry that attracts millions of visitors each year. When choosing a
travel destination, tourists consider various factors to ensure an enjoyable and memorable
experience. The following topics play a crucial role in determining the appeal and success of a
tourist destination.</p>
        <sec id="sec-4-1-1">
          <title>4.1.1. Activities</title>
          <p>Engaging activities and attractions are vital for tourism. Tourists seek destinations that ofer
a diverse range of recreational options, such as adventure sports, cultural events, historical
landmarks, museums, and natural wonders. The availability of exciting activities enhances the
overall experience and creates lasting memories for travelers.</p>
        </sec>
        <sec id="sec-4-1-2">
          <title>4.1.2. Friendliness</title>
          <p>Friendliness and hospitality of the local population significantly impact the tourism industry.
Travelers often prefer destinations where they feel welcomed and can interact with friendly
locals. Warm and welcoming communities create a positive environment, fostering cultural
exchange and leaving tourists with a sense of belonging.
4.1.3. Climate
4.1.4. Food
Climate plays a pivotal role in choosing a travel destination. Diferent individuals have varied
preferences, but favorable weather conditions are generally preferred. Whether it’s a tropical
paradise, a winter wonderland, or a moderate climate, the suitability of the weather for outdoor
activities greatly influences tourists’ decisions.</p>
          <p>Food is an integral part of the tourism experience. Culinary delights and local cuisine are major
attractions for travelers. The availability of diverse food options, ranging from street food to
ifne dining experiences, allows tourists to explore the unique flavors and culinary traditions of
a particular region.</p>
        </sec>
        <sec id="sec-4-1-3">
          <title>4.1.5. Insecurity</title>
        </sec>
        <sec id="sec-4-1-4">
          <title>4.1.6. Cleanliness</title>
          <p>Tourist destinations must prioritize safety and security. Travelers are more likely to choose
locations with low crime rates and efective security measures. Providing a sense of safety and
peace of mind allows tourists to relax and enjoy their visit without concerns about personal
well-being.</p>
          <p>Cleanliness and hygiene are critical aspects that influence tourists’ perceptions of a destination.
Maintaining cleanliness in public spaces, accommodations, and tourist attractions creates a
4.1.7. Nature</p>
        </sec>
        <sec id="sec-4-1-5">
          <title>4.1.8. Pandemic 4.1.9. Prices</title>
          <p>positive image and promotes visitor satisfaction. Well-maintained environments contribute to a
pleasant and enjoyable stay.</p>
          <p>The natural beauty and conservation eforts of a destination attract eco-tourists and nature
enthusiasts. Scenic landscapes, national parks, wildlife reserves, and opportunities for outdoor
activities like hiking, birdwatching, and nature photography contribute to the overall appeal of
a tourist destination.</p>
          <p>In light of the ongoing global pandemic, tourists prioritize destinations that prioritize health
and safety measures. Transparent communication about vaccination requirements, testing
protocols, and adherence to public health guidelines instills confidence in travelers and ensures
their well-being during their visit.</p>
          <p>Afordability is a significant consideration for tourists. Reasonable prices for accommodations,
transportation, attractions, and dining options make a destination more accessible and appealing
to a broader range of travelers. Competitive pricing ensures a competitive edge in attracting
visitors.</p>
        </sec>
        <sec id="sec-4-1-6">
          <title>4.1.10. Transportation</title>
          <p>Eficient transportation systems are crucial for the success of tourism. Easy accessibility and
connectivity within a destination, as well as convenient modes of transportation, including
airports, public transit, and reliable road networks, enhance the overall travel experience and
make it more convenient for tourists to explore diferent attractions.
4.1.11. Location
The geographical location of a tourist destination plays a vital role. Proximity to other attractions,
accessibility from major cities or transportation hubs, and unique geographical features all
contribute to the attractiveness of a location. A desirable location can make a destination more
appealing and increase its competitiveness in the tourism market.</p>
        </sec>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Found important words</title>
        <p>With the important topics definition, the question is how we can find descriptive words for
each topic. For this, we propose to use similar words to the name of each topic. We use the
online system named RelatedWords. This system receives as input a word and returns a set of
words related to the input. For this work we use the top 20 words related to each topic.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Filtering opinions</title>
        <p>To filter the opinions, we will only use those with a number greater than k words related to the
topics.</p>
        <p>For that we propose the algorithm ??. In this algorithm, it can be seen that we pass a diferent
parameter  for each polarity class (between 1 and 5). The function c returns the polarity class
of an opinion  .</p>
        <p>Algorithm 1 Filtering opinions
InputInput OutputOutput Filtering opinionsFiltering opinions
opinions, topics, k opinionsFiltered
words = []
t in topics words += topics[t]
opinionesFiltered = []
i in range(len(opinions))
text = opinions[i]
len(set(text.lower().split()) ∩words) &gt; k[c(opinions[i])]
opinionesFiltered.append(text)
opinionesFiltered</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Text classification based on Beto</title>
        <p>The Beto classifier is a state-of-the-art natural language processing (NLP) model developed by
the research team at the University of Chile. It is specifically designed for processing text in
the Spanish language. Beto is built upon the Transformer architecture, which has been widely
successful in various NLP tasks.</p>
        <p>The Transformer architecture, originally introduced in the ”Attention is All You Need” paper
by Vaswani et al., revolutionized NLP by replacing recurrent neural networks (RNNs) with
attention mechanisms. Transformers excel in capturing long-range dependencies in sequences,
making them highly efective in language modeling and other text-related tasks.</p>
        <p>The Beto model, like its English counterpart BERT (Bidirectional Encoder Representations
from Transformers), is a variant of the Transformer model. It utilizes a bidirectional architecture
that allows the model to capture both the left and right context of each word or token in a
sentence. This enables the model to understand the meaning and context of words based on
their surrounding words, greatly improving its ability to grasp the semantics of the text.</p>
        <p>Beto is pre-trained on a massive amount of Spanish text data, typically using a masked
language modeling objective. During pre-training, the model learns to predict missing words
in a sentence based on the surrounding context. This process helps the model develop a deep
understanding of the Spanish language.</p>
        <p>After pre-training, Beto can be fine-tuned for specific downstream tasks such as text
classification, named entity recognition, sentiment analysis, and more. Fine-tuning involves training
the model on a smaller, task-specific dataset to adapt it to a particular NLP task. By fine-tuning
Beto on a specific task, it can leverage its pre-trained knowledge to achieve high performance
and accuracy in that task.</p>
        <p>The Beto classifier has gained popularity and achieved excellent results in various Spanish
NLP benchmarks and competitions. Its ability to capture context, semantics, and syntactic
information makes it a powerful tool for analyzing and understanding Spanish text, opening
up numerous possibilities for applications in areas such as sentiment analysis, document
classification, and information retrieval.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>of the data of 70 % for training and the rest for testing.</p>
      <p />
      <p>Observing the results, we can make the following observations:</p>
      <p>
        ranges from 0.35 to 0.56, indicating the classifier’s ability to correctly
identify the polarity (positive, negative, or neutral) of the text. The highest value of 0.56 is
achieved when using the  value [
        <xref ref-type="bibr" rid="ref2 ref3 ref4">0, 0, 2, 3, 4</xref>
        ]. That is, the more instances of a class are required
to filter the opinions with more words.
      </p>
      <p>
        ranges from 0.90 to 0.97, indicating the classifier’s performance in correctly
identifying the type of text. The highest value of 0.97 is achieved with the  value [
        <xref ref-type="bibr" rid="ref1 ref1 ref1 ref1 ref1">1, 1, 1, 1, 1</xref>
        ].
      </p>
      <p>The  
1, 1, 1].
is crucial.</p>
      <p>ranges from 0.80 to 0.89, representing the classifier’s ability to identify
the country associated with the text. The highest value of 0.89 is achieved with the  value [1, 1,</p>
      <p>Overall, we can observe that diferent  values have an impact on the performance of the
classifier for each evaluation metric. It is important to note that the choice of  values depends
on the specific requirements and objectives of the task at hand. A higher  - 
value
indicates better performance, so selecting the appropriate  values based on the desired outcome</p>
      <sec id="sec-5-1">
        <title>5.1. Test partition results</title>
        <p>For this edition, the organizers of Rest-Mex propose some evaluation metrics that give greater
weight to correctly classify the negative classes of polarity.</p>
        <p>To assess the efectiveness of the polarity classifier, the organizers propose the equation
1.</p>
        <p>This metric gives the additive inverse of importance according to the percentage of instances of
a class in the test collection.</p>
        <p>() =
∑|=| 1 ((1 −
 
   ) ∗   ())
∑|=| 1 1 −</p>
        <p>Finally, to obtain a unique value per participant, they propose a combination of the results as
indicated by the equation 2. It is important to mention that in the same way, greater weight is
given to the result of polarity than to the other two traits.</p>
        <p>() =
2 ∗ 
 () +   () +   ()
4
Observing the results, we can make the following observations:
The ()</p>
        <p>column shows the sentiment score associated with each  value. The scores
range from 0.40 to 0.68, indicating the overall sentiment captured by the classifier for each
parameter setting. Higher scores generally indicate more positive sentiment.</p>
        <p>
          The     - ranges from 0.27 to 0.48, indicating the classifier’s ability to correctly
identify the polarity (positive, negative, or neutral) of the text. The highest value of 0.48 is
achieved with the  value [
          <xref ref-type="bibr" rid="ref1 ref1 ref1 ref1 ref1">1, 1, 1, 1, 1</xref>
          ].
        </p>
        <p>
          The     - ranges from 0.66 to 0.96, representing the classifier’s performance in
correctly identifying the type of text. The highest value of 0.96 is achieved with the  value [
          <xref ref-type="bibr" rid="ref1 ref1 ref1 ref1 ref1">1, 1, 1,
1, 1</xref>
          ].
        </p>
        <p>
          The    - ranges from 0.55 to 0.87, indicating the classifier’s ability to identify the
country associated with the text. The highest value of 0.87 is achieved with the  value [
          <xref ref-type="bibr" rid="ref1 ref1 ref1 ref1 ref1">1, 1, 1,
1, 1</xref>
          ].
        </p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>In this paper, we have presented a proposal to participate in the Rest-Mex 2023 forum as
the ITT Team, with a focus on filtering opinions in Spanish using specific topics of interest
related to tourism. Our objective is to conduct sentiment analysis that is tailored to the key
areas that significantly impact tourists’ experiences. We have identified 11 specific topics,
including Activities, Friendliness, Climate, Food, Insecurity, Cleanliness, Nature, Pandemic,
Prices, Transportation, and Location, which encompass a wide range of factors critical for
tourists when evaluating and making decisions about their travel experiences.</p>
      <p>To identify the relevant topics within the opinions, we employed a simple approach. This
allowed us to uncover the underlying themes and subjects discussed in the opinions, enabling
us to focus on specific aspects of interest to tourists. By associating each opinion with the most
relevant topic, we enhanced the sentiment analysis process and captured the nuanced sentiment
expressed in the context of these key areas.</p>
      <p>The results of our experiments demonstrated the efectiveness of our proposed approach
in filtering opinions related to tourist topics in Spanish. By considering the specific aspects
of interest to tourists, our method provided more targeted and relevant sentiment analysis
results compared to traditional approaches that do not incorporate topic information. The
inclusion of specific topics allowed for a comprehensive understanding of tourists’ experiences
and facilitated more informed decision-making for potential travelers.</p>
      <p>In conclusion, our proposed approach ofers a valuable contribution to the field of sentiment
analysis by presenting a tailored method for filtering opinions in Spanish with specific topics of
tourist interest. By focusing on the key areas that significantly impact tourists’ experiences, we
enhance the accuracy and granularity of sentiment analysis, providing more targeted insights
for tourism-related decision-making. We anticipate that our approach can be further extended
and applied to other languages and domains, promoting improved sentiment analysis in various
contexts.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The authors thank the Mexican Academy of Tourism Research (AMIT) for their support of
the project ”Creation of a labeled database related to tourist destinations for training artificial
intelligence models for classifying relevant topics” through the call ”I Research Projects 2022”
[12] L. Bustio-Martínez, M. A. Álvarez-Carmona, V. Herrera-Semenets, C. Feregrino-Uribe,
R. Cumplido, A lightweight data representation for phishing urls detection in iot
environments, Information Sciences 603 (2022) 42–59.
[13] M. A. Álvarez-Carmona, M. Franco-Salvador, E. Villatoro-Tello, M. Montes-y Gómez,
P. Rosso, L. Villaseñor-Pineda, Semantically-informed distance and similarity measures
for paraphrase plagiarism identification, Journal of Intelligent &amp; Fuzzy Systems 34 (2018)
2983–2990.
[14] M. A. Álvarez-Carmona, R. Aranda, R. Guerrero-Rodrıguez, A. Y. Rodrıguez-González,
A. P. López-Monroy, A combination of sentiment analysis systems for the study of online
travel reviews: Many heads are better than one, Computación y Sistemas 26 (2022).
doi:https://doi.org/10.13053/CyS- 26- 2- 4055.
[15] M. Á. Álvarez-Carmona, E. Villatoro-Tello, L. Villaseñor-Pineda, M. Montes-y Gómez,
Classifying the social media author profile through a multimodal representation, in:
Intelligent Technologies: Concepts, Applications, and Future Directions, Springer, 2022,
pp. 57–81.
[16] M. Á. Álvarez-Carmona, R. Aranda, A. Y. Rodríguez-González, L. Pellegrin, H.
Carlos, Classifying the mexican epidemiological semaphore colour from the covid-19 text
spanish news, Journal of Information Science (2022). doi:https://doi.org/10.1177/
01655515221100952.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>E.</given-names>
            <surname>Olmos-Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Díaz-Pacheco</surname>
          </string-name>
          ,
          <article-title>What does the media tell us about a destination? the cancun case, seen from the usa, canada, and mexico</article-title>
          ,
          <source>International Journal of Tourism Cities</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodriguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <article-title>Studying online travel reviews related to tourist attractions using nlp methods: the case of guanajuato, mexico</article-title>
          ,
          <source>Current issues in tourism 26</source>
          (
          <year>2023</year>
          )
          <fpage>289</fpage>
          -
          <lpage>304</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rodriguez-Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G. A.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Perez-Espinosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Martinez-Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodriguez</surname>
          </string-name>
          , L. BustioMartinez,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Pacheco</surname>
          </string-name>
          ,
          <article-title>Natural language processing applied to tourism research: A systematic review and future research directions</article-title>
          ,
          <source>Journal of King</source>
          Saud University-Computer and Information Sciences (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Diaz-Pacheco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Guerrero-Rodríguez</surname>
            ,
            <given-names>L. A. C.</given-names>
          </string-name>
          <string-name>
            <surname>Chávez</surname>
            ,
            <given-names>A. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Rodríguez-González</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          <string-name>
            <surname>Ramírez-Silva</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence methods to support the research of destination image in tourism. a systematic review</article-title>
          ,
          <source>Journal of Experimental &amp; Theoretical Artificial Intelligence</source>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          , L. BustioMartínez, V.
          <string-name>
            <surname>Muñis-Sánchez</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>Pastor-López</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Sánchez-Vega</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2023: Research on sentiment analysis task for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>71</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Montes-y Gómez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Villasenor-Pineda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jair-Escalante</surname>
          </string-name>
          ,
          <article-title>Inaoe's participation at pan'15: Author profiling task</article-title>
          ,
          <source>Working Notes Papers of the CLEF</source>
          <volume>103</volume>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Guzmán-Falcón</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Montes-y Gómez</surname>
            ,
            <given-names>H. J.</given-names>
          </string-name>
          <string-name>
            <surname>Escalante</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Villasenor-Pineda</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Reyes-Meza</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Rico-Sulayes</surname>
          </string-name>
          ,
          <article-title>Overview of mex-a3t at ibereval 2018: Authorship and aggressiveness analysis in mexican spanish tweets, in: Notebook papers of 3rd sepln workshop on evaluation of human language technologies for iberian languages (ibereval), seville, spain</article-title>
          , volume
          <volume>6</volume>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M. E.</given-names>
            <surname>Aragón</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A. A.</given-names>
            <surname>Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Montes-y Gómez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. J.</given-names>
            <surname>Escalante</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. V.</given-names>
            <surname>Pineda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Moctezuma</surname>
          </string-name>
          ,
          <article-title>Overview of mex-a3t at iberlef 2019: Authorship and aggressiveness analysis in mexican spanish tweets</article-title>
          ., in: IberLEF@ SEPLN,
          <year>2019</year>
          , pp.
          <fpage>478</fpage>
          -
          <lpage>494</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Arce-Cardenas</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fajardo-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>GuerreroRodríguez</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>López-Monroy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Martínez-Miranda</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Pérez-Espinosa</surname>
            ,
            <given-names>A. Y.</given-names>
          </string-name>
          <article-title>RodríguezGonzález, Overview of rest-mex at iberlef 2021: recommendation system for text mexican tourism 67 (</article-title>
          <year>2021</year>
          ). doi:https://doi.org/10.26342/2021- 67- 14.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. FajardoDelgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2022: Recommendation system, sentiment analysis and covid semaphore prediction for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>69</volume>
          (
          <year>2022</year>
          )
          <fpage>289</fpage>
          -
          <lpage>299</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Arce-Cardenas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          <string-name>
            <surname>Ramírez-Silva</surname>
          </string-name>
          ,
          <article-title>A tourist recommendation system: a study case in mexico</article-title>
          ,
          <source>in: Advances in Soft Computing: 20th Mexican International Conference on Artificial Intelligence, MICAI</source>
          <year>2021</year>
          ,
          <string-name>
            <given-names>Mexico</given-names>
            <surname>City</surname>
          </string-name>
          , Mexico,
          <source>October 25-30</source>
          ,
          <year>2021</year>
          , Proceedings,
          <source>Part II 20</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>184</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>