<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multi-Task Text Classification of Tourist Reviews Using Doc2Vec</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Lorena Vazquez</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jean Arreola</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Center for Research in Mathematics</institution>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>National Polytechnic Institute</institution>
          ,
          <country country="MX">Mexico</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <abstract>
        <p>Understanding tourist experiences through online reviews ofers valuable insights for enhancing destination management and marketing, particularly in culturally rich regions such as Mexico's Pueblos Mágicos. In this work, we present a multi-task natural language processing (NLP) approach to automatically analyze user-generated reviews by predicting three key attributes: sentiment polarity (on a 1-5 scale), review category (restaurant, hotel, or attractive), and the corresponding tourist destination. Our model leverages Doc2Vec, an unsupervised algorithm that learns fixed-length vector representations of variable-length texts, allowing us to capture semantic and contextual information from Spanish-language reviews. These document embeddings are then used as input features for a set of supervised classifiers tailored to each task. The proposed pipeline was developed and evaluated in the context of the Rest-Mex challenge, where multilingual, informal, and domain-specific language posed unique modeling challenges. Results demonstrate that Doc2Vec embeddings provide a strong foundation for robust document-level classification across multiple tasks, enabling scalable analysis of subjective tourist feedback. This research contributes a replicable framework for semantic understanding of tourism narratives in regional and linguistically diverse settings.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Embedding</kwd>
        <kwd>Mexican tourism</kwd>
        <kwd>Magical Town</kwd>
        <kwd>Doc2Vec</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Tourism today is shaped not just by brochures and guidebooks, but by the voices of travelers who
share their experiences online. Reviews posted by tourists ofer valuable insights into what they loved,
what could be improved, and how they felt during their visit [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. In Mexico, the “Pueblos Mágicos”
(Magical Towns) program highlights destinations known for their cultural richness, natural beauty, and
historical significance [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Understanding how visitors talk about these places can help improve the
tourist experience and support local businesses and communities.
      </p>
      <p>
        As part of the REST-MEX [
        <xref ref-type="bibr" rid="ref5 ref6 ref7 ref8">5, 6, 7, 8</xref>
        ] [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] event, it was developed a system composed of two machine
learning models and an heuristic, designed to analyze tourist reviews and extract key information
automatically. The system focuses on three main tasks: (1) identifying the sentiment or "polarity" of a
review on a scale from 1 (very negative) to 5 (very positive), (2) recognizing the type of review—whether
it’s about a restaurant, a hotel, or a local attraction—and (3) determining which Pueblo Mágico the
review refers to.
      </p>
      <p>To achieve this, natural language processing (NLP) techniques were employed to train the system
to understand and interpret text written in Spanish, which is often informal and rich in regional
expressions. By combining linguistic analysis with machine learning, the system is able to process
thousands of reviews quickly and consistently, providing a powerful tool for decision-makers in the
tourism sector.</p>
      <p>This paper explains how the system was built, the challenges with the data, and how the results can
help guide smarter tourism strategies that reflect the real voices of travelers.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset</title>
      <p>The dataset used in this work was provided by the contest organizers. It consists of 208,051 reviews,
each consisting of:
• Title: The title that the tourist assigned to their opinion
• Review: The opinion issued by the tourist.
• Polarity: The label representing the sentiment polarity of the opinion (1, 2, 3, 4, 5).
• Town: The town where the review is focused.
• Region: The region (state in Mexico) where the town is located.</p>
      <p>• Type: The type of place the review refers to ( Hotel, Restaurant, Attractive).</p>
    </sec>
    <sec id="sec-3">
      <title>3. Modelling Approach</title>
      <sec id="sec-3-1">
        <title>3.1. Data preprocessing</title>
        <p>A critical step in analyzing the reviews involved understanding their structure and identifying potential
factors that could impact the embeddings, such as multilingual content, spelling errors, and inconsistent
use of uppercase and lowercase letters.</p>
        <p>
          To mitigate these issues, a series of data cleaning and preprocessing steps were applied (similar to
[
          <xref ref-type="bibr" rid="ref10">10</xref>
          ]):
1. Selection of an appropriate file reading and text encoding method.
2. Customization of the stopword list by modifying spaCy’s [11] Spanish stopword set, excluding
adverbs, adjectives, verbs, and auxiliary verbs.
3. Standardization of spelling and conversion of all text to lowercase.
        </p>
        <p>4. Removal of punctuation, numbers, and emojis, along with normalization of whitespace.
3.2. Doc2Vec
Doc2Vec [12] is an extension of the Word2Vec framework, designed to learn fixed-length vector
representations for variable-length pieces of text such as sentences, paragraphs, or entire documents. It
was introduced by Le and Mikolov in 2014 under the name Paragraph Vector.</p>
        <p>Whereas Word2Vec learns word embeddings based on local context windows (using models such as
CBOW or Skip-gram), Doc2Vec adds an additional mechanism to capture the semantics of larger text
units by incorporating a document identifier or tag during training. The resulting vector embeddings
capture not only word-level semantics but also higher-level structure and meaning across longer
sequences.</p>
        <sec id="sec-3-1-1">
          <title>Doc2Vec comes in two primary variants:</title>
          <p>Distributed Memory Model of Paragraph Vectors (PV-DM): In this model, the document is
mapped to a unique vector (document ID), which is concatenated or averaged with word vectors from
the context window to predict the next word. This setup is analogous to the CBOW model in Word2Vec.
• It captures semantics and word order.
• The document vector acts as a sort of memory that retains what is missing from the current
context.</p>
          <p>Distributed Bag of Words version of Paragraph Vector (PV-DBOW): This model ignores the
context words in the input and uses only the document vector to predict words sampled from the
document.</p>
          <p>• This approach is more akin to the Skip-gram model.</p>
          <p>• It tends to capture topic-level information better.</p>
          <p>• Faster and simpler than PV-DM, but potentially less precise for fine-grained semantics.
In practice, both models are often trained together to leverage their complementary strengths.</p>
          <p>In Doc2Vec, tags are essential because they uniquely identify each document in the training set. They
serve two primary purposes:
• During training, each tag is associated with a trainable vector that is updated alongside word
vectors. This tag vector becomes the document embedding.
• During inference, a new document (unseen during training) can be assigned a new tag, and its
vector can be inferred by fixing the word vectors and optimizing only the new document vector.</p>
          <p>Tags are not limited to single IDs—they can also include multiple labels or metadata (e.g., genre,
author, source) if richer supervision is needed [13]. In the case of the polarity and type model, tags
were obtained for each of the polarity and types categories.</p>
          <p>Finally, it is important to highlight the advantages of the Doc2Vec model over its predecessor
approaches, as it has demonstrated strong performance in capturing the semantic structure of texts.</p>
        </sec>
      </sec>
      <sec id="sec-3-2">
        <title>3.3. Polarity Model</title>
        <p>In this study, document embeddings for each review were generated using the Doc2Vec model.
Additionally, the Type, Town, Region, and Polarity attributes were incorporated as tags.</p>
        <p>For the classification task, an XGBoost model was trained using Polarity as the target variable.
Hyperparameter optimization was carried out with the Optuna [14] library, with the objective of
addressing class imbalance—an essential factor in enhancing model performance metrics.</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.4. Type of Review Model</title>
        <p>For this model, document embeddings were obtained for each review, and for each "tag" of the type of
review ("Restaurant", "Hotel", "Attractive"). Similarity was obtained between the review embedding,
and each of the tag embedding. The three similarities were concatenated to the review embedding.</p>
        <p>A xgboost model was trained using the class weight of each of the categories [15].</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.5. Place Recognition Model</title>
        <p>For the place recognition Model, a classification model was discarded, because of the number of
categories.</p>
        <sec id="sec-3-4-1">
          <title>An heuristic approach was used, following the next steps:</title>
          <p>• A curated list of tourist destinations was compiled based on selections from [16], as well as
recommendations generated by a GPT model specifically trained to retrieve information about
Mexico’s Magical Towns. This model provided not only key attractions of each location but also
commonly recommended hotels and restaurants. Additionally, the list included the colloquial
or promotional nicknames associated with each destination. Consequently, when a particular
location was referenced in a review, it was matched with the corresponding Magical Town based
on these associations.
• Average embeddings for each type of review were obtained for every town.
• Similarity between the review embedding and all the place-type average embeddings was obtained,
acordingly to the type of the review. The town with the highest similarity was matched.</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results and Discussion</title>
      <p>The results of the systems and the baseline of the contest are presented in the tables below.</p>
      <p>System
Polarity Model
Baseline Model
Top Ranked Model</p>
      <p>Macro F1
0.19
0.158
0.987</p>
      <p>Accuracy
36.91
65.53
78.52</p>
      <p>Avg Precision
0.2
0.2
0.62</p>
      <p>Avg Recall
0.2
0.131
0.66</p>
      <p>MAE
1.044
0.55
0.22</p>
      <p>The results obtained across the three tasks of the Rest-Mex task (polarity detection, review type
classification, and place recognition) highlight both the strengths and limitations of our proposed
models in comparison to the Top Ranked Model.</p>
      <p>In the polarity detection task, our model slightly surpassed the baseline in terms of Macro F1 and
average recall. However, its mean absolute error (MAE) was notably high, and performance remained
distant from that of the Top Ranked Model, which achieved near-perfect metrics. This discrepancy
suggests that our approach may be overly sensitive to class imbalance or error propagation in sentiment
boundaries. The high MAE also indicates that misclassifications were often far from the true label in
ordinal space, suggesting the need to explore ordinal regression or class-weighted losses to better align
training objectives with evaluation metrics.</p>
      <p>For the review-type classification task, our model showed solid improvement over the baseline,
especially in Macro F1 and recall. Yet, its performance still fell short of the Top Ranked Model, which
achieved almost flawless results. This sharp contrast implies that our current feature representations
and modeling strategy may be insuficient to capture the nuanced semantic distinctions between
review types. The superior performance of the Top Ranked Model hints at the potential benefits of
using domain-adapted language models, attention mechanisms, or even hierarchical classification
architectures.</p>
      <p>The place recognition task (Town Model) posed the greatest challenge, as reflected by the modest
accuracy and F1 score achieved. Although our model outperformed the baseline, it was substantially
outperformed by the top-ranked model. Place names in user-generated content are often noisy,
misspelled, or contextually ambiguous, which may explain our model’s limited ability to disambiguate
them efectively. Potential improvements include integrating spelling normalization, geographical
knowledge bases, and context-aware entity linking methods that can better resolve ambiguous mentions.</p>
      <p>Across all tasks, the Top Ranked Model established a clear performance ceiling, demonstrating
the efectiveness of advanced modeling strategies. Their results suggest that a more holistic
approach—including pre-trained transformer models, feature-rich inputs, and possibly ensemble
techniques—is critical for achieving state-of-the-art performance in this domain.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions</title>
      <p>Our models for the Rest-Mex challenge represent a significant step forward from baseline
systems, particularly in tasks involving sentiment and type classification. However, the performance
gap relative to the Top Ranked Model highlights key areas where our current pipeline could be improved.</p>
      <p>Moving forward, we identify the following priorities.</p>
      <p>• Mitigating class imbalance [15], especially in polarity prediction, through loss reweighting, data
augmentation, or sampling strategies.
• Adopt transformer-based embeddings fine-tuned on tourism-specific corpora to improve semantic
understanding.
• Incorporating domain knowledge (e.g., gazetteers, spatial hierarchies) to enrich context in place
recognition tasks.
• Leveraging interpretability tools to analyze model errors and iteratively refine feature sets.</p>
      <p>Exploring multitask and ensemble learning frameworks to capture the interdependence between
tasks and diversify model reasoning.</p>
      <p>In summary, while our models demonstrate encouraging progress, achieving top-tier performance in
review understanding for tourism contexts will require a richer integration of linguistic, geographic,
and contextual signals, supported by more sophisticated and adaptive learning architectures.</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>We declare that the present manuscript has been written entirely by the authors and that no generative
artificial intelligence tools were used in its preparation, drafting, or editing.
[11] M. Honnibal, I. Montani, S. Van Landeghem, A. Boyd, spacy: Industrial-strength natural language
processing in python, Zenodo (2020). URL: https://doi.org/10.5281/zenodo.1212303. doi:10.5281/
zenodo.1212303.
[12] Q. V. Le, T. Mikolov, Distributed representations of sentences and documents, in: Proceedings of
the 31st International Conference on Machine Learning (ICML), PMLR, 2014, pp. 1188–1196.
[13] S. Chen, A. Soni, A. Pappu, Y. Mehdad, Doctag2vec: An embedding based multi-label learning
approach for document tagging, in: Proceedings of the 2nd Workshop on Representation Learning
for NLP, Association for Computational Linguistics, Vancouver, Canada, 2017, pp. 111–120. URL:
https://aclanthology.org/W17-2614/. doi:10.18653/v1/W17-2614.
[14] T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter
optimization framework, in: Proceedings of the 25th ACM SIGKDD International Conference on
Knowledge Discovery &amp; Data Mining (KDD), ACM, 2019, pp. 2623–2631. URL: https://arxiv.org/
abs/1907.10902. doi:10.1145/3292500.3330701.
[15] A. Allawala, A. Ramteke, P. Wadhwa, Performance impact of minority class reweighting on
xgboost-based anomaly detection, International Journal of Machine Learning and Computing 12
(2022) 143–148. URL: http://www.ijmlc.org/vol12/1093-IJMLC-3012.pdf. doi:10.18178/ijmlc.
2022.12.4.1093.
[16] Secretaría de Turismo de México, Pueblos mágicos, https://www.sectur.gob.mx/gobmx/
pueblos-magicos/, 2025. Accessed: 2025-04-01.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodriguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez</surname>
          </string-name>
          <string-name>
            <surname>Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <article-title>Studying online travel reviews related to tourist attractions using nlp methods: the case of guanajuato, mexico</article-title>
          ,
          <source>Current Issues in Tourism</source>
          <volume>26</volume>
          (
          <year>2023</year>
          )
          <fpage>289</fpage>
          -
          <lpage>304</lpage>
          . URL: https://doi.org/10.1080/13683500.
          <year>2021</year>
          .
          <volume>2007227</volume>
          . doi:
          <volume>10</volume>
          .1080/13683500.
          <year>2021</year>
          .
          <volume>2007227</volume>
          . arXiv:https://doi.org/10.1080/13683500.
          <year>2021</year>
          .
          <volume>2007227</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          , et al.,
          <article-title>Big data analytics of online news to explore destination image using a comprehensive deep-learning approach: a case from mexico</article-title>
          ,
          <source>Information Technology &amp; Tourism</source>
          <volume>26</volume>
          (
          <year>2024</year>
          )
          <fpage>147</fpage>
          -
          <lpage>182</lpage>
          . URL: https://doi.org/10.1007/ s40558-023-00278-5. doi:
          <volume>10</volume>
          .1007/s40558-023-00278-5.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Sánchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Pérez-Espinosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Martínez-Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <article-title>Ángel DíazPacheco, Natural language processing applied to tourism research: A systematic review and future research directions</article-title>
          ,
          <source>Journal of King Saud University - Computer and Information Sciences</source>
          <volume>34</volume>
          (
          <year>2022</year>
          )
          <fpage>10125</fpage>
          -
          <lpage>10144</lpage>
          . URL: https://www.sciencedirect.com/science/article/pii/S1319157822003615. doi:https://doi.org/10.1016/j.jksuci.
          <year>2022</year>
          .
          <volume>10</volume>
          .010.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J. L.</given-names>
            <surname>García</surname>
          </string-name>
          , M. d. C. Hernández, El estudio de los pueblos mágicos:
          <article-title>Una revisión a casi 20 años de la implementación del programa</article-title>
          ,
          <source>Dimensiones Turísticas</source>
          <volume>5</volume>
          (
          <year>2021</year>
          )
          <fpage>9</fpage>
          -
          <lpage>38</lpage>
          . URL: https://dimensionesturisticas.amiturismo.org/wp-content/uploads/2021/04/
          <string-name>
            <surname>DT-V5N8-Art-</surname>
          </string-name>
          1
          <string-name>
            <surname>-</surname>
          </string-name>
          El-estudio-de-los-pueblos-magicos-9-38.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Arce-Cárdenas</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fajardo-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Guerrero-Rodríguez</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>López-Monroy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Martínez-Miranda</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Pérez-Espinosa</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2021: Recommendation system for text mexican tourism</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>67</volume>
          (
          <year>2021</year>
          ). doi:https://doi.org/10.26342/2021-67-14.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2022: Recommendation system, sentiment analysis and covid semaphore prediction for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>69</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Muñis-Sánchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Pastor-López</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sánchez-Vega</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2023: Research on sentiment analysis task for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>71</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Herrera-Semenets</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2025: Researching sentiment evaluation in text for mexican magical towns</article-title>
          , volume
          <volume>75</volume>
          ,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>González-Barba</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Chiruzzo</surname>
            ,
            <given-names>S. M.</given-names>
          </string-name>
          <string-name>
            <surname>Jiménez-Zafra</surname>
          </string-name>
          ,
          <article-title>Overview of IberLEF 2025: Natural Language Processing Challenges for Spanish and other Iberian Languages, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2025), co-located with the 41st Conference of the Spanish Society for Natural Language Processing (SEPLN 2025), CEUR-WS</article-title>
          . org,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>J.</given-names>
            <surname>Arreola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Ramos-Zavaleta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rodrıguez</surname>
          </string-name>
          ,
          <article-title>An embeddings based recommendation system for mexican tourism. submission to the rest-mex shared task at iberlef</article-title>
          <year>2021</year>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>