<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Beto to Classify Spanish Tourist Opinions Through the Random Instances Selection</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Juan David Jurado-Buch</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ever Sebastian Minayo-Díaz</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jhony Alexander Tello</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kaily Estefanía Chaucanes</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Laura Valentina Salazar</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mauricio Daniel Oquendo-Coral</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Miguel Ángel Álvarez-Carmona</string-name>
          <email>miguel.alvarez@cimat.mx</email>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Centro de Investigación en Matemáticas (CIMAT)</institution>
          ,
          <addr-line>Sede Monterrey</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT)</institution>
          ,
          <addr-line>CDMX</addr-line>
          ,
          <country country="MX">Mexico</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Servicio Nacional de Aprendizaje Centro Sur Colombiano (SENA)</institution>
          ,
          <addr-line>Nariño</addr-line>
          ,
          <country country="CO">Colombia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2023</year>
      </pub-date>
      <abstract>
        <p>In this study, we developed a single Beto classification model capable of predicting 45 diferent classes that describes the polarity, type, and country of tourist opinions in Spanish. In addition, we proposed a novel function to balance the imbalanced database of tourist opinions, allowing us to achieve better results with reduced data. Specifically, we show that using only 27 % of the total training data leads to better results compared to using the entire dataset, with even competitive results obtained using just 2 % of the data. Notably, our proposed method achieved a top 8 ranking at the Rest-Mex 2023 forum. Overall, our results highlight the efectiveness of our proposed function in improving the performance of machine learning models trained on imbalanced datasets, especially in the context of tourist opinions.</p>
      </abstract>
      <kwd-group>
        <kwd>Rest-Mex</kwd>
        <kwd>Sentiment Analysis</kwd>
        <kwd>Beto</kwd>
        <kwd>Type prediction</kwd>
        <kwd>Country prediction</kwd>
        <kwd>Mexican tourism</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Opinion classification is a major problem in the field of text mining [
        <xref ref-type="bibr" rid="ref1 ref2 ref3">1, 2, 3</xref>
        ]. In particular, in
the field of tourism [
        <xref ref-type="bibr" rid="ref4 ref5">4, 5</xref>
        ], the classification of opinions can provide valuable information for
decision-making in the tourism industry and improve the experience of tourists [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. However,
this problem is complex, especially when you have multiple classes and want to predict diferent
aspects of sentiment.
      </p>
      <p>
        As a result, the Rest-Mex initiative emerged [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Rest-Mex is an international evaluation
forum specialized in Natural Language Processing applied to the tourism sector.
      </p>
      <p>
        For the 2023 edition [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], the organizers have proposed an extension to the sentiment analysis
task that has been developed since 2021 [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. On this occasion, the task is to predict, given an
opinion of a tourist place, the:
1. Polarity of the opinion: an integer value between 1 and 5.
      </p>
      <p>2. Type of tourist place: it can be an attraction, a hotel, or a restaurant.
3. Country of the tourist place: it can be Mexico, Cuba, or Colombia.</p>
      <p>
        This year’s train data collection for the polarity task has more than 250,000 opinions and the
test one with more than 100,000. Typically, 3 diferent models would be built with the data, one
for each trait to be predicted (polarity, type, and country). However, with a collection in the
order of hundreds of thousands of data, the training and test phase for 3 diferent models could
be very slow [
        <xref ref-type="bibr" rid="ref10 ref11 ref12">10, 11, 12</xref>
        ].
      </p>
      <p>In this work, we focus on the construction of a textual classifier of tourism opinions of 45
classes, where the polarity, the type of place, and the country of origin of the tourist place are
predicted. One of the most important challenges when building a textual opinion classifier
is the imbalance in the distribution of classes [13]. In many cases, some classes have too few
instances, which can negatively afect the classifier’s ability to correctly identify those classes
in a single model.</p>
      <p>Another important problem is class imbalance. In this way, it is necessary to apply an instance
selection method to the data [14].</p>
      <p>To address this problem, we propose a random instance selection method to balance data.
The approach consists of randomly selecting an equal number of instances of each class in the
training set. The idea is that, by balancing the number of instances of each class, the classifier
has more opportunities to learn the characteristics of the less represented classes and, therefore,
improves its generalizability [15, 16].</p>
      <p>To assess the efectiveness of our method, we conducted experiments using a Rest-Mex
dataset. The results obtained show that our random instance selection method improves the
evaluation of the classifier for all classes, especially those with a small number of instances.</p>
      <p>In summary, our work presents a solution to the instance selection challenge in the context
of multi-class tourism opinion classification. The results obtained suggest that our method can
be useful in diferent opinion analysis applications in the tourism industry.</p>
      <p>The rest of the paper is organized as follows: Section 2 describes the Rest-Mex corpus for
the 2023 edition. Section 3 shows the methodology proposed in this work. 4 shows the results
obtained with this proposal. Finally, 5 lists the conclusion of this work.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Dataset</title>
      <p>The train collection built by the organizers of Rest-Mex 2023 consists of 251,702 opinions from
TripAdvisor.</p>
      <p>For the Polarity classification, there are 5 classes, where class 1 represents the worst polarity
and 5 is the best polarity. In Figure 1 it is possible to observe the distribution of these classes.
This figure shows a clear imbalance. To measure this imbalance we use the same method used
in [17]. Following this same method, an imbalance value of 56,896.36 is obtained.</p>
      <p>To determine the Type of place there are 3 classes: Attractive, Hotel, and Restaurant. Figure
2 shows the distribution of this trait. In this case, there is not an imbalance as marked as for
polarity, however, it is possible to appreciate that there is no balancing. Its imbalance value is
19,864.79.</p>
      <p>Finally, to classify the Country of origin of the place that the tourist visited, there are 3 classes
in the collection: Mexico, Cuba, and Colombia. In Figure 3 its distribution is presented. The
degree of imbalance of the Country trait is 24,661.36.</p>
      <p>In this way, it can be seen that the trait with the least imbalance is that of Type, and that of
polarity is significantly greater than the other two.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Proposed Methodology</title>
      <p>The proposed methodology consists of 3 key steps. First, combine the classes of the 3 traits
(Polarity, Type, and Country) to generate a single model. Second, generate a function that
returns the number of instances to choose for trying to balance the database. Finally, classify
the data.</p>
      <p>Each of the 3 steps is described below.</p>
      <sec id="sec-3-1">
        <title>3.1. Combining classes</title>
        <p>As mentioned in the 1 section, the idea of this work is to generate a single model to classify the
3 traits involved in the Rest-Mex sentiment analysis task. The reason is to propose a simpler
system that has acceptable results than 3 diferent models.</p>
        <p>For this, it is proposed to generate all the possible combinations for the 3 diferent traits of
the collection. This is combining the 5 Polarity classes, the 3 Type classes, and the 3 Country
classes. Given these numbers, there are 45 diferent combinations.</p>
        <p>Table 1 shows the 45 possible classes ordered by the number of instances each combination
has.</p>
        <p>In this way, it has a diferent distribution of a single trait with 45 diferent classes. However,
although it may be easier to classify instances with a model of 45 classes than 3 models of 5, 3,
and 3 classes respectively, the class imbalance problem has not yet been resolved. If we measure
the class imbalance with the new collection of Table 1 we obtain a degree of 7,532.97. This
degree is lower than that of the 3 individual traits, but some balancing is still necessary.</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Instances selection</title>
        <p>To make a balance of the data, it is proposed to make a random selection of the data.</p>
        <p>The idea is to take the number of instances of a class  as parameter  as a reference and from
this value, for all classes with a number of instances greater than  select randomly  instances.</p>
        <p>To select the reference value, the equation 1 is proposed.</p>
        <p>(1)
is a
(, )
= 
(,
(())</p>
        <p>2
)</p>
        <p>Where () is a function that takes a database  and returns an ordered list of the classes
within  . ()</p>
        <p>is a function that returns the length of a class list of  . (,  )
function that returns the number of instances of the  − th class in an ordered list of classes in a
data set  .</p>
        <p>The parameter  is used to regulate the degree of balancing, if  = 0 then the class with the
largest number of instances will be taken as a reference value, which would make the database
maintain the same instances, if otherwise if  is large enough to take the minority class, all other
classes would randomly select as many instances as the class with the fewest instances.</p>
        <p>In the case of the distribution of Table 1, if  = 6 is taken, 115 instances of each class would
be randomly selected. This would generate a new fully balanced database with 5175 instances.</p>
        <p>
          It is proposed to experiment with all possible values of  , that is, with  ∈ [
          <xref ref-type="bibr" rid="ref6">0, 6</xref>
          ] and with  ∈ ℤ .
        </p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Classifier</title>
        <p>such as Beto.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Results</title>
      <p>To obtain the results within the training database, it is proposed to make a 70/30 partition for
training and testing respectively. This partition was made respecting the distribution of the
original partition for each value of  .</p>
      <p>The idea is to be able to observe how the diferent values of  within the ordering afect the
classification performance.</p>
      <p>Table 3 shows the results for each value of  . It can be seen as the used percentage of the
original database, with  = 1 drops to 27% of the original data. This value reaches a little more
than 2 % for higher values of  .</p>
      <p>It can be seen how the imbalance value decreases as the value of  increases. However, this
is also reflected when the Accuracy of the models is calculated. If the entire database is used,
values of more than 63 for Accuracy and 0.38 for F-measure are achieved. This diference is
very likely due to the degree of imbalance in the data.</p>
      <p>It should be noted that, when  takes the values 1 and 2, the F-measure rises to 0.41, although
the Accuracy is better for  = 1 . Even with  = 3 the result of F-measure is very close to that
obtained with  = 0 . This is an interesting result since with a small percentage of data it is
possible to arrive at similar results. In addition, they are competitive results taking into account
that there are 45 classes.</p>
      <p>Finally, when  &gt; 4 the diference in the results is larger.</p>
      <sec id="sec-4-1">
        <title>4.1. Test partition results</title>
        <p>For this edition, the organizers of Rest-Mex propose some evaluation metrics that give greater
weight to correctly classify the negative classes of polarity.</p>
        <p>To assess the efectiveness of the polarity classifier, the organizers propose the equation 2.
This metric gives the additive inverse of importance according to the percentage of instances of
a class in the test collection.</p>
        <p>() =
∑|=| 1 ((1 −   ) ∗   ())</p>
        <p>∑|=| 1 1 −   
 
(2)</p>
        <p>To evaluate Type and Country traits, they propose the equations 3 and 4. These metrics are
macro F-measures of each trait.
(3)
(5)
  () =
  () +   () +   ()</p>
        <p>3
  () =   () +   () +   () (4)
3</p>
        <p>Finally, to obtain a unique value per participant, they propose a combination of the results as
indicated by the equation 5. It is important to mention that in the same way, greater weight is
given to the result of polarity than to the other two traits.</p>
        <p>() =
2 ∗   () + 
 () +</p>
        <p>()
4</p>
        <p>This way of evaluating the results seems ideal for the method proposed in this work, since
the main objective is to find the best possible result, balancing the corpus to a certain degree.</p>
        <p>Figure 4 shows a summary of the results obtained in the test partition of the forum.</p>
        <p>The bars represent the value obtained with the equation 5 for each value of  .</p>
        <p>As you can see, the best result obtained is 0.70, which is achieved when  = 1 . That is when
we worked with 27% of the training data. It is also important to note that when  &gt; 1 the results
remain competitive at the value obtained when  = 0 , which is 0.69.</p>
        <p>The worst result obtained is when  = 6 with 0.62, which is a competitive result considering
that it only uses 2 % of the training data.</p>
        <p>Within the figure, it can also see a red curve, which represents the percentage used for
training. The yellow curve represents the diference between the Accuracy and the F-Measure,
which tends to decrease as  is higher. The green line and the orange line represent the baselines
proposed by the organizers.
result for accuracy is obtained with  = 0 for the three traits, however, when  = 1 a better result
is obtained for F-measure for Polarity, in addition to obtaining the same result for Type. Only
the Country trait, obtains better results from F-measure when  = 0 .
results than using 100 %.
results for Polarity, Type, and Country.
in Rest-Mex 2023.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
    </sec>
    <sec id="sec-6">
      <title>5. Conclusions</title>
      <p>This work presents a single model for sentiment analysis. The data is labeled to classify
the Polarity, Type, and Country from a tourist opinion. The main idea is to generate the 45
combinations to build a model from these classes.</p>
      <p>Our main proposal is a function that selects a number of opinions to balance the data by
randomly selecting instances from the database.</p>
      <p>Experiments give evidence that using 27 % of the total training data generates equal or better
Also, it is concluded that it is possible to use only 2 % of the data and reach competitive
With this method, it was possible to obtain place 8 out of a total of 17 systems participating
The authors thank the Mexican Academy of Tourism Research (AMIT) for their support of
the project ”Creation of a labeled database related to tourist destinations for training artificial
intelligence models for classifying relevant topics” through the call ”I Research Projects 2022”
ments, Information Sciences 603 (2022) 42–59.
[13] M. Á. Álvarez-Carmona, E. Guzmán-Falcón, M. Montes-y Gómez, H. J. Escalante,
L. Villasenor-Pineda, V. Reyes-Meza, A. Rico-Sulayes, Overview of mex-a3t at ibereval
2018: Authorship and aggressiveness analysis in mexican spanish tweets, in: Notebook
papers of 3rd sepln workshop on evaluation of human language technologies for iberian
languages (ibereval), seville, spain, volume 6, 2018.
[14] M. E. Aragón, M. A. A. Carmona, M. Montes-y Gómez, H. J. Escalante, L. V. Pineda,
D. Moctezuma, Overview of mex-a3t at iberlef 2019: Authorship and aggressiveness
analysis in mexican spanish tweets., in: IberLEF@ SEPLN, 2019, pp. 478–494.
[15] M. A. Álvarez-Carmona, M. Franco-Salvador, E. Villatoro-Tello, M. Montes-y Gómez,
P. Rosso, L. Villaseñor-Pineda, Semantically-informed distance and similarity measures
for paraphrase plagiarism identification, Journal of Intelligent &amp; Fuzzy Systems 34 (2018)
2983–2990.
[16] M. E. Villa-Pérez, M. A. Alvarez-Carmona, O. Loyola-Gonzalez, M. A. Medina-Pérez,
J. C. Velazco-Rossell, K.-K. R. Choo, Semi-supervised anomaly detection algorithms: A
comparative summary and future research directions, Knowledge-Based Systems 218
(2021) 106878.
[17] M. Á. Álvarez-Carmona, R. Aranda, A. Y. Rodríguez-González, L. Pellegrin, H.
Carlos, Classifying the mexican epidemiological semaphore colour from the covid-19 text
spanish news, Journal of Information Science (2022). doi:https://doi.org/10.1177/
01655515221100952.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodrıguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodrıguez-González</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <article-title>A combination of sentiment analysis systems for the study of online travel reviews: Many heads are better than one</article-title>
          ,
          <source>Computación y Sistemas</source>
          <volume>26</volume>
          (
          <year>2022</year>
          ). doi:https://doi.org/10.13053/CyS- 26- 2- 4055.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>A.</given-names>
            <surname>Diaz-Pacheco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Guerrero-Rodríguez</surname>
            ,
            <given-names>L. A. C.</given-names>
          </string-name>
          <string-name>
            <surname>Chávez</surname>
            ,
            <given-names>A. Y.</given-names>
          </string-name>
          <string-name>
            <surname>Rodríguez-González</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          <string-name>
            <surname>Ramírez-Silva</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence methods to support the research of destination image in tourism. a systematic review</article-title>
          ,
          <source>Journal of Experimental &amp; Theoretical Artificial Intelligence</source>
          (
          <year>2022</year>
          )
          <fpage>1</fpage>
          -
          <lpage>31</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rodriguez-Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G. A.</given-names>
            <surname>Sanchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Perez-Espinosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Martinez-Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodriguez</surname>
          </string-name>
          , L. BustioMartinez,
          <string-name>
            <given-names>A. D.</given-names>
            <surname>Pacheco</surname>
          </string-name>
          ,
          <article-title>Natural language processing applied to tourism research: A systematic review and future research directions</article-title>
          ,
          <source>Journal of King</source>
          Saud University-Computer and Information Sciences (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodriguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <article-title>Studying online travel reviews related to tourist attractions using nlp methods: the case of guanajuato, mexico</article-title>
          ,
          <source>Current issues in tourism 26</source>
          (
          <year>2023</year>
          )
          <fpage>289</fpage>
          -
          <lpage>304</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>E.</given-names>
            <surname>Olmos-Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Díaz-Pacheco</surname>
          </string-name>
          ,
          <article-title>What does the media tell us about a destination? the cancun case, seen from the usa, canada, and mexico</article-title>
          ,
          <source>International Journal of Tourism Cities</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>G.</given-names>
            <surname>Carmona-Sanchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alvarez-Carmona</surname>
          </string-name>
          ,
          <article-title>Naive features for sentiment analysis on mexican touristic opinions texts</article-title>
          ., in: IberLEF@ SEPLN,
          <year>2021</year>
          , pp.
          <fpage>118</fpage>
          -
          <lpage>126</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          ,
          <string-name>
            <surname>D. FajardoDelgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2022: Recommendation system, sentiment analysis and covid semaphore prediction for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>69</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <article-title>Álvarez-Carmona, Á</article-title>
          . Díaz-Pacheco,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-González</surname>
          </string-name>
          , L. BustioMartínez, V.
          <string-name>
            <surname>Muñis-Sánchez</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>Pastor-López</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Sánchez-Vega</surname>
          </string-name>
          ,
          <article-title>Overview of rest-mex at iberlef 2023: Research on sentiment analysis task for mexican tourist texts</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>71</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Aranda</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Arce-Cardenas</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <string-name>
            <surname>Fajardo-Delgado</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>GuerreroRodríguez</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          <string-name>
            <surname>López-Monroy</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Martínez-Miranda</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          <string-name>
            <surname>Pérez-Espinosa</surname>
            ,
            <given-names>A. Y.</given-names>
          </string-name>
          <article-title>RodríguezGonzález, Overview of rest-mex at iberlef 2021: recommendation system for text mexican tourism 67 (</article-title>
          <year>2021</year>
          ). doi:https://doi.org/10.26342/2021- 67- 14.
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Alvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>López-Monroy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Montes-y Gómez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Villasenor-Pineda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Jair-Escalante</surname>
          </string-name>
          ,
          <article-title>Inaoe's participation at pan'15: Author profiling task</article-title>
          ,
          <source>Working Notes Papers of the CLEF</source>
          <volume>103</volume>
          (
          <year>2015</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>S.</given-names>
            <surname>Arce-Cardenas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Á</surname>
          </string-name>
          .
          <string-name>
            <surname>Álvarez-Carmona</surname>
            ,
            <given-names>J. P.</given-names>
          </string-name>
          <string-name>
            <surname>Ramírez-Silva</surname>
          </string-name>
          ,
          <article-title>A tourist recommendation system: a study case in mexico</article-title>
          ,
          <source>in: Advances in Soft Computing: 20th Mexican International Conference on Artificial Intelligence, MICAI</source>
          <year>2021</year>
          ,
          <string-name>
            <given-names>Mexico</given-names>
            <surname>City</surname>
          </string-name>
          , Mexico,
          <source>October 25-30</source>
          ,
          <year>2021</year>
          , Proceedings,
          <source>Part II 20</source>
          , Springer,
          <year>2021</year>
          , pp.
          <fpage>184</fpage>
          -
          <lpage>195</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Herrera-Semenets</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Feregrino-Uribe</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cumplido</surname>
          </string-name>
          ,
          <article-title>A lightweight data representation for phishing urls detection in iot environ-</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>