<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Zaragoza, Spain
$ memoreno@utb.edu.co (M. M. Novoa); jserrano@utb.edu.co (J. Serrano); jcmartinezs@utb.edu.co (J. C. Martinez-Santos);
epuerta@utb.edu.co (E. Puertas)</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>VerbaNexAI at ASQP-PT 2025: Robust Detection of Tourism Aspects Using Pretrained Models and BIO Tagging in Portuguese</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Melissa Moreno Novoa</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jairo Serrano</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Juan Carlos Martinez-Santos</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Edwin Puertas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Universidad Tecnologica de Bolivar</institution>
          ,
          <addr-line>Cartagena</addr-line>
          ,
          <country country="CO">Colombia</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2025</year>
      </pub-date>
      <volume>000</volume>
      <fpage>0</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>This paper presents the system developed for the ASQP-PT 2025 shared task, explicitly addressing the Aspect Term Extraction (ATE) subtask in Portuguese tourism reviews. We used a pre-trained BERT model, which was ifne-tuned with TripAdvisor reviews annotated with aspect categories, opinion terms, and sentiment polarity. Our contribution's novelty lies in designing and implementing a domain-adapted pipeline that integrates grammatical ifltering, precise BIO tag generation with token alignment, and stratified evaluation using cross-validation and per-class metrics. Experimental results show that our system achieves an F1 score of 0.6108 on the ATE test set, demonstrating its ability to extract explicit aspect terms despite the challenges posed by colloquial and noisy language. Moreover, the pipeline's modular architecture allows for future extensions toward whole-aspectopinion-category-polarity prediction. This work represents a technically robust and linguistically grounded contribution to sentiment analysis in underrepresented languages, ofering a reproducible and practical solution for real-world applications in the tourism domain.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Aspect Term Extraction</kwd>
        <kwd>ATE</kwd>
        <kwd>Portuguese</kwd>
        <kwd>Tourism</kwd>
        <kwd>BERT fine-tuning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Tourism is a social and economic phenomenon that generates millions of opinions on social media and
specialized platforms such as TripAdvisor. These reviews provide valuable information on lodging, food
services and other tourist-related services, which are essential for agencies, operators, and destination
managers to identify trends, pinpoint areas for improvement, and design strategies based on actual
user feedback [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. In this context, sentiment analysis becomes a key tool for interpreting user opinions
and detecting emotional patterns. However, the growing volume and speed at which people generate
these opinions pose significant challenges for manual analysis [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], highlighting the need for automatic
systems capable of accurately and scalably processing large volumes of text. Thus, natural language
processing (NLP) focused on sentiment analysis directly improves the quality of tourism services and
strengthens data-driven decision-making processes [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        Although sentiment analysis has seen considerable progress in other languages, particularly English,
several limitations remain in Portuguese. Most eforts in NLP applied to the Portuguese language
have focused on document-level sentiment analysis or tasks such as Aspect-Based Sentiment Analysis
(ABSA), including the subtasks of Aspect-Based Sentiment Analysis in Portugues ABSAPT-2022 [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] and
ABSAPT-2024 [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] within IberLEF. However, these approaches only partially address the complexity of
sentiment analysis: they extract aspect terms or classify sentence polarity, but do not comprehensively
link each aspect to its category, opinion terms, and the resulting polarity [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ].
      </p>
      <p>
        Although resources for Aspect Sentiment Quad Prediction (ASQP) already exist in English - for
example, new data sets that include aspects, categories, opinions, and polarity in a single step [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]
there are still few ASQP data sets available in Portuguese. This gap underscores the need to explore
a specific corpus for the subtasks of Aspect Term Extraction (ATE), Opinion Term Extraction (OTE),
Aspect Category Detection (ACD), and Aspect Sentiment Quad Prediction (ASQP) in user-generated
texts in Portuguese, to lay the groundwork for future studies and evaluations.
      </p>
      <p>Portuguese reviews on TripAdvisor often exhibit colloquial variability that significantly hinders
automatic processing: words with elongated vowels such as "péssimooo" to emphasize dissatisfaction
or "legalzz" to express enthusiasm; abbreviations like "qnd" instead of "quando", "tb" for "também", or
"mto" in place "of muito"; as well as emoticons and symbols that introduce noise into the text. These
nonstandard forms prevent tokenizers from reliably recognizing lexical roots and syntactic markers,
complicate text normalization, and obstruct accurately identifying relevant nominal fragments (aspects)
and correctly classifying their categories and polarities.</p>
      <p>
        Based on these observations, our work focuses on the ATE subtask within the ASQP-PT 2025 [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
framework, which is a shared task organized under the IberLEF [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] initiative. This study proposes and
evaluates a robust pipeline for aspect-term extraction in Portuguese tourism reviews as a foundation
for future full-stimulation quad prediction. In later stages, we propose a pipeline that will serve as a
solid foundation for extending the analysis to complete sentiment quad prediction, ensuring that each
extracted aspect term acts as a reliable anchor for the assignment of category and polarity.
      </p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Work</title>
      <p>Analyzing the millions of reviews generated on platforms such as TripAdvisor allows one to identify the
strengths and weaknesses of tourist destinations, providing critical information on culture, gastronomy,
and services that help to consolidate a positive image of a destination [10]. In this context, ASQP
emerges as an advanced task within ABSA, whose objective is to extract the aspect term, the opinion
term, the polarity of the feelings and the aspect category simultaneously [11]. ASQP has gained traction
in multiple languages; in particular, Portuguese studies were scarce until 2020 [12]. Between 2020 and
2025, approaches ranging from rule- and lexicon-based methods to transformer-based models have been
introduced, along with newly annotated corpora in domains such as tourism, politics and e-commerce,
yielding significant improvements in aspect extraction and classification metrics.</p>
      <p>To organize and compare the various ABSA approaches in Portuguese, Table 1 groups the studies
according to the type of subtask and the technique used without implying a strict chronological
progression. -ABSA (E2E -ABSA) schemes based on fine-tuning of lightweight models with automatic
annotation; classical ATE methods using Conditional Random Fields (CRF) with post-processing;
hybrid proposals that combine ATE with Sentiment Orientation Extraction (SOE) through transformer
ensembles and conditional text generation; and frameworks that, building on ATE, apply Aspect
Sentiment Classification (ASC) via BERT variants and data augmentation strategies employing ChatGPT.
It also includes extensions to non-tourism domains that contrast rule-based and heuristic named entity
recognition (NER) techniques with generative models. Thus, the table provides a thematic overview of
the most relevant methods in the field.</p>
    </sec>
    <sec id="sec-3">
      <title>3. Data</title>
      <p>The dataset employed in the present study consists of thousands of reviews of hotel establishments
located in Paris, Las Vegas, and New York City, extracted from the TripAdvisor platform[20]. Based on
the existing annotations from the ABSAPT-2024 challenge, we expanded the corpus by incorporating new
annotations covering aspect terms, associated opinion terms, aspect categories, and their corresponding
polarities. Each review may contain multiple quadruples, understood as combinations of aspect,
sentiment term, category, and polarity, all explicitly identifiable within the text.</p>
      <p>
        We stratified data segmentation to preserve the balance between categories and polarities, allocating
60% for training. Figure 1 provides a comprehensive and precise representation of the distribution of
aspects, opinions, polarities, and categories within the dataset. These data are the starting point for
modeling and evaluating the ABSA system. The remaining 40% was reserved for testing, subdivided
Resplande Sant et al. ATE and Sentiment Orientation
[
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] Extraction (SOE)
Thurow Bender et al. ATE and Aspect Sentiment
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] Classification (ASC)
Seno et al. [15]
Zhao et al. [16]
Lin et al. [17]
Aziz et al. [18]
Li et al. [19]
      </p>
      <p>Aspect Detection and Polarity
Classification in political comments
Unified multi-task extraction of aspect
terms, ASTE in single sentences
Cross-lingual ABSA with adaptive
rebalancing for class-imbalance issues
Unified ABSA framework covering
multiple subtasks (ATE, ASC)
Implicit aspect-level sentiment
analysis</p>
      <p>Technique Used
Fine-tuning with PTT5, FLAN-T5, mT0
using instruction tuning; GPT-3.5 for
automatic annotation
Conditional Random Fields (CRF) with
BIO tags; POS features; post-processing
with lemmatization
Ensemble of Transformers (RoBERTa,
mDeBERTa); conditional text generation
with PTT5 Large
BERTimbau Large for ATE (token
classification); BERT fine-tuned with
ChatGPT-augmented data for ASC
ChatGPT vs. rule-based methods, NER
heuristics, and BERT; best performance in
polarity classification (PC) with ChatGPT
Dependency-Enhanced GCN (DE-GCN)
with location-aware graphs and
span-sharing joint extraction
Equi-XABSA framework using dynamic
weighted loss and anti-decoupling
mechanisms
BERT encoding with bi-afine attention
and multi-layer GCN for modeling
aspect–opinion relations
Generative T5 model with integrated
GNN and multi-prompt fusion strategy
into 20% for ATE and OTE, and 20% for ACD and ASQP. Consequently, the primary objective of this
study is to evaluate the precision, recall, and F1 score of the predictions, considering as correct only
those that exactly match all four components.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Architecture</title>
      <p>We structured the architecture of the aspect term extraction and Beginning-Inside-Outside (BIO)
classification system into five sequential phases, ranging from ingesting the input data to generating
results with exact positional references. Each phase is described in the following, following the structure
presented in Figure 2.</p>
      <sec id="sec-4-1">
        <title>4.1. Pre-Processing</title>
        <p>In this initial stage, the repository is mounted, and the file containing the raw reviews is loaded. Each
text then undergoes a cleaning process that removes special characters, punctuation marks, and digits
and converts all content to lowercase to standardize the representation. Finally, we leverage spacy
capabilities to reconstruct the reviews using their noun chunks, allowing us to retain primarily nominal
entities and reduce lexical noise.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Feature Extraction</title>
        <p>With the preprocessed text, we extract the essential features the model requires. First, we reorganize
the dataset columns to obtain the aspect term and its corresponding start and end positions within
(a) Aspect Terms
(b) Opinion Sentiments
(c) Sentiment Polarity
(d) Aspect Categories
the original string. We then convert the list of associated opinion terms into a single text string. To
generate the BIO labels, we tokenize the preprocessed text using spacy and assign each token a B-TERM,
I-TERM, or O label depending on whether it matches the position of the aspect term, as illustrated in
Table 2. Finally, we use a configured BertTokenizerFast to align these word-level tokens with the BERT
subtokens, ensuring a precise correspondence between the labels and the model input.</p>
      </sec>
      <sec id="sec-4-3">
        <title>4.3. Regularization</title>
        <p>Since the O label is overwhelmingly predominant compared to B-TERM and I-TERM, we randomly split
the dataset into 80% for training and 20% for testing. We then compute class weights using only the
labels from the training set. We incorporated these weights into the loss function via a custom Trainer.
Hence, errors in the much less frequent B-TERM and I-TERM classes are more important.</p>
      </sec>
      <sec id="sec-4-4">
        <title>4.4. Classifiers</title>
        <p>We conducted the classification phase by fine-tuning the BERT base Portuguese case, known as
BERTimbau [21], for a token classification task with three labels. We define training arguments with a batch
size of 8, a learning rate of 2 × 10− 5, a warm-up of 200 steps, and checkpoint saving at the end of each
epoch. We apply 3-fold cross-validation (K-Fold) to ensure model robustness and include a callback
mechanism with three-epoch iterations to prevent overfitting.</p>
      </sec>
      <sec id="sec-4-5">
        <title>4.5. Evaluation</title>
        <p>After each validation fold, we compute the accuracy, precision, recall, and F1 scores (macro- and
per-label) on the validation set. We select the model that maximizes the macro F1 score and apply it
to the entire set of tests. Based on the predicted BIO labels, we reconstruct the aspect spans within
the original text and compare them with the reference annotations. Finally, we exported all results
to an Excel file, including text, tokens, gold and predicted labels, extracted terms, and their positions,
enhancing interpretability.</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Experimental Results</title>
      <p>During training, the model produced standard warnings about overlapping faces with respect to
initialized weights, which did not prevent convergence. In each fold, we calculated general metrics
(accuracy, precision, recall, and macro-F1) and label-specific metrics (O, B-TERM, and I-TERM) to assess
the model’s ability to identify aspect terms and their continuations correctly.</p>
      <p>Table 5 reports the most representative metrics for the first two evaluated folds. As observed, although
the overall accuracy metrics remain above 87%, the performance in the B-TERM and I-TERM classes
presents low F1 scores, particularly due to still limited precision. However, the recall for both labels is
high, indicating that the model successfully identifies a significant proportion of relevant terms, albeit
with some confusion in their exact boundaries.</p>
      <p>The confusion matrix obtained in Figure 3 shows that although the model achieves solid performance
in classifying the majority class O - tokens that do not represent aspect terms - it faces challenges
in correctly identifying the minority classes B-TERM and I-TERM. Specifically, out of 91,821 tokens
truly labeled as ’O’, 86,384 were correctly classified, while 3,695 and 1,742 were incorrectly assigned to
’B-TERM’ and ’I-TERM’, respectively. This indicates a tendency of the model to produce false positives
in the detection of aspect terms. On the other hand, out of 915 true ’B-TERM’ tokens, the model correctly
identified 709, with an approximate recall of 77.5%, but with low precision (16%) due to the large number
of false positives originating from the ’O’ class. Similarly, for the ’I-TERM’ class, out of 528 true tokens,
only 330 were correctly classified, with an approximate recall of 62.5% and equally low precision (16%).</p>
      <p>These results demonstrate that the model behaves consistently when identifying tokens of the O class
(noun chunks) yet still faces challenges in precisely delimiting compound terms (B-TERM, I-TERM). The
high recall in these labels suggests that the model can serve as a foundation for post-processing systems
aimed at refining initial predictions. Additionally, we explored the model to identify noun phrases
that could serve as initial candidates for aspect terms. Although the model enabled the construction of
structured lists, including noun chunks, head nouns, their grammatical functions, and related words,
the manual analysis revealed unreliable coverage.</p>
      <p>For example, when applied to an extended review containing multiple evaluative expressions, spaCy
identified phrases such as "infraestruturas bastante degradadas", "casa de banho partilhada", or "a cortina",
but also registered ambiguous or irrelevant chunks such as "que", "piso", "alguém" or "convívio", many
of which do not represent analyzable aspects nor are associated with consistent semantic judgments.</p>
      <p>It suggests that although spaCy processing provides a practical lexical foundation, its direct
applicability for automatic BIO label generation requires subsequent filtering based on grammatical rules or
supervised training, especially in contexts involving complex and lengthy opinions, such as TripAdvisor
reviews in Portuguese.</p>
      <p>Thus, once we completed the training and cross-validation process, without applying noun fragments
but instead using preprocessed data as outlined in Section 4.1, the model with the best macro-F1
performance was selected and applied to the complete test set. Table 3 summarizes the system’s final
performance on the aspect term extraction task (ATE). During training, the model achieved a precision
of 0.4135, a recall of 0.8697, and an F1 score of 0.4559, highlighting its ability to cover many relevant
terms, although with moderate precision.</p>
      <p>We implemented a validation and inference pipeline on an unseen test set to evaluate the model’s
generalization capability. First, we cleaned and normalized each review by removing punctuation via
regular expressions to ensure consistency with the training format. Next, we tokenized the preprocessed
texts once using BertTokenizerFast, preserving their associated BIO labels, and fed them directly
into the trained model, which returned a sequence of predicted labels (B-TERM, I-TERM, or O). No
hyperparameter search was performed.</p>
      <p>Subsequently, we applied a decoding step to reconstruct aspect terms from the labelled sequences.
This step grouped consecutive tokens labelled as B-TERM and I-TERM, determining their exact
characterlevel positions within the original text. In this way, compound terms, even when they spanned multiple
tokens, could be accurately recovered.</p>
      <p>The decoding function iterates over each token along with its predicted label, recording each term’s
start and end position. When a B-TERM label is detected, we initiate a new term; if followed by an
I-TERM label, we extend the term. Upon encountering an O label or reaching the end of the sequence,
the term is finalized and stored. The extracted terms include both the textual content and their positions.
Therefore, we stored the reconstructed information in an organized structure, and each review is
associated with its original text and a list of identified aspect terms. Each term includes the lexical
content and its coordinates within the text. This final set of predictions is exported to an Excel file,
facilitating manual inspection and further analysis.</p>
      <p>In the final evaluation of the test set, our system achieved a precision of 0.6125, a recall of 0.6091,
and an F1 score of 0.6108. Although the best performing task system reached a precision of 0.6898, a
recall of 0.7305, and an F1 score of 0.7096, see Table 4, our approach still demonstrates competitive
aspect detection and maintains balanced performance when generalizing to unseen data. These results
underscore the practical applicability of the model for Portuguese-language sentiment analysis in the
tourism domain, particularly given that aspect terms may span multiple words and be expressed in
varied ways, and pave the way for future enhancements.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Future Work</title>
      <p>We recommend prioritizing linguistic data augmentation strategies for future iterations, such as synonym
replacement or back-translation, especially given the limited coverage observed for infrequent aspects.
These techniques enhance the model’s generalization capabilities when encountering lexical variations
and less common expressions.</p>
      <p>Although adjustments to class weights were implemented to address the observed class imbalance,
the results obtained reveal ongoing precision challenges, particularly in correctly identifying aspect
terms within minority classes (’B-TERM’ and ’I-TERM’). This suggests that class weighting alone did not
fully resolve the problem, indicating that additional factors, such as semantic similarity between labeled
and unlabeled terms or significant structural class imbalance, require more sophisticated strategies.
Therefore, future research should evaluate complementary methods, synthetic oversampling techniques,
or hybrid approaches incorporating linguistic rules to mitigate observed false positives.</p>
      <p>The visualization of the predicted aspects (see Figure 4) clearly highlights the challenges of the
current system. The word cloud generated from these predictions demonstrates a need for refinement
in aspects such as lexical unification, error reduction in aspect detection, and consistent handling of
capitalization and segmentation.</p>
      <p>Finally, given the model’s sensitivity to specific parameters during training, we recommend
conducting a more exhaustive hyperparameter search, including learning rate, batch size, and number of training
epochs. Furthermore, we suggest exploring multitask learning approaches capable of simultaneously
detecting aspects and opinion terms, or reframing the problem as a multi-label classification task to
capture overlapping sentiment expressions. In the long term, we advise extending the study towards
a more comprehensive framework encompassing the remaining subtasks of aspect sentiment quad
prediction to enable a more robust evaluation of the model’s performance in real-world scenarios.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusion</title>
      <p>The results of this study demonstrate that combining rigorous linguistic preprocessing with the
finetuning of a BERT model for Portuguese enables accurate and robust identification of aspect terms
in tourism reviews. This approach validates the potential of contextual language models to address
specialized tasks in underrepresented languages. It underscores the importance of adapting natural
language processing techniques to the lexical and syntactic particularities of the tourism domain.</p>
      <p>This work’s main contribution lies in the design and implementation of a comprehensive pipeline for
aspect term extraction in Portuguese capable of handling the informal and noisy language frequently
found on platforms such as TripAdvisor. Our proposal integrates strategies such as grammatical filtering,
precise BIO tag generation with token alignment, and detailed evaluation through cross-validation and
per-class metrics. Together, these components result in an adaptable, transparent, and high-performance
system for the ATE subtask of the ASQP-PT 2025 challenge.</p>
      <p>Furthermore, the model efectively performs on the training data and maintains a balanced
performance when generalizing to unseen texts. It is a practical tool for opinion analysis systems and
personalized recommendation services in the tourism sector. Its architecture also allows for future
extensions to more complex tasks, such as predicting whole aspect–opinion–category–polarity
quadruples.</p>
      <p>Nonetheless, limitations related to corpus size and the dificulty of capturing infrequent or implicit
aspects are acknowledged. As a result, future work will explore active learning techniques, multilingual
transfer learning, and linguistic data augmentation. In summary, this work provides a technically robust
and scientifically relevant solution for aspect-based analysis in Portuguese, laying the groundwork for
developing more inclusive, eficient, and practically applicable ABSA systems.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this investigation, ChatGPT (OpenAI) was used for the revision of translations
into English, as well as for grammatical and spelling correction. After using this tool, the content was
reviewed and edited as necessary, and full responsibility for the content of the publication is assumed.</p>
    </sec>
    <sec id="sec-9">
      <title>Acknowledgments</title>
      <p>The authors express their gratitude to the Call 933 “Training in National Doctorates with a Territorial,
Ethnic and Gender Focus in the Framework of the Mission Policy — 2023” of the Ministry of Science,
Technology and Innovation (Minciencia). In addition, we thank the team of the Artificial Intelligence
Laboratory VerbaNex (https://github.com/VerbaNexAI), afiliated with the Universidad Tecnológica de
Bolívar (UTB), for their contributions to this project.
[10] R. M. L. Freire, K. F. B. Maracajá, V. Valduga, A. B. F. M. do Nascimento, Análise da imagem
afetiva dos turistas no destino vale dos vinhedos - rs, Turismo: Visão e Ação 25 (2023) 114–133.
doi:10.14210/rtva.v25n1.p114-133.
[11] A. Wang, J. Jiang, Y. Ma, A. Liu, N. Okazaki, Generative data augmentation for aspect sentiment
quad prediction, in: A. Palmer, J. Camacho-collados (Eds.), Proceedings of the 12th Joint Conference
on Lexical and Computational Semantics (*SEM 2023), Association for Computational Linguistics,
Toronto, Canada, 2023, pp. 128–140. URL: https://aclanthology.org/2023.starsem-1.12/. doi:10.
18653/v1/2023.starsem-1.12.
[12] D. A. Pereira, A survey of sentiment analysis in the portuguese language, Artificial Intelligence</p>
      <p>Review 54 (2021) 1087–1115. doi:10.1007/s10462-020-09870-1.
[13] G. Pereira, L. Barbosa, J. Moreira, T. Melo, A. Silva, Enhancing aspect-based sentiment analysis for
portuguese using instruction tuning, in: Anais do XXI Encontro Nacional de Inteligência Artificial
e Computacional (ENIAC 2024), Sociedade Brasileira de Computação - SBC, 2024, pp. 990–1001.
doi:10.5753/eniac.2024.245109.
[14] M. T. Machado, T. A. S. Pardo, Nilc at absapt 2022: aspect extraction for portuguese, in: Conference
of the Spanish Society for Natural Language Processing - SEPLN, CEUR-WS, 2022.
[15] E. Seno, L. Silva, F. Anno, F. Rocha, H. Caseli, Aspect-based sentiment analysis in comments on
political debates in Portuguese: evaluating the potential of ChatGPT, in: P. Gamallo, D. Claro,
A. Teixeira, L. Real, M. Garcia, H. G. Oliveira, R. Amaro (Eds.), Proceedings of the 16th International
Conference on Computational Processing of Portuguese - Vol. 1, Association for Computational
Lingustics, Santiago de Compostela, Galicia/Spain, 2024, pp. 312–320. URL: https://aclanthology.
org/2024.propor-1.32/.
[16] M. Zhao, J. Yang, F. Shang, Dependency-enhanced graph convolutional networks for aspect-based
sentiment analysis, Neural Computing and Applications 35 (2023) 14195–14211. doi:10.1007/
S00521-023-08384-5.
[17] J. Šmíd, P. Král, Cross-lingual aspect-based sentiment analysis: A survey on tasks, approaches, and
challenges, Information Fusion 120 (2025) 103073. URL: https://www.sciencedirect.com/science/
article/pii/S1566253525001460. doi:10.1016/J.INFFUS.2025.103073.
[18] K. Aziz, D. Ji, P. Chakrabarti, T. Chakrabarti, M. S. Iqbal, R. Abbasi,
Unifying aspect-based sentiment analysis bert and multi-layered graph convolutional
networks for comprehensive sentiment dissection, Scientific Reports 14 (2024) 1–22. URL:
https://www.nature.com/articles/s41598-024-61886-7. doi:10.1038/S41598-024-61886-7;
SUBJMETA=114,1305,2164,631;KWRD=DATA+MINING,MACHINE+LEARNING.
[19] X. Li, X. Wang, C. Yao, Y. Li, Graph-enhanced implicit aspect-level sentiment
analysis based on multi-prompt fusion, Scientific Reports 15 (2025) 1–19. URL:
https://www.nature.com/articles/s41598-025-02609-4. doi:10.1038/S41598-025-02609-4;
SUBJMETA=2811,477,631;KWRD=HUMAN+BEHAVIOUR,PSYCHOLOGY.
[20] N. G. Ángel, N. N. Gómez, M. D. P. Esteban, I. M. Rubio, D. P. Góngora, Emprendimiento social
en época de pandemia: Revisión bibliográfica, Miradas sobre el emprendimiento ante la crisis
del coronavirus, 2022, ISBN 978-84-1377-995-9, págs. 528-533 (2022) 528–533. URL: https://dialnet.
unirioja.es/servlet/autor?codigo=4224094.
[21] F. Souza, R. Nogueira, R. Lotufo, BERTimbau: pretrained BERT models for Brazilian Portuguese,
in: 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October
20-23 (to appear), 2020.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>M.</given-names>
            <surname>Chu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <article-title>Language interpretation in travel guidance platform: Text mining and sentiment analysis of tripadvisor reviews</article-title>
          ,
          <source>Frontiers in Psychology</source>
          <volume>13</volume>
          (
          <year>2022</year>
          )
          <article-title>1029945</article-title>
          . URL: https://pmc.ncbi.nlm.nih.gov/articles/PMC9607914/. doi:
          <volume>10</volume>
          .3389/FPSYG.
          <year>2022</year>
          .
          <volume>1029945</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Álvarez-Carmona</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Aranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. Y.</given-names>
            <surname>Rodríguez-Gonzalez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Fajardo-Delgado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. G.</given-names>
            <surname>Sánchez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Pérez-Espinosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Martínez-Miranda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Guerrero-Rodríguez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Bustio-Martínez</surname>
          </string-name>
          ,
          <article-title>Ángel DíazPacheco, Natural language processing applied to tourism research: A systematic review and future research directions</article-title>
          ,
          <source>Journal of King Saud University - Computer and Information Sciences</source>
          <volume>34</volume>
          (
          <year>2022</year>
          )
          <fpage>10125</fpage>
          -
          <lpage>10144</lpage>
          . doi:
          <volume>10</volume>
          .1016/j.jksuci.
          <year>2022</year>
          .
          <volume>10</volume>
          .010.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Chowdhary</surname>
          </string-name>
          ,
          <source>Natural language processing</source>
          ,
          <source>Fundamentals of Artificial Intelligence</source>
          (
          <year>2020</year>
          )
          <fpage>603</fpage>
          -
          <lpage>649</lpage>
          . URL: https://link.springer.com/chapter/10.1007/
          <fpage>978</fpage>
          -81-322-3972-7_
          <fpage>19</fpage>
          . doi:
          <volume>10</volume>
          .1007/
          <fpage>978</fpage>
          -81-322-3972-7_
          <fpage>19</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>J. R. S.</given-names>
            <surname>Gomes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. A. S.</given-names>
            <surname>Garcia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. F. B.</given-names>
            <surname>Junior</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. C.</given-names>
            <surname>Rodrigues</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. F. C.</given-names>
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D. F.</given-names>
            <surname>Maia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. F. F.</given-names>
            da
            <surname>Silva</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. R. G.</given-names>
            <surname>Filho</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.</surname>
          </string-name>
          <article-title>da Silva Soares, Deep learning brasil at absapt 2022: Portuguese transformer ensemble approaches</article-title>
          ,
          <year>2023</year>
          . URL: https://arxiv.org/abs/2311.05051. arXiv:
          <volume>2311</volume>
          .
          <fpage>05051</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Bender</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. A.</given-names>
            <surname>Gomes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Lopes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Araújo</surname>
          </string-name>
          , L. A. de Freitas, U. B.
          <string-name>
            <surname>Corrêa</surname>
          </string-name>
          , Overview of absapt at iberlef 2024:
          <article-title>Overview of the task on aspect-based sentiment analysis in portuguese, Procesamiento del lenguaje natural</article-title>
          ,
          <source>ISSN 1135-5948</source>
          , Nº.
          <volume>73</volume>
          ,
          <year>2024</year>
          , págs.
          <fpage>315</fpage>
          -
          <lpage>322</lpage>
          (
          <year>2024</year>
          )
          <fpage>315</fpage>
          -
          <lpage>322</lpage>
          . URL: https://dialnet.unirioja.es/servlet/articulo?codigo=9736870&amp;info=resumen&amp;idioma=ENGhttps: //dialnet.unirioja.es/servlet/articulo?codigo=9736870&amp;info=resumen&amp;idioma=SPAhttps: //dialnet.unirioja.es/servlet/articulo?codigo=
          <fpage>9736870</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R. R. L.</given-names>
            <surname>Barbosa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Sánchez-Alonso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. A.</given-names>
            <surname>Sicilia-Urban</surname>
          </string-name>
          ,
          <article-title>Evaluating hotels rating prediction based on sentiment analysis services</article-title>
          ,
          <source>Aslib Journal of Information Management</source>
          <volume>67</volume>
          (
          <year>2015</year>
          )
          <fpage>392</fpage>
          -
          <lpage>407</lpage>
          . URL: https://sites.google.com/inf.ufpel.edu.br/asqp-pt-2025/description. doi:
          <volume>10</volume>
          .1108/ AJIM-01-2015-0004.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>He</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Mou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <article-title>A unified one-step solution for aspect sentiment quad prediction, Proceedings of the Annual Meeting of the Association for Computational Linguistics (</article-title>
          <year>2023</year>
          )
          <fpage>12249</fpage>
          -
          <lpage>12265</lpage>
          . URL: https://arxiv.org/pdf/2306.04152. doi:
          <volume>10</volume>
          .18653/v1/
          <year>2023</year>
          . findings-acl.
          <volume>777</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>E. P.</given-names>
            <surname>Lopes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Gomes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. T.</given-names>
            <surname>Bender</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Araújo</surname>
          </string-name>
          , L. A. de Freitas, U. B.
          <string-name>
            <surname>Corrêa</surname>
          </string-name>
          ,
          <article-title>Overview of asqp-pt at iberlef 2025: Overview of the task on aspect-sentiment quadruple prediction in portuguese</article-title>
          ,
          <source>Procesamiento del Lenguaje Natural</source>
          <volume>75</volume>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <surname>González-Barba</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Ángel</surname>
            , Chiruzzo, Luis, Jiménez-Zafra,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>María</surname>
          </string-name>
          , Overview of IberLEF 2025:
          <article-title>Natural Language Processing Challenges for Spanish and other Iberian Languages, in: Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2025), co-located with the 41st Conference of the Spanish Society for Natural Language Processing (SEPLN 2025), CEUR-WS</article-title>
          . org,
          <year>2025</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>