<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Winterthur, Switzerland, April</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>RE-Miner: Mining Mobile User Reviews with Feature Extraction and Emotion Classification</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Quim Motger</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Max Tiessler</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Marc Oriol</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Irene Bertolín</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Service and Information System Engineering, Universitat Politècnica de Catalunya</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2024</year>
      </pub-date>
      <volume>8</volume>
      <issue>2024</issue>
      <fpage>0000</fpage>
      <lpage>0002</lpage>
      <abstract>
        <p>In the context of app stores, user reviews are pivotal on supporting multiple requirements engineering tasks. Among these, feature extraction and emotion classification play a crucial role in requirements prioritization, feedback gathering and release planning. Empirical evaluation of these techniques is impeded by data collection complexities and a lack of reproducible methods and available tools. Furthermore, existing studies often focus on isolated tasks, hindering a comprehensive analysis of user perceptions. This paper introduces RE-Miner, a work-in-progress tool integrating multiple feature extraction and emotion classification innovative methods, enabling a detailed analysis of feature-oriented user feedback. RE-Miner comprises a web-based service for task integration and comparison, and a web application for persistent storage and analytical visualization of reviews. As a result, RE-Miner provides a platform for seamless integration, replication, and comparison of review mining techniques, fostering advancements in feature extraction and emotion classification understanding for requirements engineering. A demo of the tool is showcased here: https://youtu.be/PFNCbborPuU.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;mobile app reviews</kwd>
        <kwd>feature extraction</kwd>
        <kwd>emotion classification</kwd>
        <kwd>natural language processing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        User reviews ofer a wealth of information to support multiple requirements engineering
tasks [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Elicitation of feature requests [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ], identification of bugs or issues [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], user feedback
gathering and analysis [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], and release planning or prioritization [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] are a few examples of
the most popular use cases for automated review processing. Among these, feature extraction
(i.e., extracting mentions to functional aspects of an app [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]) and emotion classification (i.e.,
extracting the sentiments or emotions in a text [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]) are two of the most popular techniques.
While they have undergone intense study [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ], innovations in the landscape of natural language
processing triggered by deep learning and large language models are promoting accuracy
improvements in both feature extraction [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] and emotion classification [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Meanwhile, some
challenges like disambiguation, domain-specific adaptation and precision of negative emotion
detection still remain [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>
        Evaluating these methods poses scientific challenges like data collection [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ], replication
and deployment of resource-intensive services [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ]. Additionally, from a user perspective,
comparison and selection of the most suitable approach for a given domain is problematic,
undermined or even neglected [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ]. Furthermore, most research is dedicated to the isolated
use of these tasks [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Combining feature and emotion descriptors allows a fine-granularity
analysis on the user perception to a particular feature or a cluster of features. This knowledge
is valuable to support single-feature emotion-oriented analysis and filtering of non-relevant
content [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. While there is some existing work proposing a combined analysis of these tasks
(see Section 6), their replication is problematic due to the lack of open source tools and reusable
frameworks [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ].
      </p>
      <p>This paper introduces RE-Miner, a work-in-progress software tool designed for replication
and comparative analysis in review-based feature extraction and emotion classification.
REMiner consists of a web-based service for integrating and comparing multiple review mining
tasks, and a web application to support the visual analysis of user reviews, incorporating
statistical data on the features and emotions derived from these reviews. We envisage that our
contribution will assist researchers and app stakeholders in the selection, replication, integration
and comparison of review mining techniques.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Background</title>
      <p>
        While the field of natural language processing for requirements engineering (NLP4RE) is
not novel, availability of reusable tools or even full descriptions to allow replication of such
techniques is scarce. Zhao et al. surveyed 130 NLP4RE tools [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], from which only 17 were
available for download. Furthermore, they claimed this scarcity to be particularly highlighted
in novel NLP techniques and specialized tools integrating deep learning strategies.
      </p>
      <p>
        Focusing on feature extraction tasks, the SAFE tool is considered one of the standard methods,
based on syntactic-based pattern matching techniques complemented with semantic similarity
for feature candidate linking [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. However, not only its accuracy in the analysis of reviews
is limited [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ], but its source code is not available and its replication requires design
assumptions [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Similar approaches like GuMa [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] and ReUS [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] also focus on syntactic aspects.
Beyond this formulation of the problem, KEFE proposes a deep learning classifier designed to
sift through syntactically extracted features and exclude non-relevant expressions [
        <xref ref-type="bibr" rid="ref17">17</xref>
        ].
      </p>
      <p>
        On the other hand, emotion classification aims at detecting the emotion in accordance with
a specified emotional model. This contrasts to traditional sentiment analysis techniques that
just aim at measuring the positive or negative orientation of a text [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. In our proposal, we
have adopted one of the most widely embraced emotion models, proposed by P. Ekman [
        <xref ref-type="bibr" rid="ref19">19</xref>
        ].
This model classifies the emotions as: sadness, fear, happiness, anger, surprise, and disgust.
Complementary, the neutral feeling is also used for non-specific emotions. Automatic emotion
identification methods from textual content include lexicon-based techniques, classical machine
learning models (e.g. SVM, bayes,...) or deep learning models [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ]. More recently,
transformerbased models and LLMs have significantly advanced the field of emotion classification [
        <xref ref-type="bibr" rid="ref20">20</xref>
        ].
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Tool description</title>
      <p>The main goal of RE-Miner is to provide a user-friendly, easily accessible and reusable software
tool for both researchers and mobile app developers. It is designed to facilitate replication
studies and comparative analyses in the field of review-based natural language processing
tasks, with a particular emphasis on feature extraction and emotion classification. The tool
is composed of two software-based contributions: (1) RE-Miner-Hub1, a web-based service
supporting the integration and comparison of feature extraction and emotion classification
tasks, using various NLP models and providers; and (2) RE-Miner-Dashboard2, a web-based
application to support user management, app and review persistence, automatic processing and
analytical visualization of a batch of user reviews combining features and emotions emerged
from these reviews. Figure 1 provides a high-level overview of RE-Miner architecture.</p>
      <sec id="sec-3-1">
        <title>3.1. RE-Miner-Hub</title>
        <p>
          RE-Miner-Hub is a web-service system designed to empower researchers with the capability to
conduct feature extraction and emotion classification tasks using a common API syntax.
REMiner-Hub serves as a centralized orchestration service of heterogeneous software components
(both from a logic and physical point of view), each of them deployed as decoupled, decentralized
software resources. This architecture facilitates re-usability of third-party methods, which can
extend RE-Miner set of tasks by either replicating and embedding these techniques as a new
RE-Miner software module or simply by using available services from the web. For feature
extraction, current version of RE-Miner-Hub employs two methods based on our previous work:
• TransFeatEx integrates a RoBERTa-based pre-trained model used to leverage syntactic
patterns and semantic annotations (e.g., polarity score) to identify feature expressions [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
TransFeatEx is developed as a Python-based web-service, and it is deployed as a standalone
web application and accessed using HTTP-based communication through an API. We use
its default configuration, as described in the tool repository and in the original paper [
          <xref ref-type="bibr" rid="ref21">21</xref>
          ].
• T-FREX gives access to a suite of Transformer-based models (i.e., BERT, RoBERTa and
        </p>
        <p>
          XLNet) fine-tuned for Named-Entity Recognition using crowdsourced feature annotations
1Source code and API Swagger documentation available at: https://github.com/gessi-chatbots/RE-Miner-Hub.
2Source code and sample dataset available at: https://github.com/gessi-chatbots/RE-Miner-Dashboard.
generated by users in software recommendation platforms [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ]. T-FREX models are
available on HuggingFace3, allowing their integration by simply adding these models for
inference through an NLP pipeline using the Transformers Python module.
For emotion classification, we developed and employed the following methods:
• Encoder Classifier focuses on fine-tuning encoder-only LLMs (i.e., BERT, BETO) with
a document classification layer on top [
          <xref ref-type="bibr" rid="ref22">22</xref>
          ]. These models were fine-tuned using 2,000
annotated user-generated microblogs (e.g., tweets). This component weights each emotion
class, based on the probability distribution of a given review to reflect said property.
• Generative Classifier utilizes a GPT-3.5-Turbo model fine-tuned on a few-shot learning
setting using prompt engineering to extract emotions from mobile application reviews.
We used a reduced sample dataset of 100 internally annotated app reviews for the
finetuning process4. Contrarily to the Encoder Classifier, the outcome of this method is
restricted to the most probable emotion for a given review.
        </p>
        <p>For each task, users have the freedom to choose models that align with their preferences (i.e.,
context, dataset, computational resources...). RE-Miner-Hub exposes simple API methods to
perform both feature extraction and/or emotion classification on a batch of reviews.</p>
        <p>The modular design of RE-Miner-Hub provides flexibility in the list of available models,
enabling scalable integration of new models for both sentiment analysis and feature extraction
tasks. Moreover, within the RE-Miner ecosystem, a unified data model for reviews is used. This
ensures scalability when integrating other software components that operate with the same
data model. Lastly, the Hub acts as a flexible middle-ware between the RE-Miner-Dashboard
and various third-party APIs and software components. The only requirement for integrating
a new model into the RE-Miner-Hub is that it must be accessible via API (either the Hugging
Face inference API or a traditional REST API).</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. RE-Miner-Dashboard</title>
        <p>The RE-Miner-Dashboard is primarily designed as a visualization and analytical software
component. The dashboard, encompasses several key components: (1) a React front-end
application; (2) an authorization and authentication system; (3) an API Gateway responsible
for managing the access to the APIs; (4) a backend consisting of two APIs (handling reviews
and mobile apps, respectively) and a module dedicated to creating new user entities within the
database; and (5) a NoSQL document-based database.</p>
        <p>Upon user creation and the corresponding database entry, access permissions to the
application and associated APIs are granted. Users can upload individual applications or batches
of them, along with the reviews, which are then stored in the database. When users want to
analyze a review or a batch of reviews, the RE-Miner-Dashboard application sends a request
to the RE-Miner-Hub, specifying the task (e.g. feature extraction and/or sentiment analysis).
When receiving results from the RE-Miner-Hub system, they are stored in the database. This
3E.g.: https://huggingface.co/quim-motger/t-frex-bert-base-uncased (T-FREX models are referenced in model card).
4Prompt is available in GitHub repository. Full evaluation is yet to be conducted as depicted in Section 5.
seamless integration empowers users to use the analytical dashboard for engaging visual
analyses of their data, extracting meaningful insights from the reviews. Details on the use cases and
RE-Miner-Dashboard views are showcased in Section 4.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. User workflow</title>
      <p>The RE-Miner-Dashboard presents two main use cases for review analysis. These require users
to have completed the sign-up, login and upload of apps and reviews. Figure 2 illustrates a
snapshot of both use cases.</p>
      <p>1. Single-review analysis
a) Users can select a review for analysis from the Reviews &gt; View Reviews tab. Initiation
of the analysis process can be done by clicking the Process review button.
b) A modal wizard appears, guiding the user through the following steps:
i. The user selects the task/s and the method used for each task (e.g., GPT-3.5 for
emotion classification; T-FREX BERT base for feature extraction).</p>
      <p>ii. The reviews are submitted to the RE-Miner-Hub to initiate the review analysis.
c) After analysis, an icon will appear next to the processed review. By clicking it,
the user opens the Review Analyzer view, displaying the analysis results, including
detected emotions, emotion-marked sentences, and identified features within the
review text.
2. Batch-review analysis and visual analytics
a) The user should repeat steps 1.a) and 1.b) while selecting multiple (or all) uploaded
app reviews.
b) By navigating to the Dashboard tab, the following analytical charts can be found:
i. Sentiment Polar Area: aggregated sum of each emotion across all reviews.
ii. Top Features Histogram: aggregated sum of the most frequent extracted features.
iii. Features Over Time Chart: distribution of feature mentions over a time window.
iv. Sentiment Histogram: distribution of emotions (displayed in a stacked layout
based on frequency for each emotion class) over a time window.
v. Features/Emotions Chart: combined distribution of a set of feature mentions
with their associated emotions over a time window.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Evaluation plan</title>
      <p>
        Below we summarize the main steps of the planned (and ongoing) evaluation:
1–8
• Data collection and annotation. We built on our previous work on mobile app
repository mining to collect multiple reviews for a given domain [
        <xref ref-type="bibr" rid="ref23">23</xref>
        ], filtered and refined for the
feature extraction task [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ]. This dataset consists of 468 apps with 23,816 reviews within a 1
year time window, each with at least 1 crowdsourced annotated feature. We used a subset
of this data set to showcase the diferent use cases of RE-Miner as depicted in Section 4
and in the video demonstration. For emotion classification, we plan on conducting an
internal iterative annotation process of a subset of reviews through structured guidelines,
measuring annotation agreement and establishing solid evaluation criteria.
• Experimentation. We plan to conduct an empirical evaluation of all methods in
Section 3.1 by combining multiple cross-validation analyses using the complete data set
annotated with features and emotions. This entails quantitative ground-truth evaluation
to assess and compare the efectiveness of each technique. Additionally, we intend to
apply search-based algorithms (e.g., clustering) to infer how the aggregated analysis of
features and emotions can support requirements engineering tasks. For assessing the
overall software product quality of the tool, we will focus on performance eficiency and
usability as defined in ISO/IEC 25010 [
        <xref ref-type="bibr" rid="ref24">24</xref>
        ]. For performance eficiency, we will measure the
tool’s response time and scalability under varying workloads for data upload and feature
and emotion extraction tasks. For usability assessment, we plan to conduct user studies
to evaluate ease of interaction, explainability of results, and overall user satisfaction with
potential stakeholders.
      </p>
    </sec>
    <sec id="sec-6">
      <title>6. Related work</title>
      <p>
        D¸abrowski et al. recently concluded that automated combined analysis of feature extraction and
emotion classification is limited, especially in the use of innovative NLP techniques [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. Guzman
and Maleej described a syntactic and semantic based technique combining feature extraction
with lexicon-based polarity extraction on a sentence-level [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ]. This process is consolidated
through an LDA topic modelling method to infer high-level features and average sentiments
(i.e., positive vs. negative). A similar approach is depicted by Dragoni et al. [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ], depicting
a pipeline for the streamlined automated collection and analysis of user reviews to extract a
normalized polarity score. Beyond methods surveyed by D¸abrowski et al. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ], Gunaratman et al.
propose an automated app rating mechanism by weighting features and associated sentiments
on a feature level [
        <xref ref-type="bibr" rid="ref25">25</xref>
        ]. Finally, TransFeatEx integrates a sentiment analysis filter, whose use is
limited to filtering out extremely polarized reviews to avoid biased representation [
        <xref ref-type="bibr" rid="ref21">21</xref>
        ].
      </p>
      <p>In comparison with RE-Miner, main limitations of these approaches include: (1)
unavailability of full source code or distributed software; (2) limitations for full replication; (3) use of
traditional NLP techniques with respect to innovative methods; and (4) lack of data analytics
visualization beyond finer granularity analysis on a review level. To overcome these limitations,
RE-Miner software components are distributed including source code and packaged web
services. README files include instructions to install, deploy and integrate these components, as
well as a sample dataset to replicate the demo depicted in Section 4. Finally, while RE-Miner
does not exclude the use of traditional NLP methods, the current version integrates multiple
Transformer-based approaches for both feature extraction and sentiment analysis tasks.</p>
    </sec>
    <sec id="sec-7">
      <title>7. Conclusions and future work</title>
      <p>RE-Miner contributions are three-fold. First, we aim at providing a reusable tool to facilitate
integration and extension of NLP4RE tasks in the context of app review mining. Second, we
distribute software components as source code and as a standalone web application to integrate
and run review mining processes to analyze the output of these methods on a fine granularity
(i.e., review and sentence level) basis. Finally, we introduce a sample of simple data analytics to
support and evolve the evolution of a dashboard to visualize statistics on review descriptors
(i.e., features and emotions), both in isolation and combined.</p>
      <p>As ongoing future work, we are working on extending the analytical dashboard with clustering
and topic modelling techniques to provide higher levels of abstraction of clusters of features
and emotions. This visualization will be used to support user trend analysis, involving dense
centroids associated to a particular emotion or set of features. As a next action point, we plan
to extend current tasks by including content classification of these reviews as an additional
descriptor, focused on topic modelling and type of review (e.g., feature request, bug report,
praise...). Finally, from a maintainability perspective, we plan on extending each task with new
methods in the field, to facilitate replication studies following open-science principles.</p>
    </sec>
    <sec id="sec-8">
      <title>Acknowledgments</title>
      <p>With the support from the Secretariat for Universities and Research of the Ministry of Business
and Knowledge of the Government of Catalonia and the European Social Fund. This paper has
been funded by the Spanish Ministerio de Ciencia e Innovación under project / funding scheme
PID2020-117191RB-I00 / AEI/10.13039/501100011033.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>J.</given-names>
            <surname>Dąbrowski</surname>
          </string-name>
          , et al.,
          <article-title>Analysing app reviews for software engineering: a systematic literature review, Empirical Softw</article-title>
          .
          <source>Engg</source>
          .
          <volume>27</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>C.</given-names>
            <surname>Iacob</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Harrison</surname>
          </string-name>
          ,
          <article-title>Retrieving and analyzing mobile apps feature requests from online reviews</article-title>
          ,
          <source>in: Int. Working Conference on Mining Software Repositories</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>W.</given-names>
            <surname>Maalej</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Nabil</surname>
          </string-name>
          ,
          <article-title>Bug report, feature request, or simply praise? On automatically classifying app reviews</article-title>
          ,
          <source>in: Int. Requirements Engineering Conference</source>
          ,
          <year>2015</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>D.</given-names>
            <surname>Pagano</surname>
          </string-name>
          ,
          <string-name>
            <surname>W. Maalej,</surname>
          </string-name>
          <article-title>User feedback in the appstore: An empirical study</article-title>
          ,
          <source>in: Int. Requirements Engineering Conference</source>
          ,
          <year>2013</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>L.</given-names>
            <surname>Villarroel</surname>
          </string-name>
          , et al.,
          <article-title>Release planning of mobile apps based on user reviews</article-title>
          ,
          <source>in: Proceedings - Int. Conference on Software Engineering</source>
          ,
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>T.</given-names>
            <surname>Johann</surname>
          </string-name>
          , et al.,
          <article-title>SAFE: A Simple Approach for Feature Extraction from App Descriptions and App Reviews</article-title>
          , in: Int. Requirements Engineering Conference,
          <year>2017</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>B.</given-names>
            <surname>Lin</surname>
          </string-name>
          , et al.,
          <article-title>Sentiment analysis for software engineering: How far can we go?</article-title>
          ,
          <source>in: Int. Conf. of Software Engineering</source>
          ,
          <year>2018</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <surname>J.</surname>
          </string-name>
          <article-title>D¸abrowski</article-title>
          , et al.,
          <article-title>Mining and searching app reviews for requirements engineering: Evaluation and replication studies</article-title>
          ,
          <source>Information Systems</source>
          <volume>114</volume>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Motger</surname>
          </string-name>
          , et al.,
          <string-name>
            <surname>T-FREX</surname>
          </string-name>
          :
          <article-title>A Transformer-based Feature Extraction Method from Mobile App Reviews</article-title>
          ,
          <source>in: Conference on Software Analysis, Evolution and Reengineering</source>
          ,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>S. L.</given-names>
            <surname>Ramaswamy</surname>
          </string-name>
          , J. Chinnappan, RecogNet-LSTM+
          <article-title>CNN: a hybrid network with attention mechanism for aspect categorization and sentiment classification</article-title>
          ,
          <source>Intel. Information Systems</source>
          <volume>58</volume>
          (
          <year>2022</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>F.</given-names>
            <surname>Palomba</surname>
          </string-name>
          , et al.,
          <article-title>Crowdsourcing user reviews to support the evolution of mobile apps</article-title>
          ,
          <source>Journal of Systems and Software</source>
          <volume>137</volume>
          (
          <year>2018</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>R.</given-names>
            <surname>Jongeling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Sarkar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Datta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Serebrenik</surname>
          </string-name>
          ,
          <article-title>On negative results when using sentiment analysis tools for software engineering research</article-title>
          ,
          <source>Empirical Software Engineering</source>
          <volume>22</volume>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhao</surname>
          </string-name>
          , et al.,
          <article-title>Natural language processing for requirements engineering: A systematic mapping study</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>54</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>E.</given-names>
            <surname>Guzman</surname>
          </string-name>
          , W. Maalej,
          <article-title>How do users like this feature? A fine grained sentiment analysis of App reviews</article-title>
          ,
          <source>in: Int. Requirements Engineering Conference</source>
          ,
          <year>2014</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>F. A.</given-names>
            <surname>Shah</surname>
          </string-name>
          , et al.,
          <article-title>Is the SAFE Approach Too Simple for App Feature Extraction? A Replication Study</article-title>
          , Lecture Notes in Computer Science (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Dragoni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Federici</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Rexha</surname>
          </string-name>
          ,
          <article-title>An unsupervised aspect extraction strategy for monitoring real-time reviews stream</article-title>
          ,
          <source>Information Processing &amp; Management</source>
          <volume>56</volume>
          (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>H.</given-names>
            <surname>Wu</surname>
          </string-name>
          , et al.,
          <article-title>Identifying key features from app user reviews</article-title>
          ,
          <source>in: ICSE</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>P.</given-names>
            <surname>Nandwani</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Verma</surname>
          </string-name>
          ,
          <article-title>A review on sentiment analysis and emotion detection from text</article-title>
          ,
          <source>Social Network Analysis and Mining</source>
          <volume>11</volume>
          (
          <year>2021</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>P.</given-names>
            <surname>Ekman</surname>
          </string-name>
          , Basic Emotions, John Wiley &amp; Sons, Ltd,
          <year>1999</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          [20]
          <string-name>
            <given-names>D.</given-names>
            <surname>Carneros-Prado</surname>
          </string-name>
          , et al.,
          <article-title>Comparative Study of Large Language Models as Emotion and Sentiment Analysis Systems: A Case-Specific Analysis of GPT vs</article-title>
          .
          <source>IBM Watson, in: Int. Conference on Ubiquitous Computing &amp; Ambient Intelligence</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          [21]
          <string-name>
            <given-names>A.</given-names>
            <surname>Gállego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Motger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Franch</surname>
          </string-name>
          ,
          <string-name>
            <surname>J. Marco,</surname>
          </string-name>
          <article-title>TransFeatEx: a NLP pipeline for feature extraction</article-title>
          ,
          <source>in: Joint proceedings of REFSQ-2023</source>
          ,
          <article-title>CEUR-WS</article-title>
          .org,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          [22]
          <string-name>
            <given-names>A. d.</given-names>
            <surname>Arriba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Oriol</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Franch</surname>
          </string-name>
          ,
          <article-title>Applying transfer learning to sentiment analysis in social media</article-title>
          ,
          <source>in: Int. Requirements Engineering Conference Workshops (REW)</source>
          ,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          [23]
          <string-name>
            <given-names>Q.</given-names>
            <surname>Motger</surname>
          </string-name>
          , et al.,
          <article-title>Mobile feature-oriented knowledge base generation using knowledge graphs</article-title>
          ,
          <source>in: New Trends in Advanced Database and Information Systems</source>
          ,
          <year>2023</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          [24]
          <article-title>ISO/IEC, Systems and software engineering - Systems and software Quality Requirements and Evaluation (SQuaRE) - System and software quality models ISO/IEC 25010</article-title>
          ,
          <string-name>
            <surname>Technical</surname>
            <given-names>Report</given-names>
          </string-name>
          ,
          <year>2011</year>
          . URL: https://iso25000.com/index.php/en/iso-25000-standards/iso-25010.
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          [25]
          <string-name>
            <given-names>I.</given-names>
            <surname>Gunaratnam</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Wickramarachchi</surname>
          </string-name>
          ,
          <article-title>Computational model for rating mobile applications based on feature extraction</article-title>
          ,
          <source>in: Int. Conference on Advancements in Computing</source>
          ,
          <year>2020</year>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>