<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Symbolic/Sub-Symbolic-Based Framework for Indicators Management⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Cristina Rossetti</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Politecnico di Torino, Corso Duca degli Abruzzi</institution>
          ,
          <addr-line>24, 10129 Torino</addr-line>
          ,
          <country country="IT">ITALIA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Università Politecnica delle Marche</institution>
          ,
          <addr-line>Piazza Roma, 22, 60121 Ancona</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>Indicators are a key tool for measuring and monitoring performances in several dimensions, such as economic, social and environmental areas. The integration and analysis of indicators data, collected from heterogeneous and distributed sources, pose considerable challenges. In this paper, a unified indicators management framework is proposed, which extends and enhances an existing Semantic Data Lake architecture. An automatic integration mechanism resolves heterogeneities by mapping data sources to a global schema, while a query-driven discovery system facilitates the exploration of the repository to locate information of interest. The data analysis process is further enriched with the explanation of query results and the inclusion of new functionalities for indicator analysis. In this regard, Symbolic Regression is employed to discover relationships between indicators and to support robustness analysis in the construction of composite indicators.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Indicators management</kwd>
        <kwd>Symbolic Regression</kwd>
        <kwd>Semantic Data Lake</kwd>
        <kwd>Robustness Analysis</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Indicators are essential tools for monitoring performances and supporting decision-making processes
across diferent areas, such as economic, social and environmental. The use of indicators by
organisations, institutions or countries allows them to measure the achievement of strategic goals in order to
make informed decisions [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ]. For instance, the United Nations (UNs) developed a framework based on
231 unique indicators aimed to monitor global progress towards the Sustainable Development Goals
(SDGs), gathering data from diverse institutions and organizations worldwide 1. Efectively managing
indicator data presents several challenges, starting with data collection. Given that indicators originate
from heterogeneous and scattered sources and are often produced using diferent methodologies, data
integration becomes a complex task. A related issue concerns the standardization of indicators. In fact,
stakeholders involved in data collection and decision-making processes could interpret the meaning of
certain indicators diferently, leading to ambiguity. For this purpose, semantic technologies, such as
ontologies, may serve as models to formally represent and define indicators and their components [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
Another major challenge lies in the analysis of indicators, which is essential to monitor their dynamics,
the identification of hidden patterns, the rank of entities, the discovery of analytical relationships
between diferent indicators, and so on. In this context, the use of advanced data analytics tools may
help making the most of indicators data. As such, data management systems as Data Warehouses (DWs)
or Data Lakes (DLs) provide ad hoc solutions to overcome data integration, exploration, and analysis
challenges. These systems can be further enhanced to support the entire data analytics pipeline—from
data ingestion to sophisticated analysis techniques, also with the integration of semantic approaches. A
particularly important aspect of indicator-based analysis is the construction of composite indicators
(CIs). When dealing with multidimensional phenomena, relying on diferent multiple indicators can
be impractical. Instead, these indicators can be aggregated into a single synthetic index, simplifying
comparisons and enhancing decision-making. The building of CIs has gained widespread adoption,
with the OECD establishing a ten-step framework [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ].
      </p>
      <p>
        The main objective of my research is to build a unified and global framework for the management
and analysis of (composite) indicators. In this regard, the idea is to rely on a Semantic Data Lake
(SDL) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ], a data management platform endowed with semantic technologies. The architecture is
provided with i) an automatic mapping mechanism for integrating heterogenous data sources; ii) a
query-driven discovery system for the retrieval of relevant information; iii) indicator management
functionalities. My research has contributed to improve the mapping process with neural symbolic
architectures and to extend the query answering system providing the user a better understanding of
the results. Despite existing modules in the SDL for indicator management, there is still limited support
for advanced indicator analysis and the construction of CIs. My current focus is on addressing these
gaps by employing Symbolic Regression (SR), which ofers two key benefits in this context. First, SR
algorithms enable the discovering of complex (non-linear) relationships between individual indicators.
Moreover, SR can enhance the CIs building, in particular the robustness analysis phase, in order to
assess variables importance.
      </p>
      <p>This paper is structured as follows: in Section 2 related works are discussed; the Section 3
presents the current state of the indicators management framework, briefly describing the SDL (3.1)
and the most recent developments (3.2); my main contributions regarding indicators’ analysis and
robustness analysis of CIs are addressed in Section 4; in Section 5 are the conclusions and future
directions.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Related Works</title>
      <p>
        Indicator management requires the development of ad hoc frameworks and architectures to meet the
challenges of data integration, discovery and analysis. A few works proposed implementations of
indicators frameworks with data management architectures (e.g., [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ], [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]). An important challenge
related to the management of indicators is to provide a common representation of them, thus facilitating
their standardization and usability. For this purpose, semantic technologies, such as ontologies and
Knowledge Graph (KG) have demonstrated their usefulness. For example, [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] enhanced a DL architecture
with the use of ontologies for policy support systems towards SDGs. The UNs [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] proposed an
ontologybased organization system for the representation and analysis of SDGs indicators. [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ] presented a
comprehensive survey on the use of semantic approaches in indicators management frameworks.
The authors claim that more eforts are needed to develop indicators management tools. Also, the
results of the survey show a lack of data management systems-based frameworks. For example, [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]
proposed an ontology-based DW system for business intelligence applications; [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] developed a smart
city ontology to support indicators storing and exploration in a DL. From this perspective, the SDL
proposed by [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] is a semantic-based DL with tools to support indicators uniform representation, their
management and related data sources discovery and exploration. As mentioned before, the SDL has
still room for improvement concerning indicators’ analysis support. The idea is to enhance the data
analytics pipeline with functionalities to discover new relationships among indicators. In this regard,
SR algorithms can be employed to find mathematical expressions representing explicit relations among
a set of indicators. SR difers from traditional regression techniques for its ability to simultaneously find
the functional structure and its parameters’ values, thus returning the analytic formula which best fits
a given dataset. SR has been widely used in many contexts of application, from material sciences and
physics to environmental and healthcare dimensions [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ]. Some works focused on applications of SR in
the analysis of performance indicators. [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] used SR to generate country-specific confidence indicators
estimating the economic growth. [
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] employed SR to model the relationships between large-scale
temperature and several climate factors, such as greenhouse gases emissions. [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] aimed to identify
correlations between outdoor and indoor particulate emissions by means of SR in order to discover
behavioural trends in air pollutants. Hence, SR methods can support the retrieval of interesting formulas
within indicators data and can be especially useful to identify complex non-linear relationships [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ].
Concerning the construction of composite indices, to the best of our knowledge, no work has employed
SR algorithms to support robustness analysis.
      </p>
    </sec>
    <sec id="sec-3">
      <title>3. Current state of the framework</title>
      <sec id="sec-3-1">
        <title>3.1. The Semantic Data Lake</title>
        <p>
          The SDL [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] manages heterogeneous and distributed data sources by ingesting them without any
pre-processing step, as typically in the case of DLs. Its main components are the Knowledge Layer
and the Metadata Layer. The first aims to describe the semantics of information stored in the data
sources. A set of ontologies provides the vocabularies to represent these information, and comprehends
a KPIOnto ontology2 specifically tailored to define performance indicators and their formulas. A
KG built on the ontologies represents in a graph-based structure the concepts stored in data sources.
Additionally, the Knowledge Layer includes a set of logic programming rules, which allows to reason
over indicators’ formulas. The main functionalities are: formula manipulation, including formula
rewriting and equation solving; consistency check, allowing to evaluate whether a new formula is
coherent, i.e. does not contradict any already stored formulas, or is equivalent to the stored ones [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ].
The Metadata Layer manages metadata of data sources by storing them in a Metadata Graph (MG). A
mapping mechanism involving the KG and the MG links data sources metadata to KG concepts, thus
enabling data integration. Mappings are built automatically with the LSH Ensemble [
          <xref ref-type="bibr" rid="ref16">16</xref>
          ] algorithm
and only works with categorical attributes. At present, the data sources we are working with are
multidimensional cubes, which include indicators measured on a set of dimensions, i.e. perspectives of
analysis, such as temporal or geographical. Dimensions are structured on hierarchies of levels, allowing
data to be aggregated at diferent granularities (e.g., year, month or day for the temporal dimension).
On the top of the SDL architecture, a query-driven discovery module allows the end user to query
the SDL according to KG concepts. Thus, one can search for data sources containing indicators data
measured on certain dimensional levels, e.g. aggregate GDP at country and year level. As already
mentioned, the semantics of the indicators are stored in the KG, and their formulas are structured as
formula graphs. This, together with the reasoning engine, supports a query rewriting mechanism which
aims to generate additional queries for a given query based on the presence of formulas for specified
indicators. In fact, when a formula has other indicators as arguments, it is necessary to search for other
data sources containing these indicators. When query rewriting is activated, or there is simply more
than one indicator in the query, it may happen that the information on the requested indicators is
contained in diferent sources, thus requiring joins. Performing joins between diferent data sources
can be time-consuming and computationally expensive, especially when multiple joins are required
and the resulting data sources are many and huge. For this reason, the query answering system of
the SDL is provided with a joinability index (JI) which estimates join cardinality, also when the join is
multi-attribute, by means of the LSH Ensemble algorithm.
        </p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Architecture improvements</title>
        <p>
          Data integration is performed via automatic mappings computed with LSH Ensemble applied
over Minhash signatures of data sources attributes and KG concepts. Despite the eficiency of the
algorithm, the process can be improved both in terms of efectiveness and the possibility of mapping
non-categorical attributes. One of my on-going work focuses on employing a neuro-symbolic
architecture, namely Logic Tensor Network (LTN) [
          <xref ref-type="bibr" rid="ref17">17</xref>
          ], to perform data integration. The training
process is enhanced by the ingestion of a prior knowledge, which in our case is represented by the KG,
and the prediction output is the degree of satisfaction of a certain attribute mapped to a KG concept.
The SDL has also been recently extended with the enrichment of the query-driven discovery module.
Since the JI allows to rank the returned combinations based on their estimated cardinalities, the user
has the possibility of choosing the best solutions in terms of the amount of information contained.
2https://kdmg.dii.univpm.it/kpionto/specification
In order to enhance the understanding of results, we proposed to employ data sources profiles (i.e.,
metadata on data distribution) in order to estimate the informative content of the joins. This, together
with the JI-based classification, provides the user with a comprehensive overview of the discovered
data sources, thus enhancing the exploratory power of the query answering system.
        </p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Indicator analysis and building</title>
      <sec id="sec-4-1">
        <title>4.1. Formula discovery</title>
        <p>The query-driven discovery system of SDL allows to find the best data sources containing the indicators
of interest. The reasoning engine enables the retrieval of formulas of the indicators in the query, thus
discovering additional data sources. This allows to further support the discovery of interesting relations
in data, enriching the analysis of indicators. Anyway, the availability of formulas for an indicator
is not always straightforward. In this regard, we propose the use of SR in the context of formulas
discovery, as it allows to retrieve explicit and interpretable analytical relationships between multiple
indicators without forcing a pre-defined functional structure. Furthermore, unlike traditional regression
techniques, SR allows the discovery of hidden (non-linear) relationships within the data without the
need for prior knowledge. For example, suppose that there is a user who is interested in analyzing air
pollution in Italian cities and wants to look for possible relationships with the trafic of vehicles. The
SDL returns data sources containing values of the air quality index (AQI) averaged by city and day.
The KG does not contain formulas related to AQI, but the user suspects some relations with a stored
indicator, namely the average number of vehicles (ANV). With another query, the user retrieves a data
source containing ANV data aggregated by Italian cities and day. The user presumes that as average
trafic increases, so does the level of pollution in the air, but does not know the mathematical model
relating the two indicators. Using ANV as the independent variable and AQI as the dependent variable
(target), the SR algorithm returns the best expression representing the relations between ANV and AQI,
without the need of assuming any functional structure.</p>
        <p>When the SR algorithm finds a new expression relating a given indicator to other indicators, this formula
can be stored in the KG. The reasoning engine can support this process by evaluating the goodness of
the expression in terms of coherence and consistency with existing formulas.</p>
      </sec>
      <sec id="sec-4-2">
        <title>4.2. Robustness Analysis</title>
        <p>
          CIs are synthetic indices obtained through the aggregation of multiple single indicators, with the
aim of measuring some complex multidimensional phenomenon. Among the ten main steps of CIs
building, we are currently focusing on robustness analysis. When a developer constructs a composite
index, the choices made (e.g., weighting method) can heavily influence the soundness of the result.
Unsuitable choices can lead to meaningless CIs values and thus to misleading conclusions by
decisionand policy-makers [
          <xref ref-type="bibr" rid="ref18">18</xref>
          ]. Robustness analysis evaluates the quality of the resulting index by assessing
how the inputs of the process (e.g., methodological choices) impact on the output. The main idea
consists of employing SR to perform robustness analysis by computing the CI formula and identifying
the contributions of the input variables, i.e. the single indicators. Target values are obtained through
well-established formulas for CIs, such as the Adjusted Mazziotta Pareto Index (AMPI) [
          <xref ref-type="bibr" rid="ref19">19</xref>
          ], and the SR
tries to retrieve the corresponding expressions. Here, robustness analysis can be performed in two main
steps. The first concerns the analysis of formulas retrieved by the SR algorithm, in order to identify the
selected variables and their functional form in the formula, as well as their polarity (whether they
contribute positively or negatively to the output). The idea is that variables with simpler functional
forms (e.g., linear contributions) and returned with the right polarity might be the most influential. The
second step consists of using a machine learning model, more likely a neural network (NN), to evaluate
whether the selected subset of indicators represents the most important variables for the CI. If the
prediction error of the NN is smaller when the predictors are the selected indicators compared to the
error obtained with the remaining ones, then it means that the chosen subset of indicators has a high
probability of being the ones with the highest predictive power, thus the most influential.
        </p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Conclusions and future directions</title>
      <p>The main goal of my work is to build a (composite) indicator framework with symbolic and sub-symbolic
approaches, starting from an existing SDL. A KG built on ontologies makes it possible to formally
define and represent indicators and their formulas and to facilitate the process of data integration
and exploration. The automatic mappings between the KG and MG, designed for data integration
and originally performed with the LSH Ensemble algorithm, can be improved with the use of
neurosymbolic architectures. The query-driven discovery system enables the discovery of data sources
containing indicators of interest, also communicating with the reasoner engine to retrieve additional
data if formulas are available. The results can be interpreted in terms of both cardinality and information
content estimation, thus improving user understanding. In order to support the analysis of indicators,
the SR can be employed for a twofold purpose. First, it enables formula discovery, which consists in
retrieving new relationships among indicators stored in the KG. Then, it is used in robustness analysis to
construct CIs, finding out which variables are most important. Future directions of my research relates
diferent parts of the indicators management framework. It is intended to further test, and if possible
improve, the application of LTNs for data integration. The other objective is to better integrate the
reasoning engine with the application of SR, for example by using logic programming rules to validate,
simplify and compare resulting expressions from SR algorithms. Concerning indicators’ analysis, the
proposal of SR-based formula discovery has to be implemented and tested, while the SR-based robustness
analysis still has to be structured at the methodological and experimental level.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Acknowledgments</title>
      <p>Cristina Rossetti has received funding from the MUR – DM 118/2023 as part of the project PNRR-NGEU.</p>
    </sec>
    <sec id="sec-7">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.
Participating in this Doctoral Consortium represents a unique opportunity for me to enrich my research
career. First of all, it would allow me to introduce myself to a national community of experts and
colleagues, who could give me valuable feedback and advice for my field of study, the work done so far
and my possible future directions. I also believe that confronting an audience of experts can improve
my communication and presentation skills. Participating in this event also gives me the opportunity to
expand my network of contacts and is an ideal environment to establish collaborations.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>C.</given-names>
            <surname>Diamantini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Potena</surname>
          </string-name>
          , E. Storti,
          <article-title>Semantic models of performance indicators: A systematic survey</article-title>
          ,
          <source>ACM Computing Surveys</source>
          (
          <year>2025</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Centre</surname>
          </string-name>
          ,
          <article-title>Handbook on constructing composite indicators: Methodology and user guide</article-title>
          ,
          <source>OECD publishing</source>
          ,
          <year>2008</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>C.</given-names>
            <surname>Diamantini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Potena</surname>
          </string-name>
          , E. Storti,
          <article-title>Analytic processing in data lakes: A semantic query-driven discovery approach</article-title>
          ,
          <source>Information Systems Frontiers</source>
          (
          <year>2024</year>
          ).
          <source>doi:10.1007/s10796-024-10471-4.</source>
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Maté</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Trujillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Mylopoulos</surname>
          </string-name>
          ,
          <article-title>Specification and derivation of key performance indicators for business analytics: A semantic approach</article-title>
          ,
          <source>Data &amp; Knowledge Engineering</source>
          <volume>108</volume>
          (
          <year>2017</year>
          )
          <fpage>30</fpage>
          -
          <lpage>49</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>A.</given-names>
            <surname>Nasiri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Ahmed</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Wrembel</surname>
          </string-name>
          , E. Zimányi,
          <article-title>Requirements engineering for data warehouses (re4dw): From strategic goals to multidimensional model</article-title>
          , in: Advances in Conceptual Modeling: ER 2017
          <string-name>
            <surname>Workshops</surname>
            <given-names>AHA</given-names>
          </string-name>
          , MoBiD, MREBA, OntoCom, and
          <string-name>
            <surname>QMMQ</surname>
          </string-name>
          , Valencia, Spain, November 6-
          <issue>9</issue>
          ,
          <year>2017</year>
          , Proceedings 36, Springer,
          <year>2017</year>
          , pp.
          <fpage>133</fpage>
          -
          <lpage>143</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kulkarni</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bassin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. S.</given-names>
            <surname>Parasa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V. E.</given-names>
            <surname>Venugopal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Srinivasa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Ramanathan</surname>
          </string-name>
          ,
          <article-title>Ontology augmented data lake system for policy support</article-title>
          ,
          <source>in: International Conference on Big Data Analytics</source>
          , Springer,
          <year>2022</year>
          , pp.
          <fpage>3</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Joshi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. G.</given-names>
            <surname>Morales</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Klarman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Stellato</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Helton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lovell</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Haczek</surname>
          </string-name>
          ,
          <article-title>A knowledge organization system for the united nations sustainable development goals</article-title>
          ,
          <source>in: The Semantic Web: 18th International Conference, ESWC</source>
          <year>2021</year>
          ,
          <string-name>
            <given-names>Virtual</given-names>
            <surname>Event</surname>
          </string-name>
          , June 6-10,
          <year>2021</year>
          , Proceedings 18, Springer,
          <year>2021</year>
          , pp.
          <fpage>548</fpage>
          -
          <lpage>564</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>G.</given-names>
            <surname>Xie</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Qiu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Pan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <article-title>Eiaw: towards a business-friendly data warehouse using semantic web technologies</article-title>
          , in: International Semantic Web Conference, Springer,
          <year>2007</year>
          , pp.
          <fpage>857</fpage>
          -
          <lpage>870</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bagozi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Bianchini</surname>
          </string-name>
          , V. De Antonellis,
          <string-name>
            <given-names>M.</given-names>
            <surname>Garda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Melchiori</surname>
          </string-name>
          ,
          <article-title>Personalised exploration graphs on semantic data lakes, in: On the Move to Meaningful Internet Systems: OTM 2019 Conferences: Confederated International Conferences: CoopIS</article-title>
          , ODBASE,
          <string-name>
            <surname>C</surname>
          </string-name>
          &amp;
          <article-title>TC 2019, Rhodes</article-title>
          , Greece,
          <source>October 21-25</source>
          ,
          <year>2019</year>
          , Proceedings, Springer,
          <year>2019</year>
          , pp.
          <fpage>22</fpage>
          -
          <lpage>39</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Angelis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Sofos</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. E.</given-names>
            <surname>Karakasidis</surname>
          </string-name>
          ,
          <article-title>Artificial intelligence in physical sciences: Symbolic regression trends and perspectives</article-title>
          ,
          <source>Archives of Computational Methods in Engineering</source>
          <volume>30</volume>
          (
          <year>2023</year>
          )
          <fpage>3845</fpage>
          -
          <lpage>3865</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>O.</given-names>
            <surname>Claveria</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Monte</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Torra</surname>
          </string-name>
          ,
          <article-title>Economic forecasting with evolved confidence indicators</article-title>
          ,
          <source>Economic Modelling</source>
          <volume>93</volume>
          (
          <year>2020</year>
          )
          <fpage>576</fpage>
          -
          <lpage>585</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>K.</given-names>
            <surname>Stanislawska</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Krawiec</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z. W.</given-names>
            <surname>Kundzewicz</surname>
          </string-name>
          ,
          <article-title>Modeling global temperature changes with genetic programming</article-title>
          ,
          <source>Computers &amp; Mathematics with Applications</source>
          <volume>64</volume>
          (
          <year>2012</year>
          )
          <fpage>3717</fpage>
          -
          <lpage>3728</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>R. R.</given-names>
            <surname>Karri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Heibati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Yusup</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Rafatullah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Mohammadyan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Sahu</surname>
          </string-name>
          ,
          <article-title>Modeling airborne indoor and outdoor particulate matter using genetic programming</article-title>
          ,
          <source>Sustainable Cities and Society</source>
          <volume>43</volume>
          (
          <year>2018</year>
          )
          <fpage>395</fpage>
          -
          <lpage>405</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>L.</given-names>
            <surname>Kammerer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Kronberger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Burlacu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. M.</given-names>
            <surname>Winkler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kommenda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Afenzeller</surname>
          </string-name>
          ,
          <article-title>Symbolic regression by exhaustive search: Reducing the search space using syntactical constraints and eficient semantic structure deduplication, Genetic programming theory and practice XVII (</article-title>
          <year>2020</year>
          )
          <fpage>79</fpage>
          -
          <lpage>99</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>C.</given-names>
            <surname>Diamantini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Potena</surname>
          </string-name>
          , E. Storti,
          <article-title>Sempi: A semantic framework for the collaborative construction and maintenance of a shared dictionary of performance indicators</article-title>
          ,
          <source>Future Generation Computer Systems</source>
          <volume>54</volume>
          (
          <year>2016</year>
          )
          <fpage>352</fpage>
          -
          <lpage>365</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>E.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Nargesian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. Q.</given-names>
            <surname>Pu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <article-title>Lsh ensemble: Internet-scale domain search</article-title>
          ,
          <source>arXiv preprint arXiv:1603.07410</source>
          (
          <year>2016</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>S.</given-names>
            <surname>Badreddine</surname>
          </string-name>
          , A. d. Garcez,
          <string-name>
            <given-names>L.</given-names>
            <surname>Serafini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Spranger</surname>
          </string-name>
          ,
          <article-title>Logic tensor networks</article-title>
          ,
          <source>Artificial Intelligence</source>
          <volume>303</volume>
          (
          <year>2022</year>
          )
          <fpage>103649</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>S.</given-names>
            <surname>Greco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ishizaka</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Tasiou</surname>
          </string-name>
          , G. Torrisi,
          <article-title>On the methodological framework of composite indices: A review of the issues of weighting, aggregation, and robustness</article-title>
          ,
          <source>Social indicators research 141</source>
          (
          <year>2019</year>
          )
          <fpage>61</fpage>
          -
          <lpage>94</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          [19]
          <string-name>
            <given-names>M.</given-names>
            <surname>Mazziotta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pareto</surname>
          </string-name>
          ,
          <article-title>On a generalized non-compensatory composite index for measuring socio-economic phenomena</article-title>
          ,
          <source>Social indicators research 127</source>
          (
          <year>2016</year>
          )
          <fpage>983</fpage>
          -
          <lpage>1003</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>