<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Exploiting Outlier Explanation to Unveil Key-aspects of High Green Comparative Advantage Nations</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Fabrizio</forename><surname>Angiulli</surname></persName>
							<email>f.angiulli@unical.it</email>
							<affiliation key="aff0">
								<orgName type="department">DIMES</orgName>
								<orgName type="institution">University of Calabria</orgName>
								<address>
									<addrLine>via Pietro Bucci, Rende (CS)</addrLine>
									<postCode>87036</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Fabio</forename><surname>Fassetti</surname></persName>
							<email>f.fassetti@unical.it</email>
							<affiliation key="aff0">
								<orgName type="department">DIMES</orgName>
								<orgName type="institution">University of Calabria</orgName>
								<address>
									<addrLine>via Pietro Bucci, Rende (CS)</addrLine>
									<postCode>87036</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Simona</forename><surname>Nisticò</surname></persName>
							<email>s.nistico@unical.it</email>
							<affiliation key="aff0">
								<orgName type="department">DIMES</orgName>
								<orgName type="institution">University of Calabria</orgName>
								<address>
									<addrLine>via Pietro Bucci, Rende (CS)</addrLine>
									<postCode>87036</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Luigi</forename><surname>Palopoli</surname></persName>
							<email>l.palopoli@unical.it</email>
							<affiliation key="aff0">
								<orgName type="department">DIMES</orgName>
								<orgName type="institution">University of Calabria</orgName>
								<address>
									<addrLine>via Pietro Bucci, Rende (CS)</addrLine>
									<postCode>87036</postCode>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Exploiting Outlier Explanation to Unveil Key-aspects of High Green Comparative Advantage Nations</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">782BC96CC7C1D61C71F356ED0FA6665F</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:38+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Outlier Explanation, Green Economy, Low Carbon Technologies Comparative Advantage Conceptualization, Investigation, Methodology, Validation, Writing -Review &amp; Editing</term>
					<term>S.N.: Conceptualization, Investigation, Methodology, Software, Validation, Data curation, Writing -Original Draft Preparation</term>
					<term>L.P.: Validation, Writing -Review &amp; Editing Orcid 0000-0002-9860-7569 (F. Angiulli)</term>
					<term>0000-0002-8416-906X (F. Fassetti)</term>
					<term>0000-0002-7386-2512 (S. Nisticò)</term>
					<term>0000-0003-4915-5137 (L. Palopoli)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Climate change is observable in the drastic modification of world ecosystems and weather patterns. The potential effects of this phenomenon make the research of successful strategies to delimit the problem an absolute priority. Objectives 7, 12 and 13 of the United Nations' Agenda 2030 <ref type="bibr" target="#b0">[1]</ref> are only some examples of the importance of this problem on a worldwide scale. The development and diffusion of low-carbon technologies are among the key points in politics against climate change due to the massive impact human activities have on carbon emissions.</p><p>Deep Learning techniques, currently widely used in many aspects of everyday life, can also help in this field. This work, in particular, aims to demonstrate the effectiveness of M 2 OE, a transformation-based outlier explanation technique, in extracting actionable explanations in the green economy context. Specifically, we analyze the Low Carbon Technologies Comparative Advantage, an index measuring the relative economic advantage in developing low carbon technologies, by looking at the nations exhibiting a high comparative advantage to qualitatively evaluate the insights the method provides to the user.</p><p>To this aim, we have gathered data concerning 7 indicators related to the comparative advantage of low-carbon technologies in the 2019-2021 time period. This data extraction work has resulted in the Green Comparative Advantage (GreenCA) tabular data set, in which the information retrieved for the reference time horizon is organized and summarized. By a set of experiments exploiting this data collection together with the M 2 OE method, we catch a glimpse to gain interesting insights about which politics are successful in promoting a change in favour of green energies.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>The worrying frequency of extraordinary natural events is making evident the climate change problem, which is dramatically marking the planet's equilibrium and our ecosystems and is affecting human life <ref type="bibr" target="#b1">[2]</ref>. Timely actions and a change in lifestyle and environmental politics are required to mitigate the effect of a problem primarily caused by human activities. Thanks to its power to push technology and economics forward to more sustainable models, politics has a main role in attenuating the climate change issue <ref type="bibr" target="#b2">[3]</ref>. Fortunately, the undeniable evidence of the above-introduced problem has inducted world countries to commit their efforts to discuss effective strategies to promote policy, technologies and behaviours tailored to reduce the CO 2 emissions, that are causing this phenomenon. The annual United Nations Climate Change Conference and the Agenda 2030 7, 12 and 13 goals focused respectively on affordable green energy, responsible consumption and production and climate changes <ref type="bibr" target="#b0">[1]</ref>, testify a spread in the interest of world politics in discussing environmental subjects and in the green economy, thus, considering both the economical and the sustainability aspects.</p><p>As a consequence of socioeconomic, geography and morphology differences, each county's government implements different policies to deal with the exigence of a more sustainable lifestyle and economy. Unfortunately, not all the strategies adopted are equally effective in reaching the goal of promoting the reduction of CO 2 emissions. In this regard, always looking at the green economy matter, decision-makers could be facilitated by having techniques providing insights about the aspect characterizing the policies of countries attaining sustainability goals, since it potentially provides them with the instruments to make more informed explanations.</p><p>In this paper, we aim to witness the effectiveness of M 2 OE <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>, a transformation-based Outlier Explanation technique, in gaining actionable explanations in the green economy context. In particular, we qualitatively analyze the actionability of explanations related to the Low Carbon Technologies Comparative Advantage (hereafter referred to as Green Comparative Advantage), an index measuring the relative advantage of a country in producing low-carbon technologies. We have chosen this index for its potential to promote investments in technologies reducing the environmental impact and thus to test the effectiveness of the M 2 OE explanations in a relevant real-world scenario.</p><p>To this aim, we have collected and arranged the data shared by the International Monetary Fund containing information relating to environmental taxes, investment in environmental protection, fossil fuel subsidies, energy (both renewable and non-renewable), forests and trade in low-carbon technologies products. Our efforts resulted in the Green Comparative Advantage (GreenCA) dataset providing in a tabular shape, a rich and, hopefully, complete overview of the policies applied and peculiarities of a set of countries. More specifically we have collected information for 54 countries, more details are going to be provided in section 2. The number of samples skewed toward countries with a low green comparative advantage makes it suitable to be analyzed in an outlier explanation setting by considering the less numerous class of countries with high comparative advantage as outliers.</p><p>Given reference data considered as "normal" and one or more outlying samples, the goal of the previously referred Outlier Explanation task is figuring out the aspects characterizing point outlierness and, thus what makes the analyzed sample or groups behave differently from the rest of the data. Two are the most diffused ways to approach this problem. The first of them shapes the considered problem as the search of the set of features characterizing the outlying samples or associating a score to each feature by, for example, using separability as a quality criterion <ref type="bibr" target="#b5">[6]</ref>, finding invertible projections that make the outlying sample better recognisable and then obtaining the features contributing to this mapping <ref type="bibr" target="#b6">[7]</ref>, or by leveraging features selection methods equipping them with properly-designed sampling methods to deal with extremely unbalanced data <ref type="bibr" target="#b7">[8]</ref>. Other approaches perform the selection through outlierness metrics based on the relative frequency of value combinations, applied to single outliers in datasets having categorical <ref type="bibr" target="#b8">[9]</ref> or continuous features <ref type="bibr" target="#b9">[10]</ref>, or groups of anomalies <ref type="bibr" target="#b10">[11]</ref>. Alternatively, find exceptional values by estimating the distribution of the value frequencies <ref type="bibr" target="#b11">[12]</ref>. In the second instead, the features are ranked and, for each of them a score is provided, by using scoring criteria ranging from the distance between the studied sample and its k-nearest neighbours <ref type="bibr" target="#b12">[13]</ref>, measures extracted through kernel density estimation <ref type="bibr" target="#b13">[14]</ref> or other criteria studied to be dimensional unbiased, thus not dependent from the number of features of the sample <ref type="bibr" target="#b14">[15]</ref>.</p><p>Beyond this information, the here-considered M 2 OE technique <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref> is tailored to extract richer insights by considering transformation-based explanations <ref type="bibr" target="#b15">[16]</ref>, whose goal is to find a group of features to change and for each of them a value describing how to change the value of that feature. It follows that it potentially gives actionable explanations guiding decision-makers to change the observed status making appropriate changes to the features available, which are particularly useful in real-world contexts like that considered in this paper.</p><p>The rest of the paper is structured as follows. Section 2 presents the GreenCA dataset by describing the data collection, the building process and the information included, Section 3 presents the M 2 OE Outlier Explanation method, Section 4 qualitatively evaluates the actionability of the collected explanations and, finally, Section 5 draw the conclusion of this work.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">The Green Comparative Advantage dataset</head><p>As already stated in this paper's introduction, designing effective strategies to promote green and sustainable technologies is crucial to follow the right path to have a less impacting society and lifestyle and to try to remedy the negative effects observed as a consequence of global pollution. To gain insights into the most effective way to make the development and adoption of low-carbon technologies advantageously from an economic perspective, thus, hopefully, to incentive this category of technologies, we want to study which are the differences characterizing countries having a high comparative advantage from green technologies in comparison with the others.</p><p>To look out at the insights given by M 2 OE on what characterizes counties exhibiting a high comparative advantage with the purpose of checking their usefulness, data for 6 mitigation indicator groups has been collected from the International Monetary Fund Climate Dashboard, which includes information about national policies to contain and reduce carbon emissions, shaped as time series. Deeper into details, the indicator groups considered are the following:</p><p>• Environmental Taxes (ET) <ref type="bibr" target="#b16">[17]</ref>: Charges levied by measuring, through a physical unity or some proxy criterion, something that is proven to harm the environment. • Environmental Protection Expenditures (EP) <ref type="bibr" target="#b17">[18]</ref>: Money amount invested in environmental protection activities like, for example, waste management, pollution abatement and biodiversity and landscape protection. • Forest and Carbon (FC) <ref type="bibr" target="#b18">[19]</ref>: Data about Forest Extends and Carbon stored by forests providing a high-level summary of the forest state in each country. • Fossil Fuel Subsidies (FF) <ref type="bibr" target="#b19">[20]</ref>: Estimated values of explicit and implicit government subsidies.</p><p>• Renewable Energy (RE) <ref type="bibr" target="#b20">[21]</ref>: Information about electricity generation and electricity installed capacity, where energy is classified as renewable or non-renewable. • Trade in Low-carbon Technology Products (TT) <ref type="bibr" target="#b21">[22]</ref>: Data about trade in low carbon technology product.</p><p>The original data sources for each indicator group consist of sets of time series, each of them taking care of reporting the features for one nation and one kind of measure. In the following, we describe the process performed to extract analyzed data. Table <ref type="table" target="#tab_0">1</ref> reports an overview of the features included in the here presented data collection.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Data collection methodology</head><p>The GreenCA dataset presented in this work comes as the upshot of a data collection and summarization process. Indeed, the International Monetary Fund Climate Dashboard provides users with varied, and, sometimes, redundant data, to make them usable for different kinds of analytics. The objective is to rationalize data through reshaping and filtering operations to obtain a two-class tabular dataset. Data is divided using the Green Comparative Advantage as a discriminant feature. This value represents the economic advantage over the other nations in exporting low-carbon technologies, which consists of all the technological products tailored to reduce the impact of human activities on the environment. Surveyed nations are distinguished between those exhibiting a high Green Comparative Advantage, thus that have a value greater than 1 for this index, to which we assign the target label 1, and those instead having a value lower than 1, to which the label 0 is assigned.</p><p>To avoid scale problems, when more than one unit is available for the considered indicator and when applicable, we consider only information from records measuring the analyzed indicator as a percentage.</p><p>We selected the more recent three-year period satisfying data availability, so, in the presented dataset, we chose the 2019-2021 years as the target period for our analysis. However, the described data processing procedure can be applied to updated data by considering a different time horizon to obtain an up-to-date version of this dataset. To summarize the information relating to the considered period we apply, for each indicator, the mean operation to the values for the considered years. To prevent working with null data, we drop from the 104 registered nations those with at least one unspecified value. The above-described data extraction procedure resulted in a dataset containing information from 54 countries, among which 16 have a high comparative advantage and the remaining a low comparative advantage. Table <ref type="table" target="#tab_0">1</ref> lists the set of features available, where a description, the unity of the measurement and the connected thematic area are reported for each feature.</p><p>Since there are only a few outstanding countries in which low-carbon technologies are economically advantageous, the dataset presented in this work can be looked at as an outlier detection dataset, in which the minority class can be seen as anomalous. The GreenCA dataset presented in this section can be found at the following link https://www.kaggle.com/datasets/simonanistico/greenca.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">M 2 OE</head><p>Masking Models for Outlier Explanation, shortly M 2 OE, tackles the Outlier Explanation problem by providing the user with transformation-based explanations, describing the outlier peculiarities by suggesting alterations that, applied to the analyzed outlier, makes it behaving similar to normal samples. More in detail, given an object 𝑜 ∈ 𝐷𝑆, the explanation consists of a set 𝑒 of feature-value pairs {(𝑓 𝑖 1 , 𝑣 𝑖 1 ), … , (𝑓 𝑖 𝑘 , 𝑣 𝑖 𝑘 )} codifying a transformation 𝑡 𝑒 (𝑜) of 𝑜 resulting in a new object 𝑜 ′ such that 𝑜 ′ [𝑓 𝑗 ] = 𝑜[𝑓 𝑗 ] + 𝑣 𝑗 for 𝑗 ∈ {𝑖 1 , … , 𝑖 𝑘 } and 𝑜 ′ [𝑓 𝑗 ] = 𝑜[𝑓 𝑗 ] otherwise. The just-described transformation potentially represents an actionable explanation providing users with insights on how to change the features of the outlier to make it act as a normal sample.</p><p>The pipeline proposed to find the previously introduced explanation form is depicted by Figure <ref type="figure" target="#fig_0">1</ref>. As To compute this set of explanations, we collect the set 𝐶 consisting of all the choices associated with objects of the reference set 𝑅𝑆 and find the frequent itemsets 𝐹 𝐶 (in our context each feature represent an item). Then, for each of the frequent itemsets found (representing a frequent choice) 𝑓 in 𝐹 𝐶 we apply a clustering algorithm to the reference set samples whose corresponding choice contains 𝑓, in particular, in this work, we have leveraged DBSCAN <ref type="bibr" target="#b22">[23]</ref>. After this clustering step, we take for each cluster its medoid as a representative point used to find a mask related to that set of samples, which, together with 𝑓, is one of the explanations provided to the user.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Explanation computation</head><p>Both the previously referred Choice Generator and Mask Applier networks consist of feed-forward dense neural networks having 𝑙 𝑔 ≥ 3 layers having a number 𝑛 𝑔1 ⋅ 𝑑 (𝑛 𝑔1 ≥ 3) of neurons. The layers of the latter of the two modules have linear activation functions, while, for the first neural network, the hidden layers are equipped with a ReLU activation function and the output layer with a sigmoid. This results in returning the 𝑑-dimensional real-valued choice vector c , having values c 𝑖 ∈ [0, 1] which are eventually converted into a binary format 𝑐 𝑖 ∈ {0, 1} through a thresholding operation.</p><p>To carry out the training, M 2 OE has to compute a statistic vector 𝑠 on 𝑅𝑆, whose 𝑖-th feature (1 ≤ 𝑖 ≤ 𝑑) is the mean feature-wise squared differences between normal points:</p><formula xml:id="formula_0">𝑠 𝑖 = 2 𝑘(𝑘 − 1) ∑ 𝑟,𝑟 ′ ∈𝑅𝑆 (𝑟 𝑖 − 𝑟 ′ 𝑖 ) 2 .</formula><p>Given this vector, the outlier 𝑜 and the reference sample 𝑟, the loss function leading the neural networks training is the following:</p><formula xml:id="formula_1">ℒ (𝑜, 𝑟) = 𝛼 1 ⋅ ∑ 𝑑 𝑖=1 𝑠 𝑖 ⋅ c 𝑖 [∑ 𝑑 𝑖=1 (𝑜 𝑖 − 𝑟 𝑖 ) 2 ⋅ c 𝑖 ] + 𝜖 + 𝛼 2 ⋅ 𝑑 ∑ 𝑖=1 [( õ ′ 𝑖 − 𝑟 𝑖 ) 2 ⋅ c 𝑖 ] + 𝛼 3 ⋅ || c || 2<label>(1)</label></formula><p>in which 𝛼 1 , 𝛼 2 and 𝛼 3 are values used to weigh the contributions of the three terms appearing in the loss function, 𝜖 is a small constant to avoid division by zero and õ ′ is the sample resulting from the transformation application. The three-fold objective of this loss is to find the subspaces in which the outlier deviates most from the normal samples (first term) while making the transformed outlier õ ′ as similar as possible to the normal samples (second term) and keeping the number of features included in the explanation low (last term).</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Case of study</head><p>In our previous works <ref type="bibr" target="#b3">[4,</ref><ref type="bibr" target="#b4">5]</ref>, the performances of M 2 OE have been thoroughly discussed. In particular, it has been shown that, despite not being specifically tailored to search for subspace-based explanations, the quality of the set of features included in the transformation is in almost all cases better than those of the competitors involved in our experiments, namely ATOM and COIN, or at least comparable. Furthermore, we have shown that the transformations provided as an explanation, a novelty in the outlier explanation field, can get the outlier closer to behaving as a normal sample. In this section, our efforts are focused on observing the explanations provided by M 2 OE to check the insights they offer and their actionability. To carry out this experiment we have used the dataset described in Section 2, which is presented for the first time in this work. In our Outlier Explanation setting, countries showing a high Green Comparative Advantage are the outlier samples whose behaviour is under study. So, to summarize, we consider 16 countries exhibiting a high relative advantage in exporting low-carbon technologies in which an economic boost potentially pushes forward the development of these low-impacting technologies. For this analysis, the M 2 OE method is set up as follows. Due to the small number of samples available, we consider as a reference dataset all the samples not belonging to the studied group, so 38 countries. The neural modules being part of the Generative Neural Module are trained for 30 epochs, with loss weights equal to 1.0, 1.2 and 0.3 respectively, and a learning rate equal to 0.001.</p><p>The results of the explanation for the considered countries are depicted in Figure <ref type="figure">2</ref>, where, for each of them, the characterizing features are listed by showing the alteration suggested to make the comparative advantage of that nation low. To improve the delivery of the explanations, we present the transformation values as percentages of the original values of the features. In the following, we summarize the findings extracted by M 2 OE, however, the explanations reported in the figure supply detailed explanations showing also how to change pointed-out features expressed as a percentage of the original feature value.</p><p>• According to the explanations provided by M 2 OE, Austria's comparative advantage is due to its taxes on transport (Figure <ref type="figure">2a</ref>), indeed decreasing them by about 30% causes a loss of this behoof. Taxes bear a high level for this index also for the Republic of Croatia (Figure <ref type="figure">2b</ref>), Hungary (Figure <ref type="figure">2g</ref>) and Slovak Republic (Figure <ref type="figure">2n</ref>). Indeed, according to the results of the methodology considered in this work, a decrease in Taxes on Energy and Environmental taxes for the first, and taxes on pollution for the last two would make them have a low value for the considered index. • Another group of countries stand out in terms of comparative advantage due to investments related to the environment. More in detail, the Czech Republic (Figure <ref type="figure">2c</ref>) is characterized by expenditure on biodiversity and landscape protection, Estonia (Figure <ref type="figure">2e</ref>) on environmental protection R&amp;D, Italy (Figure <ref type="figure">2i</ref>) on waste management and environmental protection R&amp;D, North Macedonia (Figure <ref type="figure">2k</ref>) on environmental protection for unclassified aspects and, finally, United Kingdom (Figure <ref type="figure">2p</ref>) on waste management. • Another frequent pattern is that in which the policies are supposed to combine expenditures and targeted taxes. According to the information we have retrieved, it happens in Denmark (Figure <ref type="figure">2d</ref>) whose policy is characterized by a mix of taxes on transport and investment in biodiversity and landscape protection and in Israel (Figure <ref type="figure">2h</ref>) where investments on waste management are combined with taxes on transport. • In other cases, the pivotal feature for comparative advantage relates to geographical aspects like the share of forest area for Finland and Sweden (Figures 2f and 2o respectively) and the index of carbon stocks in forests for Romania (Figure <ref type="figure">2m</ref>). In both cases, a reduction is supposed to cause a low comparative advantage. • Finally, the remaining two nations, namely Japan (Figure <ref type="figure">2j</ref>) and Panama (Figure <ref type="figure">2l</ref>), exhibit an explanation differing from the previously described patterns. As for the first of them, M 2 OE says that to reduce its comparative advantage a decrease in non-renewable electricity generation is needed, while, as for the second, the proposed method's suggestion is to reduce the forest area a little and increase environmental taxes heavily.</p><p>Since the object of our analysis is also to assess the ability of M 2 OE to effectively unveil the key aspects of countries of high comparative advantage, it is useful to recall that explanations need to be read in such a way: the features included in the explanation are the important aspects and features positively impact the considered index if the transformation suggests lowering their value and negatively otherwise.</p><p>The previously performed analysis testifies that the suggestions from M 2 OE's explanations can be translated into actions that decision-makers can perform.</p><p>To further confirm the quality of the observed results, we have measured the outlierness score of the analyzed samples in the space given by the features included in the explanation. More in detail, we have computed the mean value of the outlierness score on the original outliers and the samples resulting from the transformation. Moreover, we have measured the fraction of correctly patched samples, that is to say, the portion of outliers for which the returned transformation has lowered their anomaly score of at least 5%. The outlierness score involved in our analysis is the iForest score <ref type="bibr" target="#b23">[24]</ref>, based on the Isolation Forest anomaly detection method, whose underlying idea is that the anomalies are few and isolated from the normal samples. This score has been chosen for its dimension unbiasedness, which makes comparable even explanations with different numbers of features. The outliers in the set of features provided by the explanations show a mean outlierness of 0.65, which is consistently higher than that shown by the full feature space equal to 0.42. Instead, the samples resulting from the transformation exhibit an outlierness score of 0.45, which is substantially lower than that of the outliers, indeed, the 96% of the samples has been correctly transformed. This further confirms our conviction on the actionability of the explanations provided by M 2 OE in the considered context, also witnessed by the previous qualitative evaluation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Conclusion</head><p>In this paper, we have analyzed M 2 OE, a transformation-based outlier explanation method, in the green economy context to study its effectiveness in extracting actionable explanations.</p><p>To analyze this context, we have designed a tabular dataset, named the Green Comparative Advantage (GreenCA) dataset, representing one of the contributions of this paper. This data collection summarizes and reshapes information accessed from the International Monetary Fund about many indicators relative to the Green Comparative Advantage, which is an index measuring the relative benefit of exporting low-carbon technologies.</p><p>Inspecting the explanations provided for 16 nations exhibiting a high comparative advantage, we have seen how the transformations provided by M 2 OE as explanations can be considered actionable since they provide useful suggestions showing how to make that countries have a low comparative advantage by acting on the policies or natural aspects like the share of forests. This information is useful in a complementary analysis to unveil the aspects characterizing countries having a high comparative advantage. The quality of the observed explanations has also been validated through a numerical analysis. In future work, we plan to deepen our analysis by considering social, economic and environmental diversity to observe how they influence the Green Comparative Advantage.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: M 2 OE pipeline</figDesc><graphic coords="5,72.00,65.61,451.28,101.86" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>GreenCA dataset features overview</figDesc><table><row><cell>ID</cell><cell>Description</cell><cell>Unit</cell><cell>Indicator Group</cell></row><row><cell>ET_0</cell><cell>Environmental Taxes</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>ET_1</cell><cell>Taxes on Energy</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>ET_2</cell><cell>Taxes on Pollution</cell><cell>Percent of GDP</cell><cell>Enviromental Taxes</cell></row><row><cell>ET_3</cell><cell>Taxes on Resources</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>ET_4</cell><cell>Taxes on Transport</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell cols="2">EP_0 Expenditure on biodiversity &amp; landscape protection</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>EP_1</cell><cell>Expenditure on environment protection</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>EP_2 EP_3 EP_4</cell><cell>Expenditure on environmental protection n.e.c. Expenditure on environmental protection R&amp;D Expenditure on pollution abatement</cell><cell>Percent of GDP Percent of GDP Percent of GDP</cell><cell>Environmental Protection Expenditures</cell></row><row><cell>EP_5</cell><cell>Expenditure on waste management</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>EP_6</cell><cell>Expenditure on waste of water management</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>FC_0</cell><cell>Carbon stocks in forests</cell><cell>Million tonnes</cell><cell></cell></row><row><cell>FC_1</cell><cell>Forest area</cell><cell>1000 HA</cell><cell></cell></row><row><cell>FC_2 FC_3</cell><cell>Index of carbon stocks in forests Index of forest extent</cell><cell>Index Index</cell><cell>Forest and Carbon</cell></row><row><cell>FC_4</cell><cell>Land area</cell><cell>1000 HA</cell><cell></cell></row><row><cell>FC_5</cell><cell>Share of forest area</cell><cell>Percent</cell><cell></cell></row><row><cell>FF_0</cell><cell>Implicit Fossil Fuel Subsidies</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>FF_1</cell><cell>Explicit Fossil Fuel Subsidies</cell><cell>Percent of GDP</cell><cell>Fossil Fuel Subsidies</cell></row><row><cell>FF_2</cell><cell>Total Fossil Fuel Subsidies</cell><cell>Percent of GDP</cell><cell></cell></row><row><cell>RE_0_0</cell><cell>Renewable Electricity Generation</cell><cell>Gigawatt-hours (GWh)</cell><cell></cell></row><row><cell>RE_0_1 RE_1_0</cell><cell>Non-Renewable Electricity Generation Renewable Electricity Installed Capacity</cell><cell>Gigawatt-hours (GWh) Megawatt (MW)</cell><cell>Renewable Energy</cell></row><row><cell>RE_1_1</cell><cell>Non-Renewable Electricity Installed Capacity</cell><cell>Megawatt (MW)</cell><cell></cell></row><row><cell cols="2">TT_0 Trade balance in low carbon technology products</cell><cell>Percent</cell><cell>Trade in Low-carbon</cell></row><row><cell>TT_1</cell><cell>Total trade in low carbon technology products</cell><cell>Percent</cell><cell>Technology Products</cell></row></table></figure>
		</body>
		<back>

			<div type="acknowledgement">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Acknowledgments</head><p>We acknowledge the support of the PNRR project FAIR -Future AI Research (PE00000013), Spoke 9 -Green-aware AI, under the NRRP MUR program funded by the NextGenerationEU.</p></div>
			</div>

			<div type="annex">
<div xmlns="http://www.tei-c.org/ns/1.0" />			</div>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<monogr>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">O E</forename><surname>Nations</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A S</forename><surname>Development</surname></persName>
		</author>
		<ptr target="https://sdgs.un.org/2030agenda" />
		<title level="m">Transforming our world: the 2030 agenda for sustainable development</title>
				<imprint>
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A review of the global climate change impacts, adaptation, and sustainable mitigation measures</title>
		<author>
			<persName><forename type="first">K</forename><surname>Abbass</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">Z</forename><surname>Qasim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Song</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Murshed</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Mahmood</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Younis</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Environmental Science and Pollution Research</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="page" from="42539" to="42559" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Determinants of emissions pathways in the coupled climate-social system</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">C</forename><surname>Moore</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Lacasse</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">J</forename><surname>Mach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><forename type="middle">A</forename><surname>Shin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">J</forename><surname>Gross</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Beckage</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Nature</title>
		<imprint>
			<biblScope unit="volume">603</biblScope>
			<biblScope unit="page" from="103" to="111" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Counterfactuals explanations for outliers via subspaces density contrastive loss</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nisticó</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Discovery Science</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2023">2023</date>
			<biblScope unit="page" from="159" to="173" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Explaining outliers and anomalous groups via subspace density contrastive loss</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nisticò</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Machine Learning</title>
				<imprint>
			<date type="published" when="2024">2024</date>
			<biblScope unit="page" from="1" to="25" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Explaining outliers by subspace separability</title>
		<author>
			<persName><forename type="first">B</forename><surname>Micenková</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">T</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X.-H</forename><surname>Dang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Assent</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">2013 IEEE 13th international conference on data mining</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2013">2013</date>
			<biblScope unit="page" from="518" to="527" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Discriminative features for identifying and interpreting outliers</title>
		<author>
			<persName><forename type="first">X</forename><forename type="middle">H</forename><surname>Dang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">I</forename><surname>Assent</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">T</forename><surname>Ng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Zimek</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><surname>Schubert</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE 30th international conference on data engineering</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2014">2014. 2014</date>
			<biblScope unit="page" from="88" to="99" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Why is this an anomaly? explaining anomalies using sequential explanations</title>
		<author>
			<persName><forename type="first">T</forename><surname>Mokoena</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Celik</surname></persName>
		</author>
		<author>
			<persName><forename type="first">V</forename><surname>Marivate</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Pattern Recognition</title>
		<imprint>
			<biblScope unit="volume">121</biblScope>
			<biblScope unit="page">108227</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Detecting outlying properties of exceptional objects</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Acm transactions on database systems</title>
		<imprint>
			<biblScope unit="volume">34</biblScope>
			<biblScope unit="page" from="1" to="62" />
			<date type="published" when="2009">2009</date>
		</imprint>
	</monogr>
	<note>tods)</note>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Outlying property detection with numerical attributes</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Manco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data mining and knowledge discovery</title>
		<imprint>
			<biblScope unit="volume">31</biblScope>
			<biblScope unit="page" from="134" to="163" />
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Discovering characterizations of the behavior of anomalous subpopulations</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">IEEE Transactions on knowledge and data engineering</title>
		<imprint>
			<biblScope unit="volume">25</biblScope>
			<biblScope unit="page" from="1280" to="1292" />
			<date type="published" when="2012">2012</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">A density estimation approach for detecting and explaining exceptional values in categorical data</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Serrao</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Applied Intelligence</title>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="page" from="17534" to="17556" />
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Hos-miner: A system for detecting outlying subspaces of high-dimensional data</title>
		<author>
			<persName><forename type="first">J</forename><surname>Zhang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Lou</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">W</forename><surname>Ling</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 30th International Conference on Very Large Data Bases (VLDB&apos;04)</title>
				<meeting>the 30th International Conference on Very Large Data Bases (VLDB&apos;04)</meeting>
		<imprint>
			<publisher>Morgan Kaufmann Publishers Inc</publisher>
			<date type="published" when="2004">2004</date>
			<biblScope unit="page" from="1265" to="1268" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Mining outlying aspects on numeric data</title>
		<author>
			<persName><forename type="first">L</forename><surname>Duan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Tang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pei</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bailey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Campbell</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Tang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data Mining and Knowledge Discovery</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<biblScope unit="page" from="1116" to="1151" />
			<date type="published" when="2015">2015</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Discovering outlying aspects in large datasets</title>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">X</forename><surname>Vinh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Chan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Romano</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Bailey</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Leckie</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Ramamohanarao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Pei</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Data mining and knowledge discovery</title>
		<imprint>
			<biblScope unit="volume">30</biblScope>
			<biblScope unit="page" from="1520" to="1555" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Outlier explanation through masking models</title>
		<author>
			<persName><forename type="first">F</forename><surname>Angiulli</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Fassetti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Nisticò</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Palopoli</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">European Conference on Advances in Databases and Information Systems</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="392" to="406" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">International Monetary Fund, Climate change indicators dashboard. environmental taxes</title>
				<imprint>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">International Monetary Fund, Climate change indicators dashboard. environmental protection expenditures</title>
				<imprint>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">International Monetary Fund, Climate change indicators dashboard. forest and carbon</title>
				<imprint>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b19">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">International Monetary Fund, Climate change indicators dashboard. fossil fuel subsidies</title>
				<imprint>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">International Monetary Fund, Climate change indicators dashboard. renewable energy</title>
				<imprint>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b21">
	<monogr>
		<ptr target="https://climatedata.imf.org/pages/access-data" />
		<title level="m">Climate change indicators dashboard. trade in low-carbon technology products</title>
				<imprint>
			<publisher>International Monetary Fund</publisher>
			<date type="published" when="2022">2022. 2024-09-07</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b22">
	<analytic>
		<title level="a" type="main">A density-based algorithm for discovering clusters in large spatial databases with noise</title>
		<author>
			<persName><forename type="first">M</forename><surname>Ester</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H.-P</forename><surname>Kriegel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sander</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Xu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">kdd</title>
		<imprint>
			<biblScope unit="volume">96</biblScope>
			<biblScope unit="page" from="226" to="231" />
			<date type="published" when="1996">1996</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b23">
	<analytic>
		<title level="a" type="main">Isolation forest</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">T</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">M</forename><surname>Ting</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z.-H</forename><surname>Zhou</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">in: 2008 eighth ieee international conference on data mining</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2008">2008</date>
			<biblScope unit="page" from="413" to="422" />
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
