<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Enhancing Cyber-threat detection coupling Deep Neural Ensemble Learning with XAI</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Malik</forename><surname>Al-Essa</surname></persName>
							<email>malik.alessa@uniba.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Giuseppina</forename><surname>Andresini</surname></persName>
							<email>giuseppina.andresini@uniba.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">Consorzio Interuniversitario Nazionale per l&apos;Informatica -CINI</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Annalisa</forename><surname>Appice</surname></persName>
							<email>annalisa.appice@uniba.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">Consorzio Interuniversitario Nazionale per l&apos;Informatica -CINI</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Donato</forename><surname>Malerba</surname></persName>
							<email>donato.malerba@uniba.it</email>
							<affiliation key="aff0">
								<orgName type="institution">University of Bari Aldo Moro</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
							<affiliation key="aff1">
								<orgName type="institution">Consorzio Interuniversitario Nazionale per l&apos;Informatica -CINI</orgName>
								<address>
									<settlement>Bari</settlement>
									<country key="IT">Italy</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Enhancing Cyber-threat detection coupling Deep Neural Ensemble Learning with XAI</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">EF15114BC1193BA37EBA93F85FC96405</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T16:56+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Ensemble Learning, Adversarial Training, eXplainable Artificial Intelligence, Cyber-threat Detection D. Malerba) 0000-0002-0892-975X (M. Al-Essa)</term>
					<term>0000-0002-5272-644X (G. Andresini)</term>
					<term>0000-0001-9840-844X (A. Appice)</term>
					<term>0000-0001-8432-4608 (D. Malerba)</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>In the digital age, the use of deep learning is one of the most powerful machine learning paradigms for cybersecurity. Despite the amazing results recently achieved with deep learning methods in securing the digital infrastructures of modern organizations, the security of neural models can easily be jeopardized by adversarial attacks. This article describes a recently published cyber-threat detection method, named PANACEA, that combines Adversarial Training and eXplainable Artificial Intelligence (XAI) to increase the diversity of multiple neural models fused together through a neural ensemble system. Experiments carried out on several benchmark cybersecurity datasets show the beneficial effects of the proposed combination of Adversarial Training, Ensemble Learning and XAI on the accuracy of multi-class classifications of cyber-data achieved by the neural method.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>During the last decade, the cybersecurity literature has conferred a high-level role in deep learning as a powerful learning paradigm to detect ever-evolving cyberthreats in modern security systems. In particular, recent cybersecurity studies have shown that deep learning performance can be further strengthened with ensemble learning systems <ref type="bibr" target="#b0">[1]</ref> that are able to obtain better generalization by reducing the dispersion of predictions of single models and gaining model accuracy. However, selecting the ensemble member models based on the local model accuracy may lead to the issue of excessive ensemble because the performance of the ensemble system may not be significantly improved by some of the selected models. Therefore, several scholars encourage the diversity among individual models of deep ensembles, in addition to the accuracy of individual models, to learn diverse aspects of training data <ref type="bibr" target="#b1">[2]</ref>.</p><p>In <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>, we have recently proposed a new XAI-based method, named PANACEA, that is mainly founded on the idea that different sub-areas of the input feature space can be equally relevant to achieve a correct decision for multiple samples produced in the same situation. Hence, an accurate ensemble system may be produced through the fusion of base models that perform decisions which give more importance to different sub-areas of the input feature space. For this purpose, we use the XAI DALEX framework <ref type="bibr" target="#b4">[5]</ref> to explain the global feature importance in neural models. Specifically, we adopt a combination of XAI and clustering to select ensemble base models that achieve high explanation diversity. Finally, we use a multi-headed neural network architecture that fine-tunes simultaneously base neural models selected through DALEX-based clustering, by taking advantage of a back-propagation strategy to share knowledge among multiple base models incorporated as sub-network blocks in the ensemble system.</p><p>Motivations for adopting this neural ensemble method in cybersecurity problems can be mainly founded in the peculiarities of the network intrusion detection problems, where samples of different attack families commonly have signatures involving different features. For example, as illustrated by <ref type="bibr" target="#b5">[6]</ref>, "the time between the SYN ACK and the ACK response" is relevant for detecting shellcode intrusions, while it becomes less important when detecting other types of attacks. Shellcode, in fact, is an exploiting attack in which the attacker penetrates a piece of code from a shell to control a target machine using the standard TCP/IP socket connections.</p><p>Based upon these considerations, our point of view is that being able to fuse deep neural models that give relevance to different network traffic feature signatures (and, consequently, input feature sub-spaces) may help in improving the accuracy of a multi-class deep neural ensemble trained to recognize different cyber-attack patterns such as various categories of network traffic intru-Figure <ref type="figure">1</ref>: Schema of PANACEA sions. Our argument is mainly supported by experiments performed with three benchmark network intrusion detection datasets, namely NSL-KDD, UNSW-NB15 and CI-CIDS17, that comprise multiple real categories of network traffic intrusions (comprising rare attacks). In addition, to explore the adaptability of the proposed method to other cyber-threat detection problems, we also evaluated the effectiveness proposed method in a benchmark malware detection problem, namely CICMalDroid20, since we expect that, similarly to network traffic intrusions, different malware categories may have diverse feature signatures.</p><p>This paper summarises some of the main results published in <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref>. The PANACEA method is presented in Section 2. Section 3 illustrates the main results achieved in the evaluation of the proposed method. Finally, Section 4 draws conclusions and sketches future research directions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">PANACEA method</head><p>Let us consider a dataset 𝒟 = {(x𝑖, 𝑦𝑖)} 𝑁 𝑖=1 of 𝑁 training samples, where x ∈ R 𝑑 is a 𝑑-dimensional vector of input features that describe cyber-data samples, and 𝑦 ∈ {1, . . . , 𝐾} is the label variable with 𝐾 classes (benign class and several categories of cyber-threats), according to labels of samples historically collected.</p><p>The PANACEA method, illustrated in Figure <ref type="figure">1</ref>, is based on the following steps:</p><p>• The training of an initial neural model 𝑀 𝜃 : R 𝑑 ↦ → 𝑌 with parameter 𝜃 learned from 𝒟.</p><p>• The generation of an adversarial set 𝒜 produced by 𝒟 with data perturbation threshold 𝜖 by using 𝑀 𝜃 . The adversarial samples are produced using the FGSM algorithm. • The training of 𝜂 neural model candidates learned from 𝒟, augmented with subsets of 𝜎 adversarial samples randomly selected from 𝒜. • The use of a post-hoc global XAI technique, namely DALEX, to explain the decisions of neural model candidates and generate a feature-vector explanation of each neural model candidate. • A clustering stage (𝑘-medoids method) to group neural model candidates with similar feature explanation vectors in the same clusters, and neural model candidates with dissimilar feature explanation vectors in separate clusters. Since each cluster medoid is a neural model candidate that acts as the cluster's prototype, 𝑘 medoids (chosen using the Elbow method) are selected as the base neural models for the ensemble fusion. • A multi-headed neural network that fuses together base neural models selected through clustering.</p><p>Notice that the performance of PANACEA may depend on the input parameters:(1) 𝜖 that represents the amount of data perturbation considered to generate adversarial samples; (2) 𝜎 that defines the number of adversarial samples randomly selected for learning each neural model candidate with the adversarial training strategy; (3) 𝜂 that is the number of distinct neural model candidates learned with the adversarial training strategy. In general, the perturbation 𝜖 is selected as a small value in the range between 0 and 0.1 <ref type="bibr" target="#b6">[7]</ref>, to scale the noise and ensure that perturbations are small enough to remain undetected to the human eye, but large enough to fool the attacked neural model. In PANACEA the value of 𝜖 is automatically selected based on the characteristics of adversarial samples. This is based on the idea that the value at which a lower 𝜖 stops perturbing training samples, by diminishing the number of misclassified adversarial training samples, may correspond to an adequate value of 𝜖 for gaining accuracy with the adversarial training strategy. Based on this idea, for each 𝜖 in the range [0, 0.1], the adversarial set 𝒜𝜖, produced from the original training set with initial neural model 𝑀 𝜃 as target model, is considered. The Overall Accuracy (OA) of 𝑀 𝜃 is computed on each 𝒜𝜖 and the Elbow method is used to pick the knee of the OA(𝒜𝜖) curve as the value of 𝜖. Notably, this procedure for the automatic selection of 𝜖 is independent of both 𝜎 and 𝜂 that remain user-defined parameters</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Evaluation study</head><p>Four benchmark multi-class datasets, i.e., NSL-KDD, UNSW-NB15, CICIDS17 (network security datasets) and CICMalDroid20 (malware security dataset) were considered to evaluate the performance of PANACEA. Experiments were conducted by dividing each dataset into training set and testing set. The detailed description of the experimental set-up is reported in <ref type="bibr" target="#b3">[4]</ref>.</p><p>The most of experiments were conducted with 𝜎 = 5% and 10% of the training set size, considering the values of elbow 𝜖 automatically selected with the Elbow method and fixing 𝜂 = 100 for all datasets. However, further experiments exploring the sensitivity of the performance of PANACEA to the number of models 𝜂 are illustrated in <ref type="bibr" target="#b3">[4]</ref>.</p><p>Table <ref type="table" target="#tab_0">1</ref> reports the number of neural models (𝑘) that the clustering step of PANACEA selected for the ensemble fusion, as well as WeightedF1, MacroF1 and OA of PANACEA in the considered experimental setting. All the accuracy metrics were measured on the testing set of each dataset. As BASELINE, we considered the deep neural network that was trained in the first step of PANACEA as the initial neural model for the adversarial sample production. We recall that the number of clusters 𝑘 was automatically identified during the clustering step of PANACEA. The results show that PANACEA outperforms BASELINE, independently of the number 𝜎 of adversarial samples processed in UNSW-NB15, CICIDS17 and CICMalDroid20. In these three datasets, the gain in accuracy is commonly observed equally along WeightedF1, MacroF1 and OA. The only exception is the MacroF1 of PANACEA with 𝜎 = 5% in UNSW-NB15. However, both WeightedF1 and OA of PANACEA outperform WeightedF1 and OA of BASELINE also in this configuration. In addition, there is at least one tested configuration of PANACEA that outperforms BASELINE in NSL-KDD. Finally, also in NSL-KDD the gain in accuracy is observed along WeightedF1 and OA, but not along MacroF1. This is due to the presence of minority classes in both NSL-KDD and UNSW-NB15. In fact, in both datasets, the ensemble strategy allows us to gain accuracy by better classifying samples of majority classes, while we may lose accuracy by classifying samples of minority classes. This intuition is confirmed by the analysis of detailed F1 per class, reported <ref type="bibr" target="#b3">[4]</ref>. Notably <ref type="bibr" target="#b3">[4]</ref> also reports an extensive analysis of the accuracy performance of PANACEA compared to several, recent state-of-the-art competitors, as well as the analysis of the accuracy performance achieved by PANACEA by using PGD, DeepFool and LowProFool in place of FGSM.</p><p>To examine in-depth diversity, Figure <ref type="figure" target="#fig_0">2</ref> depicts the top-15 relevant features on the global decisions of the base neural models selected in NSL-KDD. Feature ranking maps show how diverse input features play prominent roles in explaining the decisions of the base neural models selected for the ensemble fusion in PANACEA. For example, the input feature "serror_rate", that is ranked in third place for the neural model medoids of clusters 2, 3 and 7 of NSL-KDD, is not even in the top-15 for the medoid of cluster 6. Notably, humans may inspect this explanation result to confirm the selection of neural model candidates automatically selected by PANACEA or perform a manual update of the automatic selection (with model deletions or additions) according to background knowledge.</p><p>We complete this article by illustrating an example that shows how the ensemble model of PANACEA gains accuracy in a cyber-threat detection task compared to the single model of BASELINE. For this purpose, we consider an R2L sample of the test set of NSL-KDD that was wrongly classified by BASELINE in the class Normal, while it was correctly recognised in the class R2L by PANACEA. We analyse this sample by using SHAP that is a local algorithm to measure the effect of an input feature on the assignment of a sample to a class with a neural model. Figure <ref type="figure" target="#fig_3">3</ref> shows the five most important input features identified by SHAP to see the sample in the class R2L with the models learned by both BASELINE and PANACEA. Let us consider that only PANACEA predicted this sample in the class R2L.</p><p>Both BASELINE and PANACEA share the same top-3 features, i.e., service_http, service_ftp_data and dst_host_srv_count. Notably, these three features are recognised as important to detect R2L attacks also in <ref type="bibr" target="#b7">[8]</ref>. The input feature in the fourth place of the feature ranking of PANACEA is protocol_type_tcp that does not appear in the feature ranking of BASELINE. The authors of <ref type="bibr" target="#b8">[9]</ref> report that the simultaneous use of the TCP protocol and the FTP service is to be considered a symptom of Master is a subcategory of R2L attacks, where attackers exploit a system bug associated with FTP to send packets of illegal software to a target host <ref type="bibr" target="#b8">[9]</ref>. We note that FTP is a service based on the TCP protocol. Therefore, this example shows how the ensemble model of PANACEA manages to bring out the existence of feature patterns useful for the recognition of attack classes that are often ignored by the single model of BASELINE. These conclusions are also supported by the study of <ref type="bibr" target="#b7">[8]</ref>, that identifies both service_ftp_data and protocol_type_tcp features as the most important features to detect R2L attacks. In addition, BASELINE, differently from PANACEA, identifies serror_rate as one of the most relevant features for recognizing the sample as an R2L attack. However, neither <ref type="bibr" target="#b7">[8]</ref> nor <ref type="bibr" target="#b8">[9]</ref> identify this feature as one of the most prominent features for this type of attack.</p><p>In short, the emergence of protocol_type_tcp can be considered as an important input feature instead of ser-ror_rate motivates the ability of PANACEA in correctly recognising the considered R2L sample and, in general, the ability of outperforming BASELINE in the recognition of R2L attacks (that passes from F1(R2L)=0.55 for BASELINE to F1(R2L)=0.64 for PANACEA.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>In this paper, we have summarized the main results of our newest research illustrated in <ref type="bibr" target="#b3">[4]</ref>, where we have described a deep learning method for multi-class classification of cyber-data. <ref type="foot" target="#foot_0">1</ref> The proposed method trains an ensemble of base neural models, whose weights are initialised with an adversarial training strategy. We use an XAI-based approach to increase the diversity of the neural models selected to be fused together through the ensemble system.</p><p>Notably, this article delves into one of the current research directions carried out by Laboratory KDDE (https://kdde.di.uniba.it/) at the University of Bari "Aldo Moro", which aims at exploring a Symbiotic AI approach to Cybersecurity. The team has recently published several papers in this field (e.g., <ref type="bibr" target="#b9">[10,</ref><ref type="bibr" target="#b10">11,</ref><ref type="bibr" target="#b5">6,</ref><ref type="bibr" target="#b11">12,</ref><ref type="bibr" target="#b12">13]</ref>). In particular, the newest studies <ref type="bibr" target="#b2">[3,</ref><ref type="bibr" target="#b3">4]</ref> stay under the umbrella of Symbiotic AI, as they explore how Explainability of AI systems can be leveraged as a valuable means to allow deep neural models to gain accuracy under critical conditions commonly occurring in cybersecurity problems, e.g., class imbalance, attack signature diversity. They provide a mechanism that can explain to humans how the candidate models are selected for ensemble systems. On the other side, these studies stay under the umbrella of Cybersecurity, as XAI is used to improve the performance of a cyber-threat detection ensemble model on multiple attack categories by allowing us to identify and use the multiple input sub-space that can help in detecting attacks with diverse signature. In addition, the use of XAI tools allows us to perform a step forward to gain the trust of stakeholders in AI decisions. In fact, it allows us to disclose cyber-data patterns that are hidden in how the AI models achieve a decision and explain why a black box model can actually achieve higher performance than another one in cyber-threat detection.</p><p>By continuing along this research direction, the team is working on the use of XAI to examine and explain the evasion ability of state-of-the-art attack methods formulated for Windows PE malware detection problems. In addition, the team is investigating emerging learning frameworks (such as distillation) to leverage explanations disclosed through attention layers to improve the performance of deep neural models trained for cyber-threat detection.    </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="5.">Acknowledgments</head></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: Top-15 feature ranking map of the base neural models selected through the clustering step of PANACEA in NSL-KDD</figDesc><graphic coords="4,118.05,84.19,145.85,219.78" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>Malik AL-Essa is supported by PON RI 2014-2020 -Machine Learning per l'Investigazione di Cyber-minacce e la Cyber-difesa -CUP H98B20000970007. Giuseppina Andresini is supported by the project FAIR -Future AI Research (PE00000013), Spoke 6 -Symbiotic AI, under the NRRP MUR program funded by the NextGenerationEU.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: Top-5 input features considered by both BASELINE (3a) and PANACEA (3b) to recognize an R2L attack in the class R2L</figDesc><graphic coords="5,97.63,94.15,200.01,98.95" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0"><head></head><label></label><figDesc></figDesc><graphic coords="2,130.96,84.19,333.36,202.26" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>WeightedF1, MacroF1 and OA of PANACEA with 𝜎 = 5% and 10% of the training set size and BASELINE. 𝑘 denotes the number of distinct neural models automatically selected in the clustering step of PANACEA over 𝜂 = 100 neural model candidates. The best results are in bold. Annalisa Appice and Donato Malerba are partially supported by project SERICS (PE00000014) under the NRRP MUR National Recovery and Resilience Plan funded by the European Union -NextGenerationEU.</figDesc><table><row><cell>dataset</cell><cell>method</cell><cell>𝑘</cell><cell>WeightedF1</cell><cell>MacroF1</cell><cell>OA</cell></row><row><cell></cell><cell>BASELINE</cell><cell>-</cell><cell>0.80</cell><cell>0.64</cell><cell>0.80</cell></row><row><cell>NSL-KDD</cell><cell>PANACEA (𝜎 = 5%)</cell><cell>8</cell><cell>0.79</cell><cell>0.60</cell><cell>0.80</cell></row><row><cell></cell><cell>PANACEA (𝜎 = 10%)</cell><cell>7</cell><cell>0.83</cell><cell>0.64</cell><cell>0.84</cell></row><row><cell></cell><cell>BASELINE</cell><cell>-</cell><cell>0.74</cell><cell>0.42</cell><cell>0.74</cell></row><row><cell>UNSW-NB15</cell><cell>PANACEA (𝜎 = 5%)</cell><cell>12</cell><cell>0.77</cell><cell>0.41</cell><cell>0.77</cell></row><row><cell></cell><cell>PANACEA (𝜎 = 10%)</cell><cell>11</cell><cell>0.78</cell><cell>0.44</cell><cell>0.77</cell></row><row><cell></cell><cell>BASELINE</cell><cell>-</cell><cell>0.92</cell><cell>0.64</cell><cell>0.91</cell></row><row><cell>CICIDS17</cell><cell>PANACEA (𝜎 = 5%)</cell><cell>9</cell><cell>0.98</cell><cell>0.73</cell><cell>0.97</cell></row><row><cell></cell><cell>PANACEA (𝜎 = 10%)</cell><cell>9</cell><cell>0.99</cell><cell>0.94</cell><cell>0.99</cell></row><row><cell></cell><cell>BASELINE</cell><cell>-</cell><cell>0.83</cell><cell>0.80</cell><cell>0.83</cell></row><row><cell>CICMalDroid20</cell><cell>PANACEA (𝜎 = 5%)</cell><cell>13</cell><cell>0.90</cell><cell>0.88</cell><cell>0.89</cell></row><row><cell></cell><cell>PANACEA (𝜎 = 10%)</cell><cell>18</cell><cell>0.85</cell><cell>0.83</cell><cell>0.86</cell></row></table></figure>
			<note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0">The original research illustrated in<ref type="bibr" target="#b3">[4]</ref> was published under Creative Commons License Attribution 4.0 (CC BY 4.0) https: //creativecommons.org/licenses/by/4.0/</note>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Ensemble learning for intrusion detection systems: A systematic mapping study and cross-benchmark evaluation</title>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">A</forename><surname>Tama</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lim</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.cosrev.2020.100357</idno>
	</analytic>
	<monogr>
		<title level="j">Computer Science Review</title>
		<imprint>
			<biblScope unit="volume">39</biblScope>
			<biblScope unit="page" from="1" to="27" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">A survey on ensemble learning</title>
		<author>
			<persName><forename type="first">X</forename><surname>Dong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Cao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Shi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Ma</surname></persName>
		</author>
		<idno type="DOI">10.1007/s11704-019-8208-z</idno>
	</analytic>
	<monogr>
		<title level="j">Frontiers of Computer Science</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="page" from="241" to="258" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<monogr>
		<title level="m" type="main">Panacea: A neural model ensemble for cyber-threat detection</title>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Essa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
		<idno type="DOI">10.1109/DSAA60987.2023.10302518</idno>
		<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Panacea: a neural model ensemble for cyber-threat detection</title>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Essa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
		<idno type="DOI">10.1007/s10994-023-06470-2</idno>
	</analytic>
	<monogr>
		<title level="m">Machine Learning</title>
				<imprint>
			<date type="published" when="2024">2024</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">DALEX: Explainers for complex predictive models in R</title>
		<author>
			<persName><forename type="first">P</forename><surname>Biecek</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Machine Learning Research</title>
		<imprint>
			<biblScope unit="volume">19</biblScope>
			<biblScope unit="page" from="1" to="5" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">ROULETTE: A neural attention multioutput model for explainable network intrusion detection</title>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">P</forename><surname>Caforio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Vessio</surname></persName>
		</author>
		<idno type="DOI">10.1016/j.eswa.2022.117144</idno>
	</analytic>
	<monogr>
		<title level="j">Expert Systems with Applications</title>
		<imprint>
			<biblScope unit="page">117144</biblScope>
			<date type="published" when="2022">2022</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Recent advances in adversarial training for adversarial robustness</title>
		<author>
			<persName><forename type="first">T</forename><surname>Bai</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Luo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Zhao</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Wen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Wang</surname></persName>
		</author>
		<idno type="DOI">10.24963/ijcai.2021/591</idno>
	</analytic>
	<monogr>
		<title level="m">30th International Joint Conference on Artificial Intelligence, IJCAI 2021, IJCAI.ORG</title>
				<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="4312" to="4321" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">KDD feature set complaint heuristic rules for R2L attack detection</title>
		<author>
			<persName><forename type="first">M</forename><surname>Sabhnani</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Serpen</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International Conference on Security and Management, SAM 2003</title>
				<imprint>
			<publisher>CSREA Press</publisher>
			<date type="published" when="2003">2003</date>
			<biblScope unit="page" from="310" to="316" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">An explainable machine learning framework for intrusion detection systems</title>
		<author>
			<persName><forename type="first">M</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Zheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Yang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">X</forename><surname>Wang</surname></persName>
		</author>
		<idno type="DOI">10.1109/ACCESS.2020.2988359</idno>
	</analytic>
	<monogr>
		<title level="j">IEEE Access</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="page" from="73127" to="73141" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">Leveraging grad-cam to improve the accuracy of network intrusion detection systems</title>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">P</forename><surname>Caforio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Vessio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">24th Conference on Discovery Science</title>
		<title level="s">Lecture Notes in Computer Science</title>
		<meeting><address><addrLine>DS</addrLine></address></meeting>
		<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2021">2021. 2021</date>
			<biblScope unit="volume">12986</biblScope>
			<biblScope unit="page" from="385" to="400" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">INSOMNIA: towards concept-drift robustness in network intrusion detection</title>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Pendlebury</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Pierazzi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Loglisci</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Cavallaro</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">14th ACM Workshop on Artificial Intelligence and Security</title>
				<imprint>
			<publisher>ACM</publisher>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="111" to="122" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">XAI to explore robustness of features in adversarial training for cybersecurity</title>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Essa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
		<idno type="DOI">10.1007/978-3-031-16564-1_12</idno>
	</analytic>
	<monogr>
		<title level="m">Foundations of Intelligent Systems</title>
				<imprint>
			<publisher>Springer International Publishing</publisher>
			<date type="published" when="2022">2022</date>
			<biblScope unit="page" from="117" to="126" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">An XAI-based adversarial training approach for cyber-threat detection</title>
		<author>
			<persName><forename type="first">M</forename><surname>Al-Essa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Andresini</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Appice</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Malerba</surname></persName>
		</author>
		<idno type="DOI">10.1109/DASC/PiCom/CBDCom/Cy55231.2022.9927842</idno>
	</analytic>
	<monogr>
		<title level="m">IEEE International Conference on Cyber Science and Technology Congress, CyberSciTech 2023</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2022">2022. 2022</date>
			<biblScope unit="page" from="1" to="8" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><forename type="first">A</forename></persName>
		</author>
		<ptr target="https://github.com/malikalessa/PANACEA" />
		<title level="m">Online Resources The source code of PANACEA implementation is available</title>
				<imprint/>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
