=Paper=
{{Paper
|id=Vol-3762/567
|storemode=property
|title=AI-Driven Innovations in Healthcare: Bridging Imaging and Genomics for Advanced Disease Insights
|pdfUrl=https://ceur-ws.org/Vol-3762/567.pdf
|volume=Vol-3762
|authors=Carlo Adornetto,Pierangela Bruno,Francesco Calimeri,Edoardo De Rose,Gianluigi Greco,Alessandro Quarta
|dblpUrl=https://dblp.org/rec/conf/ital-ia/AdornettoBCRGQ24
}}
==AI-Driven Innovations in Healthcare: Bridging Imaging and Genomics for Advanced Disease Insights==
AI-Driven Innovations in Healthcare: Bridging Imaging and
Genomics for Advanced Disease Insights
Carlo Adornetto1,** , Pierangela Bruno1 , Francesco Calimeri1 , Edoardo De Rose1 ,
Gianluigi Greco1 and Alessandro Quarta1,2
1
Department of Mathematics and Computer Science, University of Calabria, Via Pietro Bucci, Rende, Italy
2
Department of Computer, Control and Management Engineering "Antonio Ruberti", Sapienza University of Rome, Via Ariosto 25, Rome, Italy
Abstract
The application of Artificial Intelligence (AI) techniques for analyzing medical images and omics data is revolutionizing the
healthcare industry by offering profound insights into various diseases. Achieving precise diagnoses and formulating effective
treatment plans, however, demands intricate and multimodal analysis of complex, sensitive, and diverse medical datasets.
Recent advancements in Machine Learning and Deep Learning have proven to be formidable in identifying and classifying
specific diseases. This paper outlines the current projects undertaken by our research group in this innovative domain.
Keywords
Artificial Intelligence, Medical Imaging, Genomics, Deep Learning
1. Introduction and less resource-demanding systems [1, 2, 3].
In omics analysis, DL has excelled by exploring the vast
The rapid advancement of technology and increased data arrays of biological molecules, aiding in disease under-
availability have positioned Artificial Intelligence (AI) as standing and treatment customization across fields like
a cornerstone in healthcare. AI significantly enhances genomics, transcriptomics, proteomics, and metabolomics.
patient care, refines treatment protocols, and accelerates Advancements in high-throughput and next-generation
the diagnosis of diverse health conditions. Notably, AI sequencing technologies have fueled significant progress
has advanced medical imaging and omics analysis, re- in functional genomics, especially in understanding
fining diagnostic accuracy and personalizing treatment cancer-related genomic factors [4].
strategies. Despite the potential, DL models often suffer from a lack
Deep Learning (DL), a subset of AI, excels in analyzing of interpretability, a critical challenge in bioinformat-
medical images. Its ability to autonomously identify crit- ics. The rise of Explainable Artificial Intelligence (XAI)
ical features and yield accurate interpretations has made aims to enhance model transparency and improve fea-
it essential for analyzing complex visual data in medical ture selection. Techniques like Shapely Additive exPla-
imaging modalities such as X-rays, MRI, CT scans, PET, nations (SHAP) and Gradient-weighted Class Activation
and ultrasound. These capabilities are crucial for diagnos- Mapping (Grad-CAM) have become pivotal in demystify-
ing complex conditions like cancers, and cardiovascular ing the decisions of Neural Networks (NNs), providing
and neurological disorders. clearer insights into their predictive mechanisms [5, 6].
However, the assembly of extensive datasets poses sig- This paper, following our previous work [7], outlines
nificant challenges. To address this, Continual Learning our recent advancements in medical imaging and omics
(CL) has emerged as a solution, enabling models to adapt data analysis, paving the way for an in-depth exploration
through ongoing data streams, thus enhancing scalability of AI’s evolving role in healthcare. The forthcoming
and application efficiency resulting in more sustainable sections discuss medical imaging in Section 2, and omics-
scale data analysis in Section 3.1, concluding with a com-
Ital-IA 2024: 4th National Conference on Artificial Intelligence, orga-
nized by CINI, May 29-30, 2024, Naples, Italy
prehensive overview in Section 5.
*
Corresponding author.
$ carlo.adornetto@unical.it (C. Adornetto);
pierangela.bruno@unical.it (P. Bruno); francesco.calimeri@unical.it 2. Medical Imaging and AI
(F. Calimeri); edoardo.derose@unical.it (E. D. Rose);
gianluigi.greco@unical.it (G. Greco); 2.1. Vessel segmentation of
alessandro.quarta@uniroma1.it (A. Quarta)
0000-0002-9734-1017 (C. Adornetto); 0000-0002-0832-0151
cine-angiography
(P. Bruno); 0000-0002-0866-0834 (F. Calimeri); 0000-0002-0032-9434
The methodology adopted in this study systematically
(E. D. Rose); 0000-0002-5799-6828 (G. Greco); 0000-0001-6319-2466
(A. Quarta) enhances the evaluation of vascular complexity in Pe-
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License ripheral Arterial Occlusive Disease (PAOD) patients
Attribution 4.0 International (CC BY 4.0).
CEUR
Workshop
Proceedings
http://ceur-ws.org
ISSN 1613-0073
CEUR Workshop Proceedings (CEUR-WS.org) through the integration of advanced imaging segmenta-
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
Figure 1: Outcomes from automatic segmentation on two distinct patients are depicted. The figure on the left, Fig. a,
represents the patient with the lowest AUC value across the entire study group, whereas Fig. b on the right displays the
patient with the highest AUC value. Each figure progresses from top to bottom, beginning with the stitched image (i.e., the
grayscale input image), followed by the ground truth segmentation, and concluding with the automatically segmented image.
tion and computational analysis. Utilizing deep learning classifying different tissue types, proving to be an invalu-
techniques, this research successfully transforms cine- able tool in enhancing diagnostic accuracy and patient
angiography videos into detailed static images, markedly care [9].
enhancing the clarity and reliability of vascular assess-
ments. Furthermore, the adoption of fractal dimension as 2.2.1. Laryngeal Endoscopic Images
a quantitative metric for vascular complexity introduces a
novel, objective criterion to the field. This dual approach In this work, we present a novel approach using deep
not only promises to mitigate the subjectivity inherent learning (DL) for performing semantic segmentation on
in current diagnostic practices but also establishes a ro- laryngeal endoscopy images, building upon the foun-
bust correlation with conventional clinical evaluations, dations laid by previous research [10, 11]. The dataset
potentially revolutionizing PAOD management strate- utilized in this study includes 536 color images manually
gies [8]. Incorporating advanced imaging segmentation segmented from in vivo laryngeal examinations, all at a
and computational analysis, our method significantly re- resolution of 512×512 pixels, originating from two sepa-
fines the assessment of vascular complexity in PAOD pa- rate surgical procedures. These images are categorized
tients. Figure 1 vividly illustrates the segmented vascular into seven distinct groups: void, vocal folds, other tis-
trees from cine-angiography, alongside their correspond- sue, glottal space, pathology, surgical tool, and intubation.
ing fractal dimension analysis, showcasing the clarity Our model’s predictive capabilities were significantly
and precision of our deep learning-based approach. The enhanced by leveraging the capabilities of rule-based
study achieved significant findings, demonstrating that languages, especially Answer Set Programming (ASP).
the deep learning-based segmentation method resulted Incorporating ASP allowed us to navigate the neural
in an Area Under the Curve mean value of 0.77 ± 0.07, network’s (NN) decision-making with greater precision,
with a range from 0.57 to 0.87. This method significantly applying penalties for inaccuracies grounded in well-
improved the reliability of visual assessments of vascular established knowledge. Moreover, rule-based methods
complexity, achieving an Inter-Class Correlation Coeffi- were applied to refine our model’s output, successfully
cient (ICC) of 0.96 for segmented images, compared to rectifying minor mistakes, such as single pixels misla-
0.76 for video assessments. Additionally, the Fractal Di- beled, and adjusting misclassified categories that were
mension (FD) analysis correlated well with clinical scores, inconsistent with medical guidelines.
showing coefficients of 0.85 for manually segmented im- In summary, our approach has shown substantial ef-
ages and 0.75 for automatically segmented images. fectiveness, attaining an average Intersection over Union
(IoU) score above 0.7, a figure significantly improved by
subsequent post-processing strategies.
2.2. Segmentation
Semantic segmentation, a process that entails labeling
each pixel of an image with a specific class, represents a
major leap forward within the realm of medical imaging.
This method has been widely adopted for its critical role
in identifying tumors, recognizing various organs, and
Figure 2: The proposed algorithm for selecting a subset of genes relevant to classify CLL patients. The input data is used
to compute the genes pairwise correlation matrix (step 1), and the correlation matrix is clustered (step 2) to group similarly
correlated genes. The clusters are then mapped to the original input data and transposed. AEs are trained for each cluster to
select the most representative gene, reducing dimensionality (step 3). The genes are ranked with F-test, selecting a subset with
the highest F-value (step 4). A neural network is trained with a selected set of genes to perform binary classification of the
CLL patients (step 5). The best NNs architecture is determined through model selection, and the SHAP XAI method explains
each gene’s importance in the predictions (step 6).
3. Engineered Data Encoding for tailored treatments, yet their analysis is complex due
to three main reasons: (1) course of dimensionality: a
Medical Advancements genomics dataset typically consists of a very large num-
In this section, we delve into the innovative intersection ber of genes (features) for a small number of patients
of feature engineering and medicine, focusing on manipu- (samples); (2) imbalanced classes: there is often a signifi-
lating latent spaces to enable new AI-based solutions. We cant difference between the number of instances in each
explore a series of our works in which we exploit suitably group of interest; (3) Noise sequencing data are typically
defined latent spaces to design new gene selection algo- collected from multiple sources, different laboratories,
rithms and Generative AI approaches. In the following, and sequencing tools resulting in noisy datasets difficult
we will discuss a new algorithm for gene selection and its to analyze.
application to Chronic Lymphocytic Leukemia (CLL), and We proposed a new algorithm for genomic-scale analysis,
two new generative AI approaches used for automatic based on DL and XAI, whose aim is threefold: first, select
report generation and inverse design of materials and the most meaningful genes for a regression/classifica-
molecules. Our works not only showcase the potential tion problem; second, provide a more accurate prediction
of latent spaces in enhancing precision and efficiency in model; third, quantify and evaluate the feature’s contri-
medical research but also highlight their role in fostering bution to the predictions through XAI [12]. The proposed
the development of novel therapeutic strategies, mark- algorithm is based on two main ideas: (1) recognize simi-
ing a significant stride toward the future of personalized larly correlated features using clustered correlation ma-
medicine. trix and then filter the redundant information for each
group by using Autoencoders (AEs). In contrast with
previous works, where AEs are used for dimensional-
3.1. AI for Omics Data Analysis ity reduction [13], we implemented a mechanism to still
Functional genomics data, particularly GEP datasets, are work at the level of the original features. We hence pro-
crucial in medical science for diagnosis, prevention, and vide a more treatable dataset in terms of dimensionality,
without affecting interpretability; (2) we train NNs and fering from the non-uniqueness of the solution where,
we iteratively select the most meaningful features using moreover, very different devices can share identical prop-
a new ad-hoc defined XAI score. We eventually use the erties. Furthermore, the design spaces are likely high-
set of selected features (from all the iterations) to train dimensional and subjected to feasibility constraints.
and explain a final model. Most of the state-of-the-art DL methods for inverse de-
We used a preliminary version of this algorithm (depicted sign share the idea of looking for the design solution by
in Figure 2) for the GEP analysis of CLL patients. In our directly working at the level of the design space; indeed,
work [14] we introduced the DeepSHAP Autoencoder they have been mainly conceived to deal with applica-
Filter for Genes Selection (DSAF-GS), a deep learning tions where such a space is a low-dimensional space. By
and explainable AI-based method for gene selection in departing from these approaches, a few works in the
genomics-scale data analysis. Through the SHAP explain- literature have already advocated the benefits of map-
able AI techniques, we identified key genes influencing ping the input space into a continuous latent space. This
CLL prognosis with high accuracy. Our findings pave perspective influenced our work which proposes a neu-
the way for more targeted bio-molecular research in CLL, ral network architecture, named GIDnet (Generative
suggesting novel paths for investigating disease mecha- Inverse Design Network), where the suitable solutions
nisms and therapy timing. are additionally constrained to the only feasible region of
the latent design space, and an exploration algorithm is
3.2. Building and Exploring Meaningful used to end up with more accurate solutions [16]. A thor-
ough experimental activity over several state-of-the-art
Latent Spaces for Generative AI in benchmark datasets evidenced the superior performance
Medicine of GIDnet for inverse design problems.
Automatic Medical Report Generation via Latent In a promising future scenario, our approach can be
Space Conditioning and Transformers built using GNNs to generate specific social networks,
In this work, we explore the integration of artificial molecules, and topological representations starting from
intelligence within healthcare, focusing on automatic the prior desired properties. Our generative approach, in-
medical report generation. We introduce the VAE-GPT deed, demonstrated breakthrough performances in such
architecture, combining Variational Autoencoder (VAE) scenarios where the design space is large, discrete, and
and Generative Pre-trained Transformer (GPT) for constrained, taking into account such feasibility con-
generating medical reports from images. The VAE learns straints during the design process itself.
a latent representation of images, capturing underlying
patterns, while the GPT uses this representation to 4. Other Research Activities
generate coherent text. For the purpose the VAE is
jointly trained with a pre-trained text generator (GPT) This research group has also engaged in a variety of
and a tags predictor such that images belonging to the studies including the impact of a Nutrition Education
same context (e.g. diseases) are placed in the same Program combined with physical activity on the Mediter-
region of the latent space. Furthermore, we propose a ranean Diet adherence and inflammatory biomarkers in
novel metric, Medical Embeddings Attention Distance adolescents, showing significant improvements [17]. Ad-
(MEAD), to measure the semantic similarity between ditionally, they have examined the dynamics of opinion
generated and reference reports. Our experiments diffusion within social networks, identifying effective
demonstrate state-of-the-art performance in creating strategies based on centrality measures for influencing
informative medical reports, highlighting the potential opinion adoption [18]. Furthermore, [19] have proposed
of AI in enhancing diagnostic processes [15]. a neuro-symbolic AI approach for the compliance verifi-
cation of electrical control panels in Industry 4.0, utilizing
GIDnets: Generative Neural Networks for Solving a combination of deep learning and Answer Set Program-
Inverse Design Problems via Latent Space Explo- ming to detect anomalies with limited data. In [20] de-
ration veloped a Graph Neural Network model to assess lateral
In fields such as Engineering, Molecular Biology, and spreading displacement in New Zealand, aiming to en-
Physics, the design of technological tools and device hance earthquake impact predictions. In [21] is presented
structures is progressively supported by Inverse Design a statistical framework to learn more effectively from al-
methods, providing suggestions on crucial architectural gorithm validation challenges, specifically for medical
choices based on the properties that these tools and de- image analysis in laparoscopic videos, identifying under-
vices should exhibit. The inverse design problem aims exposure and motion as significant sources of errors. [22]
at designing proper devices according to a set of desired introduced a deep learning framework using heatmaps
properties and it is typically an ill-posed problem suf- for disease classification based on gene expression data,
demonstrating its effectiveness in tumor classification. [3] E. De Rose, Continual learning: an approach via
In [23] detailed a method for reducing and visualizing feature maps extrapolation, in: DC@AIxIA23 Doc-
data for automatic diagnosis using gene expression and toral Consortium of AIxIA 2023 conference, CEUR
clinical data, achieving high recall rates in diagnoses. Proceedings, volume 3537, Italian Association for
Lastly, we also developed a system to improve the in- Artificial Intelligence, 2023, p. To be assigned.
terpretability of automatic diagnosis by analyzing the [4] E. Alhenawi, R. Al-Sayyed, A. Hudaib, S. Mirjalili,
internal decision-making processes of neural networks Feature selection methods on gene expression mi-
[24]. croarray data for cancer classification: A systematic
review, Computers in Biology and Medicine 140
(2022) 105051.
5. Conclusion [5] S. M. Lundberg, S.-I. Lee, A unified approach to
interpreting model predictions, Advances in neural
This work advances the application of Artificial Intelli-
information processing systems 30 (2017).
gence (AI) and Deep Learning (DL) in medical diagnostics
[6] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam,
and genomics, demonstrating their transformative po-
D. Parikh, D. Batra, Grad-cam: Visual explanations
tential for enhancing diagnostic accuracy and enabling
from deep networks via gradient-based localization,
personalized medicine. By employing advanced imaging
in: Proceedings of the IEEE international confer-
segmentation, computational analysis, and introducing
ence on computer vision, 2017, pp. 618–626.
fractal dimension as a novel metric for vascular com-
[7] C. Adornetto, P. Bruno, F. Calimeri, E. DE ROSE,
plexity, we offer innovative solutions to the challenges
G. Greco, et al., Artificial intelligence in medicine:
in medical imaging and omics data analysis. Our find-
From imaging to omics, in: CEUR WORKSHOP
ings highlight the effectiveness of these methods in im-
PROCEEDINGS, volume 3486, 2023, pp. 140–145.
proving the reliability of medical assessments and the
[8] P. Bruno, M. F. Spadea, S. Scaramuzzino, S. De Rosa,
interpretability of complex data through Explainable Ar-
C. Indolfi, G. Gargiulo, G. Giugliano, G. Esposito,
tificial Intelligence (XAI) techniques. The integration of
F. Calimeri, P. Zaffino, Assessing vascular complex-
AI in healthcare, as illustrated by our research, promises
ity of paod patients by deep learning-based segmen-
to refine diagnostic processes, optimize treatment plans,
tation and fractal dimension, Neural Computing
and contribute significantly to the future of personalized
and Applications 34 (2022) 22015–22022.
patient care.
[9] T. Dhamija, A. Gupta, S. Gupta, R. Katarya, G. Singh,
Semantic segmentation in medical images through
Acknowledgements transfused convolution and transformer networks,
Applied Intelligence 53 (2023) 1132–1148.
This work has been partially funded by PON “Ricerca e [10] M.-H. Laves, J. Bicker, L. A. Kahrs, T. Ortmaier,
Innovazione” 2014-2020, CUP: H25F21001230004, and has A dataset of laryngeal endoscopic images with
been carried out while Alessandro Quarta was enrolled comparative study on convolution neural network-
in the Italian National Doctorate on Artificial Intelligence based semantic segmentation, International jour-
run by Sapienza University of Rome with University of nal of computer assisted radiology and surgery 14
Calabria. (2019) 483–492.
[11] P. Bruno, F. Calimeri, C. Marte, M. Manna, Com-
bining deep learning and asp-based models for the
References semantic segmentation of medical images, in: Rules
and Reasoning: 5th International Joint Conference,
[1] C. S. Lee, A. Y. Lee, Clinical applications of contin-
RuleML+ RR 2021, Leuven, Belgium, September 13–
ual learning machine learning, The Lancet Digital
15, 2021, Proceedings 5, Springer, 2021, pp. 95–110.
Health 2 (2020) e279–e281.
[12] C. Adornetto, G. Greco, A new deep learning and
[2] A. Quarta, P. Bruno, F. Calimeri, Continual learn-
xai-based algorithm for features selection in ge-
ing for medical image classification, in: F. Calimeri,
nomics, 2023. arXiv:2303.16914.
M. Dragoni, F. Stella (Eds.), 1st AIxIA Workshop on
[13] P. Danaee, R. Ghaeini, D. A. Hendrix, A deep learn-
Artificial Intelligence For Healthcare (HC@AIxIA
ing approach for cancer detection and relevant gene
2022) co-located with the 21st International Con-
identification, in: Pacific symposium on biocom-
ference of the Italian Association for Artificial In-
puting 2017, World Scientific, 2017, pp. 219–229.
telligence (AIxIA 2022), Udine, Italy, November
[14] F. Morabito, C. Adornetto, P. Monti, A. Amaro,
30 2022, volume 3307 of CEUR Workshop Proceed-
F. Reggiani, M. Colombo, Y. Rodriguez-Aldana,
ings, CEUR-WS.org, 2022, pp. 67–76. URL: https:
G. Tripepi, G. D’Arrigo, C. Vener, et al., Genes
//ceur-ws.org/Vol-3307/paper7.pdf.
selection using deep learning and explainable artifi-
cial intelligence for chronic lymphocytic leukemia [24] P. Bruno, F. Calimeri, A. S. Kitanidis, E. De Momi,
predicting the need and time to therapy, Frontiers Understanding automatic diagnosis and classifica-
in Oncology 13 (2023). tion processes with data visualization, in: 2020
[15] C. Adornetto, A. Guzzo, A. Vasile, Automatic medi- IEEE international conference on human-machine
cal report generation via latent space condition- systems (ICHMS), IEEE, 2020, pp. 1–6.
ing and transformers, in: 2023 IEEE Intl Conf
on Dependable, Autonomic and Secure Comput-
ing, Intl Conf on Pervasive Intelligence and Com-
puting, Intl Conf on Cloud and Big Data Comput-
ing, Intl Conf on Cyber Science and Technology
Congress (DASC/PiCom/CBDCom/CyberSciTech),
IEEE, 2023, pp. 0428–0435.
[16] C. Adornetto, G. Greco, Gidnets: generative neural
networks for solving inverse design problems via
latent space exploration, in: Proceedings of the
Thirty-Second International Joint Conference on
Artificial Intelligence, 2023, pp. 3404–3413.
[17] C. Morelli, E. Avolio, A. Galluccio, G. Caparello,
E. Manes, S. Ferraro, A. Caruso, D. De Rose,
I. Barone, C. Adornetto, et al., Nutrition educa-
tion program and physical activity improve the
adherence to the mediterranean diet: impact on
inflammatory biomarker levels in healthy adoles-
cents from the dimenu longitudinal study, Frontiers
in Nutrition 8 (2021) 685247.
[18] C. Adornetto, V. Fionda, G. Greco, On the effec-
tiveness of compact strategies for opinion diffusion
in social environments, in: ECAI 2023, IOS Press,
2023, pp. 11–18.
[19] V. Barbara, M. Guarascio, N. Leone, G. Manco,
A. Quarta, F. Ricca, E. Ritacco, Neuro-symbolic
ai for compliance checking of electrical control pan-
els, Theory and Practice of Logic Programming 23
(2023) 748–764.
[20] M. Durante, G. Terremoto, C. Adornetto, G. Greco,
E. Rathje, A new graph neural network (gnn)
based model for the evaluation of lateral spread-
ing displacement in new zealand, in: Proceed-
ings of the 8th International Conference on Earth-
quake Geotechnical Engineering (VIII ICEGE), Os-
aka, Japan, 2024, pp. 7–10.
[21] T. Roß, P. Bruno, A. Reinke, M. Wiesenfarth,
L. Koeppel, P. M. Full, B. Pekdemir, P. Godau,
D. Trofimova, F. Isensee, et al., Beyond rankings:
Learning (more) from algorithm validation, Medical
image analysis 86 (2023) 102765.
[22] P. Bruno, F. Calimeri, Using heatmaps for deep
learning based disease classification, in: 2019 IEEE
conference on computational intelligence in bioin-
formatics and computational biology (CIBCB), IEEE,
2019, pp. 1–7.
[23] P. Bruno, F. Calimeri, A. S. Kitanidis, E. De Momi,
Data reduction and data visualization for automatic
diagnosis using gene expression and clinical data,
Artificial Intelligence in Medicine 107 (2020) 101884.