<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>with XAI and transfer learning to augment neural models⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Aparna S. Varde</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Benjamin S. B. Rasmussen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Radiology and Faculty of Health Sciences, Radiologyl Research and Innovation Unit (UNIFY), Centre for Clinical Artificial Intelligence (CAI-X), Odense university hospital and University of Southern Denmark Odense</institution>
          ,
          <country country="DK">Denmark</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>SDU Center for Software Technology, Maersk McKinney Moller Institute (MMMI), Faculty of Engineering (TEK), University of Southern Denmark (SDU)</institution>
          ,
          <addr-line>Vejle</addr-line>
          ,
          <country country="DK">Denmark</country>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2026</year>
      </pub-date>
      <abstract>
        <p>This paper presents a vision to propel approaches for sustainability and thus equity in healthcare applications with XAI (explainable AI) and transfer learning to augment deep learning neural models, e.g. multimodal ones (combining text, images and heterogeneous data). Transfer learning can promote scalability, i.e. hypotheses learned on one illness can be transferred to related settings. XAI solutions build trust and transparency in clinical decision-making (trace causes of errors, and map empirical observations to theories with factual reasoning etc.) As stated by real-life clinical practitioners: "Explainability is crucial to aid doctors to ensure that techniques work, and traceability helps improve solutions". It can thus benefit human-AI collaboration in medicine to enhance accessibility. The vision in this paper aims to make positive impacts on United Nations Sustainable Development Goals, e.g. Goal 3 on Good Health and Well-Being. It can play a key role in data analytics for real-life applications, especially by helping to advance global health and equity.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;AI in Healthcare</kwd>
        <kwd>Commonsense Knowledge (CSK)</kwd>
        <kwd>Data Analytics for Social Good</kwd>
        <kwd>Explainable AI (XAI)</kwd>
        <kwd>Knowledge Bases (KBs)</kwd>
        <kwd>Knowledge-Guided Machine Learning (KGML)</kwd>
        <kwd>Sustainable AI</kwd>
        <kwd>Transfer Learning</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Background: Deep learning neural models in healthcare can be beneficial for a myriad of tasks
such as assistance in disease diagnosis, interactive patient communication, and semantically-aware
summarization of complex medical documents. Healthcare data can often be multimodal: plain text
including doctor notes and research abstracts, medical images such as X-rays and MRI, audio recordings
e.g. cough sounds, and video clips e.g. surgical procedures. Thus, to conduct data analytics in healthcare,
neural models can possibly be used, especially multimodal ones, yet they pose issues. A few models
and their issues are stated next.</p>
      <p>Related Work: Multimodal neural models such as CLIP [1], Google [2], and GPT-4o [3] can process
and generate text as well as images. CLIP (Contrastive Language–Image Pre-training) is a model that
"learns how to connect images and text by learning simultaneous embeddings for both". Gemini can
(among other things) fathom a photo of a nutritionist-recommended food item and provide its recipe
or vice versa. A healthcare-specific multimodal model is Med-Gemini [ 4] that "builds upon Google’s
Gemini models by fine-tuning on de-identified medical data while inheriting Gemini’s native reasoning,
multimodal, and long-context abilities". While such multimodal models thriving on deep learning
(DL) can be useful in healthcare, they pose issues e.g. lack of accuracy, errors without traceability,
and conveying misinformation (which can be unintentional) [5], [6]. Moreover, DL models need
colossal amounts of training data, and consume enormous amounts of energy. Training a neural model
needs as much energy as 5 hybrid cars in their lifetime [7]. Thus, solely relying on DL can make</p>
      <p>AI-based solutions less afordable and accessible, posing equity issues. Furthermore, pure DL models
lack explainability, hence adversely impacting interpretability, trust, evidence-based decisions as well
as human-AI collaboration.</p>
      <p>Motivation: Various issues with DL models motivate the need for greater sustainability and
explainability. Sustainable healthcare is vital to make proposed solutions more accessible and hence
equitable, aligning with the mission of UN SDGs (United Nations Sustainable Development Goals). The
SDGs address healthcare, equity and sustainability (see Fig. 1). For instance, automated low-complexity
disease diagnosis is imperative to promote global health initiatives. Likewise, adding explainability to
AI-based healthcare solutions is highly desirable to aid meaningful communication (with reasoning)
among medical staf and patients. It fosters more trustworthiness, consistency and evidence-based
decision support. It also improves human-AI collaboration due to enhanced interpretation, trust and
communication.</p>
      <p>Proposed Vision: This vision paper advocates sustainable, equitable healthcare solutions. It aims to
make positive impacts on UN SDGs (Goal 3: Good Health and Well-Being, Goal 10: Reduced Inequalities,
Goal 11: Sustainable Cities and Communities). The crux of our vision is illustrated in Fig. 2 which
synopsizes a few aspects such as trust in AI. In addition to exemplifying classical XAI methods, e.g.
decision trees and counterfactuals, we propose the following main contributions.</p>
      <p>1. KGML (knowledge-guided machine learning) to add context and enhance accuracy
2. KB (knowledge base) design and commonsense knowledge (CSK) to supplement neural models
3. Exploration of transfer learning and low-complexity theories with eficiency</p>
      <p>An important aim is to ensure that healthcare is afordable and accessible to various regions across
the globe. It promotes equity and contributes positively to the theme of data analytics in real-life
applications.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Exploring the KGML Paradigm</title>
      <p>A useful paradigm in recent years is knowledge-guided machine learning (KGML) which infuses domain
knowledge in machine learning methods to add specific context, enhancing performance [ 8], [9]. Fig. 3 is
an overview of the successful deployment of this paradigm in environmental science [10]. As seen here,
contextual information is extracted from ArcGIS and fed into an integrated framework of ViT (vision
transformers) and CNN (convolutional neural networks) to enhance power plant classification. More
specifically, this framework adds domain knowledge from GIS (geographic information system) data into
ViT and CNN architectures to guide the learning with domain knowledge. This knowledge is integrated
through spatial masks (SM) for adding context to power plant classification. It raises classification
accuracy by approximately 10% across all power plant types, while also adding explainability, and
saving training resources.</p>
      <p>Likewise, our vision is to propose KGML in the healthcare domain as follows.
1. Work alongside medical experts to extract relevant domain knowledge.
2. Program using XAI methods to infuse the knowledge with multimodal DL models, e.g. CLIP.
3. Adapt the notion of RAG (Retrieval-Augmented Generation) in LLMs (large language models):
medical documents to infuse RAG-context.
4. Test the performance and enhance the framework using human-in-the-loop for further
optimization from a practical standpoint.</p>
      <p>We have the following example inputs garnered by interacting with real-life physicians [11],
[12]: "For the purpose of integrating clinical context in deep learning based pneumonia classification,
conventional deep learning models for pneumonia classification often rely solely on raw radiographic data.
However, diagnostic accuracy can be significantly enhanced by augmenting these models with structured or
textual clinical context, mirroring the integrative reasoning employed by us physicians". Thus, in clinical
context integration, consider the scenario next.</p>
      <p>Scenario 1: KGML for clinical context in pyrexia analysis</p>
      <p>Inputs: Case histories of patients e.g. "72-year-old patient presents with pyrexia (39∘ C), an elevated C-reactive protein
(CRP) level, and a productive cough persisting for three days. Physical examination reveals bilateral basal crackles."</p>
      <p>
        KGML model as (DL + RAG): (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Infuse variables with weights &amp; markers given by doctors (as RAG data) into DL (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) This
KGML model moves beyond simple pattern recognition (with pure DL) to interpret findings in a broader diagnostic framework
(with RAG). (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) It can diferentiate visually similar pathologies, e.g. bacterial pneumonia vs. cardiogenic pulmonary edema, by
weighing patient age, inflammatory markers, and symptomatic presentation.
      </p>
      <p>
        Benefits: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Synergy of visual data &amp; clinical knowledge, best demonstrated by annotated radiographic evidence (e.g. model
identifies focal consolidation in lower lung lobe if paired with metadata such as "consolidation + fever + elevated inflammatory
markers"). (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) More accuracy in assigning high probability to an infectious etiology. (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Holistic approach, lowers diagnostic
ambiguity. (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) AI-driven analysis by knowledge synthesis, i.e. evidence-based medicine.
      </p>
      <p>Recent successes with RAG-based LLMs [13], [14], [15] inspire its deployment in KGML for healthcare.
A vital aspect is amalgamation of contextual domain knowledge in DL. It can be achieved by knowledge
base (KB) design to curate relevant domain knowledge with adequate use of commonsense knowledge
(CSK).</p>
    </sec>
    <sec id="sec-3">
      <title>3. KB Design and Commonsense</title>
      <p>The design of knowledge bases is a one-time process; each designed KB can be used recurrently (to add
context in a DL model) without training it on millions of data samples in each scenario [16], [17], [18]
It can be highly useful in medicine. Consider the following scenario.</p>
      <p>Scenario 2: Automated diabetes diagnosis (XAI through KBs)</p>
      <p>Inputs: Documents of patients’ conditions, family history, medical symptoms, e.g. "41-year old female patient has sugar
level slightly above normal, she conceived at age 37, has a diabetic mother, no one else in the family is diabetic".</p>
      <p>
        KB Usage to Augment DL: Extract domain-knowledge e.g. (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Diabetes is usually inherited from paternal sides; (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Hereditary
diabetes often skips a generation; (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Pregnancy with age&gt;35 can cause gestation diabetes.
      </p>
      <p>
        Benefits: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Adds context &amp; explainability to diagnosis; (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Saves resources by infusing KB info one-time (vs. making a
DL model learn repeatedly from a vast number of cases), (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Fosters automated sustainable diagnosis for widespread use,
promoting equity.
      </p>
      <p>Such benefits are highly desirable as cited in the literature [ 19]. Similar claims with XAI can be made
in various fields (radiology, cardiology, psychiatry) for decision support in diagnosis, as elaborated by
doctors [20], [21].</p>
      <p>Another important aspect is quantitative information. Numeric values can be mapped to
domainspecific terms to facilitate reasoning, e.g. "normal body temperature" instead of "values around 36.9
degrees Celsius, or 98.4 degrees Fahrenheit". It can be helpful to reason from inputs provided (in patient
data) along with corresponding medical knowledge. The main claim is that we can learn substantially by
traceability from KBs, and use that knowledge for domain-specific reasoning, augmenting classification
ofered by DL models in decision support.</p>
      <p>Our vision for KB design is synopsized in Fig. 4. Through medical literature study and domain expert
consultation, we propose to annotate domain-specific assertions with contextual information. The
resulting KB can be aggregated by multiple methods as listed next.</p>
      <p>• Context-centric: Fix ranges on a (sub-)set of dimensions, then obtain contextualized values, e.g.
consider "Oral (Mouth): 35.8–37.3∘ C (or 96.4–99.1∘ F)" as "normal" for the dimension temperature,
then obtain the subset of all assertions that appeared with matching context.
• Concept-centric: Center on a concept as subject, and look at most frequent contexts, as well as
statements for these contexts.</p>
      <p>• Statement-centric: For an individual statement, focus on distributions over its contexts.
Hence, we envision KB construction as follows.</p>
      <p>1. Define a set of dimensions e.g. temperature, blood pressure. Schema can be (S/SP/SPO): S, P, O
are subject, predicate, object respectively.
2. Collect positive examples of sentences on these dimensions e.g. "Stage1 Hypertension: Systolic
(130–139 mmHg), Diastolic (80–89 mmHg)".
3. Use these examples, along with random negatives, to train one sentence classifier per dimension,
which predicts the dimension-expression of that sentence: 1 if the dimension is expressed, 0 if
not (e.g. 1 if hypertension occurs, 0 if not).
4. At KB construction time, run each classifier on each sentence in the corpus, and whenever the
dimension-expression-prediction is above some threshold (e.g. &gt; 0.75) assume that the dimension
is expressed. Then use a prompting approach to obtain the qualitative expression, respectively,
grab the number in the sentence and consider it as quantitative expression.</p>
      <p>The analysis of the intended output can then be assisted by two diferent approaches, stated below.
• (A) Start from an S or SP or SPO, observing the most frequent values along each dimension.
• (B) Starting with a dimension range e.g. 35.8–37.3∘ C, observe the most frequent S/SP/SPO.</p>
      <p>Both these approaches can be used for reasoning, and this contextual information in KBs can
help to augment DL models to enhance learning.</p>
      <p>In addition to KBs with domain knowledge, it is important to harness everyday facts comprising
commonsense knowledge (CSK), too obvious for humans but highly dificult for machines. CSK can be
imperative in machine intelligence [23]. Hence, we propose to utilize CSK which can be discovered
from online sources and compiled into CSKBs (commonsense knowledge bases). Methods for CSK
storage and its usage to complement DL have been presented in an interesting tutorial [22]. Its crux is
synopsized in Fig. 5 which is self-explanatory.</p>
      <p>A recent study [24] leveraging the CSKB ConceptNet [25] proved highly efective in improving
multipurpose robot perception, providing better results than LLM-based guidance with ChatGPT [26].
Vital aspects here included consistency and traceability, both guaranteed by CSK (not by the LLM). In
line with such research, we propose that the training of medical AI systems for decision support can
occur better when systems are CSK-infused.</p>
      <p>Consider an example of a tool called CSK-Detector (commonsense knowledge based object detector)
[27] (see Fig. 6) developed using XAI with CSK extracted from a CSKB (commonsense knowledge
base) called DICE [28] entailing commonsense clauses of plausibility, typicality, remarkability and
salience. CSK-Detector has been successfully applied to domestic robotics and can be extended to
medical robotics as well. In a medical context, such a tool can be useful as indicated in the scenario
next.</p>
      <p>Scenario 3: Object detection in medical robots with CSK</p>
      <p>Inputs: Images from medical domains that need task-relevant classification, e.g. photo P1 from a hospital where jewelry is
on object in the room, photo P2 with a surgical table in a room.</p>
      <p>
        CSK-based analysis: Clauses in CSKB e.g. (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Plausibility: Object=Jewelry ⇒ Image ̸= OperationTheater; (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Salience:
Object=SurgicalTable ⇒ Image=OperationTheater; (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Classification: P1 is surely not an operation theater; P2 is highly likely
to be an operation theater; (
        <xref ref-type="bibr" rid="ref4">4</xref>
        ) Task-Relevance: Medical robot can be fed with task-specific context to recognize relevant images.
      </p>
      <p>
        Benefits: (
        <xref ref-type="bibr" rid="ref1">1</xref>
        ) Adds interpretability good in robot training, esp. human-robot collaboration; (
        <xref ref-type="bibr" rid="ref2">2</xref>
        ) Reduces complexity by at
least 103 orders of magnitude (#objects: 100s vs. #images: millions) good in sustainable healthcare; (
        <xref ref-type="bibr" rid="ref3">3</xref>
        ) Reduces ambiguity,
enhances trust.
      </p>
      <p>Scenarios such as these exemplify the importance of commonsense-based classification, contributing to
XAI for enhanced comprehension, as well as sustainable AI for enhanced afordability, energy savings
and accessibility. Thus, we advocate the use of CSK in various aspects of sustainable healthcare including
medical robot training and human-robot collaboration.</p>
    </sec>
    <sec id="sec-4">
      <title>4. Transfer Learning and Simplicity</title>
      <p>The paradigm of transfer learning entails discovering hypotheses on smaller datasets and transferring
the learned hypotheses to much larger data. In adapting this paradigm to healthcare, there are many
challenges, some of which are listed as follows.</p>
      <p>• Adequate selection of reasonably-sized data spanning the gamut of the enormous sample space
• Suitable choice of models for accurate learning with low complexity
• Testing the learned hypotheses in real-life scenarios with active involvement of domain experts
In recent work [29], chest X-rays are subjected to transfer learning (with computer vision models) to
classify them as COVID, pneumonia or healthy (See Fig. 7). The approach yields promising results
on unseen test data from real-life datasets. Motivated by these and other success stories [30], [31], we
propose to harness transfer learning by deploying multimodal models such as Gemini, CLIP, GPT-4o on
heteregeneous data comprising medical literature, doctor notes, images, audio and video. This process
is harder than learning from imagery alone, hence in addition to the challenges above, we face the
following additional issues.</p>
      <p>• Suficient and holistic heterogeneous data procurement for multimodal learning
• Data transformation and data augmentation, preferring simplistic approaches
• Procurement of minimal well-grounded data samples to aid learning with maximal accuracy
• Extrinsic evaluation on multiple real-world datasets comprising various data types (with and
without human-in-the-loop)
These and other issues provide the scope for further research that we propose to address as a part of
the proposed work in this vision paper.</p>
      <p>A related paradigm is exploring low-complexity solutions. In recent work [32], a 1D-CNN
(convolutional neural network) interestingly yielded higher accuracy than more complex models in automated
non-invasive tuberculosis detection. The study comprised audio-visual mapping of cough samples to
Mel Spectrograms and numerical mapping to MFCC (Mel Frequency Cepstral Coeficients) followed by
analysis with multiple DL models. A vital aspect of the methodology for low-complexity TB detection
is summarized in Algorithm 1. Using MFCC (which captured domain knowledge), the complexity
of the analysis was significantly reduced (as opposed to training directly on the audio-visual data).
Furthermore, a simple 1D CNN surprisingly gave the best results. It can be reasoned in layperson’s
terms by drawing analogy with intuitive human reasoning to get the right answer (vs. making things
more complex by overthinking). In computational terms, this logic matches the Occam’s Razor principle
of preferring simpler theories to complex ones if both ofer good results, as noticed in the literature
[33], [34], [35], [36], [37] in research spanning computational estimation and other works.</p>
      <p>Likewise, Mel Spectrograms have been useful in other studies [38], [39] involving natural language
data. Thriving on such research, we propose to investigate simple theories with several multimodal
models to analyze heterogeneous data in healthcare. We further propose to deploy the least complex
models that yield the greatest accuracy for the purpose of transfer learning.</p>
      <p>Our proposed vision for low-complexity transfer learning entails the following steps.
1. Explore several models to discover the low complex ones yielding the high accuracy
2. Deploy transfer learning on suitable heterogeneous low-sized datasets (small data), starting with
the least complex model yielding accuracy above thresholds, proceed in ascending order
3. Harness the best transfer learning model, i.e. the best combination of low complexity and
high-accuracy, for transferring the learned hypotheses to larger datasets (big data)
4. Use hypotheses learned on big data for healthcare decision support e.g. disease diagnostics
This process can help to draw good inferences while also saving resources and ofering low-cost
solutions. It promotes widespread automated medical diagnoses, aiding UN SDGs missions in Goals 3,
10 and 11 with positive impacts on healthcare, equity and sustainability.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Experiments and Discussion</title>
      <p>Initial experimentation has been conducted with datasets from healthcare and environmental domains
spanning paradigms such as KGML and transfer learning in our own relevant prior work [29], [10],
[32] that motivated this vision paper. We tabulate a synopsis of the experimental results in Tables 1, 2,
and 3 here.</p>
      <p>Note that KGML adds explanability with RAG-based context capturing domain knowledge (in this
case, spatial masks to add context to the power plants analogous to the manner in which a domain
expert would guide the classification). Similarly, medical domain experts can add context to guide
classification by providing use cases that help to pinpoint a solution. Additionally, KB design along with
CSK can further augment DL models by supplementing them with domain knowledge with semantics
as well as more intuitive everyday facts with worldly knowledge and pragmatics. This can help make
better decisions with explainability.
WND (Wind)
SUN (Solar)
BIT (Biomass / Coal)
NG (Natural Gas)
WAT (Hydroelectric)
Model for TL
VGG16
VGG16
VGG19
VGG19
VGG19
ResNet101
ResNet101</p>
      <p>Regarding transfer learning and low complexity diagnosis, further studies can be motivated thriving
upon the success of our prior work. These include investigating multimodal models with transfer
learning, as well as conducting studies using simplistic models to explore their efectiveness in learning
hypotheses that can be used to analyze medical data. Our proposed approach of investigating the least
complex models and deploying them in transfer learning (as envisioned in Section 4) is worthy of
further experimentation, and consitutes part of our ongoing work. Such work can make good impacts
on sustainable healthcare.</p>
      <p>Specific inputs including use cases, proofs of concept, and example-based reasoning constitute areas
of further study in this proposed work. The design of such inputs is a non-trivial task which motivates
further research. It entails solid work involving active interactions with clinical pracitioners, medical
researchers and other healthcare professionals. Once such inputs are formulated, they can provide
RAG-based context to enhance KGML. Furthermore, automated discovery of domain knowledge can
be conducted to curate KBs which can then be used for further analysis and reasoning. All of these
constitute contributions to the XAI work envisioned in this paper.</p>
      <p>As relevant discussion, we include another vital comment from our interaction with real-life clinical
practitioners [11], [12]: "Recent advancements in machine learning allow for the prediction of five-year
breast cancer risk through the integration of mammographic imaging and clinical variables. Given that
these estimates directly influence subsequent clinical interventions, providing clear rationales for such
predictions is vital. Transparent models enable doctors to evaluate the clinical validity of high-risk outputs,
ensuring that AI serves as an interpretable aid in the decision-making process".</p>
      <p>Both explainability and sustainability are highly desirable in next-generation AI applications [40],
[41], [42], [43], [44], [45]. Our vision in this paper is to promote XAI and sustainable AI in the healthcare
domain. This will assist broader missions of accessibility, afordability, and human-AI collaboration in
healthcare, striving more towards achieving the much-needed global equity.</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions and Impacts</title>
      <p>Our proposed research in this vision paper advocates sustainable AI in healthcare, mainly by proposing
solutions with XAI and transfer learning. Scientific contributions in this vision entail designing methods
to balance accuracy (crucial to healthcare) versus complexity (for afordability and accessibility). Further
implementation of the proposed work in our vision can pave the way to build AI tools (e.g. decision
support systems and mobile apps) for highly accurate, minimally-invasive, evidence-based,
energyeficient, low-cost diagnostics.</p>
      <p>The proposed work in our envisioned research can be particularly useful in areas where doctor-patient
ratios are low, as it can help in greater afordability and accessibility. Hence, it can make significantly
positive impacts on equity in healthcare. Consequently, it can contribute positively towards the United
Nations missions in their Sustainability Development Goals, especially Goals 3, 10 and 11 related to Good
Health and Well-Being, Reduced Inequalities, and Sustainable Cities and Communities, respectively.
This work can be interesting to data scientists, AI professionals, domain experts in healthcare, as well as
stakeholders who can be the potential users of the proposed methods and the software tools resulting
from them. The multidisciplinary appeal of this vision paper is well-suited to fit the theme of data
analytics in real-life applications.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The work presented in this article is supported by Novo Nordisk Foundation grant NNF25OC0110035. The authors acknowledge
the help of Sune Lundo Sorensen from SDU during the compilation of the camera-ready paper in the required format on
Overleaf. Some of the commonsense knowledge and KB design work in this paper is related to earlier research visits made by
Aparna Varde at Max Planck Institute for Informatics at Saarbrucken, Germany.</p>
    </sec>
    <sec id="sec-8">
      <title>Declaration on Generative AI</title>
      <p>The author(s) have not employed any Generative AI tools.
[10] B. Austin-Gabriel, A. S. Varde, H. Liu, Geoinformatics-guided machine learning for power plant classification, AAAI</p>
      <p>Conference Bridge Program arXiv preprint arXiv:2502.01039 (2025).
[11] OUH, Odense university hospial, https://en.ouh.dk/, 2026.
[12] CAI-X, Odense university hospial, https://cai-x.com/, 2026.
[13] P. Ardimento, M. L. Bernardi, M. Cimitile, Teaching uml using a rag-based llm, in: 2024 International Joint Conference
on Neural Networks (IJCNN), IEEE, 2024, pp. 1–8.
[14] Y. Gao, Y. Xiong, X. Gao, K. Jia, J. Pan, Y. Bi, Y. Dai, J. Sun, H. Wang, H. Wang, Retrieval-augmented generation for large
language models: A survey, arXiv preprint arXiv:2312.10997 2 (2023).
[15] V. Zolfaghari, N. Petrovic, F. Pan, K. Lebioda, A. Knoll, Adopting rag for llm-aided future vehicle design, in: 2024 2nd</p>
      <p>International Conference on Foundation and Large Language Models (FLLM), IEEE, 2024, pp. 437–442.
[16] A. Varde, N. Tandon, S. Nag-Chowdury, G. Weikum, Common sense knowledge in domain-specific knowledge bases,</p>
      <p>Technical Report, Max Planck Institute for Informatics, 2015.
[17] A. Garg, N. Tandon, A. S. Varde, I am guessing you can’t recognize this: Generating adversarial images for object
detection using spatial commonsense, in: AAAI conference, 2020, pp. 13789–13790.
[18] A. Varde, S. Razniewski, T.-P. Nguyen, G. Weikum, Data Mining on Environmental Quantities with Commonsense</p>
      <p>Reasoning, Technical Report, Max Planck Institute for Informatics, 2021.
[19] I. Fasterholdt, M. Gerstrøm, B. S. B. Rasmussen, K. B. Yderstraede, K. Kidholm, K. M. Pedersen, Cost-efectiveness of
telemonitoring of diabetic foot ulcer patients, Health Informatics Journal 24 (2018) 245–258.
[20] M. Mottaqi, P. Zhang, L. Xie, Integrating interpretable machine learning and multi-omics systems biology for personalized
biomarker discovery and drug repurposing in alzheimer’s disease, bioRxiv (2025) 2025–03.
[21] J. A. Bojsen, M. T. Elhakim, O. Graumann, D. Gaist, M. Nielsen, F. S. G. Harbo, C. H. Krag, M. V. Sagar, C. Kruuse, M. P.</p>
      <p>Boesen, et al., Artificial intelligence for mri stroke detection: a systematic review and meta-analysis, Insights into
Imaging 15 (2024) 160.
[22] S. Razniewski, N. Tandon, A. S. Varde, Information to wisdom: Commonsense knowledge extraction and compilation,
in: Proceedings of the 14th ACM international conference on web search and data mining, 2021, pp. 1143–1146.
[23] N. Tandon, A. S. Varde, G. de Melo, Commonsense knowledge in machine intelligence, ACM SIGMOD Record 46 (2018)
49–52.
[24] R. Hidalgo, A. S. Varde, J. Parron, W. Wang, Incorporating commonsense knowledge to enhance robot perception, IEEE
Transactions on Automation Science and Engineering 22 (2025) 15488–15501. URL: https://doi.org/10.1109/TASE.2025.
3565191.
[25] R. Speer, J. Chin, C. Havasi, Conceptnet 5.5: An open multilingual graph of general knowledge, in: Proceedings of the</p>
      <p>AAAI conference on artificial intelligence, volume 31, 2017.
[26] OpenAI, Chatgpt, https://chatgpt.com/, 2022.
[27] I. Chernyavsky, A. S. Varde, S. Razniewski, Csk-detector: Commonsense in object detection, in: 2022 IEEE International</p>
      <p>Conference on Big Data (Big Data), IEEE, 2022, pp. 6609–6612.
[28] Y. Chalier, S. Razniewski, G. Weikum, Dice: A joint reasoning framework for multi-faceted commonsense knowledge.,
in: ISWC (Demos/Industry), 2020, pp. 16–20.
[29] A. S. Varde, D. Karthikeyan, W. Wang, Facilitating covid recognition from x-rays with computer vision models and
transfer learning, Multimedia Tools and Applications 83 (2024) 807–838.
[30] F. Zhuang, Z. Qi, K. Duan, D. Xi, Y. Zhu, H. Zhu, H. Xiong, Q. He, A comprehensive survey on transfer learning,</p>
      <p>Proceedings of the IEEE 109 (2020) 43–76.
[31] K. Weiss, T. M. Khoshgoftaar, D. Wang, A survey of transfer learning, Journal of Big data 3 (2016) 9.
[32] J. Yadav, A. S. Varde, H. Liu, G. Antoniou, L. Xie, Audiovisual multimodal cough data analysis for tuberculosis detection,
in: 2024 15th International Conference on Information, Intelligence, Systems &amp; Applications (IISA), IEEE, 2024, pp. 1–8.
[33] A. S. Varde, Computational estimation by scientific data mining with classical methods to automate learning strategies
of scientists, ACM Transactions on Knowledge Discovery from Data (TKDD) 16 (2022) 1–52.
[34] J. McFadden, Razor sharp: The role of occam’s razor in science, Annals of the New York Academy of Sciences 1530
(2023) 8–17.
[35] A. S. Varde, E. A. Rundensteiner, C. Ruiz, D. C. Brown, M. Maniruzzaman, R. D. Sisson, Designing semantics-preserving
cluster representatives for scientific input conditions, in: Proceedings of the 15th ACM international conference on
Information and knowledge management, 2006, pp. 708–717.
[36] A. S. Varde, E. A. Rundensteiner, C. Ruiz, D. C. Brown, M. Maniruzzaman, Efectiveness of domain-specific cluster
representatives for graphical plots, in: ACM SIGMOD (IQIS workshop), 2006.
[37] T. Blanchard, T. Lombrozo, S. Nichols, Bayesian occam’s razor is a razor of the people, Cognitive science 42 (2018)
1345–1359.
[38] N. Shao, R. Zhou, P. Wang, X. Li, Y. Fang, Y. Yang, X. Li, Cleanmel: Mel-spectrogram enhancement for improving both
speech quality and asr, arXiv preprint arXiv:2502.20040 (2025).
[39] C. Fan, S. Zhang, J. Zhang, E. Liu, X. Li, G. Zhao, Z. Lv, Dmf2mel: A dynamic multiscale fusion network for eeg-driven
mel spectrogram reconstruction, in: Proceedings of the 33rd ACM International Conference on Multimedia, 2025, pp.
6977–6985.
[40] P. Basavaraju, A. S. Varde, Supervised learning techniques in mobile device apps for androids, ACM SIGKDD Explorations
18 (2017) 18–29.
[41] L. Paulino, C. Hannum, A. S. Varde, C. J. Conti, Search methods in motion planning for mobile robots, in: Proceedings
of SAI Intelligent Systems Conference, Springer, 2021, pp. 802–822.
[42] K. Gandhe, A. S. Varde, X. Du, Sentiment analysis of twitter data with hybrid learning for recommender applications, in:
2018 9th IEEE Annual Ubiquitous Computing, Electronics &amp; Mobile Communication Conference (UEMCON), IEEE, 2018,
pp. 57–63.
[43] R. Dwivedi, D. Dave, H. Naik, S. Singhal, R. Omer, P. Patel, B. Qian, Z. Wen, T. Shah, G. Morgan, et al., Explainable ai
(xai): Core ideas, techniques, and solutions, ACM computing surveys 55 (2023) 1–33.
[44] D. Lakshmi, R. S. Tiwari, R. K. Dhanaraj, S. Kadry, Explainable AI (XAI) for sustainable development: Trends and
applications, CRC Press, 2024.
[45] A. Van Wynsberghe, Sustainable ai: Ai for sustainability and the sustainability of ai, AI and Ethics 1 (2021) 213–218.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>[1] OpenAI, Clip: Connecting text and images</article-title>
          , https://openai.com/index/clip/,
          <year>2021</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Team</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Anil</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Borgeaud</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-B. Alayrac</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Yu</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Soricut</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Schalkwyk</surname>
            ,
            <given-names>A. M.</given-names>
          </string-name>
          <string-name>
            <surname>Dai</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hauth</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          <string-name>
            <surname>Millican</surname>
          </string-name>
          , et al.,
          <article-title>Gemini: a family of highly capable multimodal models</article-title>
          ,
          <source>arXiv preprint arXiv:2312.11805</source>
          (
          <year>2023</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Hurst</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lerer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. P.</given-names>
            <surname>Goucher</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Perelman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ramesh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Clark</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Ostrow</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Welihinda</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Hayes</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Radford</surname>
          </string-name>
          , et al.,
          <article-title>Gpt-4o system card</article-title>
          ,
          <source>arXiv preprint arXiv:2410.21276</source>
          (
          <year>2024</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>G.</given-names>
            <surname>Corrado</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Barral</surname>
          </string-name>
          ,
          <article-title>Advancing medical ai with med-gemini</article-title>
          , https://openai.com/index/clip/,
          <year>2024</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>F.</given-names>
            <surname>Demrozi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Farmanbar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Engan</surname>
          </string-name>
          ,
          <article-title>Multimodal ai (mmai) for next-generation healthcare: Data domains, algorithms, challenges, and future perspectives, Current Opinion in Biomedical Engineering (</article-title>
          <year>2025</year>
          )
          <fpage>100632</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>A.</given-names>
            <surname>Kline</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Dennis</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Hutch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Wang</surname>
          </string-name>
          , F. Cheng,
          <string-name>
            <surname>Y. Luo,</surname>
          </string-name>
          <article-title>Multimodal machine learning in precision health: A scoping review</article-title>
          ,
          <source>NPJ digital medicine 5</source>
          (
          <year>2022</year>
          )
          <fpage>171</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>K.</given-names>
            <surname>Hao</surname>
          </string-name>
          ,
          <article-title>Training a single ai model can emit as much carbon as five cars in their lifetimes</article-title>
          , MIT Technology Review June (
          <year>2019</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>A.</given-names>
            <surname>Karpatne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Kannan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Kumar</surname>
          </string-name>
          ,
          <article-title>Knowledge guided machine learning: Accelerating discovery using scientific knowledge and data</article-title>
          , CRC Press,
          <year>2022</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>L.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Peng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Xu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Q.</given-names>
            <surname>Zhu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Till</surname>
          </string-name>
          ,
          <string-name>
            <given-names>X.</given-names>
            <surname>Jia</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Jiang</surname>
          </string-name>
          , et al.,
          <article-title>Knowledge-guided machine learning can improve carbon cycle quantification in agroecosystems</article-title>
          ,
          <source>Nature communications 15</source>
          (
          <year>2024</year>
          )
          <fpage>357</fpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>