<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Multi-Level Explainability in Radiomic-based Classification of Multiple Sclerosis and Ischemic Lesions⋆</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Nighat Bibi</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Kathleen M. Curran</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ronan P. Killeen</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Jane Courtney</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Radiology, St Vincent's University Hospital</institution>
          ,
          <addr-line>Dublin, ireland</addr-line>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>School of Electrical and Electronic Engineering, Technological University Dublin</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>School of Medicine, University College Dublin, UCD Belfield</institution>
          ,
          <addr-line>Dublin</addr-line>
          ,
          <country country="IE">Ireland</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Diferentiating multiple sclerosis (MS) from ischemic stroke lesions on MRI remains a clinical challenge due to their similar appearances as white matter hyperintensities. We propose a radiomics-based machine learning framework that integrates multi-level explainable AI (XAI) techniques to support transparent and clinically meaningful lesion classification. Radiomic features are extracted from standardized MRI scans and used to train multiple classifiers, with Random Forest achieving the best performance (accuracy: 91.24%, F1: 86.54%). The framework incorporates four complementary explanation layers: global insights using SHAP, local interpretability via LIME, counterfactual reasoning with DiCE, and clinical narrative generation using GPT-based language models. This layered approach enhances interpretability at both dataset and lesion levels, enabling clinicians to understand, trust, and act upon model outputs. A radiologist who reviewed the results found the explanations helpful and confirmed that the overall analysis was clinically meaningful. Our results demonstrate the value of combining radiomics and advanced XAI techniques for diferential diagnosis of brain lesions.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Explainable AI</kwd>
        <kwd>Radiomics</kwd>
        <kwd>MRI</kwd>
        <kwd>Brain Lesions</kwd>
        <kwd>Multiple Sclerosis</kwd>
        <kwd>Ischemic Stroke</kwd>
        <kwd>SHAP</kwd>
        <kwd>LIME</kwd>
        <kwd>Counterfactual Explanations</kwd>
        <kwd>GPT Narratives</kwd>
        <kwd>Clinical Decision Support</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction and Related Work</title>
      <p>
        Diferentiating multiple sclerosis (MS) from ischemic stroke lesions on magnetic resonance imaging
(MRI) is a complex diagnostic task due to their overlapping appearance as white matter hyperintensities
(WMHs). MS is a chronic inflammatory disease marked by demyelination, while ischemic lesions arise
from vascular occlusion and subsequent tissue damage. Despite their distinct pathologies, both appear
similar on common MRI sequences like FLAIR, complicating manual diagnosis and often requiring
expert interpretation [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Radiomics ofers a quantitative approach to analyze lesion characteristics by extracting texture,
shape, and intensity-based features from medical images [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. These features, when used with machine
learning models, have shown promise in identifying subtle diferences between MS and ischemic lesions.
However, such models often behave like "black boxes," with limited transparency in how they make
decisions, which restricts their adoption in clinical workflows.
      </p>
      <p>
        Explainable AI (XAI) techniques have developed to tackle this issue by making model decisions
interpretable. SHapley Additive exPlanations (SHAP) [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ] and Local Interpretable Model-Agnostic
Explanations (LIME) [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] are widely used to understand feature contributions at both global and local levels. In
neuroimaging, these methods have been applied to improve the transparency of disease classifiers. For
instance, Eitel et al. [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] used relevance propagation to explain CNN-based MS classification. Basu et al.
[
        <xref ref-type="bibr" rid="ref6">6</xref>
        ] and Lopatina et al. [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ] have explored similar approaches using clinical and imaging data. Leite et al.
[
        <xref ref-type="bibr" rid="ref8">8</xref>
        ] demonstrated the use of texture features and SVM to distinguish between MS and ischemic lesions,
achieving notable accuracy on a small private dataset. Castillo et al. [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] used wavelet-transformed
radiomics and machine learning to diferentiate lesion types, but lacked interpretability mechanisms.
Vuong et al. [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] proposed Radiomics Feature Activation Maps to enhance the interpretability of
radiomic signatures by spatially localizing the regions contributing most to model predictions. Their
method enables visual attribution of radiomic features at the voxel level, improving transparency and
clinical trust in radiomics-based models.
      </p>
      <p>In contrast to previous work, our study proposes a multi-level XAI framework that integrates four
layers of interpretability: (1) SHAP for global feature importance, (2) LIME for local explanations,
(3) DiCE for counterfactual reasoning, and (4) GPT-generated clinical narratives for human-aligned
interpretation. This layered approach supports both technical transparency and clinical relevance. We
evaluate the method using lesion-wise radiomic features extracted from two public datasets—MSSEG
(for MS) and ISLES (for stroke)—and compare multiple classifiers, identifying Random Forest as the
best-performing model. To the best of our knowledge, this is the first radiomic framework to combine
SHAP, LIME, counterfactual explanations, and language-based narratives for transparent brain lesion
classification.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Materials and Methods</title>
      <sec id="sec-2-1">
        <title>2.1. Dataset Description</title>
        <p>
          This study utilizes two publicly available MRI datasets for lesion segmentation and classification. The
ifrst is the ISLES 2022 dataset [
          <xref ref-type="bibr" rid="ref11">11</xref>
          ], which includes difusion-weighted and FLAIR images from 250
ischemic stroke cases collected across multiple centers in Europe. The second is the MSSEG 2016 dataset
[
          <xref ref-type="bibr" rid="ref12">12</xref>
          ], containing T2-FLAIR MRI scans from 53 patients diagnosed with multiple sclerosis. Lesions in
both datasets were manually segmented by clinical experts, providing high-quality annotations for
radiomic analysis. In total, 13489 2D images were extracted: 6281 from multiple sclerosis patients and
7208 from ischemic stroke patients. We randomly split the cases into 70% training, 15% validation, and
15% testing sets, ensuring that images from the same patient appear in only one set.
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Preprocessing Pipeline</title>
        <p>
          All MRI volumes were converted into 2D slices along axial, coronal, and sagittal planes. The following
preprocessing steps were applied to each slice:
• Bias Field Correction: N4 bias correction was applied to reduce intensity non-uniformities.
• Intensity Normalization: Pixel intensities were scaled to the [0, 255] range using min–max
normalization.
• Denoising: Non-local means filtering [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ] was used to suppress noise while preserving texture.
• Mask Alignment: Lesion masks were resampled to match MRI dimensions where needed.
• Brain Region Cropping: Slices with low brain content were excluded; valid slices were cropped
to brain region with margin.
• Resizing and Padding: All slices and masks were resized to 224× 224 pixels with padding where
necessary.
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Radiomic Feature Extraction and Selection</title>
        <p>
          Radiomic features were extracted on a per-lesion basis using the PyRadiomics library [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] with default
parameters. Each lesion was treated as a separate connected component, resulting in the extraction
of 90 quantitative features covering intensity, texture, and structural characteristics. Diagnostic and
non-informative metadata were excluded, and missing values were imputed using a mean-based strategy.
Feature selection was performed exclusively on the training set to prevent information leakage. A
univariate ANOVA F-test was applied using SelectKBest (method from scikit-learn python library),
selecting the top 20 features based on their statistical significance with respect to the lesion class (MS
vs. Ischemic). The selected features were then applied to the validation and test sets. These features
span multiple radiomic families, including first-order intensity features, gray-level co-occurrence
matrix (GLCM), gray-level run-length matrix (GLRLM), gray-level size zone matrix (GLSZM), gray-level
dependence matrix (GLDM), and neighborhood gray-tone diference matrix (NGTDM).
        </p>
      </sec>
      <sec id="sec-2-4">
        <title>2.4. Classification Models</title>
        <p>We evaluated three classifiers: Random Forest (RF), Logistic Regression (LR), and Support Vector
Machine (SVM). Each model was optimized using RandomizedSearchCV (method from scikit-learn
python library) on a combined training and validation set with a predefined split. Hyperparameters
were selected based on F1-score. The final models were retrained on the full training+validation set and
evaluated on the held-out test set.</p>
      </sec>
      <sec id="sec-2-5">
        <title>2.5. Multi-Level Explainability Framework</title>
        <p>
          To ensure transparency and clinical relevance, the proposed framework integrates four complementary
XAI strategies:
• Global Explanations (SHAP): SHAP values [
          <xref ref-type="bibr" rid="ref3">3</xref>
          ] were computed for the Random Forest model to
rank feature importance across the test set.
• Local Explanations (LIME): For individual lesion predictions, LIME [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ] provided local feature
attribution.
• Counterfactuals (DiCE): We employed the DiCE framework [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ] to generate counterfactual
examples that would alter the model’s prediction, identifying minimal changes required to flip
class.
• Clinical Narratives (GPT): LIME outputs (feature names, values, and contribution direction)
were passed into a structured GPT-4o prompt to generate clinician-friendly narratives. The prompt
instructed GPT to summarize the predicted lesion type (MS or ischemic stroke), explain the most
influential features supporting the prediction, discuss features contradicting the alternative
diagnosis, and conclude with the primary reason for the prediction. The explanations avoided
technical jargon, used real-world MRI interpretations, and followed a concise, bolded structure
for readability.
        </p>
        <p>This layered approach supports both technical and human-aligned interpretability, enhancing
transparency and trust in the AI-assisted diagnosis process.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Results and Discussion</title>
      <p>We evaluated three classifiers on the test set and analyzed the interpretability of the best-performing
model using SHAP, LIME, DiCE, and GPT-based narratives.</p>
      <sec id="sec-3-1">
        <title>Classification Performance</title>
      </sec>
      <sec id="sec-3-2">
        <title>Global Interpretability (SHAP)</title>
        <p>SHAP values were computed to explain the contribution of each radiomic feature across the dataset.
Figure 1 shows that texture features like glcm_Idmn and firstorder_Skewness were consistently
important.</p>
      </sec>
      <sec id="sec-3-3">
        <title>Local Interpretability (LIME + GPT Narratives)</title>
        <p>LIME provided per-lesion explanations, and GPT transformed these into clinical narratives. Figure 2
illustrates a case predicted as MS. High uniformity and low contrast were influential, supporting the
MS diagnosis. In contrast, Figure 3 shows a predicted ischemic lesion. High skewness and entropy
supported the prediction, indicating structural heterogeneity.</p>
      </sec>
      <sec id="sec-3-4">
        <title>Counterfactual Reasoning with DiCE</title>
        <p>To simulate alternative diagnostic scenarios and examine model robustness, we used DiCE to generate
counterfactual examples. These identify minimal, actionable changes to radiomic features that would
result in a diferent prediction outcome. This approach enhances transparency by answering the
question: "What would need to change for the lesion to be classified diferently?"</p>
        <p>Figure 4 shows a counterfactual explanation for a lesion originally classified as MS. DiCE suggests that
reducing the mean intensity (original_firstorder_Mean) from 97.45 to 31.60, and increasing the
Predicted Diagnosis: MS (Multiple Sclerosis) – Lesion 3
Prediction Probability: 97%
Key Features Supporting MS:
•High Gray Level Run Emphasis (1.70): Indicates concentrated areas with high signal intensity runs, commonly seen in MS plaques
due to demyelination and chronic inflammation.
•High Gray Level Zone Emphasis (2.50): Reflects large, homogeneously intense regions, consistent with typical MS lesion appearance
in FLAIR MRI.
•Gray Level Variance (0.18): Low heterogeneity in gray levels suggests uniform lesion intensity, a hallmark of MS plaques.
•Low Gray Level Zone Emphasis (0.62): Reflects reduced presence of darker, ischemia-associated zones, favoring MS.
•Short Run Low Gray Level Emphasis (0.36): Indicates less fragmented low-signal areas, again pointing toward uniform MS lesions.
•First-order Median Intensity (116.00): Falls within the expected range for MS lesions, indicating moderately hyperintense signal.
Why Ischemic Stroke is Unlikely:
•Autocorrelation (1.67): Below typical ischemic levels; ischemic lesions often show higher internal texture correlation.
•Joint Average (1.27) &amp; Sum Average (2.55): These GLCM features are lower than typical ischemic values, indicating less internal
gray-level co-occurrence and spread.
•NGTDM Complexity (0.25): Much lower than expected in ischemic strokes, which usually present with more complex and
heterogeneous textures.</p>
        <p>Conclusion:
Lesion 3 is classified as MS primarily due to its homogeneous intensity, lack of low-gray regions, and texture simplicity, which are
characteristic of chronic demyelinating lesions rather than ischemic damage.
original_glszm_SmallAreaHighGrayLevelEmphasis from 0.14 to 33.46, would be suficient to
lfip the prediction to ischemic. These feature adjustments reflect plausible variations in lesion brightness
and structural homogeneity that align with known imaging patterns of ischemic pathology. This layer
of "what-if " analysis enables clinicians to explore how small radiomic shifts could afect classification
outcomes, making the model more interpretable and clinically interactive. Although DiCE successfully
generated plausible counterfactual examples, interpreting the proposed feature modifications requires
clinical expertise, as some radiomic changes may not correspond directly to observable anatomical
changes.</p>
        <p>Predicted Diagnosis: Ischemic Stroke – Lesion 1
Prediction Probability: 87%
Key Features Supporting Ischemic Stroke:
•High Gray Level Zone Emphasis (7.80): Indicates large, bright uniform regions—typical of acute/subacute infarcts with
welldemarcated necrotic tissue.
•Short Run Low Gray Level Emphasis (0.12): Suggests a lack of short, low-intensity textures—supporting ischemia, where lesions tend
to be sharply defined and not mottled.
•Low Gray Level Zone Emphasis (0.31): Minimal presence of dark areas aligns with the uniform brightness of ischemic lesions.
•First-order Median (163.00), Root Mean Squared (170.14), and Mean Intensity (169.25): All indicate high signal intensity,
characteristic of ischemic tissue with edema or gliosis in FLAIR MRI.</p>
        <p>Why MS is Unlikely:
•Joint Average (2.43): While elevated, it's less distinctive for MS compared to more specific texture patterns.
•Autocorrelation (6.15): Although relatively high, it does not offset the dominant ischemic characteristics.
•NGTDM Complexity (1.55): Moderate complexity, but not high enough to suggest the varied texture seen in MS plaques.
•Gray Level Variance (1.04): Suggests some heterogeneity but insufficient to match the typical diverse texture of MS lesions.
Conclusion:
Lesion 1 is classified as Ischemic Stroke due to its high uniform brightness, low texture fragmentation, and elevated mean
intensities, all of which are classic features of infarcts rather than demyelinating MS lesions.
The combination of radiomics and multi-level XAI revealed consistent and interpretable patterns in
lesion classification. SHAP highlighted globally dominant features, while LIME showed per-lesion
factors contributing to predictions. DiCE further provided hypothetical scenarios for counter-diagnosis,
and GPT-based narratives completed the pipeline by ofering concise, clinician-aligned explanations.
Together, these methods support not only technical validation but also clinical usability, making
the system suitable for diagnostic support in real-world settings. Each explanation modality brings
distinct strengths and limitations. SHAP provides both global and local feature attributions but can
be computationally intensive. LIME generates interpretable local explanations but may sufer from
instability across small perturbations. DiCE enables actionable counterfactuals but sometimes proposes
feature changes that may not correspond directly to plausible anatomical variations. GPT narratives
ofer intuitive, clinician-friendly summaries, but they are prone to occasional hallucinations, especially
in ambiguous cases. Understanding these trade-ofs is crucial for practical clinical adoption.</p>
        <p>To assess the clinical relevance of the generated explanations, we shared the model outputs and
narratives with a practicing radiologist. The radiologist noted that the explainability provided was very
helpful and that the overall analysis made sense. However, broader evaluation is needed. Future work
will involve multi-expert validation with inter-rater agreement scoring and quantitative assessment of
explanation fidelity to strengthen the clinical robustness of the framework. Additionally, integrating
counterfactual outputs from DiCE into GPT-based narrative generation presents an exciting opportunity
to create "what-if" clinical explanations that can further enhance the usability of the system.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusion</title>
      <p>This study presents a radiomics-based classification framework for diferentiating multiple sclerosis
and ischemic stroke lesions on MRI, enhanced with a multi-level explainable AI pipeline. Among the
evaluated classifiers, Random Forest achieved the best overall performance, with an accuracy of 91.2%
and an F1-score of 86.5%. Beyond predictive performance, the framework integrates global (SHAP), local
(LIME), counterfactual (DiCE), and natural language (GPT) explanations to provide transparent and
clinically meaningful insights. SHAP identified texture-based radiomic features as globally influential,
while LIME and GPT enabled per-lesion interpretability in clinician-friendly language. DiCE ofered
hypothetical reasoning to explore how small changes in feature values could lead to diferent diagnoses.
Together, these methods create a robust and interpretable decision support system that bridges technical
and clinical domains. Future work will focus on experimenting with diferent radiomic extraction
parameters, such as varying angles, distances, and bin widths, to explore their impact on classification
and explainability. We also plan to expand this framework to include multi-modal MRI inputs, increase
sample diversity for better generalization, and integrate visual explanation methods, such as
attentionbased heatmaps or Grad-CAM visualizations, to enhance clinical trustworthiness. Finally, fine-tuning
the narrative generation process using domain-specific language models will be explored. By combining
radiomics with layered explainability, this approach supports the safer, more transparent deployment
of AI tools in neuroimaging diagnostics.</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>This research is funded by the SFI Center for Research Training in Machine Learning (18/CRT/6183).</p>
    </sec>
    <sec id="sec-6">
      <title>Declaration on Generative AI</title>
      <p>During the preparation of this work, the author(s) used GPT-4o and Grammarly for grammar and spelling
checks. The author(s) reviewed and edited the output as necessary and assume full responsibility for
the content of the publication.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>P.</given-names>
            <surname>Wildner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Stasiołek</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matysiak</surname>
          </string-name>
          ,
          <article-title>Diferential diagnosis of multiple sclerosis and other inflammatory cns diseases</article-title>
          ,
          <source>Multiple sclerosis and related disorders 37</source>
          (
          <year>2020</year>
          )
          <fpage>101452</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>P.</given-names>
            <surname>Lambin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Rios-Velazquez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Leijenaar</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Carvalho</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. G.</given-names>
            <surname>Van Stiphout</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Granton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Zegers</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Gillies</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Boellard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Dekker</surname>
          </string-name>
          , et al.,
          <article-title>Radiomics: extracting more information from medical images using advanced feature analysis</article-title>
          ,
          <source>European journal of cancer 48</source>
          (
          <year>2012</year>
          )
          <fpage>441</fpage>
          -
          <lpage>446</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Lundberg</surname>
          </string-name>
          ,
          <article-title>A unified approach to interpreting model predictions</article-title>
          ,
          <source>arXiv preprint arXiv:1705.07874</source>
          (
          <year>2017</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M. T.</given-names>
            <surname>Ribeiro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Singh</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Guestrin</surname>
          </string-name>
          , '
          <article-title>why should i trust you?' explaining the predictions of any classifier</article-title>
          ,
          <source>in: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>1135</fpage>
          -
          <lpage>1144</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>F.</given-names>
            <surname>Eitel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E.</given-names>
            <surname>Soehler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Bellmann-Strobl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. U.</given-names>
            <surname>Brandt</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Ruprecht</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. M.</given-names>
            <surname>Giess</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Kuchling</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Asseyer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Weygandt</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-D. Haynes</surname>
          </string-name>
          , et al.,
          <article-title>Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional mri using layer-wise relevance propagation</article-title>
          ,
          <source>NeuroImage: Clinical</source>
          <volume>24</volume>
          (
          <year>2019</year>
          )
          <fpage>102003</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>S.</given-names>
            <surname>Basu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Munafo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.-F.</given-names>
            <surname>Ben-Amor</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Roy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Girard</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Terranova</surname>
          </string-name>
          ,
          <article-title>Predicting disease activity in patients with multiple sclerosis: An explainable machine-learning approach in the mavenclad trials</article-title>
          ,
          <source>CPT: Pharmacometrics &amp; Systems Pharmacology</source>
          <volume>11</volume>
          (
          <year>2022</year>
          )
          <fpage>843</fpage>
          -
          <lpage>853</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Lopatina</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Ropele</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Sibgatulin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. R.</given-names>
            <surname>Reichenbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Güllmar</surname>
          </string-name>
          ,
          <article-title>Investigation of deep-learningdriven identification of multiple sclerosis patients based on susceptibility-weighted images using relevance analysis</article-title>
          ,
          <source>Frontiers in neuroscience 14</source>
          (
          <year>2020</year>
          )
          <fpage>609468</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Leite</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Rittner</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Appenzeller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. H.</given-names>
            <surname>Ruocco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Lotufo</surname>
          </string-name>
          ,
          <article-title>Etiology-based classification of brain white matter hyperintensity on magnetic resonance imaging</article-title>
          ,
          <source>Journal of Medical Imaging</source>
          <volume>2</volume>
          (
          <year>2015</year>
          )
          <fpage>014002</fpage>
          -
          <lpage>014002</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>D. P.</given-names>
            <surname>Castillo</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. J.</given-names>
            <surname>Samaniego</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Jiménez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            <surname>Cuenca</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O. A.</given-names>
            <surname>Vivanco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. M.</given-names>
            <surname>Álvarez-Gómez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. J.</given-names>
            <surname>Rodriguez-Alvarez</surname>
          </string-name>
          ,
          <article-title>Identifying demyelinating and ischemia brain diseases through magnetic resonance images processing</article-title>
          ,
          <source>in: 2019 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC)</source>
          , IEEE,
          <year>2019</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>3</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vuong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Tanadini-Lang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Wu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Marks</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Unkelbach</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hillinger</surname>
          </string-name>
          ,
          <string-name>
            <given-names>E. I.</given-names>
            <surname>Eboulet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Thierstein</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Peters</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Pless</surname>
          </string-name>
          , et al.,
          <article-title>Radiomics feature activation maps as a new tool for signature interpretability</article-title>
          ,
          <source>Frontiers in oncology 10</source>
          (
          <year>2020</year>
          )
          <fpage>578895</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <surname>M. R. Hernandez Petzsche</surname>
            , E. de la Rosa, U. Hanning,
            <given-names>R.</given-names>
          </string-name>
          <string-name>
            <surname>Wiest</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          <string-name>
            <surname>Valenzuela</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Reyes</surname>
            , M. Meyer, S.-L. Liew,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Kofler</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Ezhov</surname>
          </string-name>
          , et al.,
          <year>Isles 2022</year>
          :
          <article-title>A multi-center magnetic resonance imaging stroke lesion segmentation dataset</article-title>
          ,
          <source>Scientific data 9</source>
          (
          <year>2022</year>
          )
          <fpage>762</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>O.</given-names>
            <surname>Commowick</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Kain</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Casey</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ameli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.-C.</given-names>
            <surname>Ferré</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Kerbrat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Tourdias</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Cervenansky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Camarasu-Pop</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Glatard</surname>
          </string-name>
          , et al.,
          <article-title>Multiple sclerosis lesions segmentation from multiple experts: The miccai 2016 challenge dataset</article-title>
          ,
          <source>Neuroimage</source>
          <volume>244</volume>
          (
          <year>2021</year>
          )
          <fpage>118589</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Buades</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Coll</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.-M. Morel</surname>
          </string-name>
          ,
          <article-title>Non-local means denoising</article-title>
          ,
          <source>Image Processing On Line</source>
          <volume>1</volume>
          (
          <year>2011</year>
          )
          <fpage>208</fpage>
          -
          <lpage>212</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <surname>J. J. Van Griethuysen</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Fedorov</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Parmar</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Hosny</surname>
            ,
            <given-names>N.</given-names>
          </string-name>
          <string-name>
            <surname>Aucoin</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          <string-name>
            <surname>Narayan</surname>
            ,
            <given-names>R. G.</given-names>
          </string-name>
          <string-name>
            <surname>Beets-Tan</surname>
            ,
            <given-names>J.-C.</given-names>
          </string-name>
          <string-name>
            <surname>Fillion-Robin</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <string-name>
            <surname>Pieper</surname>
            ,
            <given-names>H. J.</given-names>
          </string-name>
          <string-name>
            <surname>Aerts</surname>
          </string-name>
          ,
          <article-title>Computational radiomics system to decode the radiographic phenotype</article-title>
          ,
          <source>Cancer research 77</source>
          (
          <year>2017</year>
          )
          <fpage>e104</fpage>
          -
          <lpage>e107</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Mothilal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Sharma</surname>
          </string-name>
          ,
          <string-name>
            <surname>C. Tan,</surname>
          </string-name>
          <article-title>Explaining machine learning classifiers through diverse counterfactual explanations</article-title>
          ,
          <source>in: Proceedings of the 2020 conference on fairness, accountability, and transparency</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>607</fpage>
          -
          <lpage>617</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>