<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>On Counterfactual Explanations of Cardiovascular Risk in Adolescent and Young Adult Breast Cancer Survivors</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Alice Bernasconi</string-name>
          <email>a.bernasconi6@campus.unimib.it</email>
          <email>alice.bernasconi@istitutotumori.mi.it</email>
          <xref ref-type="aff" rid="aff2">2</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Antonio Balordi</string-name>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Alessio Zanga</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff3">3</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Rafael Cabañas</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Data Science and Advanced Analytics, F. Hofmann - La Roche Ltd</institution>
          ,
          <addr-line>Basel</addr-line>
          ,
          <country country="CH">Switzerland</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Mathematics and CDTIME, University of Almería</institution>
          ,
          <addr-line>Almería</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>Evaluative Epidemiology Unit, Department of Epidemiology and Data Science, Fondazione IRCCS Istituto Nazionale dei tumori</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
        <aff id="aff3">
          <label>3</label>
          <institution>Models and Algorithms for Data and Text Mining Laboratory (MADLab), University of Milano-Bicocca</institution>
          ,
          <addr-line>Milan</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>In the last decades, the growing population of cancer survivors has shifted researchers' focus from primary toward tertiary prevention. Particularly, adolescents and young adults (AYAs) breast cancer (BC) survivors may face long-term outcomes as a result of their treatments, among which cardiovascular diseases (CVDs) are the most life-threatening ones. To plan efective follow-up guidelines for preventing and treating these events, it is essential to disentangle the causal role of cancer treatments in these patients. In this work, we aim to extend the current state of BC treatment guidelines by leveraging on the estimate of the risk of CVDs in AYAs who underwent BC treatments, as provided by a causal Bayesian network. In these regards, we provide counterfactual explanations of a causal query, using real-world data, algorithms and methods from the causal inference domain. We show that while ovarian suppression combined with tamoxifen may be a necessary cause for ischemic heart disease, it is not a suficient one, i.e., this treatment alone is not enough to cause the disease, other factors must also be present. These findings can contribute to support clinicians in the treatment choice and help in refining treatment strategies and follow-up protocols for AYAs, advancing personalised healthcare in oncology.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Causal networks</kwd>
        <kwd>Counterfactual explanations</kwd>
        <kwd>Breast cancer survivors</kwd>
        <kwd>Treatment guidelines</kwd>
        <kwd>Cardiovascular diseases</kwd>
        <kwd>Adolescents and young adults</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        Chronic diseases are by far the leading causes of mortality world-wide. In the last decades, the prevalence
of chronic diseases is rising due to changes in life-style and the population aging, especially in Italy. Most
of the times, these disorders present co-concurrence of multiple other chronic diseases (co-morbidities)
that require the involvement of several caregivers for proper patient care. However, hospital care,
ambulatory specialist care and primary care are subdivided into numerous entities, based mainly
on medical specialty. Hence, to provide to these patients the optimal and integrated care is a major
challenge for the health care system [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ].
      </p>
      <p>
        Every year in Italy about 400 thousands new cancer diagnosis are registered, with the highest
incidence in older adults. On the other hand, cancer survival is continuously improving thanks to
the innovations in patient care, thus making the proportion of cured patients growing. Oncologists
are in charge of cancer diagnosis and treatment. These tasks require a massive number of visits and
examinations especially during the first year since cancer identification. Once cancer treatment is
concluded, follow-up visits are scheduled to prevent cancer relapse, with the scheduling decreasing
with time and depending on the major cancer prognostic factors. Although oncologists have detailed
information about the treatments their patients received, they do not have the ability to monitor all
of their efects, especially in the long-term. Nevertheless, there is strong evidence in the scientific
literature on the wide variety of long-term efects that cancer therapies can cause, including diseases of
the cardiovascular or endocrine system, reproductive disorders, infections and so on [
        <xref ref-type="bibr" rid="ref2">2</xref>
        ]. Moreover,
cancer is a complex disease treated with combined treatments: the mixed efect of diferent treatments
makes it even more dificult to predict the possible late outcomes.
      </p>
      <p>General practitioners (GPs) are the only physicians able to monitor the patients throughout their
lifetime. However, they might not have complete access to hospital charts nor the knowledge needed in
all the medical specialties. Moreover, data about the efect of the most innovative cancer treatments
are scarce. Clearly identify who and how should be involved in patients follow-up is essential to help
oncologists and GPs to plan efective follow-up guidelines to prevent and treat long-term outcomes in
cancer survivors. This aspect is also important from a public health prospective and for policy-makers
interested in better organizing resources. Artificial intelligence (AI) and causal inference (CI) can help
in enriching the knowledge of the medical stakeholders building efective tools to be used to quantify
and causally explain the burden of late outcomes in cancer survivors.</p>
      <p>Adolescents and young adults (AYAs, patients aged 15 to 39 at first cancer diagnosis) are an
heterogeneous and peculiar group of cancer patients who deserve special attention. This age-group share
tumours’ case-mix both with the younger and older counterpart. Breast cancer (BC) is the most frequent
cancer in AYA females, as in older women. Nevertheless, there is a survival gap attributable to a more
aggressive biology of BC in AYA than in older patients.</p>
      <p>Most patients with BC receive surgery as main treatment. The major surgical procedure can be
preceded or followed by other treatments both systemic (chemotherapy, target therapy or hormones
therapy) or local (radiotherapy). Treatment guidelines are the same both for AYAs and older women and
depend on several factors related to the cancer (e.g., extent of the disease) and the host (e.g., hormonal
status). Oncologists choose the best combination of treatments with two major objective: i) to maximise
the patient chance of survival and ii) to minimise the risk of cancer relapse.</p>
      <p>
        Despite the large knowledge of late efects of cancer treatments in older women, little is known about
the magnitude of the impact of cancer treatments in younger patients, like AYAs. The accumulation of
stress induced by cancer and its treatment may contribute to accelerate aging in young cancer survivors,
inducing premature mortality, frailty and other age-related diseases, like cardiovascular diseases (CVDs)
[
        <xref ref-type="bibr" rid="ref3">3</xref>
        ].
      </p>
      <p>
        The main contributions of this manuscript, made by leveraging on the first AI model [
        <xref ref-type="bibr" rid="ref4">4</xref>
        ] developed
for estimating the risk of CVDs in AYAs that survived after BC treatments, are the following:
• To enrich BC treatment guidelines with knowledge on CVDs risk in young women;
• To contribute to disentangling uncertainty in treatment choice using counterfactual explanations
on the most relevant late outcome in these patients;
• To help clinicians in tailoring personalised follow-up guidelines for high-risk patients.
      </p>
      <p>The rest of the manuscript is organised as follows. Section 2 introduces the notation and gives the
main definitions to make the paper as much as possible self-contained. The main contributions on
the case study of adolescent and young adults breast cancer survivors are presented in Section 3. We
close the manuscript with the description of the experimental results (Section 3.4) and the discussion of
the achievements (Section 4), with some proposals on how to develop further along the same research
direction for answering more ambitious and complex counterfactual queries.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Methods</title>
      <p>
        In this section we introduce the notation, together with the main concepts and the mathematical models
needed to follow the rest of the paper. In particular, we give the definitions of Bayesian network, causal
network and structural causal model, while also describing the three rungs of the ladder of causation
[
        <xref ref-type="bibr" rid="ref5">5</xref>
        ] which are fundamental to understand our contributions.
      </p>
      <sec id="sec-2-1">
        <title>2.1. Bayesian Networks</title>
        <p>
          Bayesian networks (BNs) [
          <xref ref-type="bibr" rid="ref6 ref7 ref8">6, 7, 8</xref>
          ] are a type of probabilistic graphical model (PGM) used for reasoning
under uncertainty. BNs are made of a qualitative component in the form of direct acyclic graph (DAG)
encoding the independence relations between the variables in the problem, while the quantitative
component is a set probability distributions measuring such relations. More formally, BNs can be
defined as follows.
        </p>
        <sec id="sec-2-1-1">
          <title>Definition 1 (Bayesian Network (BN)).</title>
          <p>A Bayesian network is a pair ⟨, ⟩, where:
•  = ⟨V, E⟩ is a DAG, with V a set of vertices and E ⊂
•  is a probability distribution over the random vector X.</p>
          <p>V ×</p>
          <p>V a set of directed edges,
Each vertex  ∈ V is mapped to a variable  ∈ X, so that the global probability distribution  is
factorised over  into local probability terms  (| ()), with  () the parents1 of .</p>
          <p>For each variable  ∈ V, we define the ancestors of  to be the set of variables  ∈ V ∖ {} such
that there exists a directed path2. Similarly, we define the descendants of  to be the set of variables
 ∈ V ∖ {} such that there exists a directed path from  to  . Henceforth, we will refer to a vertex
 and its corresponding variable  interchangeably.</p>
          <p>Definition 2 (Causal Network (CN)). A Causal Network is a BN in which any edge from parents to
children represents a cause-efect relationship.</p>
        </sec>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Observational and Interventional Rungs</title>
        <p>Standard probabilistic inference involves computing the posterior probability distribution for variables
of interest given evidence about other variables, commonly referred to as observational queries (e.g.,
“what if I see this?”). For instance, given states  and  of random variables  and  , respectively,
an observational query might involve computing the conditional probability  (|). Here,  and 
represent the presence of  and  , while ′ and ′ denote their absence. In this context, the average
treatment efect (ATE) is defined as</p>
        <p>ATE(,  ) =  (|) −  (|′).</p>
        <p>Conversely, causal reasoning focuses on hypothetical scenarios where we calculate the probability of
a variable given that we intervene on another. For example, the query  ( = |( = )) represents
the probability that  equals  when  is intervened to take the value . The notation do( = )
explicitly denotes an intervention, distinguishing it from mere observation. The diference between
two such interventional queries, known as the causal efect diference or average causal efect (ACE), is
defined as</p>
        <p>ACE(,  ) =  ( =  | do( = )) −  ( =  | do( = ′)).</p>
        <p>To calculate an interventional query, a process often referred to as "surgery" is employed. This
graphical operation involves removing the incoming arcs to the intervened variable  and setting the
node to a specific value  = . The model that results from this surgical intervention is known as the
"post-intervention" model. This process is performed to restrict the natural tendency of the variable to
change in response to other variables in the environment.</p>
        <p>Performing graph surgery is the initial step required to distinguish the associative efect from the purely
causal efect. However, a causal estimand cannot be directly estimated using a statistical estimator; it
must first be translated into a statistical estimand by removing the intervention. This process is known
as the identification of the causal efect .</p>
        <p>
          If there exists a set of covariates Z that satisfies the back-door criterion [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] in the model, then there
exists a consistent estimator for the causal efect of  on  :
 ( =  | do( = )) = ∑︁  ( =  |  = , Z = z) (Z = z).
        </p>
        <p>z
1A vertex  is said to be a parent of  if there exists a directed edge from  to .
2A directed path from  to  is sequence of directed edges starting from  and ending in 
(1)
(2)
(3)</p>
        <p>
          Under the condition of exogeneity (also known as no-confounding) [
          <xref ref-type="bibr" rid="ref9">9</xref>
          ], the way  would potentially
respond to experimental conditions  or ′ is independent of the actual value of . This implies that
 ( = |do( = )) =  ( = | = ) and  ( = |do( = ′)) =  ( = | = ′), thus
making (,  ) =  (,  ). A graphical criterion to identify the condition of exogeneity is
the absence of a common ancestor of  and  connected to  through a directed path that does not
include .
        </p>
      </sec>
      <sec id="sec-2-3">
        <title>2.3. Counterfactual Rung</title>
        <p>
          Counterfactual queries [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ] explore hypothetical scenarios, such as, "What would the outcome have
been if the variable had taken a diferent value?" For example,  (| = ′) represents the probability
of  if  had taken the value  instead of ′. Here,  relates to the hypothetical scenario, while  is
in the real scenario.
        </p>
        <p>A key concept in this context is the probability of necessity (PN), which measures the extent to which
one event is a necessary condition for another. The PN is defined as:</p>
        <p>PN(,  ) =  (′ = ′| = ,  = ).</p>
        <p>Here,  is considered a necessary cause for  if  would not have occurred without , given that both
 and  actually occurred. Therefore, PN represents our certainty about  being a necessary cause of
 .</p>
        <p>Similarly, we may also be interested in determining whether an event is a suficient condition. To
address this, we define the probability of suficiency (PS) as:</p>
        <p>PS(,  ) =  ( = | = ′,  = ′).</p>
        <p>is considered a suficient cause for  if  occurs whenever  occurs. Thus, PS represents the
probability that  is a suficient cause of  . In other words, it is the probability that setting  would
lead to  in a scenario where both  and  are currently absent.</p>
        <p>
          Counterfactual queries cannot be directly computed from a CN. Instead, structural causal models
(SCMs) [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ], which can be viewed as an extension of CNs, are required. SCMs consist of endogenous
variables, which represent internal elements of the model, and exogenous variables, which often lack a
clear semantic interpretation. SCMs can be formally defined as follows [
          <xref ref-type="bibr" rid="ref10">10</xref>
          ].
(4)
(5)
        </p>
        <sec id="sec-2-3-1">
          <title>Definition 3 (Structural Causal Model (SCM)).</title>
          <p>⟨U, V, ℱ , ⟩, where:
A structural causal model is defined as a 4-tuple
• U is the set of exogenous variables;
• V is the set of endogenous variables;
• ℱ = { : U ∪  () → , ∀ ∈ V} is the set of structural equations;
•  is the set containing the exogenous probability distributions  () for each  ∈ U.</p>
          <p>Note that the structural equations ℱ actually define a DAG over the variables in
edge from each variable in U ∪  () to .</p>
          <p>U ∪ V, with an</p>
        </sec>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Answering a Challenging Causal Query</title>
      <p>In this section we first formulate the causal query representing the subject of this paper, then we show
how such a query translates to the language of SCMs. Furthermore, we show how the obtained SCM
can be simplified to eficiently answer the causal query. The section closes by answering the causal
query.</p>
      <sec id="sec-3-1">
        <title>3.1. The Causal Query</title>
        <p>
          The starting point of this work is the causal network described in [
          <xref ref-type="bibr" rid="ref4">4</xref>
          ]. This causal network is the first
one developed for estimating the risk of cardiovascular diseases (CVDs) in adolescent and young adults
(AYAs) that have been treated and survived breast cancer (BC). It has been developed by combining
clinical knowledge with two diferent patients cohorts, namely a population cohort and a clinical cohort.
        </p>
        <p>The causal network is depicted in Figure 1. The cancer prognostic factors (coloured in Yellow ) and
the major CVDs risk factors (coloured in Blue ) are non modifiable risk factors. To reduce and prevent
the risk of developing a CVD or its sub-forms (i.e., ischemic heart diseases or cardiotoxicity, coloured
in Orange ), clinicians can intervene on the treatments only (those coloured in Green ). Thus, in our
work we will interpret the risk of CVDs according to treatment recommendations included within the
treatment guidelines as queries.</p>
        <p>Breast cancer treatment is regulated in Italy by Italian national guidelines, discussed every year by a
panel of experts. The aims of these guidelines are:
• To improve and standardise the clinical practice;
• To ofer all patient throughout the country the possibility of best care;
• To ensure an evidence-based reference for national and regional institutions.</p>
        <p>In this paper, the major clinical recommendations are presented in the form of clinical queries
accompanied by the quality of their supporting evidence together with the strength of the associated
recommendation. In particular, we are interested to answer the following causal query:
CAUSAL QUERY: In pre-menopausal women, with a surgically treated breast cancer, positive to
hormonal receptors, HER2 negative, low risk for recurrences, is it recommendable to add ovarian
suppression to tamoxifen treatment?</p>
        <p>The clinical recommendation about this casual query is STRONG IN FAVOUR to the addition of
the ovarian suppression. This recommendation was voted by the panelists in light of the significant
improvement in both overall and progression-free survival. Among the side efects of this combination
of treatments the more relevant listed were: mood alterations, sexual dysfunction and osteoporosis.
Despite the toxicity profile highlighted, the benefit-to-damage ratio was considered in favour of the
addition of ovarian suppression to tamoxifen treatment.</p>
        <p>
          Nevertheless, no evidence is provided about the potential role of this treatment combination to the
CVD risk. When it is unethical to conduct a randomised clinical trial, observational data and causal
inference are the only way to integrate the clinical recommendation with knowledge on CVD risk.
Hence, in this work we are answering to three queries, formulated according to the three rungs of the
ladder of causation proposed by Judea Pearl [
          <xref ref-type="bibr" rid="ref5">5</xref>
          ]:
        </p>
        <p>In pre-menopausal women, with a surgically treated breast cancer, positive to hormonal receptors, HER2
negative, low risk for recurrences...</p>
        <p>ASSOCIATION ...which is the observed risk of CVDs in those patients that received ovarian
suppression in combination with tamoxifen treatment?
INTERVENTION ...which is the risk of CVDs if we administer ovarian suppression
in combination with tamoxifen treatment?
COUNTERFACTUALS ...which would have been the risk of CVDs if we did not administered
ovarian suppression?</p>
      </sec>
      <sec id="sec-3-2">
        <title>3.2. Translating the Causal Query into a Structural Causal Model</title>
        <p>
          The causal network depicted in Figure 1 is used to translate the causal query of interest (CAUSAL
QUERY) into the corresponding structural causal model. This is achieved by first oberving that the
ovarian suppression is administered in young patients as neo-adjuvant (i.e., pre-surgical) treatment
to blocks the body’s ability to produce hormones, particularly estrogen, to preserve ovarian fertility
that could be compromised by the toxicity of cancer treatments (like chemotherapy). Nevertheless,
estrogens have a cardioprotective efect in young females. Thus, the loss of estrogen during menopause is
associated with increased risk of ischemic heart disease [
          <xref ref-type="bibr" rid="ref11 ref12">11, 12</xref>
          ]. Hence, the causal query investigates the
relationship: [hormons_neo]→[ischemic_heart_diseases] with the aim to determine whether
the former variable is a necessary, a suficient or both a necessary and suficient cause of the latter.
        </p>
        <p>Before starting the experiments, we had to select the group of patients described in the causal query
(Section 3.1). All patients included in the dataset were surgically treated, thus, no restriction was done
in this regard. To select patients positive to hormonal receptors and HER2 negative only, we set the
variable [Receptors] to "Luminal" OR "Luminal A" OR "Luminal B". Moreover, to select patients
receiving tamoxifen, we set the variable [Hormons_adiu] to "Yes"; thus, tamoxifen is the elective
hormonal adjuvant (i.e., post-surgery) treatment, administered for a minimum of 5-10 years.</p>
        <p>
          The model depicted in Figure 1 was developed using data coming from two diferent retrospective
cohorts of AYA BC patients: a population-based cohort [
          <xref ref-type="bibr" rid="ref13">13</xref>
          ], identified in population-based cancer
registries, and a single-institution clinical cohort. Population-based cancer registries have the unique
opportunity to collect information on all cancer cases diagnosed in a given area with poor clinical
details. The clinical cohort, on the opposite, has more detailed information on cancer prognostic
factors but sufers from selection bias. For this work, we decided to focus on the population-based data
(setting the node [Cohort] to "Population-based") because we wanted our results to be valid for all
AYAs with BC. By consequence, a group of patients was selected based on the values of the variables
[receptors],[hormons_adiu] and [cohort]. From a graphical perspective, in CNs this translates
to removing the outgoing edges from such variables, which brings us to obtain the DAG shown in
Figure 2.
        </p>
        <p>Moreover, while in the original model (Figure 1), the variables [hypertension], [t2db] and
[dyslipidemia] (that represent hypertension, dyslipidemia and type 2 diabestes, respectively), were
coded as "pre" (if the diseases was diagnosed before BC diagnosis), "post" (if the diseases was diagnosed
after BC diagnosis) or "no"; in this work, they were binarised by grouping together "pre" and "post" into
the same label named ”yes”.</p>
        <p>age35
grade
histology</p>
        <p>cohort
ki67
pt
receptors
vascular</p>
        <p>pn
death_in_5y
radio_neo
chemo_neo
target_neo</p>
        <p>hormons_neo
surgery
radio_adiu
chemo_adiu
hormons_adiu</p>
        <p>t2db
target_adiu</p>
      </sec>
      <sec id="sec-3-3">
        <title>3.3. Simplifying the Structural Causal Model</title>
        <p>Counterfactual reasoning with probabilistic graphical models typically requires significant
computational power, making it challenging to manage large problems. To address this issue, we further
simplified the model depicted in Figure 2 and the dataset described in Section 1.</p>
        <p>In causal and counterfactual queries, a variable is often referred to as either the causes or the efect.
In our case study, the node [hormons_neo]represents the cause, while the node [ischemic_heart_
disease] is the efect .</p>
        <p>
          A barren variable (or node) [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ] with respect to a causal query is a variable that does not influence the
probability distribution of any variables of interest for that causal query. Consequently, such variables
can be pruned from the model without afecting the outcome of the inference. In a CN, a variable is
considered barren if it is not included in the causal query and either has no descendants or only has
descendants that are themselves barren.
        </p>
        <p>In our case study, barren variables are those that are not ancestors of the efect variable [ischemic_
heart_disease]. The result of pruning barren variables is shown in Figure 3 (left).</p>
        <p>However, the model depicted in Figure 3 (left) can be further simplified using d-separation 3.
Specifically, all ancestors of the cause variable [hormons_neo]are conditionally independent of [ischemic_
heart_disease] given [hormons_neo]. Consequently, these variables can be removed, resulting in
the simplified model shown in Figure 3 (right). For simplicity, variable names were abbreviated as
follows:  represents [hormons_neo],  represents [ischemic_heart_disease],  represents
[hypertension],  represents [t2db] and  represents [dyslipidemia].</p>
        <p>The original dataset was also simplified, thus, only the rows related to the selected patients were
considered, all the columns for the irrelevant variables were removed, i.e., the variables except those
shown in Figure 3 (right).</p>
        <p>
          According to the the queries of interest, we investigate the causal efects of the variable [hormons_
neo] on the outcome variable [ischemic_heart_disease]in the model shown in Figure 3 (left).
3For more details on d-separation, readers can refer to [
          <xref ref-type="bibr" rid="ref7">7</xref>
          ].
        </p>
        <p>
          =
=
=
=
=










=
=
=
=
=
These two variables satisfy the exogeneity condition; hence, the (, ) was calculated as the
variation on the conditional probability  ( = | = ) −  ( = | = ) using
inference on junction trees [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ].
        </p>
        <p>In our use case, the endogenous variables were those remaining after the removal of barren and
d-separated variables. Then, an exogenous variable was added as a parent to each endogenous variable.
The resulting DAG is shown in Figure 4 (left). In contrast, exogenous variables are denoted by the letter
 followed by the name of the corresponding child variable.</p>
        <p>
          Equations were automatically inferred from the causal graph, without any loss of generality, via a
canonical specification [
          <xref ref-type="bibr" rid="ref15">15</xref>
          ]. Each equation is then a deterministic function, in which the states of an
exogenous variable will then represent all possible function mappings between its children domains
from their respective endogenous parents domains. In practice, the equations can be represented as a
degenerated conditional probability table containing just ones and zeros.
        </p>
        <p>
          In contrast, the distributions associated with the exogenous variables are initially unknown. To
address this, we propose utilizing the innovative technique known as EMCC (Expectation Maximization
for Causal Computation), as detailed in [
          <xref ref-type="bibr" rid="ref16 ref17">16, 17</xref>
          ]. This method treats an SCM as a BN with exogenous
variables considered latent. The core idea is to repeatedly apply a learning algorithm designed for BNs
with latent variables.
        </p>
        <p>While other methods for computing counterfactual queries exist — such as solving structural equations
manually or using potential outcomes frameworks — we focus on the EMCC approach due to its eficiency
and scalability in handling complex models with latent variables. This method allows for automatic
estimation of exogenous distributions directly from data, which is particularly advantageous when
dealing with large-scale or high-dimensional problems.</p>
        <p>After each run, the specification for the exogenous distributions are available and counterfactual
queries can be computed using an extended model known as the counterfactual model (or twin model).</p>
        <p>The twin model is a SCM that includes endogenous variables from both the real and hypothetical
scenarios, achieved by duplicating the sub-graph composed of the endogenous nodes for the real
scenario and then applying the intervention. In our case study,   (, ) was calculated as the
causal query  (= = | = ,  = ) in the model shown in the Figure 4 (center), for
which any inference algorithm can be used.</p>
        <p>This can be interpreted as the probability that a patient would not have sufered the disease if they
had not received the hormonal treatment, given that they actually did receive the treatment and sufered
the disease.</p>
        <p>Similarly for  (, ) was calculated as the causal query  (= = | = ,  =
) in the model shown in the Figure 4 (right). Analogously, this can be viewed as the probability that
a patient would have sufered the disease if they had received the ovarian suppression treatment, given
that they actually did not receive the treatment and did not sufered the disease.</p>
      </sec>
      <sec id="sec-3-4">
        <title>3.4. Answering the Causal Query</title>
        <p>For the causal analysis, our objective was to calculate the average causal efect (ACE) of the variable
[hormons_neo]on [ischemic_heart_disease]. This can be directly computed as the average
30
t
n
u
o
C20
10
0.990</p>
        <p>PN(HN, D)
treatment efect (ATE). We found that  ( = | = ) = 1.45% and  ( = | =
) = 0.22%. Consequently, we obtain (, ) = 1.23%, that means that the probability of
developing a CVD increase only about 1% if the patient receives ovarian suppression together with
tamoxifen treatment. This result indicate that this treatment does not significantly influence the outcome
of the disease.</p>
        <p>In the context of counterfactual analysis4, we investigated the likelihood that the variable [hormons_
neo] is a necessary and suficient cause for [ischemic_heart_disease]. To achieve this, we
conducted 100 runs of the previously described EMCC algorithm. For each run, we obtained a value for
the causal query   (, ) and another one for  (, ). The distribution of these values is
depicted in Figure 5. In the case of   , all values exceeded 97.83%. Conversely, the values for  
remained below 1.97%.</p>
        <p>These results suggest that, with a high probability, the ovarian suppression treatment is a necessary
cause but not a suficient cause for the ischemic heart disease. That means that the latter factor is
essential for the disease to occur, i.e., the disease cannot happen without the presence of this hormonal
treatment. However, this treatment alone is not enough to cause the disease; other factors must also be
present.</p>
      </sec>
    </sec>
    <sec id="sec-4">
      <title>4. Conclusions and Future work</title>
      <p>In this work we showed how causal networks can be efectively used to disentangle uncertainty in
treatment choices, while helping clinicians in better tailoring personalised follow-up guidelines for
chronic patients, like cancer survivors. Moreover, the evidence derived from the experimental results
can integrate the actual state of the treatment guidelines, enriching them with knowledge on the CVD
risk of AYA BC patients. Starting from a clinical question on the recommendability of the ovarian
suppression addition to tamoxifen treatment, the counterfactual explanations made it evident that
while [hormons_neo]is necessary to induce [ischemic_heart_disease]it is not a suficient cause.
Thus, while it is true that no patient not receiving ovarian suppression will develop a CVD, receiving
ovarian suppression is not suficient to explain an increase in the risk of CVDs.</p>
      <p>
        This work is an important use case which concretely shows how efective observational real-world
data can be to answer clinical questions. In these regards, the overlapping of association and intervention
results support the idea that, despite confounding is present, clinicians are able to deal with it in the
everyday clinical practice, even without ad hoc strict guidelines. This result makes even more important
and relevant the evidence driven by the third ladder of causation (counterfactuals) that mimics the
4Code available at: https://github.com/AlessioZanga/cardiovascular-counterfactuals.git
results of a randomised controlled trial, the gold standard in medicine. This approach, applied to
observational data, can be particularly important especially when evidence from trials is not available
nor ethic to be obtained [
        <xref ref-type="bibr" rid="ref18">18</xref>
        ].
      </p>
      <p>However, despite its relevance this work has some limitations too. First of all, while   and 
are calculated using a statistical estimand, the computation of   and   is more complex and requires
a SCM. As described in Section 3.3, in our proposed approach an exogenous variable was added as a
parent to each endogenous variable. Given the unknown distributions of the exogenous variable we
learn it by repeatedly applying a learning algorithm. This approach makes all the process extremely
computationally expensive which allows to handle only queries in which the efect node can have
maximum of three parents. Methodological work is needed to extend this approach to be able to answer
to more complex queries. Moreover, attention should be paid when interpreting the results of ,
  and   with regards to the rarity of the events. Thus, CVDs are really rare in AYA surviving
BC [19], so even though  and   are very low they may be relevant for this specific population.
Furthermore, a discussion with clinicians (oncologists and cardio-oncologists) will be needed to validate
the clinical plausibility of the presented results.</p>
      <p>Finally, as illustrated in the experimental results, the ovarian suppression is not a suficient cause
of the ischemic events, under the assumption of causal suficiency (all the causes needed to explain
the causal mechanisms are included in the model). Nevertheless, this assumption is dificult to be
valid especially considering the vast literature that describes the role of lifestyle factors (like smoking,
physical inactivity, obesity and poor diet) on the development of cardiovascular diseases both directly and
indirectly through type 2 diabetes, hypertension and dyslipidemia [20, 21, 22]. Planning interventions
on lifestyle modifiable factors would be more efective than treating their efects only, hence, the
addition of these variables to the model would be essential to better stratify the CVD risk and develop
more personalised follow-up strategies. To conclude, to achieve transportability of results, the model
needs external validation. The external validation is already ongoing in similar cohorts of AYA BC
survivors in 6 diferent areas of Italy (Veneto, Friuli-Venezia-Giulia, Tuscany, Apulia regions and two
Sicilian provinces) and in 4 European Countries (Estonia, Norway, Denmark and Belgium) thanks to
pilot studies nested in international Joint Actions, namely Innovative Partnership for Action Against
Cancer (iPAAC) and Prevent Non-Communicable Diseases and Cancer (Prevent NCD).</p>
    </sec>
    <sec id="sec-5">
      <title>Acknowledgments</title>
      <p>Alice Bernasconi is funded by an AIRC 2020 project, grant number 24864, titled “pRedicting
cardiOvascular diSeAses iN adolescent and young breast caNcer pAtients (ROSANNA)”.</p>
      <p>Alessio Zanga is funded by F. Hofmann-La Roche Ltd.</p>
      <p>This work was partially supported by the MUR under the grant “Dipartimenti di Eccellenza 2023-2027"
of the Department of Informatics, Systems and Communication of the University of Milano-Bicocca,
Italy.</p>
      <p>Grant PID2022-139293NB-C31 funded by MCIN/AEI/10.13039/501100011033 and by ERDF A way of
making Europe partially supported this work. Rafael Cabñas was also supported by “Plan Propio de
Investigación y Transferencia 2024-2025” from University of Almería under the project P_LANZ_2024003.</p>
      <p>This work was possible thanks to the data collected in the Ada project ("Adolescents and young adults
with cancer in Italy. How to ensure access to the best care and quality of survival"). We thank the project
working group: Alessandra Andreotti, Carlotta Buzzoni, Sabrina Fabiano, Adele Caldarella, Stefania
Giorgetti, Salvatore Sciacca, Rosa Angela Filiberti, Cinzia Gasparotti, Giuliano Carrozzi, Rosalba Amodio,
Lucia Mangone, Silvia Iacovacci, Mario Fusco, Fabrizio Stracci, William Mantovani, Giuseppe Cascone,
Sante Minerba, Giuseppe Sampietro, Anna Melcarne, Paolo Ricci, Federica Manzoni, Maria Letizia
Gambino, Elisabetta Merlo, Rossella Bruni, Alessandra Sessa, Giancarlo D’Orsi, Anna Clara Fanetti,
Lucia De Lorentis, Tiziana Scuderi, Fabio Savoia, Iolanda Grappasonni, Salvatore Bongiorno, Antonio
Romanelli.
of evidence, Psychological Medicine 51 (2021) 563–578. doi:10.1017/S0033291720005127.
[19] A. M. Berkman, C. R. Andersen, M. E. Roth, S. C. Gilchrist, Cardiovascular disease in adolescent
and young adult cancer survivors: Impact of sociodemographic and modifiable risk factors, Cancer
129 (2023) 450–460. doi:10.1002/cncr.34505.
[20] D. O. Ondimu, G. M. Kikuvi, W. N. Otieno, Risk factors for hypertension among young adults
(18-35) years attending in tenwek mission hospital, bomet county, kenya in 2018, Pan African
Medical Journal 33 (2019). doi:10.11604/pamj.2019.33.210.18407.
[21] A. Saavedra, E. Rodrigues, D. Carvalho, Dislipidemia secundária a hipotiroidismo e colestase, Acta</p>
      <p>Médica Portuguesa 33 (2020) 204–207. doi:10.20344/amp.9944.
[22] J. Muhandiramge, J. R. Zalcberg, G. J. van Londen, E. T. Warner, P. R. Carr, A. Haydon, S. G.</p>
      <p>Orchard, Cardiovascular disease in adult cancer survivors: a review of current evidence, strategies
for prevention and management, and future directions for cardio-oncology, Current Oncology
Reports 24 (2022) 1579–1592. doi:10.1007/s11912-022-01309-w.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <surname>H.-P. B.-L. Rocca</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          <string-name>
            <surname>Fleischhacker</surname>
            ,
            <given-names>O.</given-names>
          </string-name>
          <string-name>
            <surname>Golubnitschaja</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Heemskerk</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Helms</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Hoedemakers</surname>
            ,
            <given-names>S. H.</given-names>
          </string-name>
          <string-name>
            <surname>Allianses</surname>
            ,
            <given-names>T.</given-names>
          </string-name>
          <string-name>
            <surname>Jaarsma</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Kinkorova</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Ramaekers</surname>
            ,
            <given-names>P.</given-names>
          </string-name>
          <string-name>
            <surname>Ruf</surname>
            ,
            <given-names>I.</given-names>
          </string-name>
          <string-name>
            <surname>Schnur</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <string-name>
            <surname>Vanoli</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <string-name>
            <surname>Verdu</surname>
            ,
            <given-names>B.</given-names>
          </string-name>
          <string-name>
            <surname>Zippel-Schultz</surname>
          </string-name>
          ,
          <article-title>Challenges in personalised management of chronic diseases-heart failure as prominent example to advance the care process</article-title>
          ,
          <source>EPMA Journal 7</source>
          (
          <year>2015</year>
          )
          <article-title>2</article-title>
          . doi:
          <volume>10</volume>
          .1186/ s13167-016-0051-9.
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>K. D.</given-names>
            <surname>Miller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Nogueira</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. B.</given-names>
            <surname>Mariotto</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. H.</given-names>
            <surname>Rowland</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K. R.</given-names>
            <surname>Yabrof</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C. M.</given-names>
            <surname>Alfano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Jemal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. L.</given-names>
            <surname>Kramer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. L.</given-names>
            <surname>Siegel</surname>
          </string-name>
          ,
          <article-title>Cancer treatment</article-title>
          and survivorship statistics,
          <year>2019</year>
          , CA: A
          <source>Cancer Journal for Clinicians</source>
          <volume>69</volume>
          (
          <year>2019</year>
          )
          <fpage>363</fpage>
          -
          <lpage>385</lpage>
          . doi:
          <volume>10</volume>
          .3322/caac.21565.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>S.</given-names>
            <surname>Wang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N. E.</given-names>
            <surname>Jurdi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>B.</given-names>
            <surname>Thyagarajan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Prizment</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. H.</given-names>
            <surname>Blaes</surname>
          </string-name>
          ,
          <article-title>Accelerated aging in cancer survivors: Cellular senescence, frailty, and possible opportunities for interventions</article-title>
          ,
          <source>International Journal of Molecular Sciences</source>
          <volume>25</volume>
          (
          <year>2024</year>
          )
          <article-title>3319</article-title>
          . doi:
          <volume>10</volume>
          .3390/ijms25063319.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernasconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Zanga</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Lucas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scutari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Stella</surname>
          </string-name>
          ,
          <article-title>Towards a transportable causal network model based on observational healthcare data</article-title>
          , volume
          <volume>3578</volume>
          ,
          <year>2023</year>
          , p.
          <fpage>122</fpage>
          -
          <lpage>129</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          ,
          <source>Causality: Models, Reasoning and Inference</source>
          , Cambridge University Press, New York,
          <year>2000</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          ,
          <article-title>Probabilistic Reasoning in Intelligent Systems</article-title>
          , Elsevier,
          <year>1988</year>
          . URL: https://linkinghub. elsevier.com/retrieve/pii/C20090276094. doi:
          <volume>10</volume>
          .1016/C2009-0-27609-4.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>F. V.</given-names>
            <surname>Jensen</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T. D.</given-names>
            <surname>Nielsen</surname>
          </string-name>
          ,
          <article-title>Bayesian networks and decision graphs</article-title>
          , volume
          <volume>2</volume>
          , Springer, New York, NY,
          <year>2007</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>D.</given-names>
            <surname>Koller</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Friedman</surname>
          </string-name>
          ,
          <article-title>Probabilistic Graphical Models: Principles and Techniques</article-title>
          , The MIT Press,
          <year>2009</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>J.</given-names>
            <surname>Tian</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Pearl</surname>
          </string-name>
          ,
          <article-title>Probabilities of causation: Bounds and identification</article-title>
          ,
          <source>Annals of Mathematics and Artificial Intelligence</source>
          <volume>28</volume>
          (
          <year>2000</year>
          )
          <fpage>287</fpage>
          -
          <lpage>313</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>E.</given-names>
            <surname>Bareinboim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J. D.</given-names>
            <surname>Correa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Ibeling</surname>
          </string-name>
          , T. Icard,
          <article-title>On Pearl's hierarchy and the foundations of causal inference, in: Probabilistic and causal inference: the works of Judea Pearl</article-title>
          ,
          <string-name>
            <surname>ACM</surname>
          </string-name>
          ,
          <year>2022</year>
          , pp.
          <fpage>507</fpage>
          -
          <lpage>556</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>A.</given-names>
            <surname>Christinat</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. D.</given-names>
            <surname>Lascio</surname>
          </string-name>
          ,
          <string-name>
            <given-names>O.</given-names>
            <surname>Pagani</surname>
          </string-name>
          ,
          <article-title>Hormonal therapies in young breast cancer patients: when, what and for how long?</article-title>
          ,
          <source>Journal of thoracic disease 5 Suppl</source>
          <volume>1</volume>
          (
          <year>2013</year>
          )
          <fpage>S36</fpage>
          -
          <lpage>46</lpage>
          . doi:
          <volume>10</volume>
          .3978/j. issn.2072-
          <fpage>1439</fpage>
          .
          <year>2013</year>
          .
          <volume>05</volume>
          .25.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>I.</given-names>
            <surname>Alfaris</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Asselah</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Aziz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Bouganim</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Mousavi</surname>
          </string-name>
          ,
          <article-title>The cardiovascular risks associated with aromatase inhibitors, tamoxifen, and gnrh agonists in women with breast cancer</article-title>
          ,
          <source>Current Atherosclerosis Reports</source>
          <volume>25</volume>
          (
          <year>2023</year>
          )
          <fpage>145</fpage>
          -
          <lpage>154</lpage>
          . doi:
          <volume>10</volume>
          .1007/s11883-023-01085-3.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>A.</given-names>
            <surname>Bernasconi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Barigelletti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Tittarelli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Botta</surname>
          </string-name>
          , G. Gatta, G. Tagliabue,
          <string-name>
            <given-names>P.</given-names>
            <surname>Contiero</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Guzzinati</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Andreano</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Manneschi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Falcini</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Castaing</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. A.</given-names>
            <surname>Filiberti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gasparotti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Cirilli</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Mazzucco</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Mangone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Iacovacci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. F.</given-names>
            <surname>Vitale</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Stracci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pifer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Tumino</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Carone</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Sampietro</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Melcarne</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Ballotari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Boschetti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Pisani</surname>
          </string-name>
          , L.
          <string-name>
            <surname>C. D'Oro</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Cuccaro</surname>
          </string-name>
          ,
          <string-name>
            <surname>A. D'Argenzio</surname>
          </string-name>
          ,
          <string-name>
            <surname>G. D'Orsi</surname>
            ,
            <given-names>A. C.</given-names>
          </string-name>
          <string-name>
            <surname>Fanetti</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Ardizzone</surname>
            ,
            <given-names>G.</given-names>
          </string-name>
          <string-name>
            <surname>Candela</surname>
            ,
            <given-names>F.</given-names>
          </string-name>
          <string-name>
            <surname>Savoia</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Pascucci</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Castelli</surname>
            ,
            <given-names>C.</given-names>
          </string-name>
          <string-name>
            <surname>Storchi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <string-name>
            <surname>Trama</surname>
          </string-name>
          ,
          <article-title>Adolescent and young adult cancer survivors: Design and characteristics of the first nationwide population-based cohort in italy</article-title>
          ,
          <source>Journal of Adolescent and Young Adult Oncology</source>
          <volume>9</volume>
          (
          <year>2020</year>
          )
          <fpage>586</fpage>
          -
          <lpage>593</lpage>
          . doi:
          <volume>10</volume>
          .1089/jayao.
          <year>2019</year>
          .
          <volume>0170</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>R.</given-names>
            <surname>Nagarajan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Scutari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Lèbre</surname>
          </string-name>
          ,
          <source>Bayesian Network Inference Algorithms</source>
          , Springer New York, New York, NY,
          <year>2013</year>
          , pp.
          <fpage>85</fpage>
          -
          <lpage>101</lpage>
          . URL: https://doi.org/10.1007/978-1-
          <fpage>4614</fpage>
          -6446-
          <issue>4</issue>
          _4. doi:
          <volume>10</volume>
          .1007/ 978-1-
          <fpage>4614</fpage>
          -6446-
          <issue>4</issue>
          _
          <fpage>4</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>J.</given-names>
            <surname>Zhang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Tian</surname>
          </string-name>
          , E. Bareinboim,
          <article-title>Partial counterfactual identification from observational and experimental data</article-title>
          ,
          <source>in: International Conference on Machine Learning, PMLR</source>
          ,
          <year>2022</year>
          , pp.
          <fpage>26548</fpage>
          -
          <lpage>26558</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zafalon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Antonucci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cabañas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Huber</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Azzimonti</surname>
          </string-name>
          , Eficient computation of counterfactual bounds,
          <source>International Journal of Approximate Reasoning</source>
          (
          <year>2024</year>
          )
          <fpage>109111</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          [17]
          <string-name>
            <given-names>M.</given-names>
            <surname>Zafalon</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Antonucci</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Cabañas</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Huber</surname>
          </string-name>
          ,
          <article-title>Approximating counterfactual bounds while fusing observational, biased and randomised data sources</article-title>
          ,
          <source>International Journal of Approximate Reasoning</source>
          <volume>162</volume>
          (
          <year>2023</year>
          )
          <fpage>109023</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          [18]
          <string-name>
            <given-names>G.</given-names>
            <surname>Hammerton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. R.</given-names>
            <surname>Munafò</surname>
          </string-name>
          ,
          <article-title>Causal inference with observational data: the need for triangulation</article-title>
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>