<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Interpretable Vital Sign Forecasting with Model Agnostic Attention Maps</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<persName><forename type="first">Yuwei</forename><surname>Liu</surname></persName>
							<email>yuwei.liu@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Chen</forename><surname>Dan</surname></persName>
							<email>chen.dan@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Anubhav</forename><surname>Bhatti</surname></persName>
							<email>anubhav.bhatti@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Bingjie</forename><surname>Shen</surname></persName>
							<email>bingjie.shen@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Divij</forename><surname>Gupta</surname></persName>
							<email>divij.gupta@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">Suraj</forename><surname>Parmar</surname></persName>
							<email>suraj.parmar@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<author>
							<persName><forename type="first">San</forename><surname>Lee</surname></persName>
							<email>sanlee@spassmed.ca</email>
							<affiliation key="aff0">
								<orgName type="institution">SpassMed Inc</orgName>
								<address>
									<settlement>Toronto</settlement>
									<region>Ontario</region>
									<country key="CA">Canada</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Interpretable Vital Sign Forecasting with Model Agnostic Attention Maps</title>
					</analytic>
					<monogr>
						<idno type="ISSN">1613-0073</idno>
					</monogr>
					<idno type="MD5">3B8D0C2A5BC2B3D4C90547A90635234A</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2025-04-23T19:37+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Time Series Forecasting</term>
					<term>Deep Learning</term>
					<term>Interpretable Machine Learning</term>
					<term>Attention Map</term>
					<term>Vital Signs</term>
					<term>Sepsis</term>
					<term>Explainable AI</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>Sepsis is a leading cause of mortality in intensive care units (ICUs), representing a substantial medical challenge. The complexity of analyzing diverse vital signs to predict sepsis further aggravates this issue. While deep learning techniques have been advanced for early sepsis prediction, their 'black-box' nature obscures the internal logic, impairing interpretability in critical settings like ICUs. This paper introduces a framework that combines a deep learning model with an attention mechanism that highlights the critical time steps in the forecasting process, thus improving model interpretability and supporting clinical decision-making. We show that the attention mechanism could be adapted to various black box time series forecasting models such as N-HiTS and N-BEATS. Our method preserves the accuracy of conventional deep learning models while enhancing interpretability through attention-weight-generated heatmaps. We evaluated our model on the eICU-CRD dataset, focusing on forecasting vital signs for sepsis patients. We assessed its performance using mean squared error (MSE) and dynamic time warping (DTW) metrics. We explored the attention maps of N-HiTS and N-BEATS, examining the differences in their performance and identifying crucial factors influencing vital sign forecasting.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.">Introduction</head><p>Sepsis is a life-threatening condition that occurs when the immune system of the body responds incorrectly to an infection and causes rapid organ dysfunction and failure <ref type="bibr" target="#b0">[1]</ref>. A meta-analysis conducted on articles published in PubMed and the Cochrane Database revealed that the average 30-day mortality rate for sepsis was 24.4%, and the average 90-day mortality rate was 32.2% between 2009 and 2019 <ref type="bibr" target="#b1">[2]</ref>. While sepsis has been acknowledged for a long time, its clinical definition did not emerge until the late 20 𝑡ℎ century <ref type="bibr" target="#b2">[3]</ref>. In 1991, a consensus conference posited that sepsis arises from the individual's inflammatory response to infection, marked by systemic inflammatory response syndrome (SIRS), emphasizing the human response to invading organisms. This syndrome is characterized by variations in temperature, heart rate (HR), respiratory rate (RR), blood pressure (BP), and white blood cell (WBC) count <ref type="bibr" target="#b3">[4]</ref>. In 2016, the definition of sepsis was revised to multiple organ dysfunction syndrome (MODS) <ref type="bibr" target="#b4">[5]</ref>. Systolic blood pressure (SBP) and RR abnormalities indicate organ dysfunction <ref type="bibr" target="#b5">[6]</ref>. Thus, creating precise models for forecasting vital signs becomes essential in predicting sepsis <ref type="bibr" target="#b6">[7]</ref>. Accurate vital sign predictions can promptly aid clinicians in identifying and intervening in sepsis cases, potentially saving lives and improving the intensive care unit (ICU) patient outcomes.</p><p>The growth in explainable artificial intelligence (XAI) research is mainly attributed to the rapid growth in the popularity of deep learning with widespread healthcare applications. However, most models developed using these technologies are considered 'black-boxes' by experts due to their intricate, non-linear structures that are challenging for non-experts to understand <ref type="bibr" target="#b7">[8]</ref>. The proposed research contributes to the following two aspects: (1) Adding an attention mechanism to show the relationship between input time steps and forecasted results; (2) Providing analysis and interpretation of the findings derived from the attention map.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1.1.">Literature Review</head><p>In recent years, the significance of model explainability has been widely recognized, leading to the integration of an increasing number of explainable methods into data-driven models <ref type="bibr" target="#b8">[9]</ref>. Prior research has demonstrated the development of deep learning neural networks incorporating attention mechanisms, resulting in interpretable models with strong performance within the medical field. Kaji et al. demonstrated that integrating an attention mechanism into the LSTM network, trained with Electronic Health Record (EHR) data, not only improves the daily sepsis onset prediction's Area Under the Receiver Operating Characteristic Curve (AUROC) score to 0.876 but also highlights critical time points for prediction <ref type="bibr" target="#b9">[10]</ref>. An attention-based gated recurrent unit (GRU) was developed by Shickel et al. Self-attention was applied to focus on significant time steps when predicting in-hospital mortality <ref type="bibr" target="#b10">[11]</ref>. Choi et al. proposed reverse time attention (RETAIN), processing EHR data in reverse order, achieving an Area Under the ROC Curve (AUC) of 0.87 in heart failure prediction. It adds interpretability using a two-level neural attention model <ref type="bibr" target="#b11">[12]</ref>.</p><p>While previous XAI research integrating deep learning models with interpretable modules has excelled in time series classification, attention mechanisms in interpretable time series forecasting remain underexplored. Our approach aims to explore attention mechanism interpretability in time series forecasting.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.">Method</head><p>In this section, we begin by detailing the information of the eICU Collaborative Research Database (eICU-CRD) <ref type="bibr" target="#b12">[13]</ref>, followed by an outline of the composition of our input data. Subsequently, we dive into the specifics of the attention mechanism and the frameworks of our forecasting models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.1.">Dataset Description and Data Preprocessing</head><p>The eICU-CRD data is a publicly accessible repository containing data from over 200,000 ICU admissions across 208 hospitals in the United States between 2014 and 2015 <ref type="bibr" target="#b12">[13]</ref>. This comprehensive dataset comprises diverse patient information, including demographics, diagnoses, medications, and laboratory results. Our research focuses on the 'diagnosis' and 'vitalAperiodic' tables, from which we extract dynamic physiological data such as temperature, HR, and BP at 5-minute intervals. The core of our study revolves around forecasting two crucial dynamic variables: HR and mean blood pressure (MBP), derived from SBP and diastolic blood pressure (DBP) measurements. Following the works of <ref type="bibr" target="#b13">[14,</ref><ref type="bibr" target="#b14">15]</ref>, we create one or more groups within a 9-hour time window for each patient to predict vital signs for the subsequent 3 hours based on the preceding 6 hours of data. Data preprocessing involves imputing missing values, filtering outliers, and scaling using domain-specific knowledge. Clinically reasonable boundaries for each critical vital sign were set using this specialized knowledge: HR ranged from 0 to 300 bpm, MBP from 0 to 190 mmHg, and RR from 0 to 100 bpm <ref type="bibr" target="#b15">[16]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.2.">Experiment Setup</head><p>The dataset is divided into training, validation, and test sets in an 80:10:10 ratio. Within these intervals, the initial 6 hours consist of 72 time steps, while the subsequent 3 hours encompass 36 time points. The forecasting model integrates either HR alone or HR combined with RR as covariates to forecast MBP or conversely. Training of the model occurs over the first 72 time steps, followed by predictions for the remaining 36 time steps. Ultimately, model performance is assessed through Mean Squared Error (MSE) and Dynamic Time Warping (DTW) evaluations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.3.">Deep Learning Forecasting Model</head><p>Based on the forecasting performance of the N-HiTS and N-BEATS model <ref type="bibr" target="#b16">[17,</ref><ref type="bibr" target="#b17">18,</ref><ref type="bibr" target="#b18">19]</ref>, as well as the idea proposed by Pantiskas et al. <ref type="bibr" target="#b19">[20]</ref> we aim to address their inherent lack of interpretability and understand why the model has different performances. To achieve this, we implemented an attention mechanism that can be applied to the N-HiTS and N-BEATS architecture, which may also be applied to other black-box deep learning models. The N-HiTS and N-BEATS model consists of a series of stacks, each responsible for learning residual values from the preceding stack.</p><p>Within each stack are blocks comprising several fully connected layers, which generate backward (𝜃 𝑏 𝑙 ) and forward (𝜃 𝑓 𝑙 ) expansion coefficients according to Equation <ref type="formula" target="#formula_0">1</ref>, where ℎ 𝑙,4 represents the output of the fourth fully connected layer in the basic block, and 𝐿𝑖𝑛𝑒𝑎𝑟 denotes a linear projection layer <ref type="bibr" target="#b16">[17]</ref>:</p><formula xml:id="formula_0">𝜃 𝑏 𝑙 = Linear 𝑏 𝑙 (ℎ 𝑙,4 ), 𝜃 𝑓 𝑙 = Linear 𝑓 𝑙 (ℎ 𝑙,4 ),<label>(1)</label></formula><p>Additionally, each block includes backward (𝑔 𝑏 𝑙 ) and forward (𝑔 𝑓 𝑙 ) basis layers that produce backcast and forecast outputs as per Equation <ref type="formula" target="#formula_1">2</ref>, where ̂︀ 𝑦 𝑙 and ̂︀ 𝑥 𝑙 denote forecast and backcast outputs, respectively:</p><formula xml:id="formula_1">̂︀ 𝑦 𝑙 = dim(𝜃 𝑓 𝑙 ) ∑︁ 𝑖=1 𝜃 𝑓 𝑙,𝑖 v 𝑓 𝑙,𝑖 , ̂︀ 𝑥 𝑙 = dim(𝜃 𝑏 𝑙 ) ∑︁ 𝑖=1 𝜃 𝑏 𝑙,𝑖 v 𝑏 𝑙,𝑖 .<label>(2)</label></formula><p>Here, 𝑣 𝑓 𝑙,𝑖 and 𝑣 𝑏 𝑙,𝑖 represent forecast and backcast basis vectors. Notably, for N-HiTS, it has a max-pooling layer (Equation <ref type="formula" target="#formula_2">3</ref>) before passing the values to the fully connected layer, which is applied to enable multi-rate signal sampling for the 𝑙 𝑡ℎ basic block <ref type="bibr" target="#b16">[17]</ref>:</p><formula xml:id="formula_2">𝑦 (𝑝) 𝑡−𝐿:𝑡,𝑙 = MaxPool (𝑦 𝑡−𝐿:𝑡,𝑙 , 𝑘 𝑙 ) ,<label>(3)</label></formula><p>where 𝑘 𝑙 is the kernel size of the MaxPool layer. Subsequently, inspired by Pantiskas et al. <ref type="bibr" target="#b19">[20]</ref> idea, we introduced an attention mechanism to explore the relationship between learned information and original inputs after obtaining the residuals from the final stack. The forecasted result is utilized to construct the Query (Q), while the original input forms the basis for the Value (V) and Key (K) <ref type="bibr" target="#b19">[20]</ref>. The resulting output is computed as follows:</p><formula xml:id="formula_3">𝑂 𝑁 *𝐻 = 𝐷 • 𝑉 = 𝑠𝑜𝑓 𝑡𝑚𝑎𝑥( 𝑄𝐾 𝑇 √ 𝐿 )𝑉<label>(4)</label></formula><formula xml:id="formula_4">𝐾 𝑁 *1*𝐿 = 𝐼 𝑁 *1*𝐿 • 𝑊 𝑁 *𝐿*𝐿 𝐾 + 𝑏 𝑁 *1*𝐿 𝐾 (<label>5</label></formula><formula xml:id="formula_5">)</formula><formula xml:id="formula_6">𝑉 𝑁 *1*𝐿 = 𝐼 𝑁 *1*𝐿 • 𝑊 𝑁 *𝐿*𝐿 𝑉 (<label>6</label></formula><formula xml:id="formula_7">)</formula><p>and 𝑁 is the number of input multi time seires, 𝐻 is the forecasting horizon length, and 𝐿 is the history input horizon. As shown in Figure <ref type="figure" target="#fig_0">1</ref>, after the attention layer, a normalizer is applied, and skip connections are employed to mitigate the vanishing gradient issue. Finally, a fully connected layer is utilized to generate the forecasted results.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2.4.">Interpretable Attention Map</head><p>To illustrate the attention map for a specific item, we computed <ref type="bibr" target="#b19">[20]</ref>:</p><formula xml:id="formula_8">𝐴 𝐻*𝐿*𝑁 = 𝐷 𝐻*𝐿 • 𝑎𝑏𝑠(𝑊 𝑁 *𝐿*𝐿 𝑣 ) 𝑇<label>(7)</label></formula><p>Here, 𝐴 𝐻*𝐿*𝑁 denotes the attention map, where 𝐴 𝐻*𝐿 𝑖 represents the 𝑖 𝑡ℎ series in the multivariate time series. Each row 𝑗 in 𝐴 𝐻*𝐿 𝑖 signifies the relationship between the 𝑗 𝑡ℎ forecasted data point and the historical input of length 𝐿.</p><p>This computation enables the visualization of how the model attends to different historical inputs when forecasting specific data points across the multivariate time series. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.">Results and Discussion</head></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.1.">Forecasting Benchmarks</head><p>Here, table <ref type="table" target="#tab_0">1</ref> shows the results using different deep learning time series forecasting models. We compared N-HiTS <ref type="bibr" target="#b16">[17]</ref>, N-BEATS <ref type="bibr" target="#b17">[18]</ref>, Temporal Fusion Transformer (TFT) <ref type="bibr" target="#b20">[21]</ref>, which are computed by Bhatti et al. <ref type="bibr" target="#b18">[19]</ref> using MSE and DTW as the evaluation metrics.</p><p>The results indicate that the N-HiTS model, both with and without an attention mechanism, consistently outperforms other models across MBP and HR predictions when considering MSE. Similarly, the N-BEATS model also performs well both with and without attention mechanisms.</p><p>Furthermore, the TFT model demonstrates competitive performance, especially when considering MSE. But in the previous paper by Bhatti et al. <ref type="bibr" target="#b18">[19]</ref>, the forecasting result of TFT is relatively smooth and doesn't show fluctuations.</p><p>In conclusion, the N-HiTS model, when augmented with an attention mechanism, emerges as a robust choice for forecasting MBP and HR, showcasing its efficacy in capturing complex temporal patterns. However, further exploration and experimentation are warranted to optimize model performance, particularly regarding temporal alignment and covariate incorporation. Notably, significant changes or peaks at certain points (like the 35 𝑡ℎ , 54 𝑡ℎ , and 63 𝑟𝑑 points) increasingly darken, highlighting their crucial role in shaping the prediction. This pattern suggests that N-HiTS places a stronger emphasis on data after the 20 𝑡ℎ points, effectively capturing  both data fluctuations and overall trends. As a result, the predictions closely align with the actual data and accurately reflect downward trends.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3.2.">Interpretability Analysis</head><p>On the other hand, the predictions from N-BEATS do not closely follow the downward trend of the actual data and display considerable fluctuation. This model's attention map reveals that N-BEATS in Fig 2b assigns larger weights to almost every rise and fall (such as at the 3 𝑟𝑑 , 10 𝑡ℎ , and 29 𝑡ℎ points), but without considering if it's worth to focus on the trend, which contributes to less effective information capture. Moreover, it appears that N-BEATS prioritizes data from the initial 1-2 hours more than N-BEATS, contributing to less stable prediction outcomes.</p><p>Both models indicate that the initial 1-3 hours are crucial for prediction, suggesting that medical staff should focus on interventions during this period. Significant changes occurring up to three hours prior also substantially impact the predictions. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4.">Conclusion</head><p>In this paper, we presented an interpretable time series forecasting algorithm that combines black-box deep learning models (N-HiTS &amp; NBEATS) with a general attention mechanism. This approach allows us to observe how the deep learning algorithm assigns importance to inputs while transparently generating each step of its output. Upon applying this advanced architecture to the eICU-CRD dataset, our findings demonstrate that the attention mechanism can enhance interpretability in deep learning time series forecasting models with minimal reduction or even no change in accuracy. By visualizing attention distributions, clinicians can identify which vital signs and historical data points are most influential in predicting sepsis. Furthermore, our model-agnostic attention mechanism is applicable to various deep learning forecasting models.</p></div><figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_0"><head>Figure 1 :</head><label>1</label><figDesc>Figure 1: Structure of our implementation. Adding the attention layer at the top of stacks, getting the results from the output.</figDesc><graphic coords="3,92.68,532.12,416.70,100.38" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_1"><head></head><label></label><figDesc>In the heatmap provided (Fig 2a, Fig 2b), darker colors indicate higher attention weights at specific time points, which correspondingly have a greater influence on prediction outcomes. Conversely, lighter colors suggest a lesser impact. The "N-HiTS + Attention" in Fig 2a demonstrates that areas after the 20 𝑡ℎ time point exhibit darker shades compared to earlier sections.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_2"><head></head><label></label><figDesc>(a) N-HiTS Attention distribution (b) N-BEATS Attention distribution.</figDesc></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_3"><head>Figure 2 :</head><label>2</label><figDesc>Figure 2: N-HiTS &amp; N-BEATS with attention using covariates to forecast MBP after minmax filter.</figDesc><graphic coords="6,130.96,274.75,333.36,171.64" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" xml:id="fig_4"><head>Figure 3 :</head><label>3</label><figDesc>Figure 3: N-HiTS forecasting results with attention using covariates after minmax filter</figDesc><graphic coords="7,147.63,84.18,300.03,170.45" type="bitmap" /></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1</head><label>1</label><figDesc>Performance of forecasting models on forecasting MBP and HR. Here, covariates (W C) for MBP are HR &amp; RR, and covariates for HR are MBP &amp; RR. * The MSE values are scaled by 1𝑒 −4 for better representation. † The DTW values are scaled by 1𝑒 −3 for better representation.</figDesc><table><row><cell>Models</cell><cell>Cov.</cell><cell cols="4">MBP (MSE*) MBP (DTW  † ) HR (MSE*) HR (DTW  † )</cell></row><row><cell>Persistence [19]</cell><cell>-</cell><cell>24.55</cell><cell>34.50</cell><cell>7.35</cell><cell>17.52</cell></row><row><cell>N-HiTS [19]</cell><cell>W C</cell><cell>18.46</cell><cell>18.70</cell><cell>7.37</cell><cell>13.12</cell></row><row><cell>N-HiTS [19]</cell><cell>W/o C</cell><cell>18.02</cell><cell>20.46</cell><cell>7.22</cell><cell>13.97</cell></row><row><cell>N-BEATS [19]</cell><cell>W C</cell><cell>19.79</cell><cell>19.37</cell><cell>8.73</cell><cell>14.36</cell></row><row><cell>N-BEATS [19]</cell><cell>W/o C</cell><cell>18.52</cell><cell>17.63</cell><cell>7.48</cell><cell>10.71</cell></row><row><cell>TFT [19]</cell><cell>W C</cell><cell>18.89</cell><cell>25.93</cell><cell>7.71</cell><cell>16.12</cell></row><row><cell>TFT [19]</cell><cell>W/o C</cell><cell>19.45</cell><cell>25.65</cell><cell>8.12</cell><cell>16.65</cell></row><row><cell>N-BEATS with Attention</cell><cell>W C</cell><cell>21.86</cell><cell>21.07</cell><cell>8.04</cell><cell>14.32</cell></row><row><cell cols="2">N-BEATS with Attention W/o C</cell><cell>18.71</cell><cell>18.03</cell><cell>8.40</cell><cell>11.33</cell></row><row><cell>N-HiTS with Attention</cell><cell>W C</cell><cell>18.78</cell><cell>20.44</cell><cell>7.24</cell><cell>13.32</cell></row><row><cell>N-HiTS with Attention</cell><cell>W/o C</cell><cell>19.73</cell><cell>20.42</cell><cell>6.97</cell><cell>12.24</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Changing definitions of sepsis</title>
		<author>
			<persName><forename type="first">F</forename><surname>Gül</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">K</forename><surname>Arslantaş</surname></persName>
		</author>
		<author>
			<persName><forename type="first">İ</forename><surname>Cinel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Kumar</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Turkish journal of anaesthesiology and reanimation</title>
		<imprint>
			<biblScope unit="volume">45</biblScope>
			<biblScope unit="page">129</biblScope>
			<date type="published" when="2017">2017</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Mortality in sepsis and septic shock in europe, north america and australia between 2009 and 2019-results from a systematic review and meta-analysis</title>
		<author>
			<persName><forename type="first">M</forename><surname>Bauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Gerlach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Vogelmann</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Preissing</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Stiefel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Adam</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Critical Care</title>
		<imprint>
			<biblScope unit="volume">24</biblScope>
			<biblScope unit="page" from="1" to="9" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">Sepsis: pathophysiology and clinical management</title>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">E</forename><surname>Gotts</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">A</forename><surname>Matthay</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Bmj</title>
		<imprint>
			<biblScope unit="volume">353</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Sepsis definitions: time for change</title>
		<author>
			<persName><forename type="first">J.-L</forename><surname>Vincent</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Opal</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">C</forename><surname>Marshall</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">J</forename><surname>Tracey</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">The Lancet</title>
		<imprint>
			<biblScope unit="volume">381</biblScope>
			<biblScope unit="page" from="774" to="775" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">The critical roles and mechanisms of immune cell death in sepsis</title>
		<author>
			<persName><forename type="first">Z</forename><surname>Cheng</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">T</forename><surname>Abrams</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Toh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">S</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Z</forename><surname>Wang</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Q</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Yu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C.-H</forename><surname>Toh</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><surname>Wang</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Frontiers in immunology</title>
		<imprint>
			<biblScope unit="volume">11</biblScope>
			<biblScope unit="page">1918</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">The third international consensus definitions for sepsis and septic shock (sepsis-3)</title>
		<author>
			<persName><forename type="first">M</forename><surname>Singer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">S</forename><surname>Deutschman</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">W</forename><surname>Seymour</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Shankar-Hari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Annane</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Bauer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Bellomo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">G</forename><forename type="middle">R</forename><surname>Bernard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J.-D</forename><surname>Chiche</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><forename type="middle">M</forename><surname>Coopersmith</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Jama</title>
		<imprint>
			<biblScope unit="volume">315</biblScope>
			<biblScope unit="page" from="801" to="810" />
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">A transformer architecture for stress detection from ecg</title>
		<author>
			<persName><forename type="first">B</forename><surname>Behinaein</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bhatti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Rodenburg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Hungler</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Etemad</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the 2021 ACM International Symposium on Wearable Computers</title>
				<meeting>the 2021 ACM International Symposium on Wearable Computers</meeting>
		<imprint>
			<date type="published" when="2021">2021</date>
			<biblScope unit="page" from="132" to="134" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Notions of explainability and evaluation approaches for explainable artificial intelligence</title>
		<author>
			<persName><forename type="first">G</forename><surname>Vilone</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Longo</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information Fusion</title>
		<imprint>
			<biblScope unit="volume">76</biblScope>
			<biblScope unit="page" from="89" to="106" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Explainable artificial intelligence: Concepts, applications, research challenges and visions</title>
		<author>
			<persName><forename type="first">L</forename><surname>Longo</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><surname>Goebel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><surname>Lecue</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Kieseberg</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Holzinger</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">International cross-domain conference for machine learning and knowledge extraction</title>
				<imprint>
			<publisher>Springer</publisher>
			<date type="published" when="2020">2020</date>
			<biblScope unit="page" from="1" to="16" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">An attention based deep learning model of clinical events in the intensive care unit</title>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">A</forename><surname>Kaji</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">R</forename><surname>Zech</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">S</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">K</forename><surname>Cho</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><forename type="middle">S</forename><surname>Dangayach</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">B</forename><surname>Costa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">E</forename><forename type="middle">K</forename><surname>Oermann</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">PloS one</title>
		<imprint>
			<biblScope unit="volume">14</biblScope>
			<biblScope unit="page">e0211057</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Deepsofa: a continuous acuity score for critically ill patients using clinically interpretable deep learning</title>
		<author>
			<persName><forename type="first">B</forename><surname>Shickel</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Loftus</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><surname>Adhikari</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Ozrazgat-Baslanti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Bihorac</surname></persName>
		</author>
		<author>
			<persName><forename type="first">P</forename><surname>Rashidi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientific reports</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="page">1879</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Retain: An interpretable predictive model for healthcare using reverse time attention mechanism</title>
		<author>
			<persName><forename type="first">E</forename><surname>Choi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">T</forename><surname>Bahadori</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Sun</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><surname>Kulas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Schuetz</surname></persName>
		</author>
		<author>
			<persName><forename type="first">W</forename><surname>Stewart</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Advances in neural information processing systems</title>
		<imprint>
			<biblScope unit="volume">29</biblScope>
			<date type="published" when="2016">2016</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">The eicu collaborative research database, a freely available multi-center database for critical care research</title>
		<author>
			<persName><forename type="first">T</forename><forename type="middle">J</forename><surname>Pollard</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">E</forename><surname>Johnson</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">D</forename><surname>Raffa</surname></persName>
		</author>
		<author>
			<persName><forename type="first">L</forename><forename type="middle">A</forename><surname>Celi</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">G</forename><surname>Mark</surname></persName>
		</author>
		<author>
			<persName><forename type="first">O</forename><surname>Badawi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Scientific data</title>
		<imprint>
			<biblScope unit="volume">5</biblScope>
			<biblScope unit="page" from="1" to="13" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Bhatti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Thangavelu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><surname>Hassan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Kim</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2306.14016</idno>
		<title level="m">Interpreting forecasted vital signs using n-beats in sepsis patients</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b14">
	<analytic>
		<title level="a" type="main">Characterizing the patients, hospitals, and data quality of the eicu collaborative research database</title>
		<author>
			<persName><forename type="first">H</forename><forename type="middle">M</forename><surname>O'halloran</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Kwong</surname></persName>
		</author>
		<author>
			<persName><forename type="first">R</forename><forename type="middle">A</forename><surname>Veldhoen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><forename type="middle">M</forename><surname>Maslove</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Critical Care Medicine</title>
		<imprint>
			<biblScope unit="volume">48</biblScope>
			<biblScope unit="page" from="1737" to="1743" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b15">
	<analytic>
		<title level="a" type="main">Extending machine learning-based early sepsis detection to different demographics</title>
		<author>
			<persName><forename type="first">S</forename><surname>Parmar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Shan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Kim</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE First International Conference on Artificial Intelligence for Medicine, Health and Care (AIMHC)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2024">2024. 2024</date>
			<biblScope unit="page" from="70" to="71" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b16">
	<analytic>
		<title level="a" type="main">Nhits: Neural hierarchical interpolation for time series forecasting</title>
		<author>
			<persName><forename type="first">C</forename><surname>Challu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><forename type="middle">G</forename><surname>Olivares</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">N</forename><surname>Oreshkin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">F</forename><forename type="middle">G</forename><surname>Ramirez</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">M</forename><surname>Canseco</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Dubrawski</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Proceedings of the AAAI Conference on Artificial Intelligence</title>
				<meeting>the AAAI Conference on Artificial Intelligence</meeting>
		<imprint>
			<date type="published" when="2023">2023</date>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="page" from="6989" to="6997" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b17">
	<monogr>
		<author>
			<persName><forename type="first">B</forename><forename type="middle">N</forename><surname>Oreshkin</surname></persName>
		</author>
		<author>
			<persName><forename type="first">D</forename><surname>Carpov</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Chapados</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Bengio</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N-Beats</forename></persName>
		</author>
		<idno type="arXiv">arXiv:1905.10437</idno>
		<title level="m">Neural basis expansion analysis for interpretable time series forecasting</title>
				<imprint>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b18">
	<monogr>
		<author>
			<persName><forename type="first">A</forename><surname>Bhatti</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Liu</surname></persName>
		</author>
		<author>
			<persName><forename type="first">C</forename><surname>Dan</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Shen</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Lee</surname></persName>
		</author>
		<author>
			<persName><forename type="first">Y</forename><surname>Kim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">J</forename><forename type="middle">Y</forename><surname>Kim</surname></persName>
		</author>
		<idno type="arXiv">arXiv:2311.04770</idno>
		<title level="m">Vital sign forecasting for sepsis patients in icus</title>
				<imprint>
			<date type="published" when="2023">2023</date>
		</imprint>
	</monogr>
	<note type="report_type">arXiv preprint</note>
</biblStruct>

<biblStruct xml:id="b19">
	<analytic>
		<title level="a" type="main">Interpretable multivariate time series forecasting with temporal attention convolutional neural networks</title>
		<author>
			<persName><forename type="first">L</forename><surname>Pantiskas</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Verstoep</surname></persName>
		</author>
		<author>
			<persName><forename type="first">H</forename><surname>Bal</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">IEEE Symposium Series on Computational Intelligence (SSCI)</title>
				<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2020">2020. 2020</date>
			<biblScope unit="page" from="1687" to="1694" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b20">
	<analytic>
		<title level="a" type="main">Temporal fusion transformers for interpretable multi-horizon time series forecasting</title>
		<author>
			<persName><forename type="first">B</forename><surname>Lim</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">Ö</forename><surname>Arık</surname></persName>
		</author>
		<author>
			<persName><forename type="first">N</forename><surname>Loeff</surname></persName>
		</author>
		<author>
			<persName><forename type="first">T</forename><surname>Pfister</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">International Journal of Forecasting</title>
		<imprint>
			<biblScope unit="volume">37</biblScope>
			<biblScope unit="page" from="1748" to="1764" />
			<date type="published" when="2021">2021</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
