=Paper=
{{Paper
|id=Vol-3823/7_Marbukh_position_8
|storemode=property
|title=Towards Recommender System Supported Contact Tracing for Cost-Efficient and Risk Aware Infection Suppression
|pdfUrl=https://ceur-ws.org/Vol-3823/7_Marbukh_position_8.pdf
|volume=Vol-3823
|authors= Vladimir Marbukh
|dblpUrl=https://dblp.org/rec/conf/healthrecsys/Marbukh24
}}
==Towards Recommender System Supported Contact Tracing for Cost-Efficient and Risk Aware Infection Suppression==
Towards Recommender System Supported Contact Tracing
for Cost-Efficient and Risk Aware Infection Suppression⋆
Vladimir Marbukh1,∗
1
National Institute of Standards & Technology, Information Technology Laboratory, 100 Bureau Dr., Gaithersburg, Maryland, USA
Abstract
In public health, contact tracing is the process of identifying people who may have been exposed to an infected
person. Contact tracing performance criteria, which include infection suppression, protection of high-risk indi-
viduals, and cost-efficiency, are not necessarily aligned with each other. Pareto optimization of the corresponding
inherent trade-offs, especially at the early stages of infection, is typically unrealistic due to insufficient information
on infection propagation, risk factors, prevention and treatment options, etc. We suggest that contact tracing
performance can be significantly improved with the support of a specialized Recommender System (RS). Based
on the combination of up-to-date contact tracing and medical data, RS can identify and test through Exposure
Notification System (ENS) not only high-risk individuals but also potential superspreaders to suppress infection
propagation. Due to incomplete information, the dynamic nature of the problem, and a large state and action
spaces, the RS should be supported by Deep Reinforcement Learning (DRL) for solving the corresponding Partially
Observable Markov Decision Process (POMDP).
Keywords
contact tracing, exposure notifications, recommender system, deep reinforcement learning, partially observable
Markov decision process
1. Introduction & Motivation
In public health, contact tracing is the process of identifying people who may have been exposed to an
infected person, subsequent testing them for infection, and isolating or treating the infected [1]. Contact
tracing performance criteria include infection suppression, protection of high-risk individuals, and
cost-efficiency. These criteria are not necessarily aligned with each other, e.g., given testing capacity,
infection suppression requires high priority testing for the potential super spreaders, while protection of
high-risk individuals requires testing them with higher priority. Given the testing priorities, the existing
Google/Apple Exposure Notification (GAEN) technology [2] can support an Exposure Notification
System (ENS) by allowing public health authorities to quickly notify people for subsequent testing.
GAEN is a framework and protocol specification developed by Apple Inc. and Google to facilitate digital
contact tracing during the COVID-19 pandemic to augment more traditional contact tracing techniques
using Android or iOS smartphones.
Extensive research on COVID-19 has revealed that while risk-aware, multi-criteria optimization of
contact tracing has significant potential, realization of this potential requires deep knowledge of the
infection propagation mechanisms, medical prognoses and treatment options for infected individuals
with different risk profiles [3]. Even though COVID-19 originated almost five years ago, such knowledge
is still lacking [4, 5], which suggests that a contact tracing system should have the ability to collect
and make sense of all up-to-date available information on infection. This can be achieved with the
HealthRecSys’24: The 6th Workshop on Health Recommender Systems co-located with ACM RecSys 2024
⋆
Official contribution of the National Institute of Standards and Technology; not subject to copyright in the United States.
Certain equipment, instruments, software, or materials are identified in this paper in order to specify the experimental
procedure adequately. Such identification is not intended to imply recommendation or endorsement of any product or
service by NIST, nor is it intended to imply that the materials or equipment identified are necessarily the best available for
the purpose.
Envelope-Open marbukh@nist.gov (V. Marbukh)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
CEUR
ceur-ws.org
Workshop ISSN 1613-0073
Proceedings
Figure 1: Recommender System supported Contact Tracing.
support of a specialized Recommender System (RS). Given testing capacity, the RS should utilize the up-
to-date contact tracing, medical, and all other available relevant data to identify and through Exposure
Notification System (ENS) notify individuals to be tested [2, 6]. Due to incomplete information, the
dynamic nature of the problem, and a large state and action space the RS should be supported by Deep
Reinforcement Learning (DRL) for solving the corresponding Partially Observable Markov Decision
Process (POMDP) [7, 8]. POMDP describes the evolution of health status of each participating individual,
where infectious status may not be observable and testing decisions are constrained by available testing
capacity. Since a positive test result for some individual may reveal increased accumulated exposure for
other individuals due to proximity to the newly discovered infection spreaders, the problem cannot be
decoupled. These interdependencies significantly complicate the problem. The paper is organized as
follows. Section 2 outlines operations and flow of information in the proposed RS supported contact
tracing, and section 3 provides some technical details on accumulated exposure evaluation.
2. RS Supported Contact Tracing
Figure 2 presents a highly aggregated view of a Recommender System supported Contact Tracing
System.
The Exposure Monitoring System (EMS) monitors “accumulated exposure to infection” for each
[0,𝑡]
participating individual 𝑖, ℰ𝑖 (defined in the next section) in near real time 𝑡, and feeds this informa-
tion into the RS. RS also gets available information on demographic and risk factors of participating
(1)
individuals 𝑥𝑖 as well as health status of both participating and not participating individuals who
(2)
went through Health Care System 𝑥𝑖 . Note that participating individuals are likely to consent to
revealing their health information since they would benefit from accounting for their risk factors, e.g.,
advanced age, preexisting conditions, etc. For not participating individuals, some relevant information,
which does not require revealing individual identity, can be obtained without violating their privacy.
RS is also fed the estimate of the infection reproduction number 𝑅(𝑡), i.e., the average number of new
infections produced by one infected individual during his/her lifetime: 𝑅(𝑡) ̃ ≈ 𝑅(𝑡). Estimate 𝑅(𝑡) ̃ may
combine information from EMS, the Health Care System, and possibly from other tracing mechanisms
not shown in Figure 1, e.g., from manual tracing. Infection suppression requires keeping the infection
reproduction number less than one: 𝑅(𝑡) < 1. Due to numerous uncertainties in the 𝑅(𝑡) estimation:
̃ ≈ 𝑅(𝑡), the infection suppression condition is 𝑅(𝑡)
𝑅(𝑡) ̃ ≤ 1 − 𝜀, where “safety margin” 𝜀 < 1 depends on
the confidence level of the corresponding estimate. The reward of the RS supported Contact Tracing
is quantified be the negative loss −𝐿(𝑡), where 𝐿(𝑡) = 𝐿𝑒𝑐 (𝑡) + 𝐿𝑠𝑐 (𝑡). Here economic loss due to lost
productivity and cost of testing/treatment is 𝐿𝑒𝑐 (𝑡), and “social cost” quantifying suffering and, most
importantly, deaths due to the infection 𝐿𝑠𝑐 (𝑡). The cost estimates are provided to RS by the Health Care
System and Agencies collecting and processing economic data.
System evolution is described by POMDP 𝛿(𝑡) = (𝛿𝑖 (𝑡)), where component 𝛿𝑖 (𝑡) describes the health
status of participating individual 𝑖 , i.e., “non-infected,” “infected,” “deceased.” “Non-infected” and
“infected” states may not be observable which makes process 𝛿(𝑡) partially observable. The decision
to test a participating individual reveals his/her infected or not-infected status at a certain cost due
[0,𝑡] (1) (2)
to limited testing capacity. RL employs DRL to make testing decisions on the basis of ℰ𝑖 , 𝑥𝑖 , 𝑥𝑖 .
Constraints on the infection reproduction number can be incorporated through penalty function ℎ(𝑅(𝑡)) ̃
̃ ̃
which is flat for 𝑅(𝑡) ≤ 1 − 𝜀 and sharply increases for 𝑅(𝑡) > 1 − 𝜀.
Our conjecture is that (near) optimal notification strategy is threshold-based: individual 𝑖 should be
notified at the first moment 𝑡 = 𝜃𝑖 > 0 when this individual’s accumulated exposure reaches threshold
ℰ𝑖̂ :
[0,𝑡]
𝜃𝑖 = inf{𝑡 ∶ ℰ𝑖 ≥ ℰ𝑖̂ }, (1)
𝑡≥0
where threshold ℰ𝑖̂ = Δ(ℰ , 𝑥) depends on the history of former testing decisions/results combined
with medical and demographic data of these individuals. The function Δ(ℰ , 𝑥) can be evaluated by
employing a Deep Supervised Learning (DSL) algorithm. Note that in practice, notification strategy may
operate on the basis of a small number of risk groups [3], which may be defined and then redefined by
an on-line clustering algorithm. Assumptions of homogeneity and large number of individuals within
each group, simplifies optimization of group-specific thresholds in (1).
3. Accumulated Exposure
For each participating individual 𝑖, the contact tracing system identifies “accumulated exposure” to
another participating individual 𝑗 during time interval [0, 𝑡] as follows:
𝑇
[0,𝑡] ̃ 𝑖𝑗 (𝜏 ))𝛼 ] 𝑑𝜏 ,
ℰ𝑖𝑗 = ∫ 𝜙 [(𝑑/𝑑 (2)
0
where the corresponding instantaneous exposure rate 𝜙(𝑧) is an increasing function of 𝑧 > 0, the
distance between individuals 𝑖 and 𝑗 at moment 𝜏 is 𝑑𝑖𝑗 (𝜏 ), and 𝑑̃ > 0, 𝛼 ≥ 1 are some parameters.
Individual 𝑖 accumulated exposure to infection during time interval [0, 𝑡] is defined as the aggregated
exposure to all known spreaders during this time interval:
𝑇
[0,𝑡] ̃ 𝑖𝑗 (𝜏 ))𝛼 ] 𝑑𝜏 ,
ℰ𝑖 = ∑ ∫ 𝜋𝑗 (𝜏 )𝜙 [(𝑑/𝑑 (3)
𝑗 0
where 𝜋𝑗 (𝜏 ) = 1 if individual 𝑗 is infected at moment 𝜏 and 𝜋𝑗 (𝜏 ) = 0 otherwise.
Consider some examples. As currently defined by the CDC [1], a high-risk COVID-19 exposure is a
contact with a person who tests/tested positive for SARS-CoV-2 which takes place at a distance of less
than two meters for a total of 15 minutes or more over a 24-hour period. In this case, 𝑑̃ = 2 m, 𝛼 → ∞,
24
𝜙(𝑥) ≡ min(𝑥, 1), and thus an individual is assumed exposed if ℰ [0,𝑇 ] = ∫0 𝟙(𝑑(𝑡) − 2 m)𝑑𝑡 > 15 min,
where 𝟙(𝑥) = 0 if 𝑥 ≤ 0 and 𝟙(𝑥) = 1 otherwise. In another example [9], 𝑑̃ = 2 m, 𝜙(𝑥) ≡ 𝑥, and thus an
24
individual is assumed exposed if ℰ [0,𝑇 ] = ∫0 (2/𝑑(𝑡))𝛼 𝑑𝑡 > 15 min.
Finally note that available information on accumulated exposure to specific individuals can be used to
identify “infection superspreaders” who otherwise could be unidentified, e.g., due to being asymptomatic
or for any other reason. This can be done with known algorithms [10] on undirected exposure graph 𝐺
where nodes 𝑖 and 𝑗 are connected if ℰ𝑖𝑗[0,𝑡] ≥ ℰ ̆[0,𝑡] , and ℰ ̆[0,𝑡] > 0 is a properly defined threshold.
References
[1] CDC, Contact tracing, 2024. URL: https://www.cdc.gov/museum/pdf/
cdcm-pha-stem-lesson-contact-tracing-lesson.pdf.
[2] C. Nebeker, D. Kareem, A. Yong, R. Kunowski, M. Malekinejad, E. Aronoff-Spencer, Digital
exposure notification tools: A global landscape analysis, PLOS Digital Health 2 (2023) 1–16.
doi:10.1371/journal.pdig.0000287 .
[3] D. Acemoglu, V. Chernozhukov, I. Werning, M. D. Whinston, Optimal Targeted Lockdowns in
a Multigroup SIR Model, American Economic Review: Insights 3 (2021) 487–502. doi:10.1257/
aeri.20200590 .
[4] C. F. Manski, F. Molinari, Estimating the COVID-19 infection rate: Anatomy of an inference
problem, Journal of Econometrics 220 (2021) 181–192. doi:10.1016/j.jeconom.2020.04.041 .
[5] W. Rauch, H. Schenk, N. Rauch, M. Harders, H. Oberacher, H. Insam, R. Markt, N. Kreuzinger,
Estimating actual SARS-CoV-2 infections from secondary data, Scientific Reports 14 (2024) 6732.
doi:10.1038/s41598- 024- 57238- 0 .
[6] X. Chen, L. Yao, J. McAuley, G. Zhou, X. Wang, Deep reinforcement learning in recommender
systems: A survey and new perspectives, Knowledge-Based Systems 264 (2023) 110335. doi:10.
1016/j.knosys.2023.110335 .
[7] X. Xiang, S. Foo, Recent advances in deep reinforcement learning applications for solving partially
observable markov decision processes (POMDP) problems: Part 1—fundamentals and applications
in games, robotics and natural language processing, Machine Learning and Knowledge Extraction
3 (2021) 554–581. doi:10.3390/make3030029 .
[8] X. Xiang, S. Foo, H. Zang, Recent advances in deep reinforcement learning applications for
solving partially observable markov decision processes (POMDP) problems part 2—applications in
transportation, industries, communications and networking and more topics, Machine Learning
and Knowledge Extraction 3 (2021) 863–878. doi:10.3390/make3040043 .
[9] K. Sayrafian, B. Cloteaux, V. Marbukh, Impact of using soft exposure thresholds in automatic
contact tracing, in: 2022 IEEE International Conference on E-health Networking, Application &
Services (HealthCom), 2022, pp. 1–6. doi:10.1109/HealthCom54947.2022.9982790 .
[10] D. Zhang, Y. Wang, Z. Zhang, Identifying and quantifying potential super-spreaders in social
networks, Scientific Reports 9 (2019) 14811. doi:10.1038/s41598- 019- 51153- 5 .