=Paper=
{{Paper
|id=Vol-2884/paper_121
|storemode=property
|title=Uncertainty Aware Wildfire Management
|pdfUrl=https://ceur-ws.org/Vol-2884/paper_121.pdf
|volume=Vol-2884
|authors=Tina Diao,Samriddhi Singla,Ayan Mukhopadhyay,Ahmed Eldawy,Ross Shachter,Mykel Kochenderfer
}}
==Uncertainty Aware Wildfire Management==
Uncertainty Aware Wildfire Management Tina Diao,1 Samriddhi Singla,2 Ayan Mukhopadhyay,1 Ahmed Eldawy,2 Ross Shachter, 1 Mykel Kochenderfer 1 1 Stanford University, 2 University of California, Riverside 1 {tdiao, ayanmukh, shachter, mykel}@stanford.edu, 2 {ssing068, eldawy}@ucr.edu Abstract particularly susceptible to wildfires. Known to be notori- ously unpredictable, sudden changes in wind directions or Recent wildfires in the United States have caused many lives and billions of dollars, destroying countless structures and weather can change the way fires spread (Garza 2020). Fire- forests. Fighting wildfires is extremely complex. It is difficult fighters need to allocate limited resources in dynamic and to observe the true state of fires due to smoke and risk asso- uncertain environments to intervene and stop the spread of ciated with ground surveillance. There are limited resources fire. to be deployed over a massive area and the spread of the The problem of combating wildfires is an example of dy- fire is challenging to predict. This paper proposes a decision- namic resource allocation. Such an approach to combat natu- theoretic approach to combat wildfires. We model the re- ral calamities like wildfires, floods, and earthquakes are dif- source allocation problem as a partially observable Markov ficult for several reasons. The dynamics of the events are decision process. We also present a data-driven model that lets us simulate how fires spread as a function of relevant unknown and complicated to model in closed-form. There covariates. A major problem in using data-driven models to are multiple covariates that affect the spread of fire like combat wildfires is the lack of comprehensive data sources the vegetation, fuel, altitude, and wind, and the exact re- that relate fires with relevant covariates. We present an algo- lationship between each covariate and fire spread is uncer- rithmic approach based on large-scale raster and vector anal- tain. Resource allocation is extremely challenging because ysis that can be used to create such a dataset. Our data with of the difficulty of making predictions about the spread of over 2 million data points is the first open-source dataset that fire and expected damage. Designing principled approaches combines existing fire databases with covariates extracted to deploy resources is important to mitigate the effects of from satellite imagery. Through experiments using real-world wildfires and emergencies in general (Mukhopadhyay et al. wildfire data, we demonstrate that our forecasting model can 2020). accurately model the spread of wildfires. Finally, we use sim- ulations to demonstrate that our response strategy can signif- Events like fires affect large areas, and it is difficult to icantly reduce response times compared to baseline methods. correlate large-scale data from various sources to analyze and study fires. To the best of our knowledge, there exists no comprehensive data source that combines fire occurrence Introduction with geospatial features, fuel levels, and weather to allow the In 2018, a large wildfire (named Camp Fire) in California re- research community to develop approaches to combat wild- sulted in the loss of 88 lives, displaced countless more, and fires. Finally, the problem of deploying resources to manage destroyed more than 18,500 structures. The estimated cost wildfires is full of uncertainties. As fires spread, it becomes of the destruction was a staggering $15 billion. As we write increasingly difficult to observe the true state on the ground this manuscript at the start of the wildfire season in the state, due to the presence of smoke. As a result, first responders more than a million acres have already burned in California only have imperfect information to allocate their resources. this year alone due to more than 7,000 wildfires. Wildfires Contributions: We make the following contributions in have destroyed many towns and structures across the state. this paper. 1) We present a decision-theoretic approach to At one point in August 2020, the entire northern half of the dynamically allocate resources in uncertain environments to state had been instructed to prepare for evacuation (State of intervene against wildfires. Specifically, we model the re- California 2020). Crucially, the time of the year that is re- sponse problem as a partially observable Markov decision ferred to as the “wildfire season” in the state has only just process (POMDP) and present an approach to find opti- begun. mal actions for suppression for a given state of the prob- Fighting wildfires is difficult. Rapid urbanization and the lem. Our approach accommodates the constraint that the effects of climate change make urban and suburban areas true state of the fire is difficult to observe in practice. 2) AAAI Fall 2020 Symposium on AI for Social Good. Instead of focusing on physics-based models, we present a Copyright c 2020 for this paper by its authors. Use permitted un- data-driven approach that can simulate the spread of wild- der Creative Commons License Attribution 4.0 International (CC fires. We extract relevant covariates such as fuel levels, veg- BY 4.0). etation type, height of canopy, and elevation from satellite imagery to drive the simulation. 3) The calculation of accu- is divided into T time steps. We assume access to histor- rate zonal statistics is a bottleneck for large scale geospa- ical data of fire incidents D, which is a vector of tuples tial data analysis (Singla and Eldawy 2018). We demon- {(t1 , `1 , u1 , w1 ), (t2 , `2 , u2 , w2 ), . . . , (tn , `n , un , wn )}, strate how extremely large-scale geospatial data pertaining where each incident di ∈ D is identified by its time of to wildfires can be combined with other covariates through occurrence ti , location `i (mapping to a cell in G), intensity large-scale distributed raster and vector analysis. Crucially, R of fire observed ui ∈ + , and a vector of spatio-temporal we release the dataset open-source for the research com- R features wi ∈ m . The features w capture potential deter- munity. 4) Through experiments using real-world wildfire minants of fire such as weather and the type of vegetation in data from California, we demonstrate that our forecasting a cell. model can accurately model the spread of wildfires and our Observing the true state of the world is almost always response strategy results in significant improvement in sup- impossible when wildfires occur. Gathering real-time in- pression efforts compared to baseline methods that do not formation by visiting the affected areas by land is nat- consider potential fire spread. urally difficult. Therefore, information must be gathered through air surveillance obstructed by smoke. Consequently, Prior Work the true dynamics of wildfires are only partially observ- able. We model the fire suppression problem more real- The dynamics of fire spread are usually modeled us- istically as a partially observable Markov decision pro- ing physics-based models. Popular fire spread models in- cess (POMDP). A POMDP can be defined by the tuple clude BehavePlus (Andrews 1986) and Farsite (Finney {S, A, O, Z, T, R, γ} (Kochenderfer 2015). We define each 1998). They are based on mathematically modeled surface component of the POMDP formulation below: fire spread as a function of heat flux and fuel availabil- ity (Rothermel 1972). Such models are widely used by first 1. States: S is a finite set of states. The state at time t is de- responders and fire fighters to forecast the spread of fires. A noted by st = {Xt , Ft }, where Xt = {Xt1 , Xt2 , . . . , Xtk } relatively modern approach is to predict the rate of spread by and Ft = {Ft1 , Ft2 , . . . , Ftk } denote the status of the fire integrating real-time information about weather from sen- and fuel level in each of the cells in G, respectively. We sors (Altintas et al. 2015). Data-driven modeling has also consider the status of the fire X as a binary variable such been used to model fire spread. Supervised machine learn- that ing techniques have been applied to uncover strong associ- 1 if uit ≥ i ations of factors to wildfire sizes and frequency using dif- xt = ferent data sources. For example, Joseph et al. (2019) in- 0 otherwise vestigated weather conditions and geographic characteris- tics of extreme fire patterns in the contiguous United States R where uit ∈ + denotes the measured intensity of the fire in cell gi ∈ G at time step t. We consider that the fuel level and Ghorbanzadeh et al. (2019) examined wildfire suscepti- F is a discrete variable, such that F ∈ {0, 1, . . . , m}. The bility using geographic data in northern Iran. exogenous parameters and m can be estimated from data Response to wildfires traditionally uses simulation-based or through domain knowledge. approaches to select locations of intervention that maximize the expected utility of suppression efforts. Petrovic, Alder- 2. Actions: A is a finite set of actions. The actions denote the son, and Carlson (2012) model wildfire dynamics and ex- different permutations of cell indices that fire suppression amine the trade-off between multiple competing suppres- efforts can be applied to, up to a maximum number of sion efforts to compute an optimal strategy for fire re- cells specified as a resource constraint. sponses. Stochastic simulation and multi-agent coordination 3. State Transitions: T defines conditional transition proba- has also been explored to combat wildfires (Fried, Gilless, bilities, with T (s0 | s, a) denoting the transition probabil- and Spero 2006; Martin-fernÁndez, Martı́nez-Falero, and ity from state s to s0 when action a is taken. The transition Pérez-González 2002). model T includes the following three components: Griffith et al. (2017) explore how suppression efforts can be optimized by solving a mixed-integer linear program and (a) Burning: If cell gi ∈ G is burning at time step t, we by using Monte Carlo approaches to find optimal actions in a assume that the fuel level decreases by one unit in the Markov decision process (MDP). Our approach to optimize next time step. Therefore, Ft+1i = Fti − 1 if xit ≥ . suppression improves upon prior work (Griffith et al. 2017) (b) Action effectiveness: At any time step t, if an action, to address the uncertainty in state information. We also inte- i.e. fire suppression effort(s) is applied to a cell gi ∈ G grate a data-driven generative model to simulate the spread such that xit ≥ , there is probability q that the effort of fire to aid decision-making under uncertainty. successfully puts out the fire. (c) Fire dynamics: We use a generative model to simu- Problem Description late the spread of fire. We represent spread dynamics We consider a spatial area divided into a set of spatial cells by the probability distribution f (Xt+1i | xjt , w), where G. Let gi ∈ G denote the ith cell. We represent the neigh- gi ∈ Nj . Therefore, given that a specific cell gj ∈ G bors of a cell gi by Ni , for some definition of neighborhood is on fire at time step t, f denotes the likelihood of its (for example, neighbors of a cell can be the set of its neighboring cells being on fire at the subsequent time adjacent cells). Consider that the total time in consideration step t + 1. We assume that in each time step, fire from a cell can spread only to its neighboring cells. This assumption aids computational tractability, but is without loss of gener- ality because the decision-maker can discretize time fine enough such that the assumption is realistic. 4. Reward function: R : S ×P A → R is the reward func- tion, such that R(st , a) = gi ∈G xit U (gi ), where U (gi ) denotes the utility for a cell gi ∈ G to be on fire. Nat- urally, U varies across the cells. A cell with human oc- cupants is presumably more valuable and more costly to burn than a cell composed of forested land. Cells can also carry unequal ecological utilities (Bradshaw and Lueck 2012). Without loss of generality, we consider three tiers of damage across cells, representing costs of residences Figure 1: An example of a spatial area discretized in a grid (red), valuable ecological resource (yellow), and wildland with different costs and the colors correspond to varying (green) in decreasing values. Figure 1 shows an example costs to burn: red, yellow, and green in decreasing cost. grid with different types of cells. 5. Observations and Observation Transitions: O represents Bt−1 Bt Bt+1 the set of observations with Z(o | s, a) denoting the prob- ability of receiving observation o at state s when action ot−1 ot a is taken. We denote specific observations by oit , which correspond to whether cell gi ∈ G is seen to be burning or not at time step t. Its transitions Z(o | s, a) are determin- St−1 St St+1 istic, i.e. oit,a = uit if an action has been applied to cell gi ∈ G. This is because we assume to have “eyes on lo- at−1 at cation” when an action is applied to a cell. Otherwise, for cells where action for suppression is not applied (denoted Rt−1 Rt by ā in the expression below), a generative representa- tion is used based on prior work (Julian and Kochenderfer 2019) such that Figure 2: POMDP (factored) representation of the dynamic 1 if P rt (Xti ) > η in state s decision problem. Shaded ovals reflect what is known to the i ot,ā = decision maker. 0 otherwise where η is an exogenous parameter. 6. Discount factor: γ ∈ [0, 1] denotes the discount factor. We use random forests (Liaw, Wiener et al. 2002) which involves constructing a large number of decision trees at In a POMDP, the decision-maker cannot directly observe training time and then aggregating the outputs of the trees. the state. Instead, they only have access to beliefs that are The aggregation technique is typically using the mode of generated probabilistically based on the actions taken. In- the outputs for classification and the mean of the outputs for formation about states can be inferred from the history (h) regression. The central idea behind using random forests is of observations and actions. It is common to maintain a dis- to average many noisy but (approximately) unbiased mod- tribution over states given the history; this distribution is els, thereby reducing the variance of the overall forecasting known as the belief B, such that B(s | h) denotes the prob- model. ability of being in state s given history h. The goal for the decision-maker is to find a mapping from belief states to ac- Resource Allocation tions that maximizes the expected future discounted reward. The general dynamic decision framework following the Approach POMDP formulation is shown in Figure 2. Recall that the sources of uncertainty are the current state of the fire (S) Modeling Fire Spread and the dynamics of the spread (driven by the set of co- In order to accurately simulate the spread of fire, we model variates w). At each time step, the available set of actions the fire dynamics f using a data-driven model. Recall that f (A) is a combination of the fire location(s) to suppress. The is a probability distribution over a cell being on fire in time decision-maker acts based on a belief distribution of the true step t + 1, conditional on its neighbors being on fire at the state of the world and receives an observation. Utility (U) re- previous time step t. The goal of modeling the function f is flects a measure of the expected damage of having a fire in a to understand the effect of various covariates like wind, veg- specific cell (e.g. residence versus wildland). etation type, canopy height, altitude, etc. on fire spread. Typ- Due to the large state space and action space (the ac- ically, covariates in geospatial analysis are heterogeneous. tion space is combinatorial), we use the sampling-based on- line Monte Carlo tree search (MCTS) (Kochenderfer 2015). Data Processing To address the computational complexity of POMDPs, ap- Data needed to model the spread of wildfires comes from proaches based on MCTS typically use a particle filter to varied sources. The temporal and spatial resolutions of such represent beliefs in the search tree. Specifically, an approach data sources are typically different. The data sources can that is of relevance to our problem is the partially observable also be of different forms (vector or raster). The vector Monte Carlo planning algorithm with observation widen- model uses points and line segments to identify spatial lo- ing (POMCPOW) (Sunberg and Kochenderfer 2017). POM- cations while the raster model uses a set of cells for the CPOW differs from other online MCTS algorithms in that in same purpose. Combining large-scale vector and raster data the simulation step, given a state (s), history (h), and depth is known to be a difficult problem (Singla and Eldawy 2018). (for tree exploration), it weights the belief nodes and ex- We collected fire occurrence data in vector form from the pands the belief updates gradually as more simulations are Visible Infrared Imaging Radiometer Suite (VIIRS) thermal added. At each step, a single simulated new state is added to anomalies/active fire database (Schroeder et al. 2014). The the particle collection, weighted to approximate the belief in spatial resolution of VIIRS data is in the form of pixels rep- every tree node, which is then used to sample the next new resenting 375 × 375 meter square cells (Schroeder et al. state. 2014). The latitude and longitude values correspond to the Although POMCPOW is shown to outperform other algo- center. Evidence of fire was read from the daily fire radia- rithms (Sunberg and Kochenderfer 2017), an issue with di- tive power (FRP) levels in the VIIRS dataset. The data used rectly using POMCPOW on our problem is that the observa- to build the feature space was collected in raster form from tion space is large and complex. This leads to a severe spar- the LANDFIRE project (Ryan and Opperman 2013). The sity of particles, i.e., the probability of sampling a relevant foundation of the LANDFIRE project is based on satellite observation is very small. To alleviate this, we modify the imagery. The raster files had a spatial resolution of 30 × 30 routine used to update belief in POMCPOW. Specifically, meter square cells. This included features like canopy base we replace the weighted particle filter with the standard par- density, canopy cover, and vegetation type. We list features ticle filter without rejection (Kochenderfer 2015). As shown used and the years from which the data was collected in Ta- in Algorithm 1, given a current belief (b), action (a), and ble 1. observation o, |b| samples are generated from the simulator, weighted, and subsequently resampled by its weights. Be- Name Year(s) fore the resampling step, all weights of an observation o may Canopy Base Density 2012, 2014, 2016 be 0 due to the large observation space. If all the weights of Canopy Base Height 2012, 2014, 2016 an observation are 0, the probabilities of the sampled states Canopy Cover 2012, 2014, 2016 are normalized to be proportional to the number of states al- Canopy Height 2012, 2014, 2016 ready sampled. This reweighting step makes an approxima- Existing Vegetation Cover 2012, 2014, 2016 tion to importance resampling that seeks to estimate prop- Existing Vegetation Height 2012, 2014, 2016 erties of a target distribution by sampling from a different Existing Vegetation Type 2012, 2014, 2016 distribution. Elevation 2016 Slope 2016 Algorithm 1: UpdateBelief (b, a, o) Table 1: LANDFIRE Raster Data Input : Belief b, action a, observation o Output: Updated belief b0 To reconcile the different spatial resolutions, we divide 1 b0 ← ∅ the state of California into a grid G of 375 × 375 meter 2 for i ← {1, . . . , | b |} do cells. The center of each fire pixel from the vector data can 3 si ← random state in b therefore overlap with exactly one cell in G. To compute 4 s0i ∼ G(si , a) the feature vector associated with each data point, we com- 5 wi ← O(o | s0i , a) pute zonal statistics for the vector data using the raster data. 6 end The method of zonal statistics refers to calculating summary P|b| statistics using a raster dataset within zones defined by an- 7 if i=1 wi = 0 then 8 wi = len(w) 1 ; // reweight states other dataset (typically in vector form). Traditional systems to compute zonal statistics require the 9 end data to be converted into the same format, either raster or 10 for i ← {1, . . . , | b |} do vector. Automated tools can then be used to either vectorize 11 Randomly select k with probability proportional to the raster dataset or rasterize the vector dataset. The first ap- wk proach converts each pixel in the raster to a point and then 12 Add s0k to b0 tests the point against each polygon in the vector data to find 13 end a match. This approach has a computational complexity of 14 return normalized b 0 O(np log np · c · r), where np is the number of polygons in the vector data, and c and r are the number of columns and rows in the raster data respectively. The second approach Cluster Vector Data Distribute Intersections Files Intersections Raster Data Computation Input Zonal Statistics Computation Figure 3: Architecture for calculating zonal statistics for large-scale raster and vector data rasterizes the vector data by converting each polygon to a neighboring cell of gi at time step t + 1. To calculate the raster (mask) layer with the same resolution as the input features for a specific cell gi ∈ G at a specific time step, raster layer. It then combines the two raster layers to com- we calculated summary statistics (maximum, minimum, me- pute the desired aggregate function. Most systems that use dian, sum, mode, count, and mean of the feature values) us- this approach keep the mask layer in memory. If the input ing all raster cells within gi at time step t. To the best of our raster layer has a very high resolution, the size of the mask knowledge, our data is the first comprehensive open-source layer can become too large to be kept in memory. This ap- dataset that combines fire occurrence with relevant covari- proach has a computational complexity of O(np · c · r). ates extracted from satellite imagery. The data is available at Neither of the two approaches mentioned above scale for https://wildfire-modeling.github.io. We used data from 2012 the high resolution raster and vector data due to the re- to 2017 as our training set and data from 2018 as our test set. quirement of converting between vector and raster formats. We set the time step for our experiments to a day, based on Specifically, in our problem, the vector data consist of over 3 the minimum time fidelity of the VIIRS dataset. All exper- million polygons and each of the raster data sources consists iments were run on an Intel Xeon 2.2 GHz processor with of over 1 billion entries. To deal with the large-scale geospa- 125 GB of memory. tial data, we create a fully decentralized approach to com- pute large-scale zonal statistics based on Singla and Eldawy Fire Spread (2020). Our approach does not require data to be converted We label a forecast as a true positive prediction when both from one form to another (vector or raster). Instead, it com- the predicted fire intensity and the recorded fire intensity are putes an intermediate data structure, called an intersections greater than the pre-specified threshold . We observe that file between the raster and vector data. The intersections file the random forest regression model is insensitive to the num- can be computed using only the vector data and the meta- ber of trees used (5, 50, 100, and 500). We also observed data of the raster data (coordinate reference, resolution, etc.). similar accuracy across training and test sets. We tested sev- Further, our approach can leverage parallel computation by eral realizations of to examine the robustness of our fore- using the intersection files. Our approach, with a computa- casting approach on different sizes of fire. Our results show tional complexity of O(np log np + c · r), is scalable and that while it is relatively difficult to predict spread from ex- efficient for large raster and vector datasets. Furthermore, as tremely small fires ( = 0.5), our forecasting model achieves a by-product, our approach makes it easier to find the neigh- high accuracy (> 90%) in predicting spreads from relatively borhoods of polygons by performing a spatial self-join op- larger fires. We summarize the results in Table 2. eration using the predicate intersects on all the cells in the vector data. We show the architecture of our approach in FRP Threshold () Accuracy on Test Set Figure 3. 0.5 77% 1 81% Experiments 5 92% We used fire data from California, USA spanning 2012– 2018 for the prediction modeling. We divided the state into Table 2: Accuracy with 5-tree Random Forest Regression a set of 375 × 375 meter cells. Our goal was to capture how fire spreads given an initial occurrence of fire. As a result, we only considered cells and days that exhibited the possibility Fire Response of fire spreading from an existing fire. Specifically, each cell Our goal is to create a pipeline for modeling response to in our data has a fire in its neighborhood. Our data has a total wildfires by utilizing the data-driven model of fire spread of 2,367,209 data points. Each row in our data represents a to optimize response decisions. However, in this paper, cell gi at a specific time t, a set of spatial-temporal features we present experimental results based on a simple spread wit , and the status of fire uti as well as the spatial-temporal model. Specifically, we simulate the spread of fire using features wjt+1 and the status of fire ut+1 j of gj ∈ Ni , a fixed environmental conditions (for example, we use a fixed Baseline UAFR 1,800 Baseline UAFR Baseline UAFR 700 Negative Utility Negative Utility Negative Utility 3,000 1,600 600 2,500 1,400 q=1 q=0.9 q=0.8 q=1 q=0.9 q=0.8 q=1 q=0.9 q=0.8 (a) (b) (c) ·104 6,000 8,000 Baseline UAFR Baseline UAFR Baseline UAFR Negative Utility Negative Utility Negative Utility 1.4 5,500 7,000 5,000 1.2 4,500 6,000 q=1 q=0.9 q=0.8 q=1 q=0.9 q=0.8 q=1 q=0.9 q=0.8 (d) (e) (f) Figure 4: Effectiveness measured by negative utility on test data (lower is better) on varying grid sizes: (a) 4 × 4 (b) 6 × 6 (c) 8 × 8 (d) 10 × 10 (e) 12 × 12 and (f) 16 × 16. wind direction and rate of spread in each simulation). We combat due to difficulties in surveillance and scarcity of calculate the probability of fire spread following a simpli- resources. In this paper, we built a data-driven forecast- fied version of the deterministic fire dynamics model in prior ing model by extracting relevant determinants of fire spread work (Griffith et al. 2017). through satellite imagery. Then, we developed an approach We conducted experiments on different sized square grids to wildfire suppression that explicitly takes state uncertainty (42 , 62 , 82 , 102 , 122 and 162 ). We varied the starting states into account. To the best of our knowledge, we were the first by random initialization of fire maps and cell types. We to create a comprehensive dataset on wildfires that combines experimented with 6 different initial states. For each ini- historical fire data with relevant covariates. Our dataset, with tial state, we created 256 spread scenarios by varying wind over two million data points, and our codebase are open- and rate of spread. For each initialization, a fixed percent- source for the research community to use. age (10%) of cells was considered to be on fire. The pro- There are some limitations to our current work. While our portions of red, yellow, and green cells were set to be 20%, forecasting model shows high accuracy, we observe that the 30%, and 50% respectively, consistent with the distribution random forest regression model is insensitive to the number of land use in California (United States Department of Agri- of decision trees in the ensemble. As a result, simpler meth- culture 2016). In all experiments, we set the fuel level in ods like classification and regression trees (CART) (Breiman each cell to 5, i.e., each cell takes 5 days to completely burn et al. 1984) might result in better generalization to unseen out and deplete its fuel. We use a baseline model consistent data. Second, our approach to suppression needs to be inte- with fire suppression strategies used in practice. Specifically, grated with the data-driven fire spread model. Finally, while our baseline fire suppression targets the cell with the max- we simulated wind for our experiments, our open-source imum utility that shows evidence of fire (through observa- dataset does not contain information about wind. We are tions). Our open-source code is built using the POMDPs.jl currently incorporating hourly wind data from the National framework (Egorov et al. 2017) and is available at https: Oceanic and Atmospheric Administration (NOAA)1 with //github.com/wildfire-modeling/response model. our fire data to develop a more comprehensive dataset. Figure 4 shows the mean negative utility averaged across start states and spread scenarios. We also vary the action ef- fectiveness q since in practice, suppression efforts do not Acknowledgements always completely puts out wildfires. We see that our ap- proach (referred to as “Uncertainty Aware Fire Response” We would like to acknowledge the Department of Manage- or UAFR) consistently outperforms the baseline method in ment Science & Engineering at Stanford University, USDA all scenarios with more significant improvements in larger NIFA AFRI (Grant 2019-67022-29696), and the Center of grids and increased action effectiveness. Automotive Research at Stanford (CARS) for funding this research. Discussion Wildfires have caused massive damage in the last decade. 1 https://www.climate.gov/maps-data/dataset/wind-roses- They are particularly challenging for first responders to charts-and-tabular-data References Mukhopadhyay, A.; Pettet, G.; Vazirizade, S.; Lu, D.; Altintas, I.; Block, J.; De Callafon, R.; Crawl, D.; Cowart, Baroud, H.; Jaimes, A.; Vorobeychik, Y.; Kochenderfer, M.; C.; Gupta, A.; Nguyen, M.; Braun, H.-W.; Schulze, J.; Goll- and Dubey, A. 2020. A Review of Emergency Incident Pre- ner, M.; et al. 2015. Towards an integrated cyberinfrastruc- diction, Resource Allocation and Dispatch Models. ture for scalable data-driven monitoring, dynamic prediction Petrovic, N.; Alderson, D. L.; and Carlson, J. M. 2012. Dy- and resilience of wildfires. Procedia Computer Science 51: namic resource allocation in disaster response: Tradeoffs in 1633–1642. wildfire suppression. PloS one 7(4): e33285. Andrews, P. L. 1986. BEHAVE: fire behavior prediction and Rothermel, R. C. 1972. A mathematical model for predicting fuel modeling system: BURN subsystem, Part 1, volume 194. fire spread in wildland fuels, volume 115. Intermountain US Department of Agriculture, Forest Service, Intermoun- Forest & Range Experiment Station, Forest Service. tain Research Station. Ryan, K. C.; and Opperman, T. S. 2013. LANDFIRE–A Bradshaw, K.; and Lueck, D. 2012. Wildfire policy: Law and national vegetation/fuels data base for use in fuels treatment, Economics Perspectives. Routledge. restoration, and suppression planning. Forest Ecology and Breiman, L.; Friedman, J.; Stone, C. J.; and Olshen, R. A. Management 294: 208–216. 1984. Classification and regression trees. CRC Press. Schroeder, W.; Oliva, P.; Giglio, L.; and Csiszar, I. A. 2014. Egorov, M.; Sunberg, Z. N.; Balaban, E.; Wheeler, T. A.; The New VIIRS 375 m active fire detection data product: Al- Gupta, J. K.; and Kochenderfer, M. J. 2017. POMDPs.jl: gorithm description and initial assessment. Remote Sensing A Framework for Sequential Decision Making under Uncer- of Environment 143: 85–96. tainty. Journal of Machine Learning Research 18(26): 1–5. Singla, S.; and Eldawy, A. 2018. Distributed zonal statistics URL http://jmlr.org/papers/v18/16-300.html. of big raster and vector data. In Proceedings of the 26th Finney, M. A. 1998. FARSITE, Fire Area Simulator–model ACM SIGSPATIAL International Conference on Advances development and evaluation. 4. US Department of Agricul- in Geographic Information Systems, 536–539. ture, Forest Service, Rocky Mountain Research Station. Singla, S.; and Eldawy, A. 2020. Raptor Zonal Statistics Fried, J. S.; Gilless, J. K.; and Spero, J. 2006. Analysing : Fully Distributed Zonal Statistics of Big Raster + Vector initial attack on wildland fires using stochastic simulation. Data. In Proceedings of 2020 IEEE International Confer- International Journal of Wildland Fire 15(1): 137–146. ence on Big Data (To Appear). IEEE. Garza, A. 2020. AI Is Helping Fight Wildfires Before State of California. 2020. 2020 Incident Archive. https: They Start. https://time.com/5497251/wildfires-artificial- //www.fire.ca.gov/incidents/2020/. intelligence/. Sunberg, Z.; and Kochenderfer, M. 2017. Online algorithms Ghorbanzadeh, O.; Valizadeh Kamran, K.; Blaschke, T.; for POMDPs with continuous state, action, and observation Aryal, J.; Naboureh, A.; Einali, J.; and Bian, J. 2019. Spa- spaces. arXiv preprint arXiv:1709.06196 . tial prediction of wildfire susceptibility using field survey United States Department of Agriculture. 2016. FIA State gps data and machine learning approaches. Fire 2(3): 43. Stats. https://www.fs.fed.us/pnw/rma/fia-topics/state-stats/ Griffith, J. D.; Kochenderfer, M. J.; Moss, R. J.; Mišić, V. V.; California/index.php. Gupta, V.; and Bertsimas, D. 2017. Automated dynamic re- source allocation for wildfire suppression. Lincoln Labora- tory Journal 22(2). Joseph, M. B.; Rossi, M. W.; Mietkiewicz, N. P.; Mahood, A. L.; Cattau, M. E.; St. Denis, L. A.; Nagy, R. C.; Iglesias, V.; Abatzoglou, J. T.; and Balch, J. K. 2019. Spatiotempo- ral prediction of wildfire size extremes with Bayesian finite sample maxima. Ecological Applications 29(6). Julian, K. D.; and Kochenderfer, M. J. 2019. Distributed wildfire surveillance with autonomous aircraft using deep reinforcement learning. Journal of Guidance, Control, and Dynamics 42(8): 1768–1778. Kochenderfer, M. J. 2015. Decision Making Under Uncer- tainty: Theory and Application. MIT Press. Liaw, A.; Wiener, M.; et al. 2002. Classification and regres- sion by randomForest. R news 2(3): 18–22. Martin-fernÁndez, S.; Martı́nez-Falero, E.; and Pérez- González, J. M. 2002. Optimization of the resources man- agement in fighting wildfires. Environmental Management 30(3): 352–364.