<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>A Gray Box Model for Characterizing Driver Behavior</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Soyeon Jung</string-name>
          <email>soyeonj@stanford.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Ransalu Senanayake</string-name>
          <email>ransalu@stanford.edu</email>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Mykel J. Kochenderfer</string-name>
          <email>mykel@stanford.edu</email>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Department of Aeronautics and Astronautics, Stanford University</institution>
          ,
          <addr-line>Stanford, CA 94305</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Department of Computer Science, Stanford University</institution>
          ,
          <addr-line>Stanford, CA 94305</addr-line>
          ,
          <country country="US">USA</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>Understanding the behavior of human drivers is critical for the successful deployment of autonomous vehicles. Driver modeling is challenging due to the uncertainty inherent in human behavior and the complex interactions among drivers. Recent efforts have focused on black-box models that are highly expressive but lack interpretability of underlying dynamics. White-box models, on the other hand, are more interpretable as they are typically defined by relatively simple rules. They can prevent undesirable outcomes but cannot model the variability of human behavior. This paper presents a gray-box driver model that combines rule-based models with data-driven learning. The parameters of rulebased driver models are learned from real-world data using expectation-maximization. We perform experiments on interactive driving scenarios with lane changes and evaluate our model based on prediction accuracy, data efficiency, and safety. Results show that our model can accurately replicate human driver behavior with less data.</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>Introduction</title>
      <p>
        Developing a realistic driver model that can capture human
driving behavior is essential for the successful integration of
autonomous vehicles into existing roadways. Autonomous
vehicles, being safety-critical and complex systems, are
required to comply with automotive safety standards.
Traditionally, ISO 26262 has been the standard for functional
safety of road vehicles
        <xref ref-type="bibr" rid="ref11">(International Organization for
Standardization 2011)</xref>
        . It follows the principle that safety cannot
be absolute, giving rise to the concept of tolerable risk. More
recently, ISO/PAS 21448 was devised as the safety
standard for driver assistance systems
        <xref ref-type="bibr" rid="ref12">(International
Organization for Standardization 2019)</xref>
        . It accounts for any potential
hazards despite the absence of system failures, and provides
guidance to achieve the safety of the intended
functionality (SOTIF). For validating safety-critical systems such as
autonomous vehicles, simulations can be useful as they
allow for testing of various scenarios without risking injury
or death. However, the simulations need to employ realistic
driver models in order to accurately evaluate the autonomous
vehicles.
      </p>
      <p>
        Various approaches have been explored to model human
drivers. Recent studies have focused on black-box
models including recurrent neural networks
        <xref ref-type="bibr" rid="ref1 ref21 ref27">(Morton, Wheeler,
and Kochenderfer 2016; Alahi et al. 2016; Zyner, Worrall,
and Nebot 2019)</xref>
        , generative adversarial networks
        <xref ref-type="bibr" rid="ref10 ref18">(Gupta
et al. 2018; Kosaraju et al. 2019)</xref>
        , variational autoencoders
(Ivanovic et al. 2020), and generative adversarial imitation
learning
        <xref ref-type="bibr" rid="ref18 ref27 ref3 ref4 ref5">(Kuefler et al. 2017; Bhattacharyya et al. 2019,
2020a)</xref>
        . Although these expressive models can learn
complex behaviors from data, they not only lack interpretability
of the underlying dynamics but also can result in undesirable
behaviors such as collision
        <xref ref-type="bibr" rid="ref2">(Bhattacharyya et al. 2021)</xref>
        .
      </p>
      <p>
        On the other end of the spectrum, researchers have
developed white-box models, or rule-based models, where the
driver response is governed by a small set of predefined
rules. White-box models, such as IDM
        <xref ref-type="bibr" rid="ref24">(Treiber, Hennecke,
and Helbing 2000)</xref>
        for car following and MOBIL
        <xref ref-type="bibr" rid="ref15 ref20 ref6">(Kesting, Treiber, and Helbing 2007)</xref>
        for lane changing, are
interpretable and thus able to prevent unacceptable behavior.
However, they cannot model the inherent stochasticity of
human behavior. To mitigate the limitations of both black-box
and white-box models, Bhattacharyya et al.
        <xref ref-type="bibr" rid="ref3">(2020b, 2021)</xref>
        developed a gray-box model where the parameters of an
interpretable rule-based model are learned as probability
distributions from data. For one-dimensional driving scenarios
with no lane change, they estimate the parameters of the
stochastic IDM
        <xref ref-type="bibr" rid="ref25">(Treiber and Kesting 2017)</xref>
        using particle
ifltering. Because this model learns the underlying
distributions by repeatedly sampling a set of particles, its
computational complexity increases with the number of samples.
      </p>
      <p>
        In this paper, we first extend the work of
        <xref ref-type="bibr" rid="ref2">Bhattacharyya
et al. (2021)</xref>
        to a two-dimensional driver model. The driver
behavior is governed by the combination of a car-following
model and a lane-changing model. Incorporating the
lanechanging model adds more complexity and uncertainty to
the interactions among multiple vehicles. We also propose a
methodology to learn the parameters of the two-dimensional
driver model from data using expectation-maximization
(EM). One advantage of using EM is that it allows us to use
expert knowledge to define the distributions over the
parameters and measurements. Additionally, it is guaranteed that
the likelihood monotonically increases with each iteration
        <xref ref-type="bibr" rid="ref20 ref6">(McLachlan and Krishnan 2007)</xref>
        . In this paper, we validate
the hypothesis that our model can improve the accuracy and
data efficiency by performing experiments using a real world
vehicle trajectory dataset.
      </p>
    </sec>
    <sec id="sec-2">
      <title>Rule-based Driver Models</title>
      <p>
        The driver behavior can be decomposed into two
components: the longitudinal motion and the lane change
behavior, which governs the lateral motion. This paper adopts
a stochastic extension of Intelligent Driver Model (IDM)
        <xref ref-type="bibr" rid="ref25">(Treiber and Kesting 2017)</xref>
        to model the longitudinal
motion, and the Minimizing Overall Braking Induced by Lane
change (MOBIL) model
        <xref ref-type="bibr" rid="ref15 ref20 ref6">(Kesting, Treiber, and Helbing
2007)</xref>
        for the lane change behavior.
      </p>
      <sec id="sec-2-1">
        <title>Stochastic IDM</title>
        <p>
          The IDM
          <xref ref-type="bibr" rid="ref24">(Treiber, Hennecke, and Helbing 2000)</xref>
          determines
the longitudinal acceleration so the vehicle drives at the
desired speed while maintaining safe separation from the
vehicle in front. The input variables into the model are the
absolute speed of ego vehicle x˙ (t), the relative speed with respect
to its preceding vehicle ∆ ˙ x(t), and the distance gap between
two vehicles d(t), at the current timestep t. The acceleration
x¨IDM is
        </p>
        <p>"
x¨IDM = amax 1 −
x˙ (t) 4
vdes
−
ddes
d(t)
2#
,
where ddes is the desired distance gap given by
ddes = dmin + τ x˙ (t) −
x˙ (t)∆ ˙ x(t)
2√amaxb
.</p>
        <p>The output variables ddes and x¨IDM are governed by
several parameters. The desired speed is vdes. The minimum
allowable distance gap is dmin and the desired time gap to the
preceding vehicle is τ . The limits on the acceleration and
deceleration are amax and bsafe, respectively.</p>
        <p>
          In order to encode the inherent stochasticity in human
driving behavior, we use the stochastic IDM (sIDM)
          <xref ref-type="bibr" rid="ref25">(Treiber
and Kesting 2017)</xref>
          , which introduces an additional variance
term σ IDM. In this model, the acceleration output for each
vehicle is sampled from the following Gaussian distribution
with the mean x¨IDM given in (1):
x¨sIDM ∼ N
        </p>
        <p>(x¨IDM, σ I2DM)</p>
      </sec>
      <sec id="sec-2-2">
        <title>MOBIL</title>
        <p>
          The MOBIL
          <xref ref-type="bibr" rid="ref15 ref20 ref6">(Kesting, Treiber, and Helbing 2007)</xref>
          model
determines whether to make a lane change in order to
maximize the longitudinal acceleration of the ego vehicle and its
neighbors. It initiates a lane change maneuver if the
following conditions are met:
x¨˜ego − x¨ego + p x¨˜new − x¨new + x¨˜old − x¨old &gt; ∆ ath (4)
(1)
(2)
(3)
− x¨˜new ≤ bsafe
(5)
        </p>
        <p>In (4), the quantities with tildes are calculated assuming
a lane change. Subscripts · ego, · old, and · new are associated
with the ego vehicle, the follower before changing lane, and
the follower after changing lane, respectively. The
parameter p ∈ [0, 1] is the politeness factor, which represents how
(6)
(7)
(8)
much the ego vehicle values the acceleration increase of its
neighbors. The threshold of acceleration increase is ∆ ath.
The model decides to change lane only if a weighted sum
of the acceleration increase of the ego vehicle and that of
the neighbors exceeds this threshold. Equation (5) is a safety
criterion to ensure that, if the lane change is made, the
deceleration of the following car does not exceed the safe braking
limit bsafe.</p>
      </sec>
      <sec id="sec-2-3">
        <title>Vehicle Dynamics</title>
        <p>Given the acceleration output from (3) and the
lanechanging action from (4) and (5), the velocity and position of
each vehicle can be updated according to the following
dynamics. The longitudinal velocity and position for the next
time step are
Note that a sigmoid function is used to map the real-valued
C to the range of 0 and 1. This replaces the hard constraint in
(4) with a soft constraint, allowing us to incorporate
stochasiticity of the lane changing behavior into our model. We
also introduce a new parameter λ which governs the degree
of preference for switching lanes.</p>
        <p>
          x˙ t+1 = x˙ t + x¨sIDM∆ t
xt+1 = xt + x˙ t∆ t + 21 x¨sIDM∆ t2.
where x¨sIDM is the sampled acceleration output. Unless a
lane change is initiated, all vehicles maintain their lateral
position along their path. Once a vehicle starts changing lane,
the lateral movement is controlled by a PD controller
          <xref ref-type="bibr" rid="ref15 ref20 ref6">(Kesting, Treiber, and Helbing 2007)</xref>
          until it reaches the center of
the destination lane.
        </p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>Driver Modeling using</title>
    </sec>
    <sec id="sec-4">
      <title>Expectation-Maximization</title>
      <p>Throughout this section, x = {(x(i), y(i))}iN=1 denotes a set
of known variables, that is, the longitudinal and lateral
position measurements for each vehicle. We use z to denote a
vector of latent variables, in particular, the set of unknown
IDM and MOBIL parameters for each vehicle. Our objective
is to infer these latent parameters z by observing x. In this
paper, we assume a discrete distribution over z
parameterized by θ .</p>
      <sec id="sec-4-1">
        <title>Problem Formulation</title>
        <p>Assuming z is given, the probability of the ith vehicle
changing lanes at each timestep is defined in a manner
similar to MOBIL with
fl(ain)e(x, z) =
(
0,
1+e− 1λ (i)C , if (5) is met
otherwise
where
C = x¨˜e(gi)o − x¨e(gi)o + p(i) x¨˜(nie)w − x¨(nie)w + x¨˜(oild) − x¨(oild) − ∆ at(hi).</p>
        <p>The likelihood of the next position x′(i) = (x′(i), y′(i))
is obtained by the weighted sum of two cases, the
lanechanging case and the car-following case, as follows:
p(x′(i) | x, z) = pchange(x′(i) | x, z)fl(ain)e +
pfollow(x′(i) | x, z) 1 −
fl(ain)e</p>
      </sec>
      <sec id="sec-4-2">
        <title>Parameter Estimation using EM Algorithm</title>
        <p>Our goal is to find θ that maximizes the log-likelihood of the
observations x. For the ith vehicle, the log-likelihood can be
written as</p>
        <p>Ti
l(θ (i)) = log Y p(xt(i); θ )</p>
        <p>t=1
=</p>
        <p>Ti
X log p(xt(i); θ )
t=1</p>
        <p>Ti
= X log X p(xt(i), zt(i); θ )
t=1
t=0
z(i)
t
z(i)
t</p>
        <p>
          Ti− 1
= X log X p(x′t(i) | zt(i), xt(i))p(zt(i); θ ). (13)
This holds from the properties of logarithms (11), the law of
total probability (12), and the conditional probability (13).
However, due to the summation over the latent variable zt(i)
inside the logarithm, θ (i) cannot be solved analytically.
Accordingly, the parameters θ (i) are estimated based on EM
algorithm
          <xref ref-type="bibr" rid="ref8">(Dempster, Laird, and Rubin 1977)</xref>
          . It constructs
a tractable lower bound that contains a sum of logarithms as
follows:
l(θ (i)) =
        </p>
        <p>TXi−1 log X Qt(zt(i)) p(x′t(i) | zt(i), xt)p(zt(i); θ )</p>
        <p>Qt(zt(i))
t=0
z(i)
t
≥</p>
        <p>Ti− 1
X X Qt(zt(i)) log
t=0 z(i)
t
p(x′t(i) | zt(i), xt)p(zt(i); θ )</p>
        <p>Qt(zt(i))
where the inequality in (15) holds from Jensen’s inequality
and log-concavity. This holds with equality if and only if
Qt(zt(i)) = p(zt(i) | x′t(i), xt; θ )</p>
        <p>The EM maximizes l(θ (i)) by iterating E-step and M-step
until convergence. In the E-step, we compute Qi(zt(i)), the
posterior probabilities of zt(i)’s given xt and θ . For each
timestep t,</p>
        <p>Qt(zt(i)) := p(zt(i) | x′t(i), xt; θ )
=</p>
        <p>p(x′t(i) | zt(i), xt)p(zt(i); θ )
Pz(i) p(x′t(i) | zt(i), xt)p(zt(i); θ )
t
(9)
(10)
(11)
(12)
(14)
(15)
(16)</p>
        <p>Then in the M-step, we find the maximum likelihood
estimates for θ based on Qi(zt(i)) from the E-step:
n Ti− 1
θ ∗ := arg max X X X
θ ′
i=1 t=0 z(i)</p>
        <p>t
Qi(zt(i); θ ) log
p(x′t(i) | zt(i), xt)p(zt(i); θ ′)</p>
        <p>Qi(zt(i); θ )
(17)</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>Experiments</title>
      <p>In this section, we evaluate the performance of our trained
model on a real-world dataset. Using our EM approach, we
learn a distribution over four parameters: the desired
longitudinal speed vdes, the stochasticity parameter in sIDM (σ IDM),
the politeness (p), and the lane-changing parameter (λ ). We
define each model parameter to follow a multinomial
distribution, which allows us to work with discrete latent variables
in the EM algorithm. For the other IDM/MOBIL parameters,
we use the values provided in Table 1.</p>
      <p>Once trained, we propagate simulated trajectories using
the estimated parameters. Given a scenario with the initial
position and velocity values of all vehicles, we iterate
sampling the model parameters from the distributions and
updating the position according to (7). Based on the simulated
trajectories, we evaluate our model in terms of prediction
accuracy, data efficiency, and safety.</p>
      <sec id="sec-5-1">
        <title>Data Preprocessing</title>
        <p>
          The proposed model is evaluated on a real-world dataset
called the INTERnational, Adversarial and Cooperative
moTION (INTERACTION) dataset
          <xref ref-type="bibr" rid="ref26">(Zhan et al. 2019)</xref>
          . The
dataset contains vehicle track data and roadway information
data from highly interactive driving scenarios with
cooperative and adversarial motions between drivers. In this work,
experiments are conducted on a highway scenario with lane
change and merging, shown in Fig. 1. We focus on the lane
changing behavior and the lane following behavior.
        </p>
        <p>Each entry of the vehicle track data consists of timestamp,
track ID, position, velocity, orientation, vehicle
length</p>
        <sec id="sec-5-1-1">
          <title>IDM parameter</title>
          <p>Desired speed (m/s)
Desired time gap (s)
Minimum acceptable gap (m)
Max acceleration (m/s2)
Desired deceleration (m/s2)</p>
        </sec>
        <sec id="sec-5-1-2">
          <title>MOBIL parameter</title>
          <p>Politeness
Safe braking (m/s2)
Acceleration threshold (m/s2)</p>
          <p>Symbol</p>
          <p>Value
Symbol</p>
          <p>Value
vdes</p>
          <p>
            τ
dmin
amax
b
p
bsafe
ath
33.3
1.5
2.0
1.4
2.0
0.5
2.0
0.1
(a) A traffic scenario recorded from traffic cameras and drones
            <xref ref-type="bibr" rid="ref26">(Zhan et al. 2019)</xref>
            .
          </p>
          <p>(b) A visualization of the processed vehicle tracks and roadway data.
/width, and other information. We decompose the driver
behavior into longitudinal and lateral motions with respect to
the lanes by transforming the variables from Cartesian
coordinates to Frenet coordinates. In Frenet coordinates, x
represents the vehicle’s position (i.e. longitudinal displacement)
along the reference path, and y represents side-to-side
position (i.e. lateral displacement) relative to the reference path.
We define the reference path as the centerline of the lane
occupied by the vehicle.</p>
        </sec>
      </sec>
      <sec id="sec-5-2">
        <title>Baselines</title>
        <p>
          We compare the performance of our work against two
baseline models. The first baseline is the IDM+MOBIL model
using the default parameter values listed in Table 1. These
values define the normal driver class as done in prior work
          <xref ref-type="bibr" rid="ref17 ref23 ref25">(Kesting, Treiber, and Helbing 2009; Sunberg, Ho, and
Kochenderfer 2017)</xref>
          . In addition, the stocasticity parameter
σ IDM is set to zero, and the lane-changing actions are made
deterministically based on (4) and (5). This model represents
a purely rule-based, deterministic model.
        </p>
        <p>
          The second model uses particle filtering (PF) to
estimate the distribution over the latent variables. We adopt the
method proposed by Bhattacharyya et al.
          <xref ref-type="bibr" rid="ref2">(Bhattacharyya
et al. 2021)</xref>
          to learn the MOBIL parameters as well as the
IDM parameters. As in the EM approach, we use the default
values in Table 1 for the parameters not being trained.
        </p>
      </sec>
      <sec id="sec-5-3">
        <title>Prediction Accuracy</title>
        <p>
          The prediction accuracy is measured using the discrepancy
between the simulated trajectories and the ground truth
trajectories. We use the following two metrics introduced in
          <xref ref-type="bibr" rid="ref10">(Gupta et al. 2018)</xref>
          :
1. Average Displacement Error (ADE): The average ℓ2
distance between the estimated points and the ground truth
points over all time steps in the prediction horizon T .
Then the value is averaged over n examples:
ADE =
1 Xn XT r
nT
xˆt(i) − xt(i) 2 + yˆt(i) − yt(i) 2
(18)
2. Final Displacement Error (FDE): The ℓ2 distance
between the estimated final position and the ground truth
ifnal position at the end of the prediction horizon T . Then
the value is averaged over n examples:
        </p>
        <p>n r
FDE = 1 X
n
i=1</p>
        <p>xˆ(Ti) − x(Ti) 2 + yˆT(i) − yT(i) 2 (19)</p>
        <p>Fig. 2 shows the ADE and FDE values for each model
over different prediction horizons. We observe that both the
PF and EM models are superior to the default model. For a
detailed comparison, we also report in Table 2 the ADE and
FDE of each model for 5-second duration and 10-second
duration. According to these metrics, the PF-based model
slightly outperforms the EM-based model, but their
performance are nearly equivalent. A possible reason for the
difference in their performance is that the EM model makes a
stronger assumption about the data than the PF model.
To evaluate the data efficiency, we train the models using
subsets of different sizes from our dataset and compare their
accuracy performance. Table 3 shows the ADE of the PF and
EM models using small, medium, and large datasets. The
small dataset contains only the first 10 data points of each
vehicle track. The medium dataset contains the first 50 data
points. The large dataset is the same as the original dataset
where each track has about 200 data points on average. This
experiment was not conducted on the default model because
it does not involve any learning. We observe that both PF
and EM models perform as poorly as the default model using
the small dataset. With the medium dataset, we see that the
performance of the EM model improved more than the PF
model, indicating it is more data efficient.
To evaluate the safety, we inspect the frequency of
undesirable behaviors in the simulated trajectories. These behaviors
include collisions and hard brakes. When a vehicle is
following its lane, collisions are counted when the headway
distance is less than the vehicle length. During lane-changing
period, collisions are counted when the ℓ2 distance to
another vehicle is less than 0.5 meters. Hard brakes are counted
when a vehicle decelerates faster than the safe braking limit
bsafe in (5).</p>
        <p>Table 4 shows the frequency of collisions and hard brakes
observed in the simulated trajectories for each model. The
default model does not produce any collisions or hard
brakes, as both IDM and MOBIL are designed to be
collision-free. However, it suffers from poor prediction
accuracy as seen in Fig. 2. The PF and EM based model also
produce no hard breaking. This is because in (8), we
deifned the probability to be positive only when (5) is
satisifed. Collisions are observed in both PF and EM based model
because they are probabilistic models. It turns out that EM
based model achieves almost half the collision rate of the PF
based model.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>Conclusion</title>
      <p>This paper presented a methodology for modeling human
driver behavior that can efficiently learn a driver model
without sacrificing safety. Our gray-box model can learn the
variability from large amounts of data available, and at the same
time, interpret the underlying dynamics of driving behavior.
We performed experiments on a real-world driving scenario
with lane changes and compared the performance our EM
based model with two baselines, the default IDM+MOBIL
model and the particle filtering based model. It was shown
that our model can generate trajectories that represent the
human driving driver in the real scenarios. The EM-based
model was able to achieve nearly equal prediction accuracy
to the PF based model with less data.</p>
      <p>
        There are several potential directions for future work.
While we assumed the latent variables of the EM algorithm
follow multinomial distributions, future work can use
different distributions with a weaker assumption to better
represent the actual distributions of the model parameters. In
addition, we can evaluate the algorithms on other datasets
such as Next-Generation Simulation (NGSIM)
        <xref ref-type="bibr" rid="ref20 ref6">(Colyar and
Halkias 2007)</xref>
        . In addition, we can analyze the
generalizability of the EM approach by applying it to other scenarios
including merging and roundabout.
      </p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <string-name>
            <surname>Alahi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Goel</surname>
            ,
            <given-names>K.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ramanathan</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Robicquet</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Fei-Fei</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Savarese</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          <year>2016</year>
          .
          <string-name>
            <surname>Social</surname>
            <given-names>LSTM</given-names>
          </string-name>
          :
          <article-title>Human trajectory prediction in crowded spaces</article-title>
          .
          <source>In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <fpage>961</fpage>
          -
          <lpage>971</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ; Jung,
          <string-name>
            <given-names>S.</given-names>
            ;
            <surname>Kruse</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L. A.</given-names>
            ;
            <surname>Senanayake</surname>
          </string-name>
          , R.; and Kochenderfer,
          <string-name>
            <surname>M. J.</surname>
          </string-name>
          <year>2021</year>
          .
          <article-title>A Hybrid Rule-Based and Data-Driven Approach to Driver Modeling Through Particle Filtering</article-title>
          .
          <source>IEEE Transactions on Intelligent Transportation Systems.</source>
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ; Wulfe,
          <string-name>
            <given-names>B.</given-names>
            ;
            <surname>Phillips</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            ;
            <surname>Kuefler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            ;
            <surname>Morton</surname>
          </string-name>
          ,
          <string-name>
            <surname>J.</surname>
          </string-name>
          ; Senanayake, R.; and Kochenderfer,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2020a</year>
          .
          <article-title>Modeling human driving behavior through generative adversarial imitation learning</article-title>
          .
          <source>arXiv preprint arXiv:2006</source>
          .06412.
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>R. P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Phillips</surname>
            ,
            <given-names>D. J.</given-names>
          </string-name>
          ; Liu,
          <string-name>
            <given-names>C.</given-names>
            ;
            <surname>Gupta</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            ;
            <surname>Driggs-Campbell</surname>
          </string-name>
          ,
          <string-name>
            <surname>K.</surname>
          </string-name>
          ; and Kochenderfer,
          <string-name>
            <surname>M. J.</surname>
          </string-name>
          <year>2019</year>
          .
          <article-title>Simulating emergent properties of human driving behavior using reward augmented multi-agent imitation learning</article-title>
          .
          <source>In IEEE International Conference on Robotics and Automation (ICRA).</source>
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <string-name>
            <surname>Bhattacharyya</surname>
            ,
            <given-names>R. P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Senanayake</surname>
            ,
            <given-names>R.</given-names>
          </string-name>
          ; Brown, K.; and Kochenderfer,
          <string-name>
            <surname>M. J.</surname>
          </string-name>
          <year>2020b</year>
          .
          <article-title>Online parameter estimation for human driver behavior prediction</article-title>
          .
          <source>In American Control Conference (ACC)</source>
          ,
          <fpage>301</fpage>
          -
          <lpage>306</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          <string-name>
            <surname>Colyar</surname>
          </string-name>
          , J.; and
          <string-name>
            <surname>Halkias</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>US highway 101 dataset</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          <string-name>
            <surname>Technical Report</surname>
          </string-name>
          FHWA-HRT-
          <volume>07</volume>
          -030,
          <string-name>
            <given-names>Federal</given-names>
            <surname>Highway Administration (FHWA).</surname>
          </string-name>
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          <string-name>
            <surname>Dempster</surname>
            ,
            <given-names>A. P.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Laird</surname>
            ,
            <given-names>N. M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Rubin</surname>
            ,
            <given-names>D. B.</given-names>
          </string-name>
          <year>1977</year>
          .
          <article-title>Maximum likelihood from incomplete data via the EM algorithm</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          <source>Journal of the Royal Statistical Society: Series B (Methodological)</source>
          ,
          <volume>39</volume>
          (
          <issue>1</issue>
          ):
          <fpage>1</fpage>
          -
          <lpage>22</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          <string-name>
            <surname>Gupta</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; Johnson, J.;
          <string-name>
            <surname>Fei-Fei</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Savarese</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Alahi</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2018</year>
          .
          <string-name>
            <surname>Social</surname>
            <given-names>GAN</given-names>
          </string-name>
          :
          <article-title>Socially acceptable trajectories with generative adversarial networks</article-title>
          .
          <source>In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <fpage>2255</fpage>
          -
          <lpage>2264</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          <source>International Organization for Standardization</source>
          .
          <year>2011</year>
          . ISO 26262:
          <article-title>Road vehicles - Functional safety</article-title>
          . Geneva, Switzerland: International Organization for Standardization.
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          <source>International Organization for Standardization</source>
          .
          <year>2019</year>
          . ISO/- PAS 21448:
          <year>2019</year>
          :
          <article-title>Road vehicles - Safety of the intended functionality</article-title>
          . Geneva, Switzerland: International Organization for Standardization.
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          2020.
          <article-title>Multimodal deep generative models for trajectory prediction: A conditional variational autoencoder approach</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          <source>IEEE Robotics and Automation Letters</source>
          ,
          <volume>6</volume>
          (
          <issue>2</issue>
          ):
          <fpage>295</fpage>
          -
          <lpage>302</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          <string-name>
            <surname>Kesting</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Treiber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Helbing</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2007</year>
          .
          <article-title>General lane-changing model MOBIL for car-following models</article-title>
          .
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          <source>Transportation Research Record</source>
          ,
          <year>1999</year>
          (1):
          <fpage>86</fpage>
          -
          <lpage>94</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref17">
        <mixed-citation>
          <string-name>
            <surname>Kesting</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Treiber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Helbing</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2009</year>
          .
          <article-title>Agents for traffic simulation</article-title>
          .
          <source>Multi-agent Systems: Simulation and Applications</source>
          ,
          <volume>325</volume>
          -
          <fpage>356</fpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref18">
        <mixed-citation>
          <string-name>
            <surname>Kosaraju</surname>
            ,
            <given-names>V.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Sadeghian</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <article-title>Mart´ın-Mart´ın</article-title>
          , R.; Reid,
          <string-name>
            <surname>I.</surname>
          </string-name>
          ; Rezatofighi,
          <string-name>
            <given-names>S. H.</given-names>
            ; and
            <surname>Savarese</surname>
          </string-name>
          ,
          <string-name>
            <surname>S.</surname>
          </string-name>
          <year>2019</year>
          .
          <article-title>Social-BiGAT: Multimodal trajectory forecasting using bicycle-gan and graph attention networks</article-title>
          .
          <source>arXiv preprint arXiv:1907</source>
          .03395.
        </mixed-citation>
      </ref>
      <ref id="ref19">
        <mixed-citation>
          2017.
          <article-title>Imitating driver behavior with generative adversarial networks</article-title>
          .
          <source>In IEEE Intelligent Vehicles Symposium (IV)</source>
          ,
          <fpage>204</fpage>
          -
          <lpage>211</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref20">
        <mixed-citation>
          <string-name>
            <surname>McLachlan</surname>
            ,
            <given-names>G. J.;</given-names>
          </string-name>
          and Krishnan,
          <string-name>
            <surname>T.</surname>
          </string-name>
          <year>2007</year>
          .
          <article-title>The EM algorithm and extensions</article-title>
          , volume
          <volume>382</volume>
          . John Wiley &amp; Sons.
        </mixed-citation>
      </ref>
      <ref id="ref21">
        <mixed-citation>
          <string-name>
            <surname>Morton</surname>
            ,
            <given-names>J.</given-names>
          </string-name>
          ; Wheeler,
          <string-name>
            <surname>T. A.</surname>
          </string-name>
          ; and Kochenderfer,
          <string-name>
            <surname>M. J.</surname>
          </string-name>
          <year>2016</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref22">
        <mixed-citation>
          <article-title>Analysis of recurrent neural networks for probabilistic modeling of driver behavior</article-title>
          .
          <source>IEEE Transactions on Intelligent Transportation Systems</source>
          ,
          <volume>18</volume>
          (
          <issue>5</issue>
          ):
          <fpage>1289</fpage>
          -
          <lpage>1298</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref23">
        <mixed-citation>
          <string-name>
            <surname>Sunberg</surname>
            ,
            <given-names>Z. N.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Ho</surname>
            ,
            <given-names>C. J.;</given-names>
          </string-name>
          and Kochenderfer,
          <string-name>
            <surname>M. J.</surname>
          </string-name>
          <year>2017</year>
          .
          <article-title>The value of inferring the internal state of traffic participants for autonomous freeway driving</article-title>
          .
          <source>In American Control Conference (ACC)</source>
          ,
          <fpage>3004</fpage>
          -
          <lpage>3010</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref24">
        <mixed-citation>
          <string-name>
            <surname>Treiber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Hennecke</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Helbing</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          <year>2000</year>
          .
          <article-title>Congested traffic states in empirical observations and microscopic simulations</article-title>
          .
          <source>Physical Review E</source>
          ,
          <volume>62</volume>
          (
          <issue>2</issue>
          ):
          <year>1805</year>
          .
        </mixed-citation>
      </ref>
      <ref id="ref25">
        <mixed-citation>
          <string-name>
            <surname>Treiber</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Kesting</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          <year>2017</year>
          .
          <article-title>The intelligent driver model with stochasticity-new insights into traffic flow oscillations</article-title>
          .
          <source>Transportation Research Procedia</source>
          ,
          <volume>23</volume>
          :
          <fpage>174</fpage>
          -
          <lpage>187</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref26">
        <mixed-citation>
          <string-name>
            <surname>Zhan</surname>
            ,
            <given-names>W.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Sun</surname>
            ,
            <given-names>L.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Wang</surname>
            ,
            <given-names>D.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Shi</surname>
            ,
            <given-names>H.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Clausse</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Naumann</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          ; Ku¨mmerle, J.; Ko¨nigshof, H.; Stiller, C.;
          <string-name>
            <surname>de La Fortelle</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ; and Tomizuka,
          <string-name>
            <surname>M.</surname>
          </string-name>
          <year>2019</year>
          .
          <article-title>INTERACTION Dataset: An INTERnational, Adversarial and Cooperative moTION Dataset in Interactive Driving Scenarios with Semantic Maps</article-title>
          . arXiv:
          <year>1910</year>
          .03088 [cs, eess].
        </mixed-citation>
      </ref>
      <ref id="ref27">
        <mixed-citation>
          <string-name>
            <surname>Zyner</surname>
            ,
            <given-names>A.</given-names>
          </string-name>
          ;
          <string-name>
            <surname>Worrall</surname>
            ,
            <given-names>S.</given-names>
          </string-name>
          ; and
          <string-name>
            <surname>Nebot</surname>
            ,
            <given-names>E.</given-names>
          </string-name>
          <year>2019</year>
          .
          <article-title>Naturalistic driver intention and path prediction using recurrent neural networks</article-title>
          .
          <source>IEEE Transactions on Intelligent Transportation Systems</source>
          ,
          <volume>21</volume>
          (
          <issue>4</issue>
          ):
          <fpage>1584</fpage>
          -
          <lpage>1594</lpage>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>