<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>I Move U Move: V2X-Enabled Wireless Towing</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Constantine Ayimba</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vincenzo Sciancalepore</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Paolo Casari</string-name>
          <xref ref-type="aff" rid="aff2">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Vincenzo Mancuso</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>IMDEA Networks Institute, Avenida del Mar Mediterráneo 22</institution>
          ,
          <addr-line>28918 Leganés, Madrid</addr-line>
          ,
          <country country="ES">Spain</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>NEC Laboratories Europe</institution>
          ,
          <addr-line>Kurfürsten-Anlage 36, 69115 Heidelberg</addr-line>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff2">
          <label>2</label>
          <institution>University of Trento</institution>
          ,
          <addr-line>Via Sommarive, 9, 38123 Trento</addr-line>
          ,
          <country country="IT">Italy</country>
        </aff>
      </contrib-group>
      <abstract>
        <p>As smart connected vehicles become increasingly common, their ability to provide enhanced services has improved. One such service is the emergency transport of drivers in medical distress. In this paper, we show how such a service can be run from the network and discuss the importance of having a human in the loop in order to expedite driving. We present a Monte-Carlo-based driver assessment system that the network can use in selecting the most suitable candidate to tow an autonomous vehicle with an incapacitated driver. We show that this mechanism results in a selection policy that ensures suficiently short spacing between the autonomously driven tail/towed vehicle and the human driven lead/towing vehicle ensuring that no other vehicles get in the way to disrupt the service.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Connected vehicles</kwd>
        <kwd>health</kwd>
        <kwd>emergency</kwd>
        <kwd>reinforcement learning</kwd>
        <kwd>wireless towing</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>
        latter, coupled with the rise of connected autonomous vehicles [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ], makes it possible to create
an emergency towing service that can help an incapacitated driver.
      </p>
      <p>
        In such cases, it is crucial to have a human in the loop. This is because the vehicle will be
required to operate like an ambulance without strictly adhering to trafic rules. Ordinary
autonomous driving and strict respect of trafic rules works for commonplace, everyday passenger
transportation, but may jeopardize the timely delivery of medical assistance to a driver in need.
Recent field studies also report public reluctance to fully trust self driving ambulances [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ]. Given
the sensitivity of the operation, the choice of the lead human driver becomes key, since their
driving behavior has a direct impact on the well-being of the incapacitated driver in the towed
vehicle. Thus, the agent that picks the suitable lead vehicle needs to know the driving behaviors.
This knowledge should be built up over time so as to make a robust decision of which available
driver should respond to the towing request.
      </p>
      <p>
        The closest analog to our proposal is platooning, which involves the coordinated autonomous
driving of connected vehicles at shortest but safest inter-vehicular distances [
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Platooning is
however largely aimed at exploiting driving on highways or other open road situations and as
such is ill-suited to towing in much more constrained road environments.
      </p>
      <p>
        In this paper, we specifically look into how to select the most suitable driver based on his
driving behavior collected over time. To build this knowledge-based evaluation system we use
the Monte-Carlo reinforcement learning approach. We make use of the open-source CARLA
platform [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] developed to test self-driving algorithms in realistic scenarios. Concretely, our
main contributions are as follows:
1. we implement a follower navigation agent in CARLA that tracks the motion of a lead
vehicle;
2. we design a driving behavior profiling service leveraging Monte Carlo Reinforcement
      </p>
      <p>Learning;
3. we extend CARLA to carry out driver profiling and selection using ancillary python
scripts running in parallel to the simulator;
4. we evaluate the efectiveness of the selection algorithm in choosing the most suitable
driving profile to tow a target tail vehicle.</p>
      <p>To the best of our knowledge, ours is the first approach to leverage autonomous driving and
connected cars to deliver an ad hoc emergency towing service with a human in the loop. The
latter element is crucial with regards to gaining public confidence, given the reticence we have
mentioned regarding fully autonomous ambulances.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Wireless towing scenario and service</title>
      <sec id="sec-2-1">
        <title>2.1. Wireless towing definition</title>
        <p>We define wireless towing as the autonomous movement of a vehicle in sympathy with the
movement of another vehicle that precedes it, based on directives it receives from the lead
vehicle itself, the network or both. While this is similar to platooning, the diference here is
that the spacing between the lead and tail vehicle is not maintained relatively constant, and in
the same vein the relative speed of the two vehicles is not zero. This fact is of great importance,
because the emergency towing service should be versatile enough to work in crowded urban
environments, with intervening trafic as well as stops due to trafic lights and pedestrians.</p>
        <p>
          We assume that there exists an on-board unit on the tail vehicle that can respond to driving
directives from the network prescribing when and by how much to accelerate or brake. Recent
works such as [
          <xref ref-type="bibr" rid="ref10 ref11">10, 11</xref>
          ] point to the practicality of this assumption. Furthermore, the ability of an
autonomous vehicle to drive itself solely based on received instructions has been demonstrated
in field trials [
          <xref ref-type="bibr" rid="ref12">12</xref>
          ].
        </p>
      </sec>
      <sec id="sec-2-2">
        <title>2.2. Scenario description</title>
        <p>
          In the scenario we envisage, the vehicle’s sensor signal that the driver is unresponsive as has
been described in, e.g., [
          <xref ref-type="bibr" rid="ref1 ref2">1, 2</xref>
          ]. Upon such a detection, the vehicle sends out an alert to the
network which reacts by running the emergency service. In the alert, the vehicle specifies its
geographic information, model and engine data. The network then polls available drivers in
the vicinity of the target vehicle. The candidate lead vehicles that respond to the poll start
moving towards the target. The network continuously monitors the motion of these vehicles.
After a short duration (in the order of 30 s) the vehicle evaluates the driving behavior of each
responding vehicle. The network agent then selects a candidate vehicle from the ones available
to carry out the task.
        </p>
        <p>When the chosen lead vehicle reaches the target tail vehicle, it overtakes it and stops slightly
ahead of it after confirmation from the network. The network then alerts the tail vehicle to
activate its tow mode so as to follow the lead vehicle ahead of it. The tail and lead vehicle
maintain constant communication with each other and the network. Both vehicles constantly
communicate their geographic positions and speed among others to the network.</p>
        <p>When the vehicles reach their intended destination, the network agent evaluates the driving
experience based on the data it received over the course of towing. The agent calculates
the reward based on the evolution of the spacing between the lead and tail vehicle. The
rationale behind this is that, for an efective service, the space between the vehicles needs to be
maintained small enough not to allow other vehicles to get in between, as this would result
in delays depending on how the preceding vehicle is being driven. Moreover, a conscious tail
vehicle passenger should not get the feeling that the wireless towing service is leading him or
her astray. Both events would negatively afect the efectiveness of the service.</p>
        <p>The network agent will then update the corresponding driving behavior profile with the
calculated reward in its reinforcement learning data base. As the service is increasingly used,
more of these experiences will be similarly evaluated and a selection policy progressively
developed. Following the policy, the selection of the most suitable lead vehicle will continuously
improve. In due course, the network agent will consistently select the candidate lead vehicle
with the driving behavior that exhibits the best spacing discipline, hence the best service quality.</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Design of driving behavior selector</title>
      <p>
        As stated earlier, we leverage Monte-Carlo reinforcement learning [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ] to create our driving
behavior evaluation and selection algorithm. In particular, we characterize the environment
∗η
∗ χ
↓
      </p>
      <p>↑
∗State Space
as having a driving behavior indicator,  , of the candidate lead driver and a ratio between the
engine power of the lead vehicle and tail vehicle,  . State and action spaces are shown in Fig. 1.</p>
      <p>When the alert is received, a candidate lead vehicle sets up a path to the position of the
target vehicle. The path is constituted by many waypoints at regular intervals, which dictate
the movement of the vehicle to the meeting point. If ˙ is the sample speed at waypoint ,  2˙
the empirical variance of the speeds across all waypoints, ¯˙ the mean speed between waypoints
and  is the number of waypoints until the meeting point then,  is given as:</p>
      <p>
        The first term of Eq. (2) is the fairness index [
        <xref ref-type="bibr" rid="ref14">14</xref>
        ] of speed between waypoints and the second
term is points to the standard deviation in speeds between waypoints normalised according to
the 3 empirical rule [
        <xref ref-type="bibr" rid="ref15">15</xref>
        ]. To obtain  , we use the ratio between the maximum revolutions per
minute (RPM) that can be reached by candidate lead vehicle and target vehicle to be towed, i.e.:
 2˙ =
1

∑︁ (˙ −
∈
 = ∑︀∑︀∈∈˙˙2 ×
¯˙)2 ;
√︁ 2˙
3
      </p>
      <p>.
rpm =</p>
      <p>MaxRPM candidate ;</p>
      <p>MaxRPM target
⎧⎪rpm,
 = ⎨</p>
      <p>1
⎪⎩ rpm</p>
      <p>if rpm &lt; 1.0;</p>
      <p>In order to keep the state space small, we quantize  to 10 levels 0, . . . , 9 using a uniform
quantizer. We similarly quantize  to 5 levels 0, . . . , 4. We update the state values using the
following modified update:
where Δ =  + ( − 1) (),  ⩽ 1.0 is a discount parameter,  ⩾ 1 is the number of times
the agent was in state  , and  is the reward at the end of the episode.  is calculated as
 () ←
1</p>
      <p>[Δ + ( − 1) ()] ,
1 ∑︁ √︀( − ideal)2 ,
 = −  ≤ 
where  is the spacing between the lead car and target tail vehicle taken at regular intervals,
 is the number of intervals until the end of the episode and ideal is the desired spacing that
should be maintained between the lead and tail vehicle.1 The formula in Eq. (5) is modified
because it does not consider the simple reward  as the update is done at the end of the episode
and the lead driver/vehicle selected in a previous episode have no bearing on the current episode.</p>
      <p>
        At the start of each episode, we use  -greedy selection to choose the lead car and driving
behavior option. We also employ the weighted fair exploration mechanism presented in our
previous work [
        <xref ref-type="bibr" rid="ref16">16</xref>
        ] to improve the trade-of between exploration and exploitation.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Performance evaluation</title>
      <sec id="sec-4-1">
        <title>4.1. System setup using the CARLA simulator</title>
        <p>The CARLA simulator is a distributed system, where the server spawns the map and ancillary
infrastructure, i.e., roads, trafic lights, and buildings. The client runs python scripts which we
use to introduce the lead and tail vehicle in the simulation. Multiple clients can run on the same
server, and we exploit this feature to spawn other vehicles and pedestrians into the environment
to mirror real-world trafic.</p>
        <p>In order to solely test the performance of the selection agent against driving behaviors, we
spawn the same car model for both the lead and tail vehicle. For the lead vehicle, we implement
behavior agents which build on the basic agent provided with the simulator. This allows us
to emulate diferent driving styles for simulated human drivers. We supply the destination
coordinates to a method of the behavior agent, which leverages the CARLA world object with its
global view of the map in order to generate waypoints to the specified location. The lead vehicle
records its current position and bearing at regular intervals, thereby generating waypoints
which the tail vehicle uses to follow the lead. For each of the four candidate vehicles shown in
Fig. 2, labeled as A, B, C and D, we specify a behavior agent exhibiting a given driving behavior:
Extra-cautious (EC), Cautious (CS), Normal (NL) and Aggressive (AG). The characterisation of
each is given by the parameters in Table 1. We remark that these are base behaviors from which
a driving behavior indicator  can be derived.</p>
        <p>1We set ideal = 10 m.
(5)
(6)
B</p>
        <p>Destination +</p>
        <p>After a 30 s behavior evaluation period,  is determined for each candidate vehicle using Eq. (2)
and one of the cars/driving behavior is selected and proceeds to an agreed Meeting point (MP)
shown in Fig. 2. Given that the seed for each simulation is generated randomly the behaviors
are not completely deterministic, in each simulation round. Therefore, one behavior may map
into a diferent indicator in a subsequent simulation. This reflects the fact that human drivers
do not drive exactly the same way each time. We prefer to use the indicator  to cover this
spectrum of behaviors.</p>
        <p>In order to implement the towing behavior, we modify the basic agent provided with the
simulator so that its array of waypoints is not generated by the world object but is instead
supplied by the lead vehicle. The autonomous driving behavior of the tail vehicle is thus greatly
influenced by the behavior of the lead vehicle for the path between the meeting point and the
destination (cf. Fig. 2). Owing to this influence, the question of which vehicle to follow becomes
an important decision.
150 200</p>
        <p>Simulation Time [s]
250
1</p>
      </sec>
    </sec>
    <sec id="sec-5">
      <title>5. Results</title>
      <p>We first look at the towing performance overall. From Fig. 3, we see that the tail vehicle starts to
move after the lead vehicle gets to the meeting point and tries to keep up with it. Given that the
lead vehicle precedes the tail, it bears the inconvenience of waiting for longer at trafic lights,
so that by the time the tail vehicle has caught up it does not have to wait as long, or at all.</p>
      <p>In Fig. 4 we plot the probability of a lead car behavior being selected in every set of 50
simulations. In the initial sets, {1..50} and {51..100}, when the selection algorithm is still developing a
policy,  = 0.3 and  = 0.2 are selected most often. The spacing discipline between the tail
and lead vehicle progressively improves as can be seen by comparing Fig. 5a for  = 0.3 and
Fig. 5b for  = 0.2.</p>
      <p>As the policy is further developed, beyond 100 simulations, the selection leans towards driving
behaviors  = 0.6 and  = 0.7. The spacing maintained between the lead and tail vehicle also
further improves as can be seen from Fig. 5c. The eventual policy that is settled on, choosing
 = 0.7 results in the most favourable spacing between the tail and lead vehicle as depicted in
Fig. 5d.</p>
      <p>(a)  = 0.3</p>
      <p>(b)  = 0.2</p>
    </sec>
    <sec id="sec-6">
      <title>6. Conclusions</title>
      <p>We have presented a novel network service that leverages connected vehicles and their
selfdriving capabilities to deliver emergency towing. We have also implemented a selection
algorithm for the service that uses Monte-Carlo reinforcement learning to improve the system’s
capability to select the most appropriate driver for the towing service, based on driving behavior
assessment. We have shown that the algorithm converges to selecting the driving behavior that
maintains the best spacing discipline throughout the towing process, thereby ensuring that no
intervening vehicles get in between to disrupt the service.</p>
      <p>As part of future work, we intend to investigate more complex scenarios and the cases in
which a lead vehicle can only tow the tail vehicle for part of the way. In this case the network
will need to engage multiple candidate vehicles and will consequently require to plan ahead to
minimize delays by aligning the handover process between lead vehicles.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The work was supported by the ECID project (grant PID2019-109805RB-I00 funded by MCIN/AEI/
10.13039/501100011033) and by the Italian Ministry for University and Research (MIUR) under
the initiative “Departments of Excellence” (Law 232/2016).</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>S. J.</given-names>
            <surname>Park</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Hong</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Kim</surname>
          </string-name>
          , I. Hussain,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Seo</surname>
          </string-name>
          ,
          <article-title>Intelligent in-car health monitoring system for elderly drivers in connected car</article-title>
          ,
          <source>in: Proc. 20th Congress of the International Ergonomics Association (IEA</source>
          <year>2018</year>
          ), Springer International Publishing, Cham,
          <year>2019</year>
          , pp.
          <fpage>40</fpage>
          -
          <lpage>44</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>G.</given-names>
            <surname>Sikander</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Anwar</surname>
          </string-name>
          ,
          <article-title>Driver fatigue detection systems: A review</article-title>
          ,
          <source>IEEE Transactions on Intelligent Transportation Systems</source>
          <volume>20</volume>
          (
          <year>2019</year>
          )
          <fpage>2339</fpage>
          -
          <lpage>2352</lpage>
          . doi:
          <volume>10</volume>
          .1109/TITS.
          <year>2018</year>
          .
          <volume>2868499</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>K.</given-names>
            <surname>Abboud</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H. A.</given-names>
            <surname>Omar</surname>
          </string-name>
          , W. Zhuang,
          <article-title>Interworking of dsrc and cellular network technologies for v2x communications: A survey</article-title>
          ,
          <source>IEEE Transactions on Vehicular Technology</source>
          <volume>65</volume>
          (
          <year>2016</year>
          )
          <fpage>9457</fpage>
          -
          <lpage>9470</lpage>
          . doi:
          <volume>10</volume>
          .1109/TVT.
          <year>2016</year>
          .
          <volume>2591558</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>Z.</given-names>
            <surname>MacHardy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Khan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Obana</surname>
          </string-name>
          , S. Iwashina,
          <article-title>V2x access technologies: Regulation, research, and remaining challenges</article-title>
          ,
          <source>IEEE Communications Surveys Tutorials</source>
          <volume>20</volume>
          (
          <year>2018</year>
          )
          <fpage>1858</fpage>
          -
          <lpage>1877</lpage>
          . doi:
          <volume>10</volume>
          .1109/COMST.
          <year>2018</year>
          .
          <volume>2808444</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          [5]
          <string-name>
            <given-names>X.</given-names>
            <surname>Cheng</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Huang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Chen</surname>
          </string-name>
          ,
          <article-title>Vehicular communication channel measurement, modelling, and application for beyond 5g and 6g</article-title>
          ,
          <source>IET Communications 14</source>
          (
          <year>2020</year>
          )
          <fpage>3303</fpage>
          -
          <lpage>3311</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Coppola</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Morisio</surname>
          </string-name>
          ,
          <article-title>Connected car: Technologies, issues, future trends</article-title>
          ,
          <source>ACM Comput. Surv</source>
          .
          <volume>49</volume>
          (
          <year>2016</year>
          ). URL: https://doi.org/10.1145/2971482. doi:
          <volume>10</volume>
          .1145/2971482.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>A.</given-names>
            <surname>Zarkeshev</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Csiszár</surname>
          </string-name>
          ,
          <article-title>Are people ready to entrust their safety to an autonomous ambulance as an alternative and more sustainable transportation mode?</article-title>
          ,
          <source>Sustainability</source>
          <volume>11</volume>
          (
          <year>2019</year>
          ). URL: https://www.mdpi.com/2071-1050/11/20/5595. doi:
          <volume>10</volume>
          .3390/su11205595.
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>F.</given-names>
            <surname>Dressler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Klingler</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Segata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R. Lo</given-names>
            <surname>Cigno</surname>
          </string-name>
          ,
          <article-title>Cooperative driving and the tactile internet</article-title>
          ,
          <source>Proceedings of the IEEE</source>
          <volume>107</volume>
          (
          <year>2019</year>
          )
          <fpage>436</fpage>
          -
          <lpage>446</lpage>
          . doi:
          <volume>10</volume>
          .1109/JPROC.
          <year>2018</year>
          .
          <volume>2863026</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Dosovitskiy</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Ros</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Codevilla</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Lopez</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Koltun</surname>
          </string-name>
          ,
          <string-name>
            <surname>CARLA:</surname>
          </string-name>
          <article-title>An open urban driving simulator</article-title>
          ,
          <source>in: Proc. 1st Annual Conference on Robot Learning</source>
          , volume
          <volume>78</volume>
          <source>of Proceedings of Machine Learning Research, PMLR</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>16</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>C.</given-names>
            <surname>Quadri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mancuso</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M. Ajmone</given-names>
            <surname>Marsan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G. P.</given-names>
            <surname>Rossi</surname>
          </string-name>
          ,
          <article-title>Platooning on the edge</article-title>
          ,
          <source>in: Proceedings of the 23rd ACM MSWiM</source>
          ,
          <source>MSWiM'20</source>
          ,
          <year>2020</year>
          , p.
          <fpage>1</fpage>
          -
          <lpage>10</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ayimba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Segata</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Casari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mancuso</surname>
          </string-name>
          ,
          <article-title>Closer than close: Mec-assisted platooning with intelligent controller migration</article-title>
          ,
          <source>in: Proceedings of the 24th ACM MSWiM</source>
          ,
          <source>MSWiM'21</source>
          ,
          <year>2021</year>
          , pp.
          <fpage>23</fpage>
          -
          <lpage>32</lpage>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Tsugawa</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Jeschke</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S. E.</given-names>
            <surname>Shladover</surname>
          </string-name>
          ,
          <article-title>A review of truck platooning projects for energy savings</article-title>
          ,
          <source>IEEE Transactions on Intelligent Vehicles</source>
          <volume>1</volume>
          (
          <year>2016</year>
          )
          <fpage>68</fpage>
          -
          <lpage>77</lpage>
          . doi:
          <volume>10</volume>
          .1109/TIV.
          <year>2016</year>
          .
          <volume>2577499</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>R. S.</given-names>
            <surname>Sutton</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. G.</given-names>
            <surname>Barto</surname>
          </string-name>
          , Introduction to Reinforcement Learning, 2nd ed., MIT Press, Cambridge, MA, USA,
          <year>2020</year>
          . URL: http://incompleteideas.net/book/RLbook2020.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>R. K.</given-names>
            <surname>Jain</surname>
          </string-name>
          ,
          <string-name>
            <surname>D.-M. W. Chiu</surname>
            ,
            <given-names>W. R.</given-names>
          </string-name>
          <string-name>
            <surname>Hawe</surname>
          </string-name>
          , et al.,
          <article-title>A quantitative measure of fairness and discrimination</article-title>
          , Eastern Research Laboratory, Digital Equipment Corporation, Hudson, MA (
          <year>1984</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref15">
        <mixed-citation>
          [15]
          <string-name>
            <given-names>D.</given-names>
            <surname>Vysochanskij</surname>
          </string-name>
          ,
          <string-name>
            <surname>Y. I. Petunin</surname>
          </string-name>
          ,
          <article-title>Justification of the 3  rule for unimodal distributions</article-title>
          ,
          <source>Theory of Probability and Mathematical Statistics</source>
          <volume>21</volume>
          (
          <year>1980</year>
          ).
        </mixed-citation>
      </ref>
      <ref id="ref16">
        <mixed-citation>
          [16]
          <string-name>
            <given-names>C.</given-names>
            <surname>Ayimba</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Casari</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Mancuso</surname>
          </string-name>
          , SQLR:
          <article-title>Short-Term Memory Q-Learning for Elastic Provisioning</article-title>
          ,
          <source>IEEE Transactions on Network and Service Management</source>
          <volume>18</volume>
          (
          <year>2021</year>
          )
          <fpage>1850</fpage>
          -
          <lpage>1869</lpage>
          . doi:
          <volume>10</volume>
          .1109/TNSM.
          <year>2021</year>
          .
          <volume>3075619</volume>
          .
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>