<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta>
      <journal-title-group>
        <journal-title>Doctoral Consortium, September</journal-title>
      </journal-title-group>
    </journal-meta>
    <article-meta>
      <title-group>
        <article-title>Robust Trafic Rules and Knowledge Representation for Conflict Resolution in Autonomous Driving</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Kumar Manas</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Stefan Zwicklbauer</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>Adrian Paschke</string-name>
          <xref ref-type="aff" rid="aff1">1</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Continental AG</institution>
          ,
          <country country="DE">Germany</country>
        </aff>
        <aff id="aff1">
          <label>1</label>
          <institution>Freie Universität Berlin, Department of Computer Science and Mathematics</institution>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2022</year>
      </pub-date>
      <volume>2</volume>
      <fpage>6</fpage>
      <lpage>28</lpage>
      <abstract>
        <p>Rules and world knowledge are important priors in decision making in our daily lives, and autonomous driving (AD) systems can exploit them in decision-making and conflict resolution. In this thesis, we investigate the representation and formalization of trafic rules and regulations in natural language text in order to integrate them with a machine learning-based trajectory prediction module for conflict resolution among road users. As a result, the vehicle's long-term trajectory (3 to 5 seconds horizon) can be predicted using hybrid learning, which incorporates both rules and data into the ML models. Only rule-based systems confront dificulties in depicting complicated interactions among multiple trafic scene participants. Learning-based techniques are capable of representing complicated interactions. However, they require a large amount of data, and in many circumstances, generating corner case data (e.g., accident and rule violation for exceptions) is not feasible. Recent research combining rules with data (neurosymbolic model) is an exciting research direction to leverage the best of both worlds for reasoning over trafic scenes. Apart from investigating the impact of the knowledge integration, we will work towards finding optimal trafic rules representation for the hybrid learning tasks and leverage large language models for the automated representation of trafic rules and regulation needed for downstream AD task. The hybrid learning approach will reduce the data dependency, generate the vehicle's trafic rule complaint trajectory, and make the model more generalizable even for corner cases or less representative cases in the datasets due to reasoning capability.</p>
      </abstract>
      <kwd-group>
        <kwd>eol&gt;Knowledge Representation</kwd>
        <kwd>Rule Formalization</kwd>
        <kwd>Reasoning</kwd>
        <kwd>Natural Language Processing</kwd>
        <kwd>Autonomous Driving</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>1. Introduction</title>
      <p>Ample data and powerful hardware accelerate recent advances in Artificial Intelligence (AI).
The data-driven subsymbolic model produces state-of-the-art results but is data hungry and
susceptible to failure in unseen scenarios. Whereas the knowledge-driven symbolic model is
data eficient but fails to model complex interactions based on statistical relationships. Recent
work that combines these approaches can be a way forward to take advantage of the best of
both worlds. This combination is referred to as a hybrid AI model. In the hybrid model, the
idea is to achieve robustness and generalization capability of the symbolic model with the
statistical relationship of the subsymbolic model. We are interested in the hybrid AI approach
as it has the possibility of making the model generalizable and operating in a dynamic, unseen,
and open world environment. In autonomous driving domain, capturing all dangerous or
common scenarios of the trafic participant’s interaction is not feasible due to a large number
of combinations. Also, in datasets, most of the scenarios are based on typical driving, with a
low representation of failure and accident scenarios. A hybrid AI based model has the potential
to help us navigate this problem by supplementing data with trafic rules or another knowledge
base. Knowledge can be represented in various ways: text, mathematical equations, simulations,
images, and logic. Integrating these forms of knowledge is essential, as we need to evaluate
the compatibility of knowledge representation with subsymbolic model architecture. There
are various possible choices for integration, like as a learning algorithm or with network
architecture. Currently, there is not always a clear research consensus regarding the optimal
way of knowledge representation and integration methods for a more accurate model.</p>
      <p>
        The goal of this thesis is to assist in the planning of autonomous vehicle motion by ruling
out an undrivable or legally infeasible trajectory based on trafic rules. For motion planning,
the trajectory of the vehicle needs to be predicted. The trajectory is a state of the vehicle with
respect to time along the path. A trajectory needs to consider the trafic signs, rules, and static
and dynamic obstacles along the route. Our goal is to develop a knowledge representation and
automated reasoning module so that trajectory prediction is more accurate and plausible and
confirms trafic rules. There are two broad approaches to trajectory prediction: Traditional
model based approach [
        <xref ref-type="bibr" rid="ref1">1</xref>
        ][
        <xref ref-type="bibr" rid="ref2">2</xref>
        ] and the data driven learning based approach [
        <xref ref-type="bibr" rid="ref3">3</xref>
        ][
        <xref ref-type="bibr" rid="ref4">4</xref>
        ]. Traditional
model-based approaches are based on heuristics, leverage rules, and environmental and physical
features (e.g., velocity, direction). They are particularly limited in modeling complex interactions
among trafic participants in the trafic scene. Learning based approaches can model these
complex interactions and context information with the help of a good amount of data and
algorithmic design. Some algorithmic design choices can be the loss function in form of rule
or rules integrated into network architecture. However, these models pay less attention to
robustness and need abundant training data, which is not feasible in many cases. We will inject
trafic rules into the model, dictating vehicle priority. Hybrid model can inject both rules and
data into the trajectory prediction module. We will model or represent implicit and explicit
trafic rules as a logical and semantic knowledge base to perform automated reasoning over the
trafic scenes.
      </p>
      <p>
        • Explicit or declarative rules are defined and standardized by authorities in the form of
rule books or regulations, like the German trafic rule book (StVO) [
        <xref ref-type="bibr" rid="ref5">5</xref>
        ]. Therefore, they
are often more clear and structured.
• Implicit or procedural rules are not defined in concrete terms, but they are necessary to
limit chaos in trafic scenes to smooth trafic flow. The decision is based on context and
surrounding information, like signaling another car to pass through as a non aggressive
driver in the trafic.
      </p>
      <p>Rules and knowledge need to be modeled so that these rules will later be used for reasoning
task over a trafic scene. The reasoning task here is to reason about the trafic situation for the
priority of trafic participants. The following section will formalize the objective and goals of
this thesis work and an abstract plan to achieve the goals.</p>
    </sec>
    <sec id="sec-2">
      <title>2. Objective and Research Questions</title>
      <sec id="sec-2-1">
        <title>The main goal of this project is to research and answer the following questions:</title>
        <p>How can we semantically and logically formalize, represent, and automatically reason implicit
and explicit trafic rules and regulations for more robustness in trajectory planning algorithms (for
autonomous vehicles)?</p>
        <p>It is hypothesized that integrating rules and knowledge into the learning based trajectory
prediction module can improve the robustness and generate a more plausible trajectory.</p>
      </sec>
      <sec id="sec-2-2">
        <title>The research questions originating from the proposed objective are:</title>
        <p>RQ1: What are the adequate semantic and logical knowledge representation methods of
trafic rules and knowledge for the AD domain?
Knowledge modeling is required for rules defined for a downstream task, such as the interaction
among trafic participants at intersections or overtaking. Rule representation is an essential step
for modeling. In knowledge modeling, rules will be represented in the semantic and logical form.
Knowledge modeling allows us to use knowledge as a base (referred to as a knowledge base)
to perform reasoning or integrate it with other downstream tasks. For example, knowledge
modeling of the right before left trafic rule in Germany.</p>
        <p>RQ2: Can we use the language model for trafic rules and background knowledge
representation so that this representation provides better or similar results than the existing knowledge
representation, which requires manual intervention?
Language models have been trained on large volumes of text data to assign probabilities to
sequences of words. In this thesis we use them for automated translation of natural language
trafic rules into formal logical format, whereas in context of automated driving formal rules
are created in hand crafted fashion for each specific context.</p>
        <p>RQ3: How and at which stage should knowledge be infused in the hybrid model for reasoning
over trafic scene?
Knowledge modeling helps us to inject knowledge into the models and perform reasoning.
Prolog and Answer Set Programming are some of the logic-based approaches to perform
automated reasoning. Reasoning can be performed as a separate component in the overall
model pipeline or combined with knowledge representation or an existing data-driven model.
The reasoning component will be used to resolve conflicts over trafic scenes, such as between
two cars, in which one will have priority over another. This work will investigate how to model
trafic rules and how to reason using them as priors or constraints.</p>
        <p>RQ4: What are the possible tradeofs and scope of improvement when we use the rule as a
pre-condition for learning based trajectory prediction algorithm vs. when we use the rule as a
conformity check of trajectories?</p>
      </sec>
    </sec>
    <sec id="sec-3">
      <title>3. Relevance</title>
      <p>
        This thesis aims to increase the robustness of the autonomous driving system by assisting the
long-term trajectory prediction module. The long-term trajectory predicts the position of trafic
participants over the next 3 to 5 seconds [
        <xref ref-type="bibr" rid="ref6">6</xref>
        ]. This long-term prediction helps to better plan the
trajectory taken by an autonomous vehicle. By validating and selecting generated trajectories
based on rules and commonsense knowledge, we can discard non-plausible trajectories. This
removal of trajectory will increase the confidence score or probability of remaining plausible
trajectory so that long-term trajectory can be planned robustly.
      </p>
    </sec>
    <sec id="sec-4">
      <title>4. Related Work</title>
      <p>Trafic rules are often based on specific situations and need a clear structure for models to
understand, such as keeping a safe distance. We need to define the safe distance in meters.
They need to be defined in a structured way so that they can be integrated into downstream
trajectory prediction tasks.</p>
      <p>
        There are multiple ways to represent trafic rules as knowledge. An ontology is used to represent
concepts and relations of trafic situations and metadata of sensor data [
        <xref ref-type="bibr" rid="ref7">7</xref>
        ][
        <xref ref-type="bibr" rid="ref8">8</xref>
        ]. Furthermore,
ifrst order [
        <xref ref-type="bibr" rid="ref9">9</xref>
        ] or higher order logic [
        <xref ref-type="bibr" rid="ref10">10</xref>
        ] is also used to represent knowledge of trafic rules.
Conversion of textual rules to logic rules (e.g., using linear temporal logic) is an active research
area, as logical language removes the semantic ambiguity of textual rules. However, such
conversion creates an additional burden for knowledge base creation, and many intricate details
of complex trafic participant behavior are challenging to capture. Work by [
        <xref ref-type="bibr" rid="ref11">11</xref>
        ] formalized the
trafic rules as temporal logic and evaluated their work on a public dataset. However, their work
is limited to simple interactions such as a straight highway and ignored regulatory signs such
as lane markings and other informative signs, as well as the right of way rule. Nevertheless,
their work can be extended to make it more generic and scenario specific use case agnostic.
[
        <xref ref-type="bibr" rid="ref12">12</xref>
        ] compiled interstate trafic rules in temporal logic and showed the creation of rules with
predicates, functions, and propositions but they did not model rules for intersections and they
assumed that driving lanes are separated.
      </p>
      <p>
        The use of a large language model(LLM) in text and language understanding is quite well
researched and produced state-of-the-art results. Large pre-trained models like BERT [
        <xref ref-type="bibr" rid="ref13">13</xref>
        ], T5
[
        <xref ref-type="bibr" rid="ref14">14</xref>
        ], GPT-3 [15] can be used for translation, text understanding and visual question
answering. [16] worked toward converting natural language text to signal temporal logic using the
Transformer [17] and [18] used the LSTM-based architecture for English sentences to first-order
logic. However, to the best of the author’s knowledge, these models and work have not yet
been deployed for safety-critical systems such as automated formalization of trafic rules from
natural language text.
      </p>
      <p>The work discussed above for trafic rule formalization are not robust because they are not
expandable to unseen or new scenarios where conflict needs to be resolved among trafic
participants. The work of [19] attempts to integrate trafic rules and symbols in their work. Similarly,
work in [20] manages static rules which can be easily mapped to constraints, it is not expandable
for more complex behavioral rules with multiple agents or participants in scenarios. This work
and other work for knowledge integration achieve better performance, but they consider mostly
trafic signs and symbols rather than multi-agent complex interaction based on the trafic rules.
As multiple interactions with multiple trafic signs and rules require fast resolution of conflicts,
rules cannot be explicitly defined for all such combinations. As a result, for unseen scenarios or
situations where rule syntax cannot be directly mapped to trafic scenarios, we should combine
them with data-driven models.</p>
      <p>Introducing rules in logical format into the neural network are proposed as a solution to
overcome some of the problems associated with less data and make model more robust against new
unseen scenarios. Logic tensor network [21] and Logic neural network [22] are some of the
work in this direction which introduced rules into the neural network. Work by [23] attempts to
introduce symbolic knowledge in deep learning model. Their work achieves this by introducing
logical constraints into the loss function for the classification problem.</p>
      <p>Recent work by [24] combined a learning-based model with logical rules of trafic regulations
in signal temporal logic (STL) in the context of motion planning of autonomous vehicles. In this
work, symbolic knowledge is integrated as priors and is shown to be efective in generating
better trajectory prediction. This work is interesting, but limited to specific rules related to
the center line and safe distance and can be further researched for more rules, such as those
related to controlled and uncontrolled intersections. Furthermore, this work needs to be further
scaled and researched to understand the complexity of the rules, for example, in situations
involving intersections of city road trafic. [ 25] implemented injection of knowledge into the
data-driven trajectory prediction module, but their knowledge is limited to physics-constrained
value and perception information is used as knowledge instead of trafic rules. Nevertheless,
these two works can be interesting to explore further in terms of scaling complex rules or
architecture evaluation. The thesis work will research to scale limitations of these baseline
works in terms of rule integration, in such a way that the integrated model should be able to
model complex trafic participants interaction. Further, they will be evaluated with the use case
scenarios (e.g., overtaking and road intersections) available from open public datasets such as
Commonroad [26] and with hand crafted scenarios simulating diferent ways of overtaking or
road intersections.</p>
    </sec>
    <sec id="sec-5">
      <title>5. Proposed Work and Experiment Planning</title>
      <p>The thesis work will have two modules that closely interact: The first step is to represent
knowledge, trafic rules, and regulations. After knowledge representation, they will be used
to perform conflict resolution over trafic scenes. Trafic scene representation and parsing
will be prerequisite for both modules. As limitation of information extracted from the trafic
scene will limit our scope of rule application over trafic scene and rule book creation. Fig.
1 shows the main general ideas and components with their interconnection. The trajectory
prediction module shown in the illustration is not the focus of our work, but we will use them
to evaluate our work and see if we can improve their performance. We will be using an existing
state-of-the-art trajectory prediction module. Modules shown with the same color indicate
closer integration than others. The formalization of trafic rules from textual rules is part of
the knowledge representation module, which will leverage the information generated from the
trafic scene and the trajectory prediction module to perform reasoning over the trafic situation.
Reasoning in our case is to first find the presence of conflicts among trajectories and then to
resolve the conflicts among trafic participants. Among multiple interacting participants, their
priority can be resolved at a specific instant based on rules and commonsense knowledge. More
information about these modules is provided below along with a rough outline of work and
planned experiments.</p>
      <sec id="sec-5-1">
        <title>5.1. Knowledge Representation and Integration</title>
        <p>Based on trafic scenarios such as overtaking, lane merge, and intersections, we are modeling the
rules required for these scenarios. Therefore, this stage will consist of knowledge representation
of trafic rules (e.g., StVO) for our trafic scenarios.</p>
        <p>
          • Initial evaluation for methods of knowledge integration and its role in trajectory
prediction.
• Evaluate data structure for knowledge representation, which allows automated reasoning.
• Comparison of diferent architecture for knowledge representation and reasoning.
Current research has used more definite logical rules, but we will attempt to use natural language
text directly. This component of the thesis will use a large language model (LLM) like T5 [
          <xref ref-type="bibr" rid="ref14">14</xref>
          ] and
GPT-3 [15] which allow us to learn the meaning behind the trafic rules and can help generate
the formal representation of trafic rules. Due to the safety requirements of autonomous driving,
human verification of the formal representation of rules will be required for testing purposes.
We performed some initial experiments with a language model to convert natural language text
into first-order logic. These results are initial evaluations without fine-tuning these data-hungry
language models using trafic and driving rules text. However, they can give us hints related to
the capability of these models to formalize trafic rule knowledge.
        </p>
        <p>Prompt: A two-lane expressway is a road with only one lane in each direction and usually
no metallic median barrier between.</p>
        <p>Output : P = two-lane expressway; Q = road with only one lane in each direction; R = no
metallic median barrier between;  () ↔ (() ∧ ())</p>
        <p>As seen in this example, our representations are technically correct. However, we had some
failure cases where no fine-tuned language model failed to represent the natural text to logic
due to missing quantifiers and variables or some problems with the wrong assignment. Also,
it can be seen that there is a scope for more abstract predicate generation from the natural
language text. We are in the process of fine-tuning the language models for the autonomous
driving domain, as then they can provide more accurate representation due to more specific
training targeted to a domain.</p>
        <p>This segment of work is connected to RQ1 and RQ2 regarding representation types and the use
of language model for the same.</p>
      </sec>
      <sec id="sec-5-2">
        <title>5.2. Trafic Scene Parsing and Trajectory Prediction Module</title>
        <p>We would like to explain some terminology used in this paper. Path tells us how to reach point
A to B without considering the interaction that occurs along the route and speed. Trajectory
has a notion of time and speed, and it is a vehicle state with respect to time. In the trajectory
prediction for trafic participants, usually, multiple trajectories are predicted for a single vehicle.
This is because there are multiple possibilities due to the uncertainty attached to the other
trafic participant’s behavior. Trafic Scene consists of stationary and moving elements in the
scene; they usually consist of road segments and trafic participants like vehicles, pedestrians,
and trafic control infrastructure. It also contains the relationship and the relation between
the elements mentioned above. Scenario is the temporal development of scenes in a sequence.
To apply our rules, we need to parse the trafic scene so that our logical system can relate to
the trafic scene before its usage. Many AD datasets provide an API to access the map data
and respective locations of trafic participants. So, we can use them to extract and parse the
information about the scene. Another module shown in Fig. 1 is the trajectory prediction module,
which generates the trajectory of vehicles based on their past behavior, velocity, acceleration,
etc. These two modules output will be combined with formalized knowledge mentioned in Sec.
5.1 to be processed by the reasoning module.</p>
      </sec>
      <sec id="sec-5-3">
        <title>5.3. Reasoning Over Trafic Scene</title>
        <p>Represented knowledge will be used for reasoning over trafic scenes. Reasoning needs to
be performed when conflicts are detected among the trajectories. Conflicts are resolved for
trajectories generated by the trajectory prediction module against the trafic rule representation
from the earlier module. So in a sense, the rules will act like prior for more robust trajectory
generation. The reasoning module could consist of Answer Set Programming(ASP), ProbLog, or
neurosymbolic approaches like Logic tensor network [21] or DeepProbLog [27]. Reasoners like
ASP and its variants are powerful and data eficient, but they face issues when uncertainty is
involved, or rules cannot be directly applied due to incompleteness. Therefore, we will evaluate
the neurosymbolic model to perform reasoning based on rules and data. These evaluations
can provide us with an opportunity to answer our RQ3, regarding where we should infuse our
knowledge. Subsection 5.3.1 explains it more thoroughly. Fig. 2 shows this idea of the hybrid
model, where the AD dataset and formalized knowledge as rules will be combined. This hybrid
or neurosymbolic model integrates the symbolic representation homogenously into the neural
network, so that the model can learn from data and rules simultaneously. We hope that by such
a combination we can get the best of both the symbolic world, which is data-eficient and can
handle unseen situations, and the subsymbolic world, which manages complex relationships
and interactions. Right now, rules in first order logic (FOL) form is our initial choice, considering
the recent development of the logic-based neural network [21] [22], which are more adapted
toward FOL. Furthermore, FOL is more expressive compared to horn or propositional logic.
5.3.1. Scaling Representation and Reasoning
We wanted to make knowledge representation and reasoning as generic as possible. This is
currently missing from most recent work in this domain. During this phase, we will adapt
our work to this purpose (generalization capability) and evaluate it on increasingly complex
scenarios involving more trafic participants or multiple rules that apply at the same time apart
from earlier mentioned use cases. This includes evaluating new unseen scenarios, where the
model needs to perform reasoning based on the rules. We believe that integrating a rule-based
module with a data-driven learning-based module is the way to achieve this robustness and
plausible trajectory. Neurosymbolic approaches like logic neural network [28] leverage first
order rules to perform reasoning tasks, and it would be interesting to evaluate the use case
of trajectory prediction with modifications. These neurosymbolic models have not yet been
evaluated for logical language having a temporal aspect such as signal temporal logic (STL). A
neural network with STL can be evaluated on the basis of the initial results obtained from the
FOL logic and the neural network. Additionally, a subsymbolc neural network is believed to
perform a faster reasoning process and decision space search than a logical symbolic system
based reasoner. In the learning-based approach, some attempts were made to use rules as a loss
function to generate rule complaint trajectory. Ways to integrate a loss function that covers a
broad range of trafic rules is an open question, and we will work toward this. This evaluation is
also connected to our RQ4, where we evaluate trade-ofs and performance in various scenarios.</p>
      </sec>
      <sec id="sec-5-4">
        <title>5.4. Evaluation of the Pipeline</title>
        <p>We will evaluate our work with public datasets and specific hand-made scenarios. In this step,
we will make use of the trajectory prediction algorithm to evaluate the impact of our work.
Once we are more advanced in our work, we will refine and formalize this step, as we will then
have a better idea of the overall architecture. Currently, we can think of using scenario-based
testing, a safe distance from another vehicle, and the occurrence of rule violation as a starting
point of testing coupled with typical trajectory prediction evaluation criteria such as the final
displacement error (FDE), the mean absolute error (MAE). These criteria generally measure the
deviation of ground truth trajectory values with the predicted trajectory provided by models.
For automated knowledge modeling assessment, we use the trajectory prediction metric when
using formal logic trafic rules for trajectory prediction compared to the metric when using
handcrafted trafic rules, based on the above metric. Evaluation of neurosymbolic model can
be achieved by designing scenarios where data are not available and rule by itself perform
correct decision making and similarly we can evaluate the scenarios for which exact rule is
not available but similar interaction among trafic participants can be found in the dataset.
Argoverse [29], Interaction [30] nuScenes [31] are some of the public autonomous driving
dataset. These datasets can be used for the evaluation of our work. To evaluate specific use
case scenario, we can use a driving simulator like CARLA [32] to create a specific scenario and
evaluate the performance of our model.</p>
      </sec>
    </sec>
    <sec id="sec-6">
      <title>6. Discussion and Open Questions</title>
      <p>Trajectory prediction and planning is a well-researched area. However, there are still a lot
of open research areas related to long-term trajectory prediction and planning due to the
uncertainty involved in sensor perception to maneuver as road movement is dynamic and
evolving. Recent advances of high definition maps make it easier to pre-calculate routes and
plans based on road signs and trafic lights with static obstacles, but resolving real time conflict
with dynamic objects is a dificult task. We are in the first year of our work and we hope to
pose some open questions and discussions that can benefit us.</p>
      <p>1. Since formal rule-based methods are deterministic and creating them is time consuming,
we want to discuss a possible alternative that might be deterministic, and at the same time,
they should not create incompleteness issues. In our view, rules can run into a problem
when due to a missing or uncertain predicate or parameter they can be unsolvable.
2. What can be possible approaches to knowledge integration in a learning-based model?
For example, some possible ways are: as a loss function, as a conformity check, or in the
model architecture. It would be insightful to talk about them together with other possible
approaches.
3. When we use rules as a constraint in neural network-based trajectory prediction, how do
we tackle multiple rules in the same model? What about the priority among the rules in
the neural network? We want to discuss such feasibility with experts.</p>
      <p>We are highly interested in discussing these questions regarding our core research proposal
and ideas as we are trying to combine both worlds of AI and in highly application oriented
domains with real-time and safety requirements.</p>
    </sec>
    <sec id="sec-7">
      <title>Acknowledgments</title>
      <p>The authors acknowledge the funding provided by the German Federal Ministry for Economic
Afairs and Energy within the project "KI Wissen – Automotive AI powered by Knowledge" and
Continental AG for the project. Special thanks go to Daniel Bär for feedback, idea generation,
and also to Rafael Schön for discussions.
[15] T. B. Brown, B. Mann, N. Ryder, M. Subbiah, J. Kaplan, P. Dhariwal, A. Neelakantan,
P. Shyam, G. Sastry, A. Askell, S. Agarwal, A. Herbert-Voss, G. Krueger, T. Henighan,
R. Child, A. Ramesh, D. M. Ziegler, J. Wu, C. Winter, C. Hesse, M. Chen, E. Sigler, M. Litwin,
S. Gray, B. Chess, J. Clark, C. Berner, S. McCandlish, A. Radford, I. Sutskever, D. Amodei,
Language models are few-shot learners, CoRR abs/2005.14165 (2020). URL: https://arxiv.
org/abs/2005.14165.
[16] J. He, E. Bartocci, D. Nickovic, H. Isakovic, R. Grosu, From english to signal temporal logic,</p>
      <p>CoRR abs/2109.10294 (2021). URL: https://arxiv.org/abs/2109.10294. arXiv:2109.10294.
[17] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, I.
Polosukhin, Attention is all you need, Advances in neural information processing systems 30
(2017). URL: https://arxiv.org/abs/1706.03762.
[18] H. Singh, M. Aggarwal, B. Krishnamurthy, Exploring neural models for parsing natural
language into first-order logic, CoRR abs/2002.06544 (2020). URL: https://arxiv.org/abs/
2002.06544. arXiv:2002.06544.
[19] A. Best, S. Narang, D. Barber, D. Manocha, Autonovi: Autonomous vehicle planning
with dynamic maneuvers and trafic constraints, CoRR abs/1703.08561 (2017). URL: http:
//arxiv.org/abs/1703.08561. arXiv:1703.08561.
[20] Z. Ajanovic, B. Lacevic, B. Shyrokau, M. Stolz, M. Horn, Search-based optimal motion
planning for automated driving, in: 2018 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS), 2018, pp. 4523–4530. doi:10.1109/IROS.2018.8593813.
[21] S. Badreddine, A. S. d’Avila Garcez, L. Serafini, M. Spranger, Logic tensor networks, Artif.</p>
      <p>Intell. 303 (2022) 103649. URL: https://doi.org/10.1016%2Fj.artint.2021.103649.
[22] R. Riegel, A. Gray, F. Luus, N. Khan, N. Makondo, I. Y. Akhalwaya, H. Qian, R. Fagin,
F. Barahona, U. Sharma, S. Ikbal, H. Karanam, S. Neelam, A. Likhyani, S. Srivastava,
Logical neural networks, 2020. URL: https://arxiv.org/abs/2006.13155. doi:10.48550/
ARXIV.2006.13155.
[23] J. Xu, Z. Zhang, T. Friedman, Y. Liang, G. V. d. Broeck, A semantic loss function for deep
learning with symbolic knowledge, 2017. URL: https://arxiv.org/abs/1711.11157. doi:10.
48550/ARXIV.1711.11157.
[24] X. Li, G. Rosman, I. Gilitschenski, J. A. DeCastro, C. I. Vasile, S. Karaman, D. Rus,
Diferentiable logic layer for rule guided trajectory prediction, in: J. Kober, F. Ramos, C. J. Tomlin
(Eds.), 4th Conference on Robot Learning, CoRL 2020, 16-18 November 2020, Virtual Event
/ Cambridge, MA, USA, volume 155 of Proceedings of Machine Learning Research, PMLR,
2020, pp. 2178–2194. URL: https://proceedings.mlr.press/v155/li21b.html.
[25] M. Bahari, I. Nejjar, A. Alahi, Injecting knowledge in data-driven vehicle trajectory
predictors, Transportation research part C: emerging technologies 128 (2021) 103010. URL:
https://www.sciencedirect.com/science/article/pii/S0968090X21000425.
[26] M. Althof, M. Koschi, S. Manzinger, Commonroad: Composable benchmarks for motion
planning on roads, in: 2017 IEEE Intelligent Vehicles Symposium (IV), 2017, pp. 719–726.
doi:10.1109/IVS.2017.7995802.
[27] R. Manhaeve, S. Dumancic, A. Kimmig, T. Demeester, L. D. Raedt, Deepproblog: Neural
probabilistic logic programming, CoRR abs/1805.10872 (2018). URL: http://arxiv.org/abs/
1805.10872. arXiv:1805.10872.
[28] R. Riegel, A. G. Gray, F. P. S. Luus, N. Khan, N. Makondo, I. Y. Akhalwaya, H. Qian, R. Fagin,
F. Barahona, U. Sharma, S. Ikbal, H. Karanam, S. Neelam, A. Likhyani, S. K. Srivastava,
Logical neural networks, CoRR abs/2006.13155 (2020). URL: https://arxiv.org/abs/2006.
13155. arXiv:2006.13155.
[29] M.-F. Chang, J. Lambert, P. Sangkloy, J. Singh, S. Bak, A. Hartnett, D. Wang, P. Carr,
S. Lucey, D. Ramanan, et al., Argoverse: 3d tracking and forecasting with rich maps, in:
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,
2019, pp. 8748–8757. doi:10.1109/CVPR.2019.00895.
[30] W. Zhan, L. Sun, D. Wang, H. Shi, A. Clausse, M. Naumann, J. Kummerle, H. Konigshof,
C. Stiller, A. de La Fortelle, M. Tomizuka, Interaction dataset: An international, adversarial
and cooperative motion dataset in interactive driving scenarios with semantic maps, arXiv
preprint arXiv:1910.03088 (2019). URL: https://arxiv.org/abs/1910.03088.
[31] H. Caesar, V. Bankiti, A. H. Lang, S. Vora, V. E. Liong, Q. Xu, A. Krishnan, Y. Pan, G. Baldan,
O. Beijbom, nuscenes: A multimodal dataset for autonomous driving, 2020 IEEE/CVF
Conference on Computer Vision and Pattern Recognition (CVPR) (2020) 11618–11628.
doi:10.1109/CVPR42600.2020.01164.
[32] A. Dosovitskiy, G. Ros, F. Codevilla, A. Lopez, V. Koltun, CARLA: An open urban driving
simulator, in: Proceedings of the 1st Annual Conference on Robot Learning, 2017, pp.
1–16. URL: https://arxiv.org/abs/1711.03938.</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          [1]
          <string-name>
            <given-names>A.</given-names>
            <surname>Houenou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P.</given-names>
            <surname>Bonnifait</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Cherfaoui</surname>
          </string-name>
          , W. Yao,
          <article-title>Vehicle trajectory prediction based on motion model and maneuver recognition</article-title>
          ,
          <source>in: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems</source>
          ,
          <year>2013</year>
          , pp.
          <fpage>4363</fpage>
          -
          <lpage>4369</lpage>
          . doi:
          <volume>10</volume>
          .1109/IROS.
          <year>2013</year>
          .
          <volume>6696982</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          [2]
          <string-name>
            <given-names>W.</given-names>
            <surname>Schwarting</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Pierson</surname>
          </string-name>
          ,
          <string-name>
            <given-names>J.</given-names>
            <surname>Alonso-Mora</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Karaman</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Rus</surname>
          </string-name>
          ,
          <article-title>Social behavior for autonomous vehicles</article-title>
          ,
          <source>Proceedings of the National Academy of Sciences</source>
          <volume>116</volume>
          (
          <year>2019</year>
          )
          <fpage>24972</fpage>
          -
          <lpage>24978</lpage>
          . doi:
          <volume>10</volume>
          .1073/pnas.1820676116.
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          [3]
          <string-name>
            <given-names>A.</given-names>
            <surname>Alahi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Goel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>V.</given-names>
            <surname>Ramanathan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Robicquet</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Fei-Fei</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Savarese</surname>
          </string-name>
          ,
          <article-title>Social lstm: Human trajectory prediction in crowded spaces</article-title>
          ,
          <source>in: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>961</fpage>
          -
          <lpage>971</lpage>
          . doi:
          <volume>10</volume>
          .1109/CVPR.
          <year>2016</year>
          .
          <volume>110</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          [4]
          <string-name>
            <given-names>M.</given-names>
            <surname>Bansal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Krizhevsky</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A. S.</given-names>
            <surname>Ogale</surname>
          </string-name>
          ,
          <article-title>Chaufeurnet: Learning to drive by imitating the best and synthesizing the worst</article-title>
          , CoRR abs/
          <year>1812</year>
          .03079 (
          <year>2018</year>
          ). URL: http://arxiv.org/abs/
          <year>1812</year>
          .03079. arXiv:
          <year>1812</year>
          .03079.
        </mixed-citation>
      </ref>
      <ref id="ref5">
        <mixed-citation>
          <article-title>[5] Bundesministeriums der justiz und fur verbraucherschutz, strassenverkehrs-ordnung ( stvo</article-title>
          ),
          <year>2013</year>
          . URL: https://www.gesetze
          <article-title>-im-internet</article-title>
          .de/stvo_2013/StVO.pdf.
        </mixed-citation>
      </ref>
      <ref id="ref6">
        <mixed-citation>
          [6]
          <string-name>
            <given-names>R.</given-names>
            <surname>Chandra</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Guan</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Panuganti</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Mittal</surname>
          </string-name>
          ,
          <string-name>
            <given-names>U.</given-names>
            <surname>Bhattacharya</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Bera</surname>
          </string-name>
          ,
          <string-name>
            <given-names>D.</given-names>
            <surname>Manocha</surname>
          </string-name>
          ,
          <article-title>Forecasting trajectory and behavior of road-agents using spectral clustering in graph-lstms</article-title>
          , CoRR abs/
          <year>1912</year>
          .01118 (
          <year>2019</year>
          ). URL: http://arxiv.org/abs/
          <year>1912</year>
          .01118. arXiv:
          <year>1912</year>
          .01118.
        </mixed-citation>
      </ref>
      <ref id="ref7">
        <mixed-citation>
          [7]
          <string-name>
            <given-names>L.</given-names>
            <surname>Zhao</surname>
          </string-name>
          ,
          <string-name>
            <given-names>R.</given-names>
            <surname>Ichise</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Sasaki</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Z.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <string-name>
            <given-names>T.</given-names>
            <surname>Yoshikawa</surname>
          </string-name>
          ,
          <article-title>Fast decision making using ontologybased knowledge base</article-title>
          ,
          <source>in: 2016 IEEE Intelligent Vehicles Symposium (IV)</source>
          ,
          <year>2016</year>
          , pp.
          <fpage>173</fpage>
          -
          <lpage>178</lpage>
          . doi:
          <volume>10</volume>
          .1109/IVS.
          <year>2016</year>
          .
          <volume>7535382</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref8">
        <mixed-citation>
          [8]
          <string-name>
            <given-names>M.</given-names>
            <surname>Buechel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>G.</given-names>
            <surname>Hinz</surname>
          </string-name>
          ,
          <string-name>
            <given-names>F.</given-names>
            <surname>Ruehl</surname>
          </string-name>
          ,
          <string-name>
            <given-names>H.</given-names>
            <surname>Schroth</surname>
          </string-name>
          ,
          <string-name>
            <given-names>C.</given-names>
            <surname>Gyoeri</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Knoll</surname>
          </string-name>
          ,
          <article-title>Ontology-based trafic scene modeling, trafic regulations dependent situational awareness and decision-making for automated vehicles</article-title>
          ,
          <source>in: 2017 IEEE Intelligent Vehicles Symposium (IV)</source>
          ,
          <year>2017</year>
          , pp.
          <fpage>1471</fpage>
          -
          <lpage>1476</lpage>
          . doi:
          <volume>10</volume>
          .1109/IVS.
          <year>2017</year>
          .
          <volume>7995917</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref9">
        <mixed-citation>
          [9]
          <string-name>
            <given-names>A.</given-names>
            <surname>Karimi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. S.</given-names>
            <surname>Duggirala</surname>
          </string-name>
          ,
          <article-title>Formalizing trafic rules for uncontrolled intersections</article-title>
          ,
          <source>in: 2020 ACM/IEEE 11th International Conference on Cyber-Physical Systems (ICCPS)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>41</fpage>
          -
          <lpage>50</lpage>
          . doi:
          <volume>10</volume>
          .1109/ICCPS48487.
          <year>2020</year>
          .
          <volume>00012</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref10">
        <mixed-citation>
          [10]
          <string-name>
            <given-names>A.</given-names>
            <surname>Rizaldi</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Althof</surname>
          </string-name>
          ,
          <article-title>Formalising trafic rules for accountability of autonomous vehicles</article-title>
          ,
          <source>in: 2015 IEEE 18th International Conference on Intelligent Transportation Systems</source>
          ,
          <year>2015</year>
          , pp.
          <fpage>1658</fpage>
          -
          <lpage>1665</lpage>
          . doi:
          <volume>10</volume>
          .1109/ITSC.
          <year>2015</year>
          .
          <volume>269</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref11">
        <mixed-citation>
          [11]
          <string-name>
            <given-names>K.</given-names>
            <surname>Esterle</surname>
          </string-name>
          ,
          <string-name>
            <given-names>L.</given-names>
            <surname>Gressenbuch</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Knoll</surname>
          </string-name>
          ,
          <article-title>Formalizing trafic rules for machine interpretability</article-title>
          ,
          <source>in: 2020 IEEE 3rd Connected and Automated Vehicles Symposium (CAVS)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>1</fpage>
          -
          <lpage>7</lpage>
          . doi:
          <volume>10</volume>
          .1109/CAVS51000.
          <year>2020</year>
          .
          <volume>9334599</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref12">
        <mixed-citation>
          [12]
          <string-name>
            <given-names>S.</given-names>
            <surname>Maierhofer</surname>
          </string-name>
          ,
          <string-name>
            <surname>A.-K. Rettinger</surname>
            ,
            <given-names>E. C.</given-names>
          </string-name>
          <string-name>
            <surname>Mayer</surname>
            ,
            <given-names>M.</given-names>
          </string-name>
          <string-name>
            <surname>Althof</surname>
          </string-name>
          ,
          <article-title>Formalization of interstate trafic rules in temporal logic</article-title>
          ,
          <source>in: 2020 IEEE Intelligent Vehicles Symposium (IV)</source>
          ,
          <year>2020</year>
          , pp.
          <fpage>752</fpage>
          -
          <lpage>759</lpage>
          . doi:
          <volume>10</volume>
          .1109/IV47402.
          <year>2020</year>
          .
          <volume>9304549</volume>
          .
        </mixed-citation>
      </ref>
      <ref id="ref13">
        <mixed-citation>
          [13]
          <string-name>
            <given-names>J.</given-names>
            <surname>Devlin</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Chang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Toutanova</surname>
          </string-name>
          ,
          <article-title>BERT: pre-training of deep bidirectional transformers for language understanding</article-title>
          , CoRR abs/
          <year>1810</year>
          .04805 (
          <year>2018</year>
          ). URL: https://arxiv. org/abs/
          <year>1810</year>
          .04805.
        </mixed-citation>
      </ref>
      <ref id="ref14">
        <mixed-citation>
          [14]
          <string-name>
            <given-names>C.</given-names>
            <surname>Rafel</surname>
          </string-name>
          ,
          <string-name>
            <given-names>N.</given-names>
            <surname>Shazeer</surname>
          </string-name>
          ,
          <string-name>
            <given-names>A.</given-names>
            <surname>Roberts</surname>
          </string-name>
          ,
          <string-name>
            <given-names>K.</given-names>
            <surname>Lee</surname>
          </string-name>
          ,
          <string-name>
            <given-names>S.</given-names>
            <surname>Narang</surname>
          </string-name>
          ,
          <string-name>
            <given-names>M.</given-names>
            <surname>Matena</surname>
          </string-name>
          ,
          <string-name>
            <given-names>Y.</given-names>
            <surname>Zhou</surname>
          </string-name>
          ,
          <string-name>
            <given-names>W.</given-names>
            <surname>Li</surname>
          </string-name>
          ,
          <string-name>
            <given-names>P. J.</given-names>
            <surname>Liu</surname>
          </string-name>
          ,
          <article-title>Exploring the limits of transfer learning with a unified text-to-text transformer</article-title>
          ,
          <source>Journal of Machine Learning Research</source>
          <volume>21</volume>
          (
          <year>2020</year>
          )
          <fpage>1</fpage>
          -
          <lpage>67</lpage>
          . URL: http://jmlr.org/papers/v21/
          <fpage>20</fpage>
          -
          <lpage>074</lpage>
          .html.
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>