<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.0 20120330//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink">
  <front>
    <journal-meta />
    <article-meta>
      <title-group>
        <article-title>Table of Contents</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <string-name>Session</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <contrib contrib-type="author">
          <string-name>: Safe Learning</string-name>
          <xref ref-type="aff" rid="aff0">0</xref>
        </contrib>
        <aff id="aff0">
          <label>0</label>
          <institution>Session 4: AI Value Alignment</institution>
          ,
          <addr-line>Ethics and Bias</addr-line>
        </aff>
      </contrib-group>
      <pub-date>
        <year>2019</year>
      </pub-date>
      <abstract>
        <p>Invited Talk to the AI Safety Landscape Session Towards a Framework for Safety Assurance of Autonomous Systems . . . . . . . . . . . . . . . . . . . . . . John McDermid, Yan Jia and Ibrahim Habli Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Hossein Aboutalebi, Doina Precup and Tibor Schuster Metric Learning for Value Alignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 Andrea Loreggia, Nicholas Mattei, Francesca Rossi and Kristen Brent Venable Session 2: Reinforcement Learning Safety Penalizing side e ects using stepwise relative reachability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 Victoria Krakovna, Laurent Orseau, Miljan Martic and Shane Legg Conservative Agency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 Alexander Turner, Dylan Had eld-Menell and Prasad Tadepalli Detecting Spiky Corruption in Markov Decision Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Jason Mancuso, Tomasz Kisielewski, David Lindner and Alok Singh Modeling AGI Safety Frameworks with Causal In uence Diagrams . . . . . . . . . . . . . . . . . . . . . . . 44 Tom Everitt, Ramana Kumar, Victoria Krakovna and Shane Legg Session 3: Safe Autonomous Vehicles On the Susceptibility of Deep Neural Networks to Natural Perturbations . . . . . . . . . . . . . . . . . 51 Mesut Ozdag, Sunny Raj, Steven L. Fernandes, Alvaro Velasquez, Laura Pullum and Sumit Kumar Jha Managing Uncertainty of AI-based Perception for Autonomous Systems . . . . . . . . . . . . . . . . . . 57 Maximilian Henne, Adrian Schwaiger and Gereon Weiss A Framework for Safety Violation Identi cation and Assessment in Autonomous Driving . 61 Lukas Heinzmann, Sina Shafaei, Mohd Hafeez Osman, Christoph Segler and Alois Knoll The Glass Box Approach: Verifying Contextual Adherence to Values . . . . . . . . . . . . . . . . . . . . . 68 Andrea Aler Tubella and Virginia Dignum Requisite Variety in Ethical Utility Functions for AI Value Alignment . . . . . . . . . . . . . . . . . . . . 75 Nadisha-Marie Aliman and Leon Kester Slam the Brakes: Perceptions of Moral Decisions in Driving Dilemmas . . . . . . . . . . . . . . . . . . . . 82 Holly Wilson and Andreas Theodorou</p>
      </abstract>
    </article-meta>
  </front>
  <body>
    <sec id="sec-1">
      <title>-</title>
      <p>8
Understanding Bias in Datasets using Topological Data Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 91</p>
      <p>Ramya Srinivasan and Ajay Chander
Poster Papers
Computational Strategies for the Trustworthy Pursuit and the Safe Modeling of
Probabilistic Maintenance Commitments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98</p>
      <p>Qi Zhang, Edmund Durfee and Satinder Singh
Categorizing Wireheading in Partially Embedded Agents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105</p>
      <p>Arushi Majha, Sayan Sarkar and Davide Zagami
Adversarial Exploitation of Policy Imitation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112</p>
      <p>Vahid Behzadan and William Hsu</p>
    </sec>
  </body>
  <back>
    <ref-list>
      <ref id="ref1">
        <mixed-citation>
          <article-title>The Challenge of Imputation in Explainable Arti cial Intelligence</article-title>
          <string-name>
            <given-names>Models . . . . . . . . . . . . . . . . 119 Muhammad</given-names>
            <surname>Ahmad</surname>
          </string-name>
          , Carly Eckert and Ankur Teredesai
        </mixed-citation>
      </ref>
      <ref id="ref2">
        <mixed-citation>
          <article-title>On the importance of system testing for assuring safety of AI systems</article-title>
          <string-name>
            <surname>. . . . . . . . . . . . . . . . . . . .</surname>
          </string-name>
          123 Franz Wotawa
        </mixed-citation>
      </ref>
      <ref id="ref3">
        <mixed-citation>
          <string-name>
            <given-names>Towards</given-names>
            <surname>Empathic Deep Q-Learning</surname>
          </string-name>
          <string-name>
            <given-names>. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 Bart</given-names>
            <surname>Bussmann</surname>
          </string-name>
          , Jacqueline Heinerman and Joel Lehman
        </mixed-citation>
      </ref>
      <ref id="ref4">
        <mixed-citation>
          <article-title>Watermarking of DRL Policies with Sequential</article-title>
          <string-name>
            <given-names>Triggers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 Vahid</given-names>
            <surname>Behzadan</surname>
          </string-name>
          and William Hsu
        </mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>