=Paper=
{{Paper
|id=Vol-2419/toc
|storemode=property
|title=None
|pdfUrl=https://ceur-ws.org/Vol-2419/toc.pdf
|volume=Vol-2419
}}
==None==
AISafety 2019 Table of Contents
Table of Contents
Invited Talk to the AI Safety Landscape Session
Towards a Framework for Safety Assurance of Autonomous Systems . . . . . . . . . . . . . . . . . . . . . . 1
John McDermid, Yan Jia and Ibrahim Habli
Session 1: Safe Learning
Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive
Clinical Trials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
Hossein Aboutalebi, Doina Precup and Tibor Schuster
Metric Learning for Value Alignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
Andrea Loreggia, Nicholas Mattei, Francesca Rossi and Kristen Brent Venable
Session 2: Reinforcement Learning Safety
Penalizing side effects using stepwise relative reachability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Victoria Krakovna, Laurent Orseau, Miljan Martic and Shane Legg
Conservative Agency . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Alexander Turner, Dylan Hadfield-Menell and Prasad Tadepalli
Detecting Spiky Corruption in Markov Decision Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Jason Mancuso, Tomasz Kisielewski, David Lindner and Alok Singh
Modeling AGI Safety Frameworks with Causal Influence Diagrams . . . . . . . . . . . . . . . . . . . . . . . 44
Tom Everitt, Ramana Kumar, Victoria Krakovna and Shane Legg
Session 3: Safe Autonomous Vehicles
On the Susceptibility of Deep Neural Networks to Natural Perturbations . . . . . . . . . . . . . . . . . 51
Mesut Ozdag, Sunny Raj, Steven L. Fernandes, Alvaro Velasquez, Laura Pullum and
Sumit Kumar Jha
Managing Uncertainty of AI-based Perception for Autonomous Systems . . . . . . . . . . . . . . . . . . 57
Maximilian Henne, Adrian Schwaiger and Gereon Weiss
A Framework for Safety Violation Identification and Assessment in Autonomous Driving . 61
Lukas Heinzmann, Sina Shafaei, Mohd Hafeez Osman, Christoph Segler and Alois Knoll
Session 4: AI Value Alignment, Ethics and Bias
The Glass Box Approach: Verifying Contextual Adherence to Values . . . . . . . . . . . . . . . . . . . . . 68
Andrea Aler Tubella and Virginia Dignum
Requisite Variety in Ethical Utility Functions for AI Value Alignment . . . . . . . . . . . . . . . . . . . . 75
Nadisha-Marie Aliman and Leon Kester
Slam the Brakes: Perceptions of Moral Decisions in Driving Dilemmas . . . . . . . . . . . . . . . . . . . . 82
Holly Wilson and Andreas Theodorou
1
AISafety 2019 Table of Contents
Understanding Bias in Datasets using Topological Data Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 91
Ramya Srinivasan and Ajay Chander
Poster Papers
Computational Strategies for the Trustworthy Pursuit and the Safe Modeling of
Probabilistic Maintenance Commitments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Qi Zhang, Edmund Durfee and Satinder Singh
Categorizing Wireheading in Partially Embedded Agents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
Arushi Majha, Sayan Sarkar and Davide Zagami
Adversarial Exploitation of Policy Imitation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
Vahid Behzadan and William Hsu
The Challenge of Imputation in Explainable Artificial Intelligence Models . . . . . . . . . . . . . . . . 119
Muhammad Ahmad, Carly Eckert and Ankur Teredesai
On the importance of system testing for assuring safety of AI systems . . . . . . . . . . . . . . . . . . . . 123
Franz Wotawa
Towards Empathic Deep Q-Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130
Bart Bussmann, Jacqueline Heinerman and Joel Lehman
Watermarking of DRL Policies with Sequential Triggers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
Vahid Behzadan and William Hsu
2