Vol-2428⫷ Vol-2429 ⫸Vol-2430
urn:nbn:de:0074-2429-0


Vol-2429/paper6⫷Vol-2429/paper7⫸Vol-2429/paper8

Learning Reliable Policies in the Bandit Setting with Application to Adaptive Clinical Trials