Vol-3884⫷ Vol-3885 ⫸Vol-3886
urn:nbn:de:0074-3885-0


Vol-3885/paper17⫷Vol-3885/paper18⫸Vol-3885/paper19

Inference Acceleration for Large Language Models Using "Stairs" Assisted Greedy Generation