Vol-4122⫷ Vol-4123 ⫸Vol-4124
urn:nbn:de:0074-4123-0


Vol-4123/preface⫷Vol-4123/paper1⫸Vol-4123/paper2
Yubin KimArthur MaciejewiczBrandon Beveridge

Lessons from the bleeding edge: large-scale production inference of LLMs