Vol-4111⫷ Vol-4112 ⫸Vol-4113
urn:nbn:de:0074-4112-0


Vol-4112/67_main_long⫷Vol-4112/68_main_long⫸Vol-4112/69_main_long
Enrico MensaLorenzo ZaneCalogero Jerik ScozzaroMatteo DelsantoTommaso MilaniDaniele P. Radicioni

Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark