=Paper=
{{Paper
|id=Vol-4112/68_main_long
|storemode=property
|title=Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark
|pdfUrl=https://ceur-ws.org/Vol-4112/68_main_long.pdf
|volume=Vol-4112
|authors=Enrico Mensa,Lorenzo Zane,Calogero Jerik Scozzaro,Matteo Delsanto,Tommaso Milani,Daniele P. Radicioni
}}
==Easy to Complete, Hard to Choose: Investigating LLM Performance on the ProverbIT Benchmark==
None