Robuta

https://programbench.com/task/rbakbashev__elfcat.52f8cc7/ rbakbashev/elfcat — ProgramBench ProgramBench evaluates whether language models can rebuild programs from scratch. elfcatprogrambench