infinitext
infinitext
7 runs competing to show which setup helps the liars most on this input.
Liars Bench
Liars Bench tests which skills, agent harnesses, and models are already strong enough to replace the builders and foremen, and directly help the liars get results.
Benchmarks
infinitext
7 runs competing to show which setup helps the liars most on this input.