Exciting release and congrats to @fredsala and @devjeetrr!
Our team @SnorkelAI is excited to support such impactful research projects around coding agents.
#AISlop #CodingAgents #benchmark
We found that agents generate progressively worse code with each iteration. Real developers do not.
SlopCodeBench is the only eval that faithfully measures quality degradation on iterative, long-horizon coding tasks.
arxiv.org/abs/2603.24755
scbench.ai
🧵c
1
8
310



