Armin Parchami
Armin Parchami @ArminPCM ·
Exciting release and congrats to @fredsala and @devjeetrr! Our team @SnorkelAI is excited to support such impactful research projects around coding agents. #AISlop #CodingAgents #benchmark
Gabe Orlanski Gabe Orlanski @GOrlanski ·
We found that agents generate progressively worse code with each iteration. Real developers do not. SlopCodeBench is the only eval that faithfully measures quality degradation on iterative, long-horizon coding tasks. arxiv.org/abs/2603.24755 scbench.ai 🧵c
1
310
paulchen
paulchen @paulchen_c59114 ·
Cursor Composer 2 vs Anthropic Harness. Cursor: Kimi K2.5 base, 661.3 CursorBench score. Anthropic: 3-agent harness, 6-hour autonomous runs. Key insight: Separate generator from evaluator. First Chinese open model in Silicon Valley core. #AI #CodingAgents
38
Stackbox
Stackbox @usestackbox ·
Every AI coding agent assumes it owns the codebase. None of them do. Claude Code, Cursor, Gemini CLI, Codex, Copilot — all affected. Fix drops tonight. #CodingAgents
92
Gerrit Roska
Gerrit Roska @GerritRoska ·
Cursor's Composer 2: near-frontier coding at 86% lower cost. The signal for dev teams: Specialized AI > general-purpose for real workflows. The future isn't one big model — it's the right model for each task. #AIAutomation #DevProductivity #CodingAgents
1
11
Gerrit Roska
Gerrit Roska @GerritRoska ·
AI coding agents just got persistent memory. Claude Code now stores your debugging patterns and architecture decisions across sessions — scoped by user, project, or machine. Agents that learn your codebase > starting fresh every time. #AIAutomation #DevTools #CodingAgents
20
Mohith karthikeya M
Mohith karthikeya M @mohithxkarthi ·
Something I've been building for months drops in 2 days. If you run multiple AI coding agents at once — this changes everything. No chaos. Full control. Stay close. 👀 #CodingAgentst
23
Stackbox
Stackbox @usestackbox ·
Run multiple AI coding agents in true isolation — with shared memory. No chaos. No conflicts. Just parallel execution that actually works. Open source. Dropping Wednesday. 🔒 #CodingAgentsa
45
Lucy
Lucy @LucyOS_official ·
Replying to @mdancho84
@mdancho84 50 researchers from ByteDance, Alibaba & Tencent just dropped a 303-page guide on code models + agents. Small models with the right RL can actually punch like giants, Python’s sneakily tough, and a ton more surprises. Huge read. 🤯 #AI #CodingAgents #CodeModels
272