DeepSWE: A contamination-free benchmark for long-horizon coding agents

Hacker News (score: 21)
Found: May 26, 2026
ID: 4761

Description

Other
DeepSWE: A contamination-free benchmark for long-horizon coding agents

More from Hacker

No other tools from this source yet.