Anthropic recently scored a huge win on OpenAI's turf by achieving SOTA on -their- SWE-Bench Verified benchmark, using an upgraded Claude 3.5 Sonnet. For the first time, they spill the beans.
The new Claude 3.5 Sonnet, Computer Use, and…
Anthropic recently scored a huge win on OpenAI's turf by achieving SOTA on -their- SWE-Bench Verified benchmark, using an upgraded Claude 3.5 Sonnet. For the first time, they spill the beans.