Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 ...
One of the hottest markets in the artificial intelligence industry is selling chatbots that write computer code.
Now, Claude Sonnet 4.5 has lapped that last model, outperforming it on the SWE-bench Verified evaluation, a human-filtered subset of the SWE-bench. Claude Sonnet 4.5 also outperformed leading models ...
Anthropic has formally announced Claude Sonnet 4.5, a new AI model specifically made for coding. Anthropic didn’t mince any ...
NetEase-backed study shows language model agents may detect bugs faster and with greater coverage than existing tools.