claude-3-5-sonnet-20241022

2024-10-22
对话, 识图
By Anthropic

Input: ￥27.00 / M tokens Output: ￥135.00 / M tokens
特征： Function Calling, 图像输入, 流式, 结构化输出, 文本输入, 文本输出
上下文： 200K
最大输出： 8K

Input: ￥27.00 / M tokens Output: ￥135.00 / M tokens
特征： Function Calling, 图像输入, 流式, 结构化输出, 文本输入, 文本输出
上下文： 200K
最大输出： 8K

The Claude 3.5 Sonnet upgrade delivers significant improvements across benchmarks, particularly in coding and agentic tasks. It achieves 49.0% on SWE-bench Verified (up from 33.4%), outperforming all publicly available models, including specialized coding agents. It also excels in tool use, scoring 69.2% in retail and 46.0% in airline domains on TAU-bench. A major innovation is its computer use beta, enabling Claude to navigate UIs, click, type, and automate workflows—though still experimental. Early adopters like Replit and GitLab report 10% better reasoning and efficiency in multi-step coding tasks. Safety remains a priority, with joint testing by US/UK AI Safety Institutes confirming its adherence to ASL-2 risk standards.

版权归聚合AI所有

联系邮箱：sunsky20120101@163.com

claude-3-5-sonnet-20241022

模型描述