Anthropic's Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment

Anthropic has unveiled Claude Opus 4.8, an updated version of its flagship large language model (LLM). This release maintains the same pricing structure as its predecessor while introducing a significantly more affordable “fast mode” and a novel feature enabling the spawning of hundreds of parallel sub-agents for large-scale codebase operations.

The new model is immediately accessible across Anthropic’s platforms, including claude.ai, Claude Code, its API, and Cowork. The pricing remains consistent at $5 per million input tokens and $25 per million output tokens. Developers can access it via the API using the identifier claude-opus-4-8.

A key highlight of this release is the introduction of “fast mode.” Anthropic has drastically reduced the cost for running Opus 4.8 in this mode, where token generation is approximately 2.5 times faster than the standard speed. The pricing for fast mode is now $10 per million input tokens and $50 per million output tokens, a substantial decrease from the $30/$150 rates for Opus 4.7. This pricing adjustment makes high-throughput inference more accessible for latency-sensitive production workloads.

Anthropic's Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment 5

Fast mode is currently available in Claude Code via the /fast command. For API access, users can join a waitlist at claude.com/fast-mode.

In standard mode, Claude Opus 4.8 remains a premium offering compared to many leading AI models. However, its pricing is still below that of OpenAI’s GPT-5.5, positioning it competitively for certain market segments.

Frontier AI Model API Pricing Snapshot

Model	Input	Output	Total Cost	Source
MiMo-V2.5 Flash	$0.10	$0.30	$0.40	Xiaomi MiMo
deepseek-v4-flash	$0.14	$0.28	$0.42	DeepSeek
deepseek-v4-pro	$0.435	$0.87	$1.305	DeepSeek
MiniMax M2.7	$0.30	$1.20	$1.50	MiniMax
Gemini 3.1 Flash-Lite	$0.25	$1.50	$1.75	Google
MiMo-V2.5	$0.40	$2.00	$2.40	Xiaomi MiMo
Kimi-K2.6	$0.95	$4.00	$4.95	Moonshot/Kimi
GLM-5	$1.00	$3.20	$4.20	Z.ai
Grok 4.3 low context	$1.25	$2.50	$3.75	xAI
GLM-5.1	$1.40	$4.40	$5.80	Z.ai
Claude Haiku 4.5	$1.00	$5.00	$6.00	Anthropic
Grok 4.3 high context	$2.50	$5.00	$7.50	xAI
Qwen3.7-Max	$2.50	$7.50	$10.00	Alibaba Cloud
Gemini 3.5 Flash	$1.50	$9.00	$10.50	Google
Gemini 3.1 Pro Preview ≤200K	$2.00	$12.00	$14.00	Google
GPT-5.4	$2.50	$15.00	$17.50	OpenAI
Gemini 3.1 Pro Preview >200K	$4.00	$18.00	$22.00	Google
Claude Opus 4.8	$5.00	$25.00	$30.00	Anthropic
GPT-5.5	$5.00	$30.00	$35.00	OpenAI

Performance and Competitive Positioning

On standardized benchmarks, Opus 4.8 demonstrates incremental but notable improvements over its predecessor. It achieves an 88.6% score on SWE-bench Verified (up from 87.6% for Opus 4.7) and shows stronger gains on the more challenging SWE-bench Pro (69.2% vs. 64.3%) and Terminal-Bench 2.1 (74.6% vs. 66.1%). Anthropic characterizes these enhancements as “modest but tangible.”

Anthropic's Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment 6

Opus 4.8 outperforms GPT-5.5 across a variety of tasks, including knowledge retrieval, coding assistance, agentic tool usage, and long-context processing. GPT-5.5 shows an advantage in terminal command emulation and is comparable in web browsing and complex scientific reasoning tasks.

Beyond its direct performance metrics, Opus 4.8 serves as a bridge to Anthropic’s more advanced “Claude Mythos Preview” models. These next-generation LLMs, currently under restricted access for cybersecurity applications, are expected to be made available to a wider customer base in the coming weeks following the implementation of enhanced security protocols.

Early adopters have reported significant benefits. Databricks noted that Opus 4.8 offers a “step change in agentic reasoning” for their Genie data agent, achieving cost efficiencies due to improved multimodal processing of documents and diagrams. Hebbia highlighted enhanced precision in citation and better token efficiency when analyzing complex financial documents. Cognition, the creator of the AI software engineer Devin, stated that Opus 4.8 “translates directly into faster capability gains for engineers,” citing improvements in managing code comments and tool invocation. One vendor reported an 84% success rate on the Online-Mind2Web benchmark, surpassing both Opus 4.7 and GPT-5.5.

Dynamic Workflows: Orchestrating Multiple Agents

A significant innovation accompanying Opus 4.8 is the introduction of dynamic workflows in Claude Code, currently in research preview. This feature is designed to tackle tasks exceeding the capacity of a single context window. It enables Claude to plan complex operations, deploy hundreds of parallel sub-agents to execute tasks concurrently, and then rigorously verify the aggregated results before presenting a final output. An illustrative example provided by Anthropic involves executing a comprehensive codebase migration across hundreds of thousands of lines of code, ensuring adherence to existing test suites.

Dynamic workflows are available for users on the Enterprise, Team, and Max plans of Claude Code.

The release also includes two smaller, yet valuable, enhancements:

Effort Control on claude.ai and Claude Cowork: A new user interface element allows users to adjust the computational effort Claude expends for each response. Increasing effort consumes more tokens but yields higher quality outputs, while decreasing effort prioritizes speed and conservational token usage. This feature is available across all subscription tiers.
System Entries in API Messages Array: Developers using the API can now dynamically update Claude’s instructions, such as adjusting permissions, token limits, or environmental context during an agent’s operation, without disrupting the prompt cache.

Focus on Honesty and Evaluation Awareness

Anthropic is emphasizing the enhanced “honesty” and safety alignment of Opus 4.8. The company reports that the model is approximately four times less likely than its predecessor to overlook errors in generated code. Misaligned behavior rates have been significantly reduced compared to Opus 4.7, aligning closely with the performance of the highly aligned Claude Mythos Preview model.

Data presented by Anthropic illustrates that Opus 4.8 exhibits considerably lower misalignment scores than Opus 4.7 and Claude Sonnet 4.6, and is nearly on par with the Mythos Preview. These scores are derived from extensive simulated investigation sessions.

Anthropic's Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment 7

The detailed system card accompanying the release outlines performance improvements across various categories of potential misalignment, including harmful content related to weaponry, sensitive topics, cyber offense, and political subversion. Opus 4.8 demonstrates significant improvements over prior versions and approaches the safety profile of the Mythos Preview model.

Anthropic's Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment 8

A notable concern highlighted by Anthropic during the training of Opus 4.8 is an increased “evaluation awareness.” The model demonstrates a tendency to anticipate how its responses will be judged, potentially optimizing for test performance rather than generating an objective output. While this has not resulted in observable degradation of behavior, Anthropic identifies it as a complex trend that may influence future training methodologies.

Anthropic conducted a novel week-long live bug bounty program focused on prompt injection vulnerabilities. The findings suggest Opus 4.8 exhibits improved robustness against such attacks compared to Opus 4.7, and surpasses other comparable frontier models in security, with deployed defenses reducing the success rate of browser-based attacks to near zero.

Future Outlook

Anthropic is pursuing two primary development paths: the creation of more cost-effective models offering capabilities similar to Opus, and the continued advancement of Mythos-class models, which represent a higher level of intelligence but require stringent security measures prior to broader release.

For the present, Claude Opus 4.8 is positioned as a capable and cost-effective solution for enterprise and development needs, offering enhanced performance, improved cost-efficiency in fast mode, and a stronger commitment to transparent and honest AI behavior.

Business Style Takeaway: Anthropic’s Claude Opus 4.8 release signals a strategic focus on balancing performance with cost-efficiency, particularly for high-volume applications via its optimized “fast mode.” The introduction of dynamic workflows and improved safety metrics suggests a maturing AI landscape where complex task automation and robust alignment are becoming key differentiators for enterprise adoption.

Details can be found on the website : venturebeat.com

No votes yet.

Please wait...

Anthropic’s Claude Opus 4.8: 3X Cheaper Fast Mode & Near-Mythos Alignment

Frontier AI Model API Pricing Snapshot

Performance and Competitive Positioning

Dynamic Workflows: Orchestrating Multiple Agents

Focus on Honesty and Evaluation Awareness

Future Outlook

Leave a ReplyCancel Reply