Home
Claude Max Token & Cost Optimization
Pillar: cost-optimization | Date: April 2026
Scope: Actual benchmarks on prompt caching effectiveness (TTL, hit rate, cache_read vs fresh tokens on Max). Model selection ROI: Opus 4.7 vs Sonnet 4.6 vs Haiku 4.5 for specific task types with real numbers. MCP manifest hygiene: always-on vs on-demand activation cost difference. Subagent dispatch ROI — when it pays to spawn vs inline. Fast mode tradeoffs. Claude Max-specific patterns: rate limits, weekly caps, burst behavior. No marketing — real numbers only.
Sources: 18 gathered, consolidated, synthesized.
Home