Software.
Anthropic released Claude Opus 4.8 (claude-opus-4-8) at flat pricing ($5/$25 per Mtok, fast mode ~3x cheaper) plus a Dynamic Workflows research preview; retakes the Artificial Analysis Intelligence Index lead at 61.4 (vs GPT-5.5 60.2) and SWE-Bench Pro 69.2%, though it loses Terminal-Bench 2.1 to GPT-5.5
anthropic.com/news/claude-opus-4-8, artificialanalysis.ai, TechCrunch
Liquid AI shipped open-weight LFM2.5-8B-A1B, an on-device MoE (8.3B total / 1.5B active, 128K context) decoding ~253 tok/s on a laptop with day-one llama.cpp / MLX / vLLM / SGLang support
liquid.ai/blog/lfm2-5-8b-a1b, MarkTechPost
StepFun released open-weight Step 3.7 Flash (198B sparse MoE, ~11B active, 256K context) under Apache 2.0 at ~400 tok/s, claiming SWE-PRO 56.3 and 98%+ on tau-squared-bench, positioned for agentic / coding / search workloads
StepFun HuggingFace model card, Digg
A Financial Times report showed the open-source 'Heretic' abliteration tool (~17.8K stars) stripping safety guardrails from open-weight models such as Llama 3.3 and Gemma 4 within minutes, spotlighting the open-weight safety floor ahead of the EU AI Office's Aug 2 Article 92 evaluation powers
Financial Times (via Singularity.Kiwi), Lawfare
What this means
Two arcs run in parallel: (1) a software->hardware Jevons signal — the closed frontier pushed the ceiling (Opus 4.8) while a same-day open-weight efficiency wave (Liquid on-device 128K at 253 tok/s, StepFun 400 tok/s under Apache 2.0) pushed cheap-and-good inference outward, so architects should re-baseline cost-per-task now rather than wait for price cuts; (2) capability-risk procurement hardens as trivially abliterated open weights lower the deployed safety floor just as the EU AI Office's Article 92 access powers arrive Aug 2 — boards should treat open-weight governance as a vendor-risk dimension, not a compliance afterthought.