brianletort.ai
All issues

The Model Pulse

Issue 08 · Week 24 of 2026.

/Weekly read/~6 min read/Public sources onlyDownload brief

The Big Read

A new closed frontier shipped and was switched off in the same week; the durable open progress was a diffusion efficiency path.

The thesis this issue defends

W24's model story had two halves. Anthropic shipped Claude Fable 5 — a new Mythos-class tier above Opus — and it debuted #1 on the independent Artificial Analysis Intelligence Index at 64.9, roughly five points ahead of GPT-5.5. Three days later (Jun 12), a U.S. government export-control directive forced Anthropic to disable Fable 5 and its safeguards-lifted Mythos 5 sibling for every customer, routing queries back to Opus 4.8. So the public closed-frontier leader for half a week is, at week's end, unavailable — the working default is again Opus 4.8. Meanwhile the open layer's real progress was efficiency, not a new crown: Google DeepMind released DiffusionGemma, an Apache-2.0 text-diffusion model that generates ~1,000+ tok/s on an H100, and Cohere shipped North Mini Code, a cheap self-host coding MoE; MiniMax-M3 was announced (AA Index 55) but its weights are still pending. Gemini 3.5 Pro remained not-GA. The procurement implication is sharper than another leaderboard reshuffle: a top-tier closed model can now be revoked by a third party, so production systems need a standardized eval harness and a hard fallback router, with cheap open/local models carrying routine work and the closed frontier reserved for high-value reasoning.

Tree delta

What changed in the tree.

3 models added, 0 updated.

Three W24 additions: Claude Fable 5 (new Mythos-class closed frontier), DiffusionGemma (open text-diffusion), and Cohere North Mini Code (efficient open coding MoE).

Added (3)

  • claude-fable-5
  • diffusiongemma
  • north-mini-code

Updated

None this period.

Mythos 5 is the same model as Fable 5 with cyber safeguards lifted (government-only) and is not a separate tree row. MiniMax-M3 is excluded until its weights are released, and Gemini 3.5 Pro is excluded until Google publishes a GA model card or API identifier.

Explore the LLM Evolutionary Tree

Frontier movements

Flagship-class releases.

3 releases this period.

Vendor-stated frontier capability. The releases that reset the closed-source ceiling.

  • /Anthropic/Frontier/Reasoning

    Claude Fable 5

    New Mythos-class tier above Opus; debuted #1 on the Artificial Analysis Intelligence Index at 64.9, priced $10/$50 per MTok

    Fable 5 is the new public top of the closed frontier on independent indices, but it is priced as an async heavy-lift tool (~2x Opus 4.8) and was free on paid tiers only through Jun 22 before reverting to usage credits. Architects should reserve it for high-value reasoning and codebase-scale orchestration, not default every agent call to it, and should re-benchmark on cost-per-task rather than headline scores.

    Anthropic launch; Artificial Analysis Intelligence Index

  • /Anthropic/Frontier/Reasoning

    Claude Mythos 5 (and the Jun 12 takedown)

    Same model as Fable 5 with safeguards lifted for a government program; both force-disabled globally by a U.S. export-control directive three days after launch

    This is the first known government-forced takedown of a deployed frontier model, and it reframes model selection: a top-tier closed model is now a sovereign/regulatory single point of failure that a third party can switch off. Architects and operators should build hard fallbacks (queries already revert to Opus 4.8) and avoid single-sourcing the frontier for any production-critical path.

    VentureBeat; MarkTechPost (Reuters/BBC/Axios cited secondhand)

  • /Google DeepMind/Frontier/Reasoning

    Gemini 3.5 Pro

    Still pending at W24 close: limited Vertex enterprise preview, no public API identifier, price row, or independent benchmark despite a Google-stated June target

    A frontier movement by absence for the second consecutive issue. Buyers should keep the launch on the June watchlist but not pause current baselines on an unreleased SKU; the likely enterprise pattern remains task routing (Gemini for huge-context/multimodal, Anthropic/OpenAI for coding and agentic reliability) rather than a universal replacement.

    TechTimes; Presenc launch brief

Open weights

Open-frontier and open-source drops.

3 releases this period.

Open-weights releases that change procurement options. Pull these into pilot when score parity meets license parity.

  • /Google DeepMind/Specialist/MoE

    DiffusionGemma

    Apache-2.0 open text-diffusion model: 26B / ~3.8B-active MoE on a Gemma 4 backbone, multimodal-in, 256K context, ~1,000+ tok/s on an H100 with native vLLM

    DiffusionGemma is the week's most architecturally significant open release: a credible open path to low-latency, local interactive inference via block diffusion rather than autoregressive decoding. Teams building latency-sensitive local agents should evaluate it as an efficiency option and watch whether the autoregressive-vs-diffusion split becomes a real procurement axis.

    Google blog; Hugging Face model card

  • /Cohere/Edge / small/MoE

    North Mini Code

    Small open-weight coding MoE (~30B / ~3B active) scoring 27.6 on the Artificial Analysis Intelligence Index

    North Mini Code is not frontier-competitive, but it is a cheap self-host candidate for routine code and text sub-agent calls. Teams trying to cut orchestration cost should treat it as a routing node for low-risk repeated work, reserving closed flagships for high-stakes reasoning.

    Artificial Analysis

  • /MiniMax/Open frontier/MoE

    MiniMax-M3 (announced; weights pending)

    Announced at 55 on the Artificial Analysis Intelligence Index — would be the leading open-weights model once weights release, but is not yet downloadable

    MiniMax-M3 is the strongest competitively-scoring open item of the window, but with weights pending it is a signal, not a deployable asset. Procurement teams tracking an open-frontier alternative should prepare an evaluation slot but not plan a migration until the weights and an independent benchmark land.

    Artificial Analysis

Architecture watch

Patterns to track.

3 patterns reshaping the canopy.

Architectural patterns that crossed multiple vendors this period. Each pattern lists exemplar releases and what it changes for deployment, cost, or capability.

  • Government as a model kill switch

    Claude Fable 5 takedownClaude Mythos 5 (Project Glasswing)

    The Jun 12 export-control directive that forced Anthropic to disable Fable 5 and Mythos 5 globally turns regulatory access into an architecture variable. A model that is best-in-class on Monday can be unavailable by Friday for reasons outside the vendor's control. Enterprise architecture should now treat top-tier closed models as revocable dependencies and design fallback routing and capability-degradation plans accordingly.

    VentureBeat; MarkTechPost

  • Text diffusion enters open weights

    DiffusionGemmaGemma 4 backbone

    DiffusionGemma brings block-diffusion text generation into the open-weight mainstream with native vLLM support and ~1,000+ tok/s throughput. The pattern matters because it offers a different latency/throughput profile than autoregressive decoding for interactive and local workloads. Architects evaluating local inference should add a diffusion option to their benchmark set rather than assuming autoregressive is the only path.

    Google blog; Hugging Face

  • Frontier shipping concentrated in one vendor

    Anthropic Fable 5Gemini 3.5 Pro pendingGPT-5.6 rumor only

    For the second straight issue, the only new frontier model came from Anthropic, while Google's Pro slipped and OpenAI's next model stayed a rumor. That concentration means the frontier's cadence — and now its availability — depends heavily on a single lab. Buyers should diversify model routing across vendors and keep a credible open/local tier so a single vendor's release or takedown cannot stall production.

    Anthropic launch; Artificial Analysis; TechTimes

Benchmark moves

Where the leaderboard moved.

3 benchmarks shifted.

Benchmark deltas that change a procurement read. Scores reflect public leaderboards or vendor model cards as of publication.

  • Artificial Analysis Intelligence Index

    Claude Fable 5 debuted #1 at 64.9, roughly five points ahead of GPT-5.5, and set the highest score on five of the ten underlying benchmarks

    • Claude Fable 564.9
    • GPT-5.5~60
    • Gemini 3.1 Pro~57

    Artificial Analysis

  • SWE-Bench Pro (coding)

    Fable 5's vendor-scaffold 80.3% widens the headline closed lead, but it is unreplicated by Scale SEAL/Epoch; independent vals.ai shows 95% on SWE-Bench Verified

    • Claude Fable 5 (vendor scaffold)80.3%
    • Claude Opus 4.869.2%
    • GPT-5.558.6%

    morphllm / vals.ai; vendor harness (contested)

  • Open-weight intelligence

    MiniMax-M3 (55, weights pending) would lead open weights once released; North Mini Code lands at 27.6 in the efficient tier

    • MiniMax-M3 (pending)55
    • North Mini Code27.6

    Artificial Analysis

Tier scorecard

Who leads, who pushes.

6 tiers · leaders as of Jun 13, 2026.

A snapshot of leader-vs-challenger by tier. Useful for procurement shortlists when matching workload to model class. Pair with the benchmark moves above for the underlying scores.

  • Closed frontier

    Leader: Claude Opus 4.8

    Challenger: Claude Fable 5 (suspended)

    Fable 5 debuted #1 (AA 64.9) but was force-disabled Jun 12; Opus 4.8 is the working default again until access is restored.

  • Open frontier

    Leader: DeepSeek V4-Pro

    Challenger: MiniMax-M3 (weights pending)

    No open frontier text model displaced the April leaders in-window; MiniMax-M3's 55 AA Index would contend once weights ship.

  • Reasoning

    Leader: Claude Opus 4.8

    Challenger: GPT-5.5

    Fable 5 set the reasoning bar before its takedown; with it suspended, Opus 4.8 leads the available closed reasoning tier.

  • Coding

    Leader: Claude Opus 4.8

    Challenger: GPT-5.5

    Fable 5's 80.3% SWE-Bench Pro is a contested vendor figure and the model is suspended; Opus 4.8 remains the dependable coding default.

  • Multimodal

    Leader: Gemini 3.1 Pro

    Challenger: DiffusionGemma

    Gemini remains the general multimodal reference; DiffusionGemma opens a low-latency open diffusion branch.

  • Edge / small

    Leader: Mellum2

    Challenger: North Mini Code

    Efficient open coding/sub-agent models keep widening; North Mini Code joins Mellum2 as a cheap self-host routing node.

Vendor signals

Pricing, gating, deprecation.

4 non-release signals worth tracking.

The non-release moves that shift vendor risk — pricing, deprecations, gating decisions, license changes — with a one-line procurement read.

  • /Anthropic

    A U.S. export-control directive forced Anthropic to disable Fable 5 and Mythos 5 for all customers; queries fall back to Opus 4.8

    Top-tier model availability is now a sovereign/regulatory variable, not just an uptime SLA. Buyers should require contractual clarity on access continuity and design fallback routing before standardizing on any single closed flagship.

    VentureBeat; MarkTechPost

  • /Anthropic

    Fable 5 priced at $10/$50 per MTok (~2x Opus 4.8); included on paid tiers only through Jun 22, then reverting to usage credits

    The top-tier price ceiling moved up and the free-access window is short. Operators should model the post-Jun 22 cost cliff and route only high-value work to Fable 5, keeping cheaper models for routine calls.

    Anthropic launch

  • /xAI

    Launched the Grok Build Plugin Marketplace bundling skills, MCP servers, and agents (commit-SHA-pinned), with partners MongoDB, Vercel, Sentry, Chrome DevTools, and Cloudflare

    xAI is competing on agent tooling and distribution rather than a new frontier model this window. Platform teams should weigh marketplace lock-in and pinning guarantees when evaluating agent ecosystems.

    x.ai

  • /OpenAI

    Acquired Ona (Gitpod) for cloud sandboxes that keep long-running agents alive, folding it into Codex (5M+ weekly users)

    Durable long-running-agent infrastructure is being absorbed into the model layer, narrowing 'agent persistence' as a standalone wedge. Architects building long-horizon agents should expect this capability to become a platform feature rather than a separate purchase.

    SiliconANGLE

Watchlist

On the radar next.

4 catalysts to watch, starting Jun 14-30.

Specific model-side catalysts in the next 7–30 days that would change the read materially. Watching these tells us whether the canopy is widening or thinning.

  • Jun 14-30

    Fable 5 / Mythos 5 access restoration

    Whether Anthropic restores a top-tier model — and with what geo-gating or KYC controls — sets the precedent for sovereign access risk and determines whether Fable 5 re-enters the closed-frontier scorecard.

  • Jun 14-30

    Gemini 3.5 Pro GA and first independent benchmark

    Google's Pro has slipped to the edge of its June commitment. A GA with an independent Artificial Analysis pass will show whether it lands above or below Fable 5 and GPT-5.5.

  • Jun-Jul

    MiniMax-M3 weights release

    If MiniMax ships the weights at its announced 55 AA Index, it becomes the leading open-weights model and a credible self-host alternative to closed frontiers.

  • Jun-Aug

    Independent SWE-Bench Pro replication of Fable 5

    Fable 5's 80.3% is a vendor-harness figure. An Epoch or Scale SEAL replication will confirm or deflate the coding-lead claim that buyers are being asked to act on.

Edits this issue

  • Added Claude Fable 5, DiffusionGemma, and North Mini Code to the LLM tree; reframed W24 around a frontier launch-and-takedown and an open text-diffusion efficiency path.

About The Model Pulse

A weekly read on the software side of the AI stack. Anchored to the LLM Evolutionary Tree, which the brief annotates each week. The cross-stack flywheel (capital, hardware, networking) is covered in The AI Stack Weekly.

Authorship and sources

Compiled from public model cards, vendor blogs, leaderboards, and official lab announcements. Written by Brian Letort. Independent analysis. Not investment guidance.

Operate. Publish. Teach.