brianletort.ai
All issues

The AI Stack Weekly

Issue 08 · Week 24 of 2026.

/Industry brief · ~7 min read/Public sources onlyDownload brief

The Bottom Line

The frontier became a sovereign and capital-markets object, not just a benchmark race.

Flywheel arcAll three lenses

W24 was the week the model layer stopped behaving like a product cycle and started behaving like critical infrastructure with a kill switch. Anthropic shipped Claude Fable 5, a new "Mythos-class" tier above Opus, and it debuted #1 on the independent Artificial Analysis Intelligence Index at 64.9 — then, three days later (Jun 12), a U.S. government export-control directive forced Anthropic to disable Fable 5 and Mythos 5 for every customer because it could not filter foreign-national access in real time; queries now fall back to Opus 4.8. That is the first known government-forced takedown of a deployed frontier model, and it converts "which model is best" into "which model can be switched off by a third party." The same week, the capital story went public: SpaceX, with xAI embedded, priced the largest IPO ever (~$75B raised) and closed near a $2.1T valuation, giving the market its first traded read on a frontier lab, while OpenAI's confidential S-1 (Jun 8) kept the next two labs at the pre-public stage. Underneath, demand concentrated visibly — Oracle booked an RPO of $638B (+363% YoY), TSMC printed a record month with CoWoS sold out through 2027, and d-Matrix put a merchant inference ASIC into volume production explicitly to route around the HBM chokepoint. Net/net: boards should treat top-tier closed models as single points of failure and mandate hard fallbacks; investors now have a public frontier-lab proxy to price against; architects should design multi-model routing and second-source compute; operators should assume the binding constraints are regulatory access, power, and packaging, not raw intelligence.

JevonsMetcalfeGilderSoftwareJevonsHardwareHuangNetworkingMetcalfe + Gilder

The three lenses

What moved this week, and what to do about it.

10 events across the flywheel — 4 software, 3 hardware, 3 networking.

Software.

  • Anthropic shipped Claude Fable 5 (and Mythos 5), a new Mythos-class tier above Opus, GA across the Claude API, Bedrock, Vertex AI, and Microsoft Foundry; it debuted #1 on the Artificial Analysis Intelligence Index at 64.9

    Anthropic launch; Artificial Analysis

  • A U.S. government export-control directive forced Anthropic to disable Claude Fable 5 and Mythos 5 globally for all customers; unable to filter foreign-national access in real time, Anthropic routed queries back to Opus 4.8 (still suspended at week's end)

    VentureBeat, MarkTechPost (Reuters/BBC/Axios cited secondhand)

  • Google DeepMind released DiffusionGemma, an Apache-2.0 open text-diffusion model (26B / ~3.8B active MoE, multimodal-in, 256K context) generating ~1,000+ tok/s on an H100 with native vLLM support

    Google blog; Hugging Face model card

  • Gemini 3.5 Pro remained not-GA at W24 close: still a limited Vertex enterprise preview with no public API identifier, price row, or independent benchmark, despite a Google-stated June target

    TechTimes; Presenc launch brief

What this means

The model layer's news was binary: a new closed leader arrived and then was switched off by a government within 72 hours, while the durable open progress was an efficiency path (text diffusion) rather than another frontier crown. Architects should standardize an evaluation harness and a fallback router now so a top-tier model can be slotted — or revoked — without freezing deployments; see The Model Pulse for the full architecture read.

Hardware.

  • Oracle reported Q4 FY26 with remaining performance obligations of $638B (+363% YoY, +$85B QoQ), OCI/IaaS +93% to $5.8B, ~1GW of new AI capacity targeted in Q1 FY27, and a ~$70B FY27 net capex guide

    Oracle Investor Relations; CNBC

  • TSMC reported record May revenue of NT$416.98B (+30.1% YoY), confirming sustained AI/HPC pull-through at the foundry chokepoint, with CoWoS advanced packaging sold out through 2027

    TSMC press release; SEC Form 6-K

  • d-Matrix moved its Corsair inference accelerator into volume production — a digital in-memory-compute ASIC (TSMC N6) that the company says runs inference ~10x faster and ~3x cheaper than a standalone GPU while explicitly architecting around the HBM/DRAM chokepoint

    CNBC; d-Matrix

What this means

Hardware's signal this week was demand and supply, not silicon launches: Oracle proves AI compute is being booked as long-duration backlog, TSMC proves it is foundry- and packaging-gated, and d-Matrix proves the inference layer is diversifying away from merchant GPUs and HBM. Investors with concentrated GPU exposure should track contracted-backlog conversion and ASIC ramps; operators should secure packaging-bound capacity early.

Networking.

  • Arista launched its first 1.6T rack-scale switch portfolio (7060XE7) built on Broadcom's Tomahawk 6 — a fresh Broadcom 1.6T production design win for AI back-end fabric

    Arista; Broadcom

  • Broadcom disclosed a scale-up Ethernet design win with inference-ASIC startup FuriosaAI, extending AI-Ethernet beyond hyperscaler back ends into merchant accelerator racks

    Broadcom; FuriosaAI

  • Marvell's pending S&P 500 inclusion (replacing Campbell Soup, ~late June) marked capital-markets validation of the custom-silicon and AI-networking designer thesis

    The Next Platform

What this means

1.6T fabric is now shipping in production, but the in-window proof points all ran through Broadcom — the marquee CPO/optics catalysts from Marvell, Credo, Coherent, and Lumentum fell just outside the week. Network architects should treat 1.6T Ethernet as deployable today while pressing for a second silicon source before committing a fabric roadmap to one vendor.

Capital flow

Money in, revenue out.

4 categories tracked. Capital deployment up in 3 of 4; revenue follows at multiples of 0.21 to 0.6.

The four-category scorecard. Where capital is going in, where revenue is coming out, and how much of it is real. The one chart for the boardroom.

  • Frontier Labs

    OpenAI, Anthropic, Google DeepMind, xAI

    Capital In

    ~$95B

    vs ~$90B

    Revenue Out

    ~$21B

    vs ~$20B

    Burn / Rev

    ~1.3x

    Movement

    The frontier-lab capital story went public: SpaceX, with xAI embedded, priced the largest IPO ever and closed near a $2.1T valuation, the first traded read on a frontier lab.

  • Hyperscaler-Hosted

    Azure-OpenAI, AWS-Anthropic, Google Cloud-Gemini, Oracle-OCI

    Capital In

    ~$185B

    vs ~$181B

    Revenue Out

    ~$62B

    vs ~$60B

    Burn / Rev

    ~0.3x

    Movement

    Oracle's $638B RPO and ~$70B FY27 capex guide made hosted AI demand visible as contracted backlog, but negative free cash flow sharpened the can-it-be-funded debate.

  • Neoclouds

    CoreWeave, Nscale, Crusoe, Lambda, Fluidstack, IREN

    Capital In

    ~$12B

    vs ~$12B

    Revenue Out

    ~$5B

    vs ~$5B

    Burn / Rev

    ~3x

    Movement

    CoreWeave priced $3.25B of senior notes at 8.5-9.625%, putting a hard number on the neocloud cost of unsecured capital.

  • On-Prem / Hybrid

    Enterprise GPU clusters, sovereign and national programs, Cisco / Dell / HPE

    Capital In

    ~$93B

    vs ~$91B

    Revenue Out

    ~$36B

    vs ~$35B

    Burn / Rev

    ~2x

    Movement

    Sovereign and power-first capacity advanced: NAVER and NVIDIA committed to a gigawatt-scale Korean AI factory, and Stark Power moved to acquire a 5.6GW gas-paired data-center pipeline.

Burn-to-Revenue is revenue divided by committed capital. Lower means more capital is going out than coming in.

Signal vs noise

What’s real, what’s noise.

5 claims this week — 3 signal, 2 noise.

Each claim is scored 1–5 on source quality and triangulation. Anything 2 or below is flagged as noise. Where consensus is wrong, we say so.

  • 5 / 5

    Anthropic shipped Claude Fable 5, a new Mythos-class tier, and it debuted #1 on the independent Artificial Analysis Intelligence Index at 64.9.

    Sources: Anthropic launch announcement; Artificial Analysis (independent pre-release access); arena.ai/LMArena board adds. Caveat: vendor pricing, independent index.

    SIGNAL. A genuine new top of the closed frontier — but priced as an async heavy-lift tool ($10/$50 per MTok, ~2x Opus 4.8). Architects should route Fable 5 to high-value reasoning, not default every call to it.

  • 4 / 5

    A U.S. government export-control directive forced Anthropic to disable Fable 5 and Mythos 5 for all customers within three days of launch.

    Sources: VentureBeat and MarkTechPost reporting; Reuters/BBC/Time/Axios cited secondhand. Caveat: the directive letter itself is not public.

    SIGNAL. The first known government-forced takedown of a deployed frontier model. This is a concrete sovereign/regulatory single point of failure — boards should mandate hard model fallbacks and avoid single-sourcing the top tier.

  • 5 / 5

    SpaceX, with xAI embedded, priced the largest IPO ever and closed near a $2.1T valuation, putting a frontier lab inside a public security.

    Sources: SEC registration; CNBC live IPO coverage. Multi-source, audited filing.

    SIGNAL. Investors now have a public-market frontier-lab proxy (SPCX) that OpenAI and Anthropic roadshows will be measured against. The private-mark-only era for frontier labs is ending.

  • 2 / 5 — noise

    Claude Fable 5 is the definitive coding leader at 80.3% on SWE-Bench Pro.

    Sources: Vendor-scaffold figure; independent vals.ai shows 95% on SWE-Bench Verified, but Scale SEAL has no Fable entry and an Epoch replication is pending.

    NOISE until replicated. The AA Index lead is real and independent; the headline SWE-Bench Pro number is a vendor-harness result. Do not anchor a migration decision on it before third-party replication.

  • 1 / 5 — noise

    GPT-5.6 is imminent and will reset the frontier this month.

    Sources: Backend codenames and prediction-market odds only; no system card, model page, or API contract exists.

    NOISE. A rumor with no primary artifact. Keep current GPT-5.5 baselines and do not delay procurement on an unreleased SKU.

Early warning panel

The levers we monitor.

10 metrics tracked — 5 rising, 0 falling, 5 steady.

Current vs prior period. Each metric has a threshold where the read materially changes — this panel flags the inflection before it lands in headlines. Click any metric for the methodology and this-week read.

  • Frontier lab cash position (avg months runway, top 3)

    ~34-37 mo (xAI now public via SpaceX IPO)vs ~33-36 mo

    Threshold: <18 mo triggers re-rating risk

    What this measures

    Top 3 frontier labs (OpenAI, Anthropic, Google DeepMind) by disclosed runway, with xAI now a public-market entity inside SpaceX (SPCX). The SpaceX/xAI IPO and OpenAI's confidential S-1 extend the category's access to capital. Boards should not assume frontier-lab funding pressure as a forcing function for short-term commercial concessions — capital access just widened.

  • Hyperscaler capex / AI revenue ratio (top 4 weighted)

    ~5.0-5.3 (Oracle FY27 ~$70B guide; top-4 flat)vs ~5.0-5.2

    Threshold: >6.0 invites investor pushback at next earnings

    What this measures

    Top 4 hyperscalers (MSFT, GOOG, META, AMZN) weighted aggregate of total capex divided by AI-attributable revenue. No new top-4 prints in-window, but Oracle's ~$70B FY27 net capex guide and -$24B FY26 free cash flow push the broader hosted-capex ratio up and re-open the funding-vs-demand debate. Investors should keep the bubble hypothesis on funding/conversion, not demand.

  • CoreWeave revenue backlog

    $99.4B (next print Q2, early Aug)vs $99.4B

    Threshold: Conversion velocity matters more than gross figure

    What this measures

    Booked but unrecognized revenue. The $99.4B audited figure (Mar 31) is unchanged; the in-window event was financing, not backlog — CoreWeave priced $3.25B of senior notes at 8.5-9.625%. Operators should watch the Q2 backlog print in early August and the cost-of-capital spread versus GPU-backed facilities.

  • NVIDIA Q-over-Q data center revenue

    $75.2B (Q1 FY27); Q2 guide $91B, reports Aug 26vs $75.2B (Q1 FY27); Rubin production ramp confirmed

    Threshold: Q2 FY27 guide $91B implies further +21% QoQ

    What this measures

    Q1 FY27 Data Center revenue of $75.2B was reported May 20; the Q2 guide is $91B and the next earnings print is Aug 26. No within-window change. Adjacent demand (Oracle RPO $638B, TSMC record month, CoWoS sold out through 2027) supports the guide; packaging and power remain the binding constraints, not demand.

  • Open vs closed gap on SWE-Bench Pro (coding)

    Closed lead widened on paper (Fable 5 80.3% vendor scaffold) vs open ~58.6%; contestedvs Closed +~19pp (no new Pro challenger yet)

    Threshold: Sustained open lead reshapes enterprise procurement

    What this measures

    Claude Fable 5's vendor-scaffold SWE-Bench Pro figure (80.3%) widens the headline closed-over-open gap, but it is not yet replicated by Scale SEAL or Epoch; independent vals.ai shows 95% on SWE-Bench Verified. The strongest open item, MiniMax-M3 (AA Index 55), has not released weights. Architects should treat the widened gap as provisional and keep open self-host options in pilot.

  • Sovereign AI commitments (count / aggregate $)

    ~14 / ~$180B+ (NAVER+NVIDIA 1GW added)vs ~13 / ~$160B+; power-first gating rising

    What this measures

    Analyst-curated count of sovereign/national AI-compute commitments. NAVER + NVIDIA's gigawatt-scale Korean AI factory (roadmap to 1GW; ~$18B revenue goal by 2030) was added in-window. Operators with APAC workloads should treat Korea as a credible sovereign landing zone while pricing in multi-year build timelines.

  • PJM 2026/27 capacity auction price ($/MW-day)

    $329.17 (2028/29 BRA clears Jun 30-Jul 7)vs $329.17

    Threshold: 11x in 24 months — power is the new binding constraint

    What this measures

    The 2026/27 BRA cleared at the FERC cap ($329.17) and took effect June 1, 2026; the 2027/28 BRA cleared at $333.44. The next signal is the 2028/29 BRA, scheduled Jun 30-Jul 7, 2026, with a FERC collar capping it near $325. Architects should not assume near-term price relief; budget capacity at-cap through 2028.

  • Time-to-power, busiest US markets (months)

    60-84 (new PJM); 2027/28 auction cleared short of targetvs 60-84 (new PJM); power-first campuses rising

    What this measures

    Months from new-load interconnection request to energization. PJM's 2027/28 base auction cleared ~6,600MW short of its reliability requirement — the first RTO-wide shortfall, with a large share driven by data-center load. Architects should pre-commit power and long-lead grid equipment before pre-committing GPU SKUs.

  • Cost-per-task, frontier reasoning model

    ~$0.10-$0.15 (effective); Fable 5 sets a 2x-Opus top tier ($10/$50 per MTok)vs ~$0.10-$0.15 (effective; unchanged)

    Fable 5 raises the top-tier list price ~2x; effective per-task cost depends on routing

    What this measures

    Median cost across frontier-tier reasoning models for a benchmark complex task. Claude Fable 5 lists at $10 in / $50 out per MTok (~2x Opus 4.8), pushing the ceiling up, but effective cost-per-task depends on how much work is routed to the top tier versus cheaper models. Operators running agents at scale should re-benchmark on cost-per-task, not list price.

  • Custom silicon share of incremental AI compute

    ~33-36%; d-Matrix Corsair inference ASIC in volume productionvs ~33-36%; Broadcom AI revenue +143% YoY

    Threshold: >35% materially compresses merchant GPU pricing

    What this measures

    d-Matrix's Corsair inference ASIC entered volume production in-window — a concrete merchant non-GPU data point, not a projection — and Marvell's pending S&P 500 inclusion validates the custom-silicon designer thesis. Investors with concentrated NVIDIA exposure should diversify into ASIC co-design (Broadcom, Marvell) and inference-specialist silicon, and into advanced packaging / power.

Predictions

What we expect next.

6 predictions for the next 30-90 days, confidence 58%-65%.

Each prediction is falsifiable, time-bounded, and tied to a specific signal we will watch. Future issues score these hit, miss, partial, or pending and build a public track record.

Prediction 01

58%

confidence

Software

Gemini 3.5 Pro reaches public GA by June 30, 2026 but debuts below Claude Fable 5 on the Artificial Analysis Intelligence Index in its first independent pass.

Deadline: By June 30, 2026

Trigger: Google AI Studio / Vertex GA announcement plus Artificial Analysis leaderboard update.

Prediction 02

60%

confidence

Software

Anthropic restores public (non-government) access to a Fable-5-class model — with geo-gating or KYC controls — by July 31, 2026.

Deadline: By July 31, 2026

Trigger: Anthropic status page, model documentation, or press confirming restored availability.

Prediction 03

65%

confidence

Hardware

At least one more merchant inference-ASIC vendor (Groq, Cerebras, Tenstorrent, or d-Matrix) announces a volume-production or named-hyperscaler milestone before August 31, 2026.

Deadline: By August 31, 2026

Trigger: Vendor press release, earnings disclosure, or customer design-win announcement.

Prediction 04

60%

confidence

Networking

A second of (Marvell, Credo) cites a 1.6T or co-packaged-optics production design win by August 31, 2026, joining Broadcom's Tomahawk 6.

Deadline: By August 31, 2026

Trigger: Earnings call, product release, or customer design-win disclosure.

Prediction 05

62%

confidence

Capital

Neither OpenAI nor Anthropic converts its confidential draft to a publicly visible S-1 on SEC EDGAR before August 31, 2026.

Deadline: By August 31, 2026

Trigger: SEC EDGAR public filings; Reuters / Bloomberg / The Information confirmation of public S-1 vs confidential DRS.

Prediction 06

60%

confidence

Power

A hyperscaler announces a >500MW power-first AI campus or behind-the-meter generation deal by September 30, 2026.

Deadline: By September 30, 2026

Trigger: Hyperscaler energy/data-center announcement; utility or developer disclosure.

Track record

Scoring prior predictions.

5 prior predictions: 0 hit, 0 miss, 1 partial, 4 pending. Hit rate 0%.

5 predictions across issues so far. Hit rate: 0%. Hits 0, misses 0, partials 1, pending 4.

Prediction 01

60%

confidence

Software

Gemini 3.5 Pro reaches public GA by June 30, 2026, but does not exceed Claude Opus 4.8 on SWE-Bench Pro in its first independent Artificial Analysis run.

Deadline: By June 30, 2026

Trigger: Google AI Studio / Gemini API changelog plus Artificial Analysis leaderboard update.

pendingAs of W24 close (Jun 13), Gemini 3.5 Pro is still not GA — limited Vertex preview, no public benchmark. Deadline June 30 not reached; the GA leg is now at risk and the bar has moved (Fable 5, not Opus 4.8, is the new top).

Prediction 02

70%

confidence

Hardware

At least one major OEM announces customer shipment or formal order availability for Vera Rubin NVL72-class systems before September 30, 2026.

Deadline: By September 30, 2026

Trigger: Dell, HPE, Lenovo, Supermicro, or NVIDIA customer-shipment announcement.

pendingTracking ahead: the first operational Vera Rubin rack (Dell PowerEdge XE9812 -> CoreWeave) shipped Jun 1, with standalone Vera CPU order availability opening ~August and full systems at fall GA. Deadline Sep 30 not reached.

Prediction 03

65%

confidence

Hardware

Before August 31, 2026, at least one memory supplier or supply-chain analyst reports HBM4 allocation tightness despite three-supplier qualification.

Deadline: By August 31, 2026

Trigger: SK hynix, Samsung, Micron, TrendForce, or Bloomberg/Reuters supply-chain reporting.

pendingNo new in-window allocation print, but Samsung's reported Gwangju advanced-packaging review and pending 2027 HBM4 price hikes reinforce the thesis. Deadline Aug 31 open.

Prediction 04

65%

confidence

Networking

Broadcom, Marvell, or NVIDIA announces a new CPO/1.6T production design win or revenue guide uplift tied to AI networking before August 31, 2026.

Deadline: By August 31, 2026

Trigger: Earnings call, product release, or customer design-win disclosure.

partialArista's 1.6T 7060XE7 portfolio on Broadcom's Tomahawk 6 (Jun 9) is a fresh Broadcom 1.6T production design win, satisfying the 1.6T leg; no co-packaged-optics production win or vendor revenue-guide uplift yet. Tracking to a full hit by deadline.

Prediction 05

60%

confidence

Power

A hyperscaler announces another >500MW power-first AI campus or behind-the-meter generation deal by September 30, 2026.

Deadline: By September 30, 2026

Trigger: Hyperscaler energy/data-center announcement; utility or developer disclosure.

pendingIn-window power-first activity (Stark/Sagebrush 5.6GW gas-paired pipeline; NAVER+NVIDIA 1GW sovereign factory) was not a hyperscaler behind-the-meter deal. Deadline Sep 30 open.

Watchlist

On the radar this week.

5 catalysts to watch, starting Jun 14-30.

Specific catalysts that would change the read materially. Watching these tells us whether the thesis is strengthening or weakening.

  • Jun 14-30

    Anthropic Fable 5 / Mythos 5 access restoration

    Whether and how Anthropic restores a top-tier model after the government takedown will set the template for sovereign access risk. Watch for geo-gating, KYC, or a permanent capability split between civilian and government tiers.

  • Jun 14-30

    Gemini 3.5 Pro GA and first independent benchmark

    Google's Pro has slipped to the edge of its June commitment. A GA with an independent Artificial Analysis pass would show whether June is a real frontier reset or a third-place launch behind Fable 5 and GPT-5.5.

  • Jun 30-Jul 7

    PJM 2028/29 base residual capacity auction

    After two near-cap clears and a 2027/28 reliability shortfall, the 2028/29 auction (collar near $325/MW-day) will show whether power price pressure persists. It is the single most important power-cost signal of the quarter.

  • Jun-Aug

    Second 1.6T / CPO design win and inference-ASIC adoption

    The networking and custom-silicon theses need a second source. Marvell, Credo, Groq, Cerebras, or Tenstorrent commentary will show whether 1.6T fabric and merchant inference ASICs broaden beyond Broadcom and d-Matrix.

  • Jun-Jul

    SPCX as frontier-lab proxy; OpenAI/Anthropic public S-1

    SpaceX/xAI's post-IPO trading is now the public benchmark for frontier-lab value. A move to a publicly visible S-1 by OpenAI or Anthropic would be the next regime change for AI capital markets.

Companion reads

The rest of the spine.

The AI Stack Weekly is the cross-stack flywheel read. Pair it with the model-and-tree spine and the working framework to get the full picture.

Edits this issue

  • Reframed W24 around concentration and sovereign/regulatory risk at the model layer: Claude Fable 5's launch and the first government-forced frontier takedown, the SpaceX/xAI IPO, and Oracle's $638B RPO.

About this brief

Compiled from public announcements, SEC filings, earnings transcripts, and official lab and vendor publications. Every quantitative claim is graded 1–5 on source quality. Claims graded 2 or below are flagged as noise. The thesis the brief defends is published separately and updated only when a hypothesis materially changes.

Authorship

Written by Brian Letort. Independent analysis. All sources cited are public. Not investment guidance.

Operate. Publish. Teach.