June 11, 2026 · 7 min read

OpenAI and Anthropic Both Filed S-1s: What CTOs Must Do

OpenAI confirmed on June 8, 2026 that it submitted a confidential S-1 to the SEC. Anthropic had already done the same a week earlier. For the first time in the industry's short history, you are building on top of two vendors who are simultaneously sprinting toward a public listing — and one of them is projecting a $14 billion loss in 2026 while telling investors it won't be profitable until 2029. That combination — massive scale, massive losses, public shareholders arriving — changes the rules of the game you've been playing.

This isn't an investment piece. I'm not going to tell you whether to buy either stock. What I will tell you is that the IPO filings are the most important architecture signal of 2026, and most engineering teams are going to miss it entirely because it looks like a finance story.

The Subsidized Pricing Window Is Closing — Model That Now

Here's the uncomfortable truth: the API pricing you've been building your unit economics around was never designed to be permanent. It was designed to win market share. Both OpenAI and Anthropic have been advised that early mover advantage is at stake — the first to list sets the terms for how investors categorize the AI sector. Public-market investors don't fund indefinite below-cost pricing; they fund paths to margin.

OpenAI reported $20 billion in ARR for 2025 and is still projecting a nine-figure annual loss. Anthropic is filing at a reported $965 billion valuation. Neither of those numbers lives comfortably alongside API pricing that stays flat or keeps declining. When quarterly earnings calls begin — and they will, once these companies are public — every analyst will be asking about gross margin on inference. The answer to that question will move prices.

The decision rule here is simple: any product whose margin depends on current LLM API pricing is exposed. Run the scenario now: if input/output token costs double in 18 months, what happens to your unit economics? If the answer is "we break even or go negative," that is a week-one engineering and commercial problem, not a future roadmap item.

Three Structural Risks That Compound After an IPO

IPO filings change vendor behavior in predictable ways. I've watched this pattern play out across enterprise software repeatedly. Here's what degrades, specifically:

1. Pricing stability disappears. Pre-IPO, both companies can absorb below-cost API pricing as a customer acquisition cost. Post-IPO, every pricing decision gets scrutinized against margin targets in the quarterly 10-Q. Expect tiered pricing, volume commitment requirements, and the slow sunset of the most generous rate tiers. This isn't speculation — it's how every SaaS company that went public behaved after their S-1 landed.

2. Product roadmap priorities shift toward enterprise revenue. Features that enterprise customers pay for (audit logs, compliance tooling, SLAs with teeth, dedicated capacity) get prioritized over features that developer communities love (cheap, fast, experimental endpoints). If your architecture depends on experimental APIs or beta features, that risk just increased materially.

3. Vendor lock-in leverage inverts. Right now, OpenAI and Anthropic need your usage numbers for their S-1 narratives. After IPO, the power dynamic shifts — they have shareholders, you have contracts. The negotiating window for favorable enterprise terms, rate locks, and committed use agreements is right now, before the prospectus is public and before bankers set the floor price.

What Abstraction Layer You Actually Need

The standard advice is "build an abstraction layer so you can swap models." That advice is correct but almost always implemented too shallowly. Teams build a thin wrapper that routes requests to one provider and call it done. That doesn't protect you from the risks above.

A real multi-vendor abstraction has four components:

python
# Minimal viable abstraction — what most teams build (insufficient)
def call_llm(prompt: str, model: str) -> str:
    if model.startswith("gpt"):
        return openai_client.chat(prompt)
    elif model.startswith("claude"):
        return anthropic_client.messages(prompt)

# What you actually need: provider-agnostic interface with
# cost tracking, fallback routing, and latency SLOs baked in
class LLMGateway:
    def complete(
        self,
        prompt: str,
        max_cost_per_1k_tokens: float,  # enforced budget ceiling
        latency_slo_ms: int,            # route to faster provider if needed
        fallback_chain: list[str],      # ordered list of providers
        ) -> LLMResponse:               # normalized response + cost metadata
        ...

The four components that matter:

Normalized cost metadata on every call — you cannot fix what you cannot measure
Fallback routing that activates on latency spikes, errors, or cost threshold breaches
Provider-agnostic prompt testing — your evals must run against any model in the chain
Contract-aware routing — route to the provider you have committed volume with first, to hit contractual minimums before overflow goes elsewhere

Building this gateway properly takes days with current tooling, not months. The blocker is rarely engineering hours — it's getting alignment on which providers belong in the fallback chain and what the cost ceiling policy is. That's a CTO decision, not an engineering one.

The Open-Source Hedge Is Real, But Has a Cost

Every time I make this argument, someone in the room says "we'll just self-host Llama or Mistral." That's a legitimate hedge, but be clear-eyed about what you're buying.

Imagine a team that moves a mid-complexity classification and summarization pipeline from GPT-4o to a self-hosted open-weight model. The API cost exposure disappears. What appears in its place: GPU infrastructure cost and operational overhead, a model evaluation cycle every time a new open-weight release lands, and a fine-tuning pipeline if the base model quality gap matters for your use case. For some tasks — especially high-volume, well-scoped ones — that tradeoff is excellent. For tasks requiring frontier reasoning or broad world knowledge, the quality gap is real and degrades outcomes in ways that are hard to measure until they surface in production.

The framework I use:

Task type	Open-weight viable?	Notes
Classification, extraction, routing	Yes, often superior	Fine-tune on your data
Summarization, structured output	Yes, with eval investment	Watch for hallucination on edge cases
Complex reasoning, coding agents	Frontier still leads	Gap narrowing fast
Customer-facing generation at scale	Depends on quality bar	A/B test before committing

The IPO filings make the open-source hedge more attractive in 2026 than it was in 2024. The tooling for self-hosting (vLLM, llama.cpp, SGLang) has matured substantially. The decision to move some workloads off proprietary APIs is now an engineering question, not a research question.

Contracts: The Window That Closes Fast

I want to be direct about timing here. Enterprise agreements with rate locks or committed pricing are negotiated, not listed on a website. Both OpenAI and Anthropic currently have sales teams that are motivated to close large enterprise commitments — those commitments help their S-1 ARR narrative. That motivation evaporates the moment the IPO roadshow begins and their attention shifts to institutional investors.

If you are consuming more than a few hundred thousand dollars annually in API costs, you should be in a conversation with your account representative this week about what a multi-year committed use agreement looks like. The worst outcome is they say no. The more likely outcome is that you lock in pricing before the first post-IPO earnings call triggers a rate review.

If you're not at that spend level yet, the relevant action is contract language, not price negotiation: ensure your current terms don't allow unilateral pricing changes without notice, and understand your data processing and model training opt-out clauses before public shareholders start asking questions about data monetization.

What to Actually Do

Run the pricing stress test this week. Model your unit economics at 1.5x and 2x current token costs for every LLM-dependent product line. If you don't have cost-per-call instrumentation today, that's your day-one task — you cannot have this conversation with your board without it.
Audit your abstraction layer. If you have a single-provider dependency with no fallback, allocate a sprint to the gateway pattern described above. Days of engineering, not weeks. The decision bottleneck is the fallback chain policy, so make that call now.
Classify your workloads by open-weight viability. Use the table above. Any high-volume, well-scoped task is a candidate for migration. Start an eval, not a migration — eval first, then decide.
Get into a commercial conversation before the roadshow. If your spend justifies it, talk to your account rep this week. Ask specifically about multi-year rate locks and committed use tiers. The window is open now and will narrow fast once bankers get involved.
Update your vendor risk register. Both vendors moving from private to public is a material change in your technology risk profile. Your board and your auditors will ask about single points of failure in your AI infrastructure. Have the answer before they ask.

The IPO filing is a finance story that has an engineering punchline: the era of below-cost inference as a stable foundation for product economics is ending. Build like it.

Working on something like this? I take on a few fractional-CTO and AI engagements at a time.

Book a call Ask my AI twin

The AI CTO playbook

Get my AI playbooks — straight to your inbox

Practical notes on shipping production AI, scaling teams, and the calls a CTO actually has to make. A few times a month. No spam, no fluff.