So You Want to Sell Inference
Reselling inference is a zero-margin commodity. The margin question comes down to cost-plus versus value-based pricing : one anchors to the inference line, the other decouples from it.
Reselling inference is a zero-margin commodity. The margin question comes down to cost-plus versus value-based pricing : one anchors to the inference line, the other decouples from it.
Databricks runs at $6.9b ARR growing 80%, widening the gap over Snowflake to $1.6b. Its AI products sit on the token path, where companies grow explosively even at scale.
A 500+ comment HN discussion reveals the current state of local AI coding - Qwen 3.6 35B dominates model mentions while Pi leads agents.
Three weekend developments — the Fable retraction, Satya's ecosystem thesis & Salesforce's $3.6B Fin acquisition — reveal why the moat has shifted from models to harnesses.
Guardrails on frontier AI were inevitable given the pace of adoption. But the capabilities available to corporations remain tremendous.
Frontier model prices keep rising while open-source crosses the good enough line. Coinbase, Lindy, Harvey & Cursor are substituting — & the savings go straight back into more tokens.
Microsoft's MAI-Code-1-Flash matches Claude Haiku 4.5 on SWE-Bench Verified using a third of the tokens. The right metric for AI in production is intelligence per dollar, value per token spent.
OpenRouter token data shows open-weight models overtaking closed models as provider leadership fragments across Anthropic, DeepSeek, Google, OpenAI, & a long tail.
Short interest in public AI stocks has risen modestly overall, but the pressure is concentrated in AI cloud and smaller software names.
How a personal AI agent built on markdown skills lets a frontier model teach smaller, local models to do real work, without retraining.