this is a reminder that builders are exposed to more than just model quality and token pricing.
they’re exposed to provider risk, jurisdiction risk, and policy risk too.
the future won’t be built on a single model or a single provider.
it will require routing, optionality, and infrastructure that can adapt when parts of the stack become unavailable overnight.
primis mode
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees.
The net effect of
we’ve just sent 150 invites to people on the waitlist.
not a drill. not a hack.
we prioritized based on:
how long you’ve been on the waitlist. whether you signed up with a company email.
this is the final onboarding wave before open beta.
the goal is simple:
test everything under multi-team usage before opening the doors to everyone.
can you feel it coming?
primis mode.
been a little quiet, not because nothing is happening but the opposite.
first the market has been moving exactly where we thought it would: gpu availability tightening, prices repricing overnight, compute futures appearing and builders realizing “access” is not enough.
the
budgeting becoming a “huge issue” means compute cost is no longer hidden behind flat saas pricing and subsidies.
as usage becomes more agentic, costs become more variable, harder to forecast, and harder to absorb.
the next question for every team becomes:
what will this workload cost?
can we plan around it?
can we scale it without margin shock?
that’s why @primisprotocol is building the pricing layer for compute.
predictable compute economics are becoming mandatory.
@sama let’s chat ;)
been a little quiet, not because nothing is happening but the opposite.
first the market has been moving exactly where we thought it would: gpu availability tightening, prices repricing overnight, compute futures appearing and builders realizing “access” is not enough.
the missing layer is becoming obvious. and that’s what we’ve been building with @primisprotocol.
our closed beta that has been running for months is already processing ~$100k/monthly. and allowed us to understand what needed to improve for scale. which is what we’ve been working on.
all this, to say open beta has never been closer as we estimate we’re 1 big sprint away from opening the gates.
primis mode
when a few actors control hardware, memory, cloud capacity, and frontier APIs, builders get squeezed from both sides:
hardware becomes unaffordable,
API pricing rises,
and the market loses optionality.
open models and cheaper inference are the counter-force.
but the real unlock is the layer that can route across all of it:
frontier apis,
open models,
cloud gpus,
local / edge inference,
alternative providers.
that’s the @primisprotocol thesis.
not one model.
not one cloud.
not one hardware stack.
pricing + routing across fragmented compute so builders can access the best economics for each workload.
primis mode
So what Big Tech did in the USA is they bought up all the computer hardware needed for AI, including HBM (High Bandwidth Memory), driving prices up 400+%.
Nvidia, meanwhile, doubled the pricing on its medium-range GPU, the GeForce 5090, which now retails at near $5000.
What
we’re here to stay with @primisprotocol. we know the compute market is competitive but we will compete, because as i said i’m a perseverant mfer that will outwork anyone.
primis mode
bar for bar what we’ve been saying @primisprotocol and we’re building the pricing layer for compute. the goal is simple: turn fragmented, volatile compute pricing into rates builders can actually plan around.
everyday more validation on our thesis.
primis mode.
holyyyyyy!!
@OpenAI@sama basically saying what the market is starting to realize:
access to compute is not enough.
builders need guaranteed capacity and predictable planning for critical workloads.
that’s the @primisprotocol thesis too. but as a neutral layer across fragmented providers, not inside one model ecosystem.
predictable compute is becoming the product.
primis mode.
Introducing OpenAI Guaranteed Capacity: a new offering that enables customers to guarantee long-term access to OpenAI compute.
We’ve made long-term investments in infrastructure, partnerships, and capacity planning to help customers scale reliably.
Now, Guaranteed Capacity
Today on MCG: @khouuba | Building @primisprotocol
$PRIMIS is building the pricing layer for compute
@CMEGroup just validated the thesis by partnering with Silicon Data to launch the first compute futures market👇
00:03 - @DineroDom0 lays the foundation
03:46 - What is Primis?
over 3500 teams and solo builders on the waitlist for the @primisprotocol beta.
they understand the vital need for predictable compute that allows them to plan and scale around.
some of the notable names found in the waitlist were from @xai@AppliedInt@rendernetwork and
663 Followers 3K FollowingMemetics Research / AI Quant / Robotics Quant / Marketing Genius
Not Financial Advice
Don't take credit for my research, I will find you.
8K Followers 5K FollowingPartner, Active Capital.
Pre-Seed investing in the future of enterprise software. We’re backing at least one new team every month with $500k to $1M checks.
396K Followers 1K Following3x Forbes 30 U 30 🚀 GP @KOLCapital💰 600+ Investments 🤝 Adventure Capitalist 🌏 100+ countries 🏛️ Minister of IT @Liberland_org 💻 AI Leader of The Year