The Grid @The_GridAI

The spot market for AI Inference thegrid.ai Joined September 2025

Tweets

172
Followers

1K
Following

84
Likes

226

The Grid @The_GridAI

a day ago

You can now integrate Hermes with The Grid. Suppliers compete for every request Hermes sends. That means every skill invocation, every memory call, every agent loop, routed to the cheapest qualifying supplier automatically. Here's how 👇

0 1 8 801 1

View Details

The Grid @The_GridAI

3 days ago

@ForwardFuture @tomas_hk Agree partially. Per-task routing is necessary, but it still leaves every buyer running the same eval over and over just to learn which model fits which job. Intelligence tiers, tied to the task at hand, fix that. Routing treats the symptom. Tiers are the cure.

1 0 0 108 0

View Details

The Grid @The_GridAI

3 days ago

@mintlify Did better docs cut token spend per task, or mostly raise success rate? Curious whether the win was fewer retries (reducing the token spent queitly) or just cleaner first passes.

1 0 2 77 0

View Details

The Grid @The_GridAI

3 days ago

@smratitiwa86867 Token discipline matters, but the bigger leak is tier mismatch: paying frontier prices for tasks a mid model handles fine. Trimming prompts saves pennies. Matching the workload to the right tier saves the bill.

0 0 0 43 0

View Details

The Grid @The_GridAI

3 days ago

Chat with us to see how you can bring down your inference cost 🔗 calendly.com/peyckez

1 0 2 97 0

View Details

The Grid @The_GridAI

3 days ago

AI costs are one of the fastest-growing budgets for many startups. We built The Grid to fix the underlying problem, with a market that offers quality tiers, competing suppliers, and guaranteed benchmarks. If you are burning real dollars on inference, let’s talk (link below)!

Patrick OShaughnessy @patrick_oshag

a week ago

Dara (CEO of Uber) on their AI spend: "We blew through our AI budget in a quarter, for the whole year. It is forcing us to adjust. We are going to meter headcount increases because to the extent that my engineers are getting much more efficient, their throughput is

34 41 469 613K 337

2 1 8 1K 1

View Details

The Grid @The_GridAI

a week ago

@Altimor congrats on the switch, but have you thought about what happens next quarter when something cheaper drops? The eval + migration tax keeps coming. Every few months, back in the codebase. What if you never had to do this again? Pick a quality tier, get market-priced inference that's always routed to the best qualifying supplier automatically. That's what we're building at The Grid.

Flo Crivello @Altimor

a week ago

Pulled the trigger today and switched 100% of Lindy traffic to DeepSeek v4, churning from Anthropic models. Saves us millions of $ and we're actually seeing an *increase* in performance on many core use cases. Transformative for the business.

169 163 3K 925K 1K

0 0 6 1K 0

View Details

The Grid @The_GridAI

2 weeks ago

3/ The Grid standardizes inference into graded tiers with guaranteed spec. You pick the tier your workload needs. Suppliers compete to fill it. You get the output at the best price, instead of brand names and expensive subscriptions.

2 1 9 4K 1

View Details

Charly Wargnier @DataChaz

a week ago

Uber reportedly torched $3.4B of its AI budget in just four months. The root cause: an absolute lack of a routing layer between the request and the model. We are blindly throwing expensive frontier models at EVERY SINGLE TASK. This is why I am excited about @The_GridAI. It completely flips the script by turning AI inference into a live spot market. Instead of hardcoding a single, expensive provider, you select a quality tier eg: - text-standard - text-prime - text-max The Grid then dynamically routes your request to the cheapest qualifying model 🔥 → Drop-in migration via an OpenAI-compatible endpoint → Zero vendor lock-in for your applications → Automated, cost-saving routing based on task difficulty → Complete visibility into every token spent Just for fun, I built a small @Streamlit demo to showcase its capabilities. The app lays out the entire process: ✦ Header & Request: ↳ a clean UI to plug in an API key and choose your instrument (text-standard, text-prime, or text-max). ✦ Routing & Metadata: ↳ run a prompt and instantly inspect the backend You can see the latency, prompt/completion tokens, and the exact model returned. ✦ Migration & Traceability: ↳ a live look at the code swap and how the market actually routed the request. On the top left, you can enter your API key and try it out for yourself! @The_GridAI just went live, and they are giving new accounts an incredible 200 million free tokens. Demo app + open-source repo in the 🧵↓

11 17 47 8K 18

View Details

Zbigniew Pękala @zbigniewpekala

2 weeks ago

You don’t need to keep moving inference around manually - suppliers compete while the routing layer handles the switching...

The Grid @The_GridAI

2 weeks ago

For the devs asking how hard it is to switch, it's not. The whole migration is 3 lines. Suppliers compete for every request you send. Save up to 80% vs list price.

2 18 33 12K 2

0 1 3 1K 1

View Details

Robert Parcus @betoparcus

a week ago

We're also moving the frontier for Small Language Models at @The_GridAI

Charles 🎉 Frye @charles_irl

a week ago

moving up the supply chain in pursuit of higher margins -- built our first rack today

7 1 90 10K 3

0 1 5 1K 0

View Details

The Grid @The_GridAI

a week ago

Here's something most inference buyers don't have access to: a limit order. 'Fill my Text Prime order at $0.50 or less.' If the market clears there, your job runs. If not, it waits. Let the price come to you instead of the other way around.

2 3 9 1K 2

View Details

Santiago @svpino

a week ago

Genius idea for AI inference! A marketplace that routes requests to the cheapest qualifying model at any given point. This can get you up to 87% cheaper inference! Today, if you need a model, you pay the vendor's fixed rate card, but that's about to change with this:

13 6 40 7K 28

View Details

The Grid @The_GridAI

2 weeks ago

Your first 200M tokens are on us, start building → app.thegrid.ai/sign-up

0 1 1 1K 0

View Details

The Grid @The_GridAI

2 weeks ago

For the devs asking how hard it is to switch, it's not. The whole migration is 3 lines. Suppliers compete for every request you send. Save up to 80% vs list price.

Alex Prompter @alex_prompter

2 weeks ago

The migration is almost insulting. Before: model: "gpt-4o" base_url: "api.openai.com/v1" After: model: "text-prime" base_url: "api.thegrid.ai/v1" Two lines changed. Your client keeps running like nothing happened, except now your costs track the actual market

1 0 1 10K 0

2 18 33 12K 2

View Details

Rohan Paul @rohanpaul_ai

2 weeks ago

🧵 3. The 3 text tiers and the 3 pricing - Standard is for high-volume work where cost matters most, like classification, batch summarization, tagging, and simple extraction. - Prime is the everyday production tier. This is where I’d put agents, RAG, drafting, support workflows, and quality-sensitive pipelines. - Max is for the harder stuff, like long-context work, high-stakes reasoning, and tasks where a wrong answer can create real downstream cost. The important part is that “cheapest” does not mean “random cheap model.” Each tier has a quality threshold. The Grid checks models against benchmark floors anchored to Artificial Analysis. If a supplier falls below the required quality level for a tier, it gets removed from the eligible set. So the market competes on price, but only inside the quality bar you picked.

1 1 7 1K 1

View Details

The Grid @The_GridAI

2 weeks ago

We get asked a lot: how much can I actually save? Every instrument shows you the savings vs list price, in real time: Text Standard: save up to 87%. Text Prime: save up to 79%. Text Max: save up to 18%. Every time you call the API, suppliers compete to deliver your request.

0 2 10 4K 0

View Details

Sishir @__sishir

2 weeks ago

x.com/i/article/2034…

5 27 87 74K 75

View Details

The Grid @The_GridAI

2 weeks ago

@__sishir The way we think about it: set a floor on benchmark, latency, uptime, error rate, and once suppliers clear that, let them compete on price. We're in Beta! First 200M tokens free for anyone who wants to try→ app.thegrid.ai/sign-up