Intentee @intentee

Cloud tools for creating code-free systems and keeping AI on your own infrastructure. github.com/intentee/paddl… Joined August 2025

Tweets

20
Followers

14
Following

3
Likes

3

Intentee @intentee

2 months ago

Stability of the user experience is a strong reason to run open source models on your own infra, and it doesn't come up as often as cost control or data privacy. More in the video.

0 0 0 7 0

View Details

Intentee @intentee

3 months ago

Paddler repo: github.com/intentee/paddl…

0 0 0 8 0

View Details

Really insightful talk from Mateusz Charytoniuk on building a self-hosted LLM stack at @rustikonconf today. He broke down where open source LLMs are headed, how to self-host them at scale with Paddler, and how Rust helps to make this ecosystem more secure and maintainable.

1 0 0 9 0

View Details

Intentee @intentee

6 months ago

MCP servers aren't just technical projects. They can add real value by making your product accessible through conversational AI. Here's a short video essay on how to frame them as a business opportunity: youtube.com/watch?v=R78mBg…

0 0 0 48 0

View Details

Intentee @intentee

8 months ago

We're organizing Post Software, an event where AI builders, designers, and professionals from other industries meet to explore how AI can create entirely new solutions, not just optimize what already exists. Want to speak or get involved? Visit post-software.intentee.com

0 0 0 179 0

View Details

Intentee @intentee

8 months ago

It gives you real-time insights into how your capacity is used, how many requests are being buffered, and any issues that might have come up. It also provides a convenient way to manage your models (swap them dynamically, use custom chat templates, adjust inference parameters).

0 0 0 34 0

View Details

Intentee @intentee

8 months ago

Paddler is our open-source platform for self-hosting LLMs at scale, and it comes with a web UI you can use to understand your cluster at a glance. paddler.intentee.com/docs/starting-…

1 0 0 46 0

View Details

Intentee @intentee

9 months ago

Self-hosting your models can solve this, and with tools like Paddler, you don’t need to hire an entire LLMOps team. Learn more paddler.intentee.com and youtube.com/shorts/B7Pk2tA…

0 0 0 28 0

View Details

Intentee @intentee

9 months ago

Per-token API pricing for LLM usage is convenient, but it comes with a huge cost unpredictability. Traffic spikes are one thing, but the way your users use your LLM-based features can differ widely

1 0 0 30 0

View Details

Intentee @intentee

9 months ago

Pre-alpha version (static site generation, open-source, github.com/intentee/poet) is out, with a custom syntax that gives full control over how content is understood structurally and will allow us to add AI-based content analysis features. More to come :)

0 0 0 28 0

View Details

Intentee @intentee

9 months ago

Technical products and developer tools need to be discoverable in AI platforms, let users talk to their docs, and ensure coding copilots can offer quality code suggestions for their technologies. This is what we’re aiming for with Poet poet.intentee.com

1 0 0 51 0

View Details

Intentee @intentee

9 months ago

Based on the last StackOverflow Developer Survey, most developers prefer both interactive formats and long-form articles when learning new technologies. This means static site documentation should no longer be static.

1 1 0 31 0

View Details

Intentee @intentee

9 months ago

Paddler lets you download models directly from Hugging Face (or from a local file path), both via API and our web admin panel. You can also swap them dynamically without needing to restart the entire setup. More: paddler.intentee.com/docs/starting-…

0 0 0 21 0

View Details

Intentee @intentee

9 months ago

Not every LLM task requires a massive number of parameters. And it’s not only a matter of cost savings. Smaller, specialized models can offer a better experience to the end user (both in quality response and performance).

1 0 1 17 0

View Details

Intentee @intentee

10 months ago

If there's no available capacity, Paddler will buffer the incoming requests. Combined with autoscaling, it lets you handle traffic spikes without dropping requests. You can also scale from zero hosts to avoid overpaying for idle GPU capacity. paddler.intentee.com/docs/internals…

0 0 2 16 0

View Details

Intentee @intentee

10 months ago

Feature highlight: model swapping. Paddler lets you swap models dynamically, without the need to restart the balancer or the agents. Available in the web admin panel or the API. Learn more: paddler.intentee.com/docs/starting-…

0 0 2 13 0

View Details

Kevin Wittek @Kiview

10 months ago

@ggerganov Really awesome and I am happy to see llama.cpp being features for non local-workstation use cases more :)

0 1 3 410 0

View Details

Intentee @intentee

10 months ago

@ggerganov Thank you for highlighting Paddler!

0 0 2 101 0

View Details

Georgi Gerganov @ggerganov

10 months ago

Highlighting project paddler Build and scale your own LLM infrastructure, powered by llama.cpp Mateusz and team worked hard over the last year and have significantly improved the project - check it out and let them know your experience github.com/intentee/paddl…

4 30 290 18K 163

View Details

Intentee @intentee

10 months ago

Paddler is 2.0 is out and it’s now a complete open-source platform to run and scale open-source LLMs in your own infrastructure. Docs: paddler.intentee.com GH: github.com/intentee/paddl…