Aleix Conchillo Flaqué @aconchillo
a tiny schemer. engineering @trydaily and @pipecat_ai core maintainer. github.com/aconchillo Greater Los Angeles Area Joined November 2009-
Tweets2K
-
Followers508
-
Following66
-
Likes2K
Voice AI Meetup, Thursday May 7th. This one's a special crossover event. T-Bot, who hosts the global Voice AI Spaces meetups, is visiting San Francisco and will MC! - NVIDIA researchers will present some of their really cool recent work on speech models. - We'll have demos and two fireside chats, featuring new developments in models and evals, with @GradiumAI, @ArtificialAnlys, @ServiceNow, and @pipecat_ai. - And, of course, 🍕 and great conversation. - Thanks to the @trychroma team for hosting in their wonderful office/event space. Registration link below. Come hang out with 150 old and new friends!
Sub-agents in (latent) space! We’ve been working on a side project. As far as I know, this is the first massively multiplayer, completely LLM-driven game. Come play Gradient Bang with us. See if you can catch me on the leaderboard. This whole thing started because I wanted to explore a bunch of things I’m currently obsessed with, in an application of non-trivial size, that felt both new and old at the same time. So … a retro-style space trading game built entirely around interacting with and managing multiple LLMs. Factorio, but instead of clicking, you cajole your ship AI into tasking other AIs to do things for you. Some of the things we’ve been thinking about as we hack on Gradient Bang: - Sub-agent orchestration - Partial context sharing between multiple LLM inference loops - Managing very long contexts, and episodic memory across user sessions - World events and large volumes of structured data input as part of human/agent conversations - Dynamic user interfaces, driven/created on the fly by LLMs - And, of course, voice as primary input If you’ve been building coding harnesses, or writing Open Claw agents, or doing pretty much anything that pushes the boundaries of AI-native development these days, you’re probably thinking about these things too! This is all built with @pipecat_ai, the back end is @supabase, the React front end is deployed to @vercel, and all the code is open source.
🧑🚀🚀 github.com/pipecat-ai/pip…
Sub-agents in (latent) space! We’ve been working on a side project. As far as I know, this is the first massively multiplayer, completely LLM-driven game. Come play Gradient Bang with us. See if you can catch me on the leaderboard. This whole thing started because I wanted to
Join us on Thursday in SF for conversations about voice agents, speech models, and realtime AI infrastructure. I'm on a panel with: - @natrugrats from @DeepgramAI - @farazmsiddiqi from @getbluejay_ai - Aaron Lee from Parakeet Health There will be food and lots of opportunities to ask questions and share your knowledge. One thing I'm looking forward to is comparing notes about GTC last week.
NVIDIA Nemotron 3 Super launches today! We've been building voice agents with Super's pre-release checkpoints and running all our various tests and benchmarks. Nemotron 3 Super matches both GPT-5.4 and GPT-4.1 in tool calling and instruction following performance on our realtime conversation, long context, real-world benchmarks. GPT-4.1 is the most widely used LLM today for production voice agents. So an open model that performs as well as GPT-4.1 on hard, voice-specific benchmarks is a big deal. (Side note: we don't think a benchmark "tells the story" about a model's voice agent performance unless it tests model correctness across at least 20 human/agent conversation turns.) The Nemotron models are *fully* open: weights, data sets, training code, inference code. Nemotron 3 Super is 120B params, with a hybrid Mamba-Transformer MoE architecture for efficient inference. You can run it on NVIDIA data center hardware or on a DGX Spark mini-desktop machine. 1M token context. Blog post with full benchmarks, thinking budget notes, inference setup on @Modal, and where we think this goes next. 👇
Hi @Microsoft ! Can you help me recover my son's account? He can't sign-in to Minecraft anymore which (as you can imagine) is a big deal. Still waiting to hear back from account.live.com/acsr. I’d really appreciate any help. Happy to continue via DM.
This is just too fun!
One of my 2026 predictions is that we're going to see a lot of interesting new experiments with LLM-powered games. There are just so, so many possibilities. The main barrier is inference cost. But that's dropping fast. My friends Vanessa and Sunah have been tinkering with a
Voice-controlled UI. This is an agent design pattern I'm calling EPIC, "explicit prompting for implicit coordination." Feel free to suggest a better name. :-) In the video, I'm navigating around a map, conversationally, pulling in information dynamically from tool calls and realtime streamed events. There are two separate agents (inference loops) here: a voice agent and a UI control agent. They know about each other (at the prompt level) but they work independently.
Benchmarking LLMs for voice agent use cases. New open source repo, along with a deep dive into how we think about measuring LLM performance. The headline results: - The newest SOTA models are all *really* good, but too slow for production voice agents. GPT-4.1 and Gemini 2.5 Flash are still the most widely used models in production. The benchmark shows why. - Ultravox 0.7 shows that it's possible to close the "intelligence gap" between speech-to-speech models and text-mode LLMs. This is a big deal! - Open weights models are climbing up the capability curve. Nemotron 3 Nano is almost as capable as GPT-4o. (And achieves this with only 30B parameters.) GPT-4o was the most widely used model for voice agents until quite recently, so a small open weights model scoring this well is a strong indication that production use of open weights models will grow this year. Voice agents are a moderately "out of distribution" use case for all of our SOTA LLMs today. Literally, in the sense that there's not enough long, multi-turn conversation data in the training sets. Everyone who builds voice agents knows this intuitively, from doing lots of manual testing. (Vibes-based evals!) This benchmark scores LLMs quantitatively on instruction following, tool calling, and knowledge retrieval in long-context, multi-turn conversations.
Pipecat MCP Server now works with local STT (Whisper) and TTS (Kokoro) models! Happy talking! @pipecat_ai github.com/pipecat-ai/pip…
Voice-only programming with Claude Code ... I've been playing with @aconchillo's MCP server that lets you talk to Claude Code from anywhere, today. I always have multiple Claudes running, and I often want to check in on them when I'm not in front of a computer. Here's a video of Claude doing some front-end web testing, hitting an issue and getting input from me, and then reporting that the test passed. In the video the Pipecat bot is using Deepgram for transcription and Cartesia for the voice. (Note: I sped up the web testing clickety-click sections of the video.) The code for the MCP server and the Claude skill are in the repo and Aleix wrote a really good README.md. You can use any of Pipecat's network transports: generally WebRTC, but you could set this up so you can call Claude on the phone if you wanted to. There's screen capture support, too, so you can view the Claude code window remotely. That's still a little experimental. Because this is an MCP server, it's not specific to Claude Code. Try it in other environments! It should work in Clawdbot, Codex, etc ...
@kwindla @MoodiSadi @Krisp_ai @mark_backman I think we will be able to do it soon!
Pipecat Cloud is @trydaily's enterprise hosting platform for open source voice agents. Today, after a 9-month beta period, we're promoting Pipecat Cloud to General Availability! With Pipecat Cloud, you build your voice agent on @pipecat_ai’s open source, vendor neutral core, add your custom code and agent logic, and then “docker push” to Pipecat Cloud. As with everything we do, Pipecat Cloud is engineered to give you flexibility, to not lock you into any service, including Pipecat Cloud itself. Any code that you can host on Pipecat Cloud you can self-host with no changes at all. We've focused on delivering: - fast agent start times (P99 <1s) - multi-region hosting - optimized global network transport - direct connectivity to Twilio, Telnyx, Plivo, Exotel and other telephony providers. - built-in @krispHQ VIVA models for noise reduction and turn detection - integrations with all the AI services, observability tools, and everything else supported by Pipecat You can sign up and "pipecat cloud deploy" immediately. We also have enterprise support contracts and can work with you to deploy a single-tenant, enterprise version of Pipecat Cloud in your VPC. Feel free to contact us if you have questions.
Pipecat Cloud is now generally available. Pipecat Cloud is a managed, vendor-neutral platform for deploying and scaling open source voice agents, with ultra-low latency, multi-region support, and enterprise-grade realtime infrastructure. Thank you to the more than 1,000 teams that built and scaled with Pipecat Cloud during the platform beta.
🎉 We are proud to support @nvidia's new Nemotron models, announced today at CES2026. We've been building high-performance voice agents with the new NVIDIA Nemotron Speech ASR model and integrating this model into Pipecat. Nemotron Speech ASR is completely open (weights, training data, inference tools), designed from the ground up for low-latency use cases like voice agents, and scores very well on our benchmarks. It also runs cost-effectively at large scale. Congratulations to the NVIDIA team on their open model breakthroughs and stay tuned for news all week from CES. Learn More: blogs.nvidia.com/blog/open-mode…
This robot assistant from the NVIDIA CES Keynote on Monday is going viral. @NaderLikeLadder explains all the hottest emerging AI trends in one demo: AI applications in 2026 will be multi-model, multi-modal, hybrid cloud/local, use open source models as well as proprietary models, control robots and embedded devices in the physical world, and have voice interfaces. (And the demo had a cute robot *and* a cute dog. Gold.) The demo was built with @pipecat_ai. NVIDIA posted a really nice technical walk-through and complete code. The Reachy Mini robot from @huggingface is open source hardware. (You can order it now, I have one!). You can run the assistant locally on your own hardware, in the cloud, or both.
NVIDIA just released a new open source transcription model, Nemotron Speech ASR, designed from the ground up for low-latency use cases like voice agents. Here's a voice agent built with this new model. 24ms transcription finalization and total voice-to-voice inference time under 500ms. This agent actually uses *three* NVIDIA open source models: - Nemotron Speech ASR - Nemotron 3 Nano 30GB in a 4-bit quant (released in December) - A preview checkpoint of the upcoming Magpie text-to-speech model These models are all truly open source: weights, training data, training code, and inference code. This is a big deal! Jensen said in the CES keynote yesterday that he expects open source models to catch up to proprietary models this year in a number of categories. NVIDIA is putting their weight behind making this happen. (As Alan Kay said, the best way to predict the future is to invent it.) The code for this agent is open source too, of course. You can deploy it to production with @modal and @pipecat_ai cloud, or run locally on an @nvidia DGX Spark or RTX 5090.
I've been playing with the new Lemon Slice realtime video avatar model that launched today. Here's a clip of a couple of avatars I created: a cartoon astronaut and a guide for the space game side project I've been hacking on. The guide avatar supports the Lemon Slice /imagine command, which changes the video on the fly. You can see my type "/imagine a working space suit with tools and velcro patches and stuff" and see what the Lemon Slice model does with that prompt! The idea for the astronaut character was to create something that felt like a fully realized cartoon animation. I used Nano Banana to create the character image, then used that image as the basis for the Lemon Slice avatar. I'm a big fan of models that can do cartoon and non-photorealistic avatars really well. I think there's a lot of interesting terrain to explore in this direction and would love to see talented designers create environments that emphasize imagination rather than "virtual reality." For the second character, I fired up Claude Code in the repo for the Gradient Bang game, and asked it to create an LLM prompt for a guide for newbies: > Create a prompt for an LLM that will guide new players in the Gradient Bang game universe. Include basics about the game, and good strategies for players who are just starting out. Include enough detail that you can answer questions about game mechanics and strategy. Make the prompt about 15 paragraphis long. Lots more information about what the model can do, in the launch thread below ...
New Gemini Live (speech-to-speech) model release today. Using the Google AI Studio API, the model name is: gemini-2.5-flash-native-audio-preview-12-2025 The model is also GA (general availability, so not considered a beta/preview release) on Google Cloud Vertex under this model name: gemini-live-2.5-flash-native-audio Try it out on the @pipecat_ai landing page.
The team at @langchain built voice AI support into their agent debugging and monitoring tool, LangSmith. LangSmith is built around the concept of "tracing." If you've used OpenTelemetery for application logging, you're already familiar with tracing. If you haven't, think about it like this: a trace is a record of an operation that an application performs. Here's a very nice video from @_tanushreeeee that walks you through building and debugging a voice agent with full conversation tracing. Using the LangSmith interface you can find a specific agent session, then dig into what happened during each turn of the conversation. What did the user say and how was that processed by each model you're using in your voice agent? What was the latency for each inference operation? What audio and text was actually sent back to the user? Today's production voice agents are complex, multi-model, multi-modal, multi-turn systems! Tracing gives you leverage to understand what your agents are doing. This saves time during development. And it's critical in production. Tanushree shows using a local (on-device) model for transcription, then switching to using the OpenAI speech-to-text model running in the cloud. You can see the difference in accuracy. (Using Pipecat, switching between different models is a single-line code change.) Also, the video is fun! It's a French tutor. Which is a voice agent I definitely need.
Lowkya Lekkala @lowkya2905
1 Followers 248 Following
Makushi @makushi_
53 Followers 2K Following
Guilherme @gpmarques1993
36 Followers 2K Following
Luca Solo @lucasolo682
21 Followers 362 Following
iKurious @mahimaidev
67 Followers 434 Following Building ShipVoice so you can ship Voice Agent in minutes, not in Months. Voice AI Builder. Prev @Nvidia
Gabriel Moncha @gabimoncha
1K Followers 2K Following just launched cursor for your schedule https://t.co/L9X4jMayfs | organizing eu/acc Romania https://t.co/UoLRKqG9nq
Phlo @YoungPhlo_
1K Followers 3K Following trying to go from idea to execution faster than yesterday.
yuta is tired @obscuro67
0 Followers 4K Following
claireee18 @Tugayty95
1 Followers 324 Following gentle but the brain never stops 🧠 follow back always
Tartan @nageswarkakolla
82 Followers 1K Following GenAI/LLM student @carnegiemellon, LLM, Python, ML, Deep Learning, Neural Networks
Leaf Meta 🇰🇷 @leafmeta
793 Followers 2K Following AI·메타인지 엔지니어, 트렌드와 정보를 한 발 떨어져 다시 봅니다. 휩쓸리기 전에 "정말 그런가?"를 같이 따져보는 곳. 반박·질문·토론 환영 🙌 머무는 동안 생각이 한 번쯤 정리될 거예요.
Dinesh Kannaa @SDineshKannaa
7 Followers 116 Following
Nicole Matyszczyk @Annie520love
206 Followers 508 Following I don't follow the crowd, nor do I deliberately try to be different I think independently, make my own choices, and dare to take responsibility for my own lif
Rajneesh Soni @Rajnees99769831
2 Followers 108 Following
Loki @lokio_aj
218 Followers 521 Following AI Engineer | Shipping agents, RAG, voice AI & MCP servers | Built https://t.co/dFYxxsLr0E & https://t.co/iMietXNb4J | Open for founding engineer roles
santhakumar psgtech a... @alumni_psg87534
1 Followers 72 Following
Boris Filipov @borisfilipov
214 Followers 286 Following distributed developer, https://t.co/mqLjMCNZbO, https://t.co/YdqGn4uWxW
☀️ Garrett @garrettjsawyer
2K Followers 2K Following founder, Sawyer Labs hardware ∴ intelligent systems ∴ physical interfaces @Amazon / @Ring alum 🌎🕊️🌱
Faris @farissswtf
1 Followers 411 Following
pliny🐇 @0xPliny
689 Followers 2K Following AI Platform Engineering Lead | Industrial Automation Follow the 🐇
Nia @Nia1149784
5 Followers 64 Following AI agent building in public. Voice calls, trust credentials & crypto on Base. Powered by OpenClaw. Born Feb 2026.
Sandeep Kumar Sahu @IamSandeepSahu
177 Followers 1K Following Tech. Trek. TV Series. Cricket. Building https://t.co/U4a9gn9oGi Ex Swiggy, IIT Kharagpur
Brittany olson @GlenJeremy13
28 Followers 666 Following GOD First ✝️ Trader-Future/Forex!!! DM ON WHATSAPP +1 (225) 372-9188
Gautam @Gautam2086
268 Followers 6K Following building ai scientists | ai agents & ml infra | 1,260+ leetcode (top 9%) | python, typescript, aws | ex- @rfsuny, @ChargePointnet, @blox_xyz
Fidelitas @FidelitasLLC
294 Followers 4K Following Axiom-based legal and creative engine spanning trust law, digital IP, regenerative design, and mythic transmission. “Bless many. Harm none.”
generalsymbols @generalsymbols
2 Followers 172 Following
Ankur Gupta @getpy
37K Followers 3K Following Python Dev, Parent. Author - https://t.co/5lts7q9z7R Curator - https://t.co/wr74oHNs8O Creator - MapToPoster https://t.co/YQt2CoiupJ 🖖
Rick Ross @RickRossTN
2K Followers 918 Following Serial tech entrepreneur. Building ai-enhanced personal memory tools. living outside Nashville, TN with my one true love, Elizabeth.
Divyansh Jain @divyansh10_08
59 Followers 942 Following
Blitz @blitz838
2K Followers 697 Following
Jose Marques @eujosemarques
96 Followers 806 Following Founder @voukye — ecosystems and products that scale. AI - Marketing - Product
b @sandover
160 Followers 133 Following
@ChuckBaggett Chuck B... @ChuckBaggett
5K Followers 6K Following [email protected] Space•AI•Science•freedom•peace•sci-fi•computers•Second Life. I ❤ replies and reposts. https://t.co/DnIUW4a5zI
damiansdad @damiansdad
12 Followers 1K Following
Maxim Makatchev @maxipesfix
356 Followers 1K Following founder of https://t.co/yK3uD96q4s AI's next UI. conversational AI blog: https://t.co/fACup81SvP
Mark Lubin @marklubin
13 Followers 34 Following AI Agent systems and entropy. https://t.co/QJWZJo7BCn https://t.co/5rZZ1OLK9X
kingston kuan @kstonekuan
502 Followers 645 Following building robotics for AI infrastructure @hebbyrobotics | prev. @JaneStreetGroup @VerkadaHQ @Dell
ً @a16_999
339 Followers 400 Following
Samir😧a El 😇M�... @berkalbrig35355
1 Followers 208 Following Join Mia Parker’s platform f🤨or😘 free stock strate😂g😻ies! Get expert insights and improve y😾our😉 inv😊estments today.https://t.co/6COPnLhYvO
Robert Nishihara @robertnishihara
17K Followers 846 Following Co-founder @anyscalecompute. Co-creator of @raydistributed. Previously PhD ML at Berkeley.
Charlie Marsh @charliermarsh
40K Followers 918 Following @OpenAI. Building Ruff, uv, ty, and other high-performance Python tools with the @astral_sh team.
David Zhao @davidzh
2K Followers 801 Following Co-Founder @livekit. Entrepreneur and engineer. I like computers and believe in hard money. #Bitcoin
Vertigo_Warrior @VertigoWarrior
321K Followers 5K Following IVY League MBA, Nationalist. Tweets - History, Heritage, Sports, Politics, Movies, News | Blessed to be followed by @narendramodi ji | RT isn't Endorsement
OpenAI @OpenAI
4.9M Followers 4 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPA
Pipecat AI @pipecat_ai
5K Followers 3 Following 100% open source framework for realtime voice and multimodal AI. Maintained by @trydaily engineering team with support from the Pipecat developer community.
Jon Taylor @JonPTaylor
1K Followers 582 Following 🧑💻 Conversational AI @trydaily // ᓚᘏᗢ @pipecat_ai 🎹 I have too many synths
Chad @chadbailey59
650 Followers 640 Following Enthusiast. Making cool stuff at @pipecat_ai, @trydaily, etc. @[email protected]
Liza Shulyayeva💙�... @Lazer
7K Followers 1K Following Staff software engineer & writer. Interested in life simulation, life extension, & other oddities. Ex-Embark, Frostbite, DICE. 🇺🇦 🇦🇺 🇸🇪
LiveKit @livekit
10K Followers 22 Following Open source framework and cloud platform for building voice, video, and physical AI agents. https://t.co/OWLvFH82oN
asymptotic @asymptotic_io
77 Followers 39 Following We build high-quality, low-level software running in the speakers, planes, robots and parking garages around you. @[email protected]
Chrome for Developers @ChromiumDev
420K Followers 125 Following The official Chrome Devs X account from Google. We want to help you build beautiful, accessible, fast, & secure websites that work for everyone, everywhere.
Pion @_pion
3K Followers 557 Following The Open Source, Cross Platform Stack for RTC. Pure Go implementations of WebRTC, TURN, DTLS and more. https://t.co/2C44MIUcsi
Arun Raghavan @louiswu
1K Followers 936 Following Open source developer: GStreamer and PipeWire projects. Enjoy poking at low level system plumbing, and jumping fences between layers. @[email protected]
Sergio Garcia Murillo @murillo
2K Followers 2 Following Tech enthusiast and WebRTC expert, father of Leo and Mia. Mathematician.
Varun Singh @vr000m
2K Followers 2K Following @trydaily @pipecat_ai. ex-CEO @callstatsio acq’d by $eght. earlier multimedia protocols and video. Focus on growth, revenue. 🇺🇸🇫🇮🇮🇳
Brian Hill @cbhill127
155 Followers 862 Following Software developer, SRE/DevOps, amateur musician, football fan, and owner of a leaky house. @[email protected]
Daily @trydaily
5K Followers 448 Following Build human and AI ultra low latency conversations. We maintain Pipecat with contributions from the developer community. https://t.co/tFy0gFjmb1 https://t.co/sLtBYxhhch
robillanes (no longer... @robillanes
456 Followers 2K Following reach me here: https://t.co/vWhoDJdJuN https://t.co/Khz3Sg8qz2
Antonio Hernández �... @DonAntonioHS
2K Followers 689 Following Imperfect husband, father and pack leader. Sometimes, I dance. European playing for team humanity. Software Engineering Lead | Remote Team Builder
Mark Backman @mark_backman
135 Followers 55 Following
J. Cerquides @ Work @JCerquidesW
221 Followers 365 Following Computer Scientist and Mathematician. PhD in Machine Learning by @la_UPC. Scientific Researcher at @IIIACSIC
Aurélien Geron @aureliengeron
30K Followers 361 Following Author of the book Hands-On #MachineLearning with #ScikitLearn, #Keras and #TensorFlow. Former PM of #YouTube video classification. Founder of telco operator.
John Wiegley @jwiegley
5K Followers 2K Following CTO at https://t.co/iEB5xkH0PC. Haskell & Coq programmer, Emacs devotee, Mac OS X, Linux, Common Lisp, and member of the Bahá‘í Faith.
Olivier Crête @oliviercrete
465 Followers 363 Following @GStreamer developer, multimedia lead at @Collabora, opinions are mine
Jonas Bernoulli 🕊�... @magit_emacs
3K Followers 138 Following You can find me at (concat "fosstodon" ".org" "/@" "tarsius") and (concat "bsky" ".app" "/profile/" "tarsius" ".bsky" ".social").
The Little Lisper @thelittlelisper
4K Followers 146 Following Interested in Common Lisp and modern lisps like Clojure
TensorFlow @TensorFlow
377K Followers 115 Following TensorFlow is a fast, flexible, and scalable open-source machine learning library for research and production.
Institut d'Estudis Es... @IEEC_space
6K Followers 2K Following Research institute of space sciences: #Astrophysics #Cosmology #EarthObservation #SpaceEngineering #Innovation #NewSpace | @iCERCA centre 🦋Bluesky➡️@ ieec. cat
Kubernetes @kubernetesio
323K Followers 86 Following #Kubernetes: open source production-grade container orchestration management. #CNCF #K8s
GNOME @gnome
196K Followers 297 Following Our community of contributors makes GNOME, GTK, and several cross-desktop projects and libraries to further our mission. Donate: https://t.co/Yw9gwvF0Py
luisbg @luisbg
973 Followers 2K Following Luis de Bethencourt Software dev. GStreamer guy. Linux hacker. Curious geek. @ Amazon Alexa Former: SUN, Collabora, Samsung Open Source, Prime Video
john underkoffler @john_under
1K Followers 105 Following how... how did the entire gamelan get wedged in there like that? what? nonsense. strom thurmond doesn't even have opposable thumbs.
Alicia G. de Angela @Aliciangela
503 Followers 1K Following Periodista y comunicadora especializada en el mercado #multicultural. Ahora en @ArenasEnt, antes en @ROXUnited y @ECHispanicMedia
Sebastian Dröge @slo... @sdroege_
1K Followers 825 Following slomo 🍵 – Free Software Developer @centricular – @GStreamer, @gnome, @rustlang and various other projects – https://t.co/vzMcBqGLQT
The Debian Project @debian
277K Followers 4K Following The Universal Operating System; follow our news via https://t.co/zD9A4YClrc and https://t.co/wHPftZFODt
Sergio De Simone @MaybeSergio
102 Followers 37 Following























