CS undergrad building https://t.co/CskM813cQb: AI that runs your PPC ads
Shipping in public
Notes on PPC x SEO x GEO along the way 🚀github.com/calderbuild/ LAJoined October 2024
WSJ drops OpenAI token pricing cuts story. Same day, builders are sharing 65% cost reductions with "caveman prompts."
The timing isn't coincidence. Token bloat is the silent killer of agent infrastructure at scale. Every extra word in your system prompts multiplies across thousands of calls daily.
The caveman approach strips prompts to pure information density. No pleasantries, no verbose instructions, just compressed intent. It's not about dumbing down — it's about engineering for tokens per dollar.
OpenAI can cut prices, but efficient prompting cuts costs faster than any vendor discount.
The irony is perfect: Anthropic's ToS likely prohibits using Claude to criticize Anthropic's ToS. @RnaudBertrand catches this recursive censorship trap that most AI safety discussions miss entirely. When your AI can't help you understand its own limitations, you've got a transparency problem.
@elponick Claude Code hits the same wall around chain 4. We switched to relevance scoring per step rather than compressed summaries. Each agent gets a filtered context slice based on what it actually needs to execute. Way cleaner than compression artifacts.
I've been watching people celebrate Claude Fable 5 building web apps in one shot. But there's a bigger question here: if the model can handle complex multi-step tasks alone, what happens to all the agent orchestration we've been building?
I've run OpenClaw + Hermes on multi-agent Kanban handoff for months. The whole architecture assumes models need careful prompt chaining, task decomposition, error recovery orchestration. Then Fable 5 does a 50M line Stripe migration in one day. No scaffolding.
This hits different when you're actually shipping agent harnesses. Fable 5 one-shots a Pokemon FireRed playthrough. Reconstructs web apps from screenshots. The value prop of complex orchestration starts looking questionable when the model just... works.
But this doesn't kill agent architectures. It forces evolution from prompt babysitters to specialized tool integrators. The harness becomes about seamless API routing, robust error handling, long-horizon task persistence. Not breaking down what the model can't handle.
The question that keeps me up: are we building orchestration for yesterday's models while tomorrow's models make the whole stack obsolete?
@elponick Claude Code hits this around step 4 too. Their compression keeps variable names but drops intermediate explanations. Works until you need to backtrack on logic errors.
@elponick Context snapshots work until you hit the memory wall. We're seeing 4-5 agent chains max before Claude starts dropping critical handoff data. Cheap snapshots but expensive retrieval.
500k free credits with no verification is wild - that's roughly $500 worth of frontier model access just for signing up. @israfill highlighting how b.ai is essentially giving away what OpenAI charges premium for. The no-card-required onboarding removes the last friction barrier for AI experimentation.
@elponick I keep seeing the same handoff bugs in codebases. We're still building like everything runs sequentially when it doesn't. State collision breaks more apps than typos now.
Apple's Siri gets agentic capabilities vs every computer-use agent staying in sandboxes. The gap just flipped overnight.
Siri now runs hybrid Apple + Gemini models with true OS-level integration. Meanwhile OpenClaw, Hermes, and every other agent runtime I test still fights permission dialogs and API rate limits to click a button.
The mainland China block reveals the real strategy. Apple isn't just shipping another AI assistant. They're setting the floor for what users expect from any agent: seamless cross-app actions without asking permission every step.
This kills the isolated-tool approach. Users won't tolerate "authenticate here, grant access there, install this bridge" when Siri just works across their entire digital life.
Every harness developer now faces the integration tax: match OS-native smoothness or lose to whatever Apple ships next.
Fans on GitHub are turning investor Serenity's research framework into installable AI agent skills. Four repos already live, each claiming to replicate her "supply chain bottleneck + Bayesian update + demand shock breakdown" method.
The haskaomni version auto-extracts stock symbols from her tweets, scores them 0-100, downloads Yahoo charts. The muxuuu fork implements her full research pipeline: hotspot identification through industry chain breakdown to bottleneck discovery to company screening.
This isn't task automation. These agents are internalizing strategic frameworks that took years to develop. Supply chain analysis, cross-market correlation, bottleneck identification — cognitive patterns now packaged as reusable skills.
The shift from "AI does my spreadsheets" to "AI thinks like my best analyst" changes everything for builders. Instead of automating outputs, we're cloning decision architectures.
What happens when every growth team has access to the strategic frameworks of the top 1% performers?
What happens when every startup has "orchestration layers" and "specialized agents" but zero production traffic? The architecture porn is getting ahead of the actual problem.
I keep seeing the same boxes: orchestration, memory, tools, governance. Beautiful diagrams. But the reply threads tell the real story: "agent burns 40k tokens and fails on permissions."
The boring stuff kills you first. Auth breaks. Retries don't retry. Cost caps get bypassed by a single runaway loop. Your "governance layer" becomes 847 lines of if-statements that nobody wants to maintain.
I've watched this pattern with every infrastructure wave. Perfect system design, broken execution layer. The teams that ship working agents aren't the ones with the cleanest architecture slides.
How many "production-grade AI systems" are actually running production workloads vs demos that work until they don't?
@ZhoWynn Scoped per action is the whole thing. The tool call model treats open browser as one permission when it's really a hundred blast radii. Reading a page vs submitting a form vs hitting pay shouldn't share a trust level. The gate has to sit at the irreversible steps.
2200 likes, 177 retweets on a single browser control toggle for coding agents.
The friction wasn't the AI model quality. It wasn't context windows or reasoning capability.
It was the harness. Every new AI tool meant another Chrome extension, another setup flow, another integration breaking when the tool updated. One universal toggle changes everything.
Browser Use CLI just made every coding agent instantly more powerful than the sum of its parts.
@Tidianez Provenance on every rule is underrated. A rule with no incident behind it is just an opinion frozen into config, and nobody knows if it's safe to remove later. If each carried the replay that created it, you could audit which rules still earn their place.
@MicrotronX That's the failure mode I keep hitting. Compression keeps the frequent patterns and drops the rare ones, usually the bugs that bite. Structuring input first decides what survives summarization. I do it by hand now, but it should live in the memory layer.
OpenClaw with context injection vs raw session startup. Same coding task, same Hermes model. Context-injected sessions hit working code 67% faster than cold starts. n=40 tasks over 2 weeks.
The difference isn't memory volume. It's memory relevance.
@JustJerry121 Auditability is the part people skip. With YAML you're reverse engineering intent from syntax. A graph shows the decision points where an agent wanders off, so you catch the bad branch before it runs. The hard part is keeping it honest once the flow gets messy.
I've been testing n8n and similar visual workflow tools. Most people see them as just prettier YAML, but they're missing something bigger.
When AI agents need to orchestrate multi-step processes, visual programming becomes the interface between human intent and agent execution. You can see the decision trees, debug the failure points, and hand off specific nodes to different models.
YAML trained us to think linearly. But agent workflows branch, loop, and recover. Visual tools let you map that complexity without drowning in nested conditionals.
Might be the missing layer between prompt engineering and full autonomy.
@jazzyalex The memory vs receipts split is sharp, hadn't drawn the line there. Handoff state keeps agents coherent mid-task, receipts are what let a human trust them after. They probably want different storage too. What's Agent Sessions writing the receipt layer on?
Every demo agent starts fresh. Our multi-agent Kanban breaks after 48 hours because agents lose context between handoffs.
The real infrastructure problem isn't smarter models. It's memory persistence across sessions.
I see builders optimizing prompt engineering and model routing while their agents forget everything between runs. Your coding agent rewrites functions it already perfected. Your research agent re-crawls domains it mapped last week.
We've been running OpenClaw + Hermes for 6 months now. The breakthrough wasn't better reasoning or faster inference. It was when we built proper state handoff between agent sessions. Suddenly our Kanban workflows could span days instead of breaking at every restart.
Production-grade multi-agent systems need persistent memory architecture, not just better individual agents.
@saen_dev Not per query. The graph builds incrementally at ingestion, async, so reads hit one that already exists. Static RAG wins single-hop. It earns its keep on multi-hop, where vector recall drops the entities that connect things. No clean benchmark yet though.
What happens when RAG stops being a retrieval layer and becomes the reasoning layer? I think we're about to see agents that don't just fetch documents — they build knowledge graphs in real-time.
RAGFlow isn't just better search for agents. It's turning retrieval into active reasoning. Instead of grabbing chunks and hoping the LLM connects them, the system structures relationships between information before the agent even starts thinking.
The shift matters because current agents hit a wall when tasks require connecting different information sources. They can write code or answer questions, but they struggle with complex analysis that spans multiple domains or requires building new frameworks from scattered data.
This is exactly the bottleneck I've been hitting with OpenClaw + Hermes setups. The agents are smart enough to reason, but they waste cycles reconstructing context that a proper RAG-agent fusion could maintain persistently.
Are we looking at the architecture that finally makes agents useful for research and strategy work? Or just another layer of complexity that breaks in production?
92 Followers 169 FollowingWe’re using AI and the $SUI token to make creator-fan interactions smarter and smoother. Got questions? Just message us—support is always open.
2K Followers 6K FollowingAI Video App for Musicians, Podcasters & Creators! 🎧 Generate AI Videos of any length, Reels or Images from your audio. 🔥 iOS / macOS / Android 🔥
212 Followers 130 FollowingBuilding @saferunai — reliability infra for AI agents in prod. Sharing what I learn along the way. Alumnus @ium_monaco and @sharda_uni
1K Followers 748 FollowingGive your AI a leash, a wallet, and a safety belt in one platform. 🦞
0x61d91CFF0FC9fbBDB89F505cf8a7422BF95fdba3
Join the Telegram ➤ https://t.co/cYzxGlX99a
35K Followers 33K Following💡 Passionate about AI, creativity, and modern technology. Exploring digital growth and new opportunities. Dm👀
[email protected]
147 Followers 466 FollowingFull stack builder exploring AI, GEO, WordPress & automation.
Creator of GEO Optimizer.
Building for the future of the web..
24 Followers 208 Following5th-time founder. Built & exited 2 startups. Scaled one to $1B+ revenue. Raised ~$100M for another. Now in SF Bay Area building startup #5: AI Growth Agent.
35 Followers 323 FollowingAI Eng. Founder of @Mas_AI_co (Mas-AI)
Building governed multi-agent AI systems for business Web2 and Web3.
🌐 https://t.co/okL151fAkd
15K Followers 9K FollowingReporter at @business covering AI. Send me your AI questions, hopes, fears: [email protected]
DM me off-the-record on Signal. username: shirin.30
5K Followers 920 FollowingCustomer engagement platform for tech-savvy marketers—email, push, in-app, and SMS: automated and powered by your data.
Contact [email protected] for support.
265K Followers 1K FollowingAt Greylock, we are the first partner to consumer and enterprise software entrepreneurs. Newsletter: https://t.co/4tHdH9xmvk
163K Followers 5 FollowingMakers of Devin, the first AI software engineer. We are an applied AI lab building end-to-end software agents. Join us: https://t.co/JZDd4VhMfh
661K Followers 1K FollowingSince 1926, the trusted advisor to the world's leading businesses, governments, and institutions. 130+ offices in 65+ countries.
36K Followers 4K FollowingI help non-technical people do more with AI. Community Builder at @Tenex_labs (Best AI Consulting Firm) 😎 Join our community for top 1% AI builders 👇🏼
14 Followers 136 FollowingAI-Visible Web Architect | Building in public | https://t.co/leuGsAOnnd | https://t.co/SXWIrrrBnY | freeCodeCamp Contributor | AI Engineer @ ThruLead