-
Tweets2K
-
Followers76K
-
Following483
-
Likes2K
Today we're releasing HiL-Dynamics, the first open-source tool that measures how production agents actually collaborate with humans under uncertainty. Not just whether they got the answer. Now you can measure exactly when your agent asks for help, when it makes assumptions, and when it'll confidently ship the wrong answer. Our findings 🧵
To understand our story, you have to go back to the beginning. It started with self-driving cars. Ten years later, it's the architecture underneath AI that actually works, across frontier labs, enterprises, governments, and mission-critical systems around the world.
The humans stay. That’s the idea behind @scale_ai's new brand campaign. 10 years of building AI has taught us something: the most important decisions belong to humans. The AI that works in decisions of consequence keeps humans at the center. Going live in SF and NYC. Where to next? 👀
It's our birthday. 🎂 scale.com/blog/ten-years…
🚨 JUST IN: Scale AI milestone incoming. Stay tuned.
This month we turn 10. The hard work started in 2016, and it hasn’t stopped. Shortcuts are for losers. Winners welcome. scale.com/careers
Today we’re releasing Refactoring, the final leaderboard of our SWE Atlas suite. This new leaderboard is the ultimate test of an agent's ability to restructure code without breaking the system. Claude Opus 4.7 with Claude Code takes the top spot🥇
Proud to share @CDAODoW has expanded its enterprise agreement with Scale AI raising the ceiling from $100M to $500M. This expansion reflects our continued commitment to accelerating the adoption of AI capabilities across the Pentagon to help America stay prepared, resilient, and strong. scale.com/blog/Scale-ai-…
AI pretenders vs. AI contenders. It's those who still haven’t realized reliability is the product vs. those who can deliver reliability and outcomes. That's what the enterprise AI race comes down to. Here's a note I sent the Scale team this week.
We recently built HiL-Bench, the first benchmark to test a critical question: do AI agents know what they’re missing and when to ask? Frontier models perform well with perfect specs. But remove a few key details, and they confidently guess and ship plausible wrong answers. We just added GPT-5.5, Opus 4.7, and Kimi K2.6 to the leaderboard. Here’s what we’re seeing ⬇️🧵
Scale AI has acquired ICG Solutions, a defense technology firm specializing in real-time streaming data analytics. This is another step forward in how we support the U.S. defense and intelligence community with AI systems built to serve America’s most important national security missions. scale.com/blog/scale-acq…
Paper: static.scale.com/uploads/67a153… Data: huggingface.co/datasets/Scale… Leaderboard: labs.scale.com/leaderboard/hil Code & Harness: github.com/hilbenchauthor…
Key takeaway for model builders: capability and judgment are orthogonal axes. Scaling SWE-Bench alone won't close this. Current post-training doesn’t penalize an agent for confidently solving the wrong problem. Ask-F1 is the first verifiable signal that does, and it transfers across domains. The goal isn't full autonomy. It's selective escalation: agents that know what they don't know.
New @ScaleAILabs Research: Your AI agent just gave you an answer but did it actually solve the problem, get lucky, or just sound right? Today’s benchmarks can’t tell. We built HiL-Bench (Human-in-Loop Benchmark) to test a critical skill: does your agent know what it’s missing and when to ask for clarification? 🧵
SWEchella
Alexandr Wang @alexandr_wang
493K Followers 858 Following chief ai officer @meta, founder @scale_ai. rational in the fullness of time
Riley Goodside @goodside
211K Followers 3K Following Screenshots of chatbots since 2022. Formerly: Google DeepMind, Scale.
AK @_akhaliq
504K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5XOCi
Jim Fan @DrJimFan
445K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
AI at Meta @AIatMeta
806K Followers 323 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
AI Pub @ai__pub
72K Followers 340 Following AI papers and AI research explained, for technical people. Get hired by the best AI companies: https://t.co/MySVjUGOQ3
AI Breakfast @AiBreakfast
236K Followers 609 Following The latest rumors and developments in the world of artificial intelligence. DM to include your AI project in the email newsletter with 100k subscribers!
Jay Hack @mathemagic1an
71K Followers 3K Following Head of AI @clickup. Tweets about AI, computing and their impacts on society. Previously founder @codegen / ML @palantir. Not a pseudonym.
Emm | scenario.com @emmanuel_2m
54K Followers 8K Following Co-founder https://t.co/n0sLx748nW | The Enterprise Platform for Creative AI, Built on Your DNA 🇺🇸🇫🇷
Eric Jang @ericjang11
133K Followers 4K Following
Tanishq Mathew Abraha... @iScienceLuvr
88K Followers 1K Following CEO @SophontAI | Founder @MedARC_AI | PhD at 19 (2023) | ex Research Director Stability AI | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6Qb
Turing Post @TheTuringPost
86K Followers 9K Following On X we surface the AI research that matters and explain the ideas behind it. In the newsletter, we connect the dots between AI’s past, present, and future ⬇️
Jerry Liu @jerryjliu0
76K Followers 1K Following Parsing the world's hardest PDFs @llama_index. cofounder/CEO Careers: https://t.co/EUnMNmbCtx Enterprise: https://t.co/Ht5jwxSrQB
Jack Clark @jackclarkSF
133K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkIJ2 Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures
Mo Bavarian @mobav0
17K Followers 1K Following Scaling up RL at OpenAI 🍓 Optimization and long context research before. Math PhD, MIT.
Ramsri Goutham Golla @ramsri_goutham
15K Followers 3K Following Lead Data Scientist at https://t.co/v4OH4htQvX 👨💻 Founder bootstrapping 3 AI SaaS Apps to $100k ARR with no employees: https://t.co/GKUAYB1rB9 https://t.co/fU8yoiYVDc https://t.co/DTyILliHVm
Chanel A.H. MS/PhD(c) @VIGIIQChanel
76 Followers 531 Following Founder & Lead Cognitive Systems Developer @VIGIIQ | Building @VIQ_AI, a human-first synthetic cognitive architecture | Neuropsych, clinical research, AI safety
What The Deuce @ShowMeTheDeuce
63 Followers 614 Following Everything takes longer than you think. Nothing is as easy as it looks.
Firef0x @G3suf4l
31 Followers 2K Following
Jon | Studio Engineer @experi__
376 Followers 345 Following AI × Design builder. Tokyo. | Co-founder @catcatcatstudio | Making tools that didn't exist yet | 日英で発信 | https://t.co/0Pu7FcjTdD
0xWives (c'est la vie... @0xWives
12K Followers 5K Following Giver of Alpha, first of his name. Tradfi, Defi, and Ai. $Aixbt Community Architect.
Cristina Garbacea @ggarbacea
931 Followers 4K Following Postdoctoral Scholar @DSI_UChicago, PhD from @UMichCSE, ML, NLP, LLMs #CovidIsAirborne 😷
0xChem @0xChemistt
9K Followers 6K Following agentic engineering ✨ waitlist is open for PeptideCompanion Ai
سعود الماجد @KSA_almajed
163 Followers 802 Following You will find me between these (AI, Tech, Investment, Startups) "تعلم فليس المرء يولد عالـمـًـا"
Gravesham For Busines... @GraveshamForBiz
579 Followers 2K Following Working with you, to capitalise upon Gravesham's #highspeed connectivity & talent advantages #Business #StartUp #Invest #ThamesEstuary #TEPC | Official Account
xiaocheng zou @xiaochengzou
13 Followers 67 Following PHD student at NCSU, research interests: data analysis, high performance computing
Matthew Lam @mattlam_
2K Followers 1K Following Builder. Less agent setup optimizing, more building eng @ tech @codexreleases @pichangelog @grokreleases @opencoderelease https://t.co/OFoB1tFrf6
Zsolt Gonye @gonyezsolt
6 Followers 71 Following
Akash Narasimha @AkashNarasimha
32 Followers 4K Following
hana @nanana000606
0 Followers 13 Following
jotisbhoewt @jotisenq3
2 Followers 134 Following
Vin @vindreezle
653 Followers 776 Following Web3 Enthusiast | Degenerate| Solana OG | Nft pixel Artist | Trader
宙遊自在| 宇宙�... @dr_to_the_space
69 Followers 121 Following 医師 | 瞑想・日々の気づきの備忘録| 49カ国旅した人 | 毎朝6時 芝公園で瞑想| マインドフルネスを科学するリトリートを創る| → 宇宙へのフリーアクセス権獲得 最後は宇宙で瞑想する
おーが@Notion名�... @lol06721
42 Followers 258 Following Notion Campus Leader Alumni BaseMe AI Campus Perplexity Campus Partner Alumni
Anmol D B @ANMOLDBHANDARE2
12 Followers 181 Following 3 yoe in venting Data Scientist strategy , pattern recognition suffering from ADHD
Dev. cli22 @cli_dev
0 Followers 48 Following
binioter @binioter96363
0 Followers 10 Following
Curseben @Curseben7777
4 Followers 357 Following
RaphwByte8 @RaphwByte8
140 Followers 1K Following
QNTN @qntnvrrn
14 Followers 827 Following
Ku\umi @krrw223
12 Followers 262 Following
tb557 @tb5571
1 Followers 4 Following
LBJ @L12B14J16
2 Followers 589 Following
Hustlernik @Hustlernik1
1 Followers 230 Following
D.O.N.G!! @alan56052022
27 Followers 719 Following
FARUK HOSSAIN @Faruk2079
42 Followers 80 Following Exploring ideas that shape the future Simplifying news for everyday people
Kenpachi Serendip @kenserendip
0 Followers 53 Following Founder, NewBridge Pathway. Helping MSR owners, lenders, and oversight teams prove mortgage servicing records stay complete and reconstructable.
cesar augusto herrera... @AugustoLli84036
0 Followers 33 Following
Hatem Ghorab @HatemZMG
2 Followers 630 Following
Tahar @T_Zano
0 Followers 297 Following
hakunamatata @hakunamatataCF
0 Followers 48 Following
Dj-CAT @Hugo12052001
712 Followers 7K Following 🕵Research and practice alpha projects - Onchan farm useful Airdrops youngest @CC2Ventures 💻💎bullish @o1_exchange @01Exchange @DecibelTrade
Vishal Onkhar, PhD @Vishal_Onkhar
70 Followers 787 Following PhD from @tudelft, researched the interaction of pedestrians and automated vehicles. Ex-writer @tudelta. Views are my own. https://t.co/K4Q1vcsKqp
Alexandr Wang @alexandr_wang
493K Followers 858 Following chief ai officer @meta, founder @scale_ai. rational in the fullness of time
Riley Goodside @goodside
211K Followers 3K Following Screenshots of chatbots since 2022. Formerly: Google DeepMind, Scale.
AK @_akhaliq
504K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo ,submit papers here: https://t.co/UzmYN5XOCi
Emad @EMostaque
325K Followers 117 Following Building first principles, sovereign AI @ii_posts. Founder @StabilityAI. Consistent inference is possible.
Google DeepMind @GoogleDeepMind
1.5M Followers 279 Following The engine room of @Google. Building AI safely and responsibly to solve the world’s most complex problems. Join us: https://t.co/jUHQA27iBL
Jim Fan @DrJimFan
445K Followers 3K Following NVIDIA Director of Robotics & Distinguished Scientist. Co-Lead of GEAR lab. Solving Physical AGI, one motor at a time. Stanford Ph.D. OpenAI's 1st intern.
AI at Meta @AIatMeta
806K Followers 323 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
Stability AI @StabilityAI
257K Followers 10 Following We’ll help you make it like nobody’s business. Multimodal media generation and editing tools to get your idea to production. Self-deploy? 👍 Need a partner? 🤝
Nathan Benaich @nathanbenaich
71K Followers 35K Following solo member of superinvestment staff @airstreet @airstreetpress @stateofai @raais
Runway @runwayml
282K Followers 331 Following Building AI to simulate the world. We're hiring: https://t.co/Aj11xygZYI
Jeremy Howard @jeremyphoward
315K Followers 7K Following 🇦🇺 Co-founder: @AnswerDotAI/@FastDotAI ; Prev: Professor@UQ; @kaggle founding president; founder @fastmail/@enlitic/… https://t.co/16UBFTX7mo
Soumith Chintala @soumithchintala
306K Followers 1K Following Building new things @thinkymachines. Also dabble in robotics at NYU. Cofounded @PyTorch. AI is delicious when it is accessible and open-source.
clem 🤗 @ClementDelangue
374K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI builders
PyTorch @PyTorch
497K Followers 86 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
Anthropic @AnthropicAI
1.3M Followers 2 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
Philip de Guzman @PhilipofGuzman
142 Followers 144 Following VP Marketing @scale_ai. Prev @PalantirTech. Knicks fan. Wishes for more wishes type.
Scale Labs @ScaleAILabs
2K Followers 109 Following welcome to the lab. from the researchers at @scale_AI
Vipul Gupta @vipul_1011
3K Followers 1K Following Research Scientist @Scale_AI. Past: PhD @Penn_State, FAIR @AIatMeta, @IITDelhi. Interested in model evaluation and AI Safety. I don’t hallucinate
Monica Mishra @monica_moneeka_
50 Followers 146 Following Software engineer in my glory days. Energetically writing about it on @medium now. Love true crime pods, nature docs, & t-rex jokes! MBA @ HBS'21, CS @ Harv'17
Manasi Sharma @ ICLR ... @ManasiSharma_
535 Followers 336 Following research engineer @scale_AI, working on reasoning for frontier models, agents, rl | prev @stanford, @StanfordAILab, @mitll, @Columbia
Calvin Zhang @calvincbzhang
403 Followers 634 Following ML Research Ops @scale_AI | Previously @CHAI_Berkeley @MIT @ETH @OfficialUoM
Afra Feyza Akyürek @afeyzaakyurek
987 Followers 823 Following Currently @scale_AI. PhD from @BUCompSci. Research in NLP. Previously @CMU_Stats @kocuniversity @izmirfenlise
Tom Channick @tomchannick
5K Followers 1K Following dad / VP comms @scale_AI / prev @Meta, @Ripple / @raiders fan ☠️
Brad Kenstler @Bckenstler
293 Followers 261 Following I lead agents research @ Scale AI. Ex-AWS. GenAI App Builder. Opinions are my own!
Chetan Rane @the_chet2an
101 Followers 95 Following Agents and RL Environments @Scale_AI prev PM @Coinbase, @YCombinator (S20), CS @Stanford
Bing Liu @vbingliu
2K Followers 126 Following Head of Research @Scale_AI, ex-Meta, Llama 3, PhD @CarnegieMellon.
Matthew Siegel @LargerLanguage
100 Followers 80 Following AI technical writer and poet. Not simultaneously. Research, warm takes, and cool stuff we're doing at @Scale_AI. For poems: @MatthewSiegel_
Jason Droege @jdroege
7K Followers 636 Following current: 👨👩👧👦, @scale_AI prev: @benchmark, @ubereats, @axon_us
Dan Hendrycks @hendrycks
45K Followers 116 Following
Lulu Cheng Meservey @lulumeservey
137K Followers 3K Following Rostra founder, Shopify board, ex Activision & Substack, writing https://t.co/4xKo7wQTQo
Sean Hendryx @SeanHendryx
419 Followers 172 Following Research Engineer @ Meta Superintelligence Labs
Arena.ai @arena
165K Followers 215 Following Where AI meets the real world. Formerly LMArena. We measure and advance the frontier of AI through community-driven evaluation. We’re hiring → https://t.co/XBZCrseaWF
United Nations Instit... @UNIDIR
24K Followers 2K Following Building a more secure world • Director Robin Geiss
Teddy Schleifer @teddyschleifer
56K Followers 9K Following I write about billionaires and their impact on the world for The New York Times. @MSNOWNews contributor. Proud @PuckNews, @Recode alum.
Gilead Sciences @GileadSciences
72K Followers 198 Following At Gilead, we set – and achieve – bold ambitions to create a healthier world for all people. Read more: https://t.co/TZWHS6iRTe
Mark Satter @marksatter
3K Followers 884 Following Congress + national security @rollcall @CQNow | Tips, rumors, wild stories? Find me on Signal: Mark.312
Jonathan Lehrfeld @lehrfeld_media
684 Followers 1K Following D.C. Commercial Real Estate Reporter @CoStarNews ✉️ [email protected] 📝🏢 formerly @MilitaryTimes, @MedillSchool & @GWtweets alum
Oma Seddiq @omaseddiq
3K Followers 3K Following Tech policy reporter for Bloomberg @BGov. [email protected] or DM for Signal. RTs ≠ endorsements
National Institute of... @NIST
95K Followers 469 Following NIST promotes U.S. innovation & competitiveness by advancing measurement science, standards & tech to enhance economic security & improve our quality of life.
U.S. Department of Co... @CommerceGov
447K Followers 377 Following Welcome to the official X feed of the U.S. Department of Commerce, led by Secretary Howard Lutnick.
Gerard Baker @gerardtbaker
40K Followers 563 Following Editor-at-Large & Columnist @WSJ. Telling people what they don’t want to hear.
Guy Taylor @guyjtaylor
955 Followers 617 Following National Security Editor at The @WashTimes. Subscribe to the #ThreatStatus newsletter, podcast & video series in the link ⬇️
CSIS Defense and Secu... @CSISDefense
16K Followers 889 Following The CSIS Defense and Security Department (DSD) is a constant source of reliable analysis on the threats and opportunities shaping U.S. security interests.
Adrienne Watson @Watson_Adrienne
15K Followers 4K Following Personal account. Love Indiana, no place like home. “Show up. Dive in. Stay at it." - BHO
Sean Savett Archived @NSC_Spox46
54K Followers 43 Following This is an archive of a Biden administration account, maintained by the National Archives and Records Administration.
Summer Yue @summeryue0
18K Followers 398 Following Safety and alignment at Meta Superintelligence. Prev: VP of Research at Scale AI, research at Google DeepMind / Brain (Gemini, LaMDA, RL / TFAgents, AlphaChip).
Richard Nieva @richardjnieva
7K Followers 2K Following Senior writer, @Forbes, covering AI. Previously: @BuzzFeedNews. East Bay native. DMs open. Tips: [email protected] or Signal at username RNieva.26
Daniel Berrios @danielxberrios
567 Followers 457 Following special projects @openai | prev. product @scale_ai, co-founder @helia_ai (acq. by scale), @stanford
Benny (杜本立) @bennydu
965 Followers 2K Following Passionate about @Accenture, scaling AI/ML globally, Customer Success, and Digital Transformation; MBA @BerkeleyHaas; BA @UCLA (Go Bruins!); views are my own
Washington Post Live @PostLive
34K Followers 859 Following The future speaks here. The conversations you won't hear anywhere else from The Washington Post.
Chad Pergram @ChadPergram
201K Followers 23K Following Chad Pergram is the Chief Congressional Correspondent for Fox News. He's won an Edward R. Murrow Award & is a two-time recipient of the Joan Barone Award.
The OSS Society @osssociety
12K Followers 956 Following The OSS Society honors the historic accomplishments of the Office of Strategic Services (OSS), the WW2 predecessor to @CIA @USSOCOM @INRSTATE
Vijay Karunamurthy @vjkaruna
3K Followers 489 Following EIR @khoslaventures . @scale_AI, Apple, Google, YouTube.
ARCHIVED: Jen Easterl... @CISAJen
61K Followers 48 Following Archived: Director, CISA—America’s Cyber Defense Agency. Combat Veteran. Proud Mom. Rubik’s Cuber. Aspiring Electric 🎸. ❤️/RT ≠ endorsement
Benjamin Powers @benjaminopowers
4K Followers 5K Following Comms/strategy @150bond. Tech reporter at heart covering AI, government and tech policy. Prev The Messenger, Grid, CoinDesk
CNN Politics @CNNPolitics
4.6M Followers 324 Following Political news, campaign stories and Washington coverage from CNN Politics.
Techmeme @Techmeme
423K Followers 990 Following Top news and commentary for technology's leaders, from all around the web. This account shares top-level Techmeme headlines. Visit our site for full context.
Cat Zakrzewski @Cat_Zakrzewski
18K Followers 3K Following @washingtonpost White House reporter. Previously a tech reporter at The Post, @WSJ & @TechCrunch. Studied @MedillSchool. Reach me on Signal: cqz.17























