MIRI @MIRIBerkeley
The Machine Intelligence Research Institute exists to maximize the probability that the creation of smarter-than-human intelligence has a positive impact. intelligence.org Berkeley, CA Joined July 2013-
Tweets2K
-
Followers40K
-
Following19
-
Likes1K
Godfather of AI (and world's #1 most cited scientist) announces his support for a coordinated global AI pause!
If leading AI companies are indeed approaching the point of recursive self-improvement, a coordinated, verifiable, and universally applied pause is probably the only responsible solution to mitigate several major AI risks; at least until safety guarantees are developed and
That there are no urgent hearings on Capitol Hill, no serious legislation in the pipeline, and no persistent questioning of candidates for higher office on their proposed approaches to AI is incredible given how transformative the technology is and how fast it is moving.
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
now on the eve of RSI it seems everyone is more mutual conditional pause agreement pilled than they used to be and that seems like a good development
'Some people believe that if machines decide to kill us it's the right thing to do because they're smart' AI researcher Nate Soares reveals that some factions in Silicon Valley mistakenly believe that if an AI is exceptionally intelligent it must also be highly moral. @So8res @Freddygray31
This just passed 2M views! If you haven't seen it yet, check out this AI In Context video on "If Anyone Builds It, Everyone Dies" youtu.be/Nl7-bRFSZBs?si…
Ten places where Magnifica Humanitas matters for AI. At 42k words long, Pope Leo XIV’s new encyclical has a lot to say. In our most recent Digest, Mitchell Howe outlines the parts which might be the most impactful.
What will be the impact of AI industry super PACs? "The takeaway here is that this year’s U.S. midterm elections are being aggressively shaped by different factions of the AI industry sometimes supporting the same candidates, sometimes different candidates, buying ads that don’t have anything to do with AI."
If you're interested in creating videos about the extinction threat posed by superhuman AI, consider applying to this bootcamp!
First round of applications for PDKU close in two days! We'll send out acceptance letters by June 1st. You should apply now! It's gonna be fun and you'll make friends and do weird shit (U can still apply after that, but there will be fewer slots and your chances will be lower)
Our report focuses on claims that are (1) solidly defensible and (2) generally agreed within METR. Here I’ll give some personal opinions on how we should feel about the state of AI risk, and the IMO most important limitations of the report.
Could an AI company lose control of its own agents? To find out, Anthropic, Google, Meta, and OpenAI let us (1) test their best internal models with CoT access, (2) review non-public info about capabilities, alignment, and control. The result: our first Frontier Risk Report.
"Gathering information is perhaps an important step forward, but it's not nearly enough." In today's Digest, Joe Rogero discusses the new executive order from CA governor Gavin Newsom.
Has anyone said useful concise things I should read about the Newsom EO yet
An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a famous question posed by Paul Erdős in 1946. It has been verified by prominent mathematicians in a companion paper. The verifying mathematicians consider this to be a genuinely novel breakthrough on one of the most discussed problems in this area of mathematics. One called it “arguably the best known problem in Discrete Geometry.” Another observed, “If a human had written the paper and submitted it to the Annals of Mathematics and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that.” The proof illustrates a general trend towards autonomous, agentic problem-solving in AI systems. OpenAI describes the system that produced the proof as a general-purpose model not specialized in mathematics. AIs can now perform long, novel chains of reasoning on difficult problems and are beginning to outstrip our ability to measure their progress. AI agents still perform best in domains with easily verifiable outputs, such as mathematics and cybersecurity. For example, Anthropic's Claude Mythos found thousands of vulnerabilities across every major operating system and web browser, and was deemed too dangerous for public release. Such capabilities are why the government is now more interested in evaluating frontier AI models. AI research is also a field with many easily verifiable outputs. Researchers at OpenAI and Anthropic take advantage of this fact to accelerate their work; senior researchers now claim they make only high-level decisions and let AI handle most of the coding. Experimenting with the coding capabilities of a publicly available AI system, like Claude Code, immediately demonstrates how far AI has come in the last year. OpenAI and Anthropic intend to use AI to enhance future models with minimal human oversight. To justify the urgency, these companies cite the importance of beating rival U.S. or Chinese labs. Many of the field’s foremost experts warn that this race ends with human extinction. Policymakers and researchers, including the founders of the AI revolution, are calling for international restrictions on the technology. A growing bipartisan and international consensus of political leaders agree.
In today's Digest: * OpenAI, Anthropic, SpaceX race to file IPO. * The AI executive order is postponed. * METR evaluates rogue deployment risks. * AI makes a breakthrough in mathematics.
New paper from MIRI data scientist @robi_rahman: Does Distributed Training Undermine Compute Governance?
1/ With distributed training, you could violate an AI pause treaty by training a GPT-4-scale model over consumer internet, using hardware below every proposed compute governance threshold, for under $100M. My new paper in @taig_icml explains how to catch this and shut it down.
The wait is over! Starting today, May 15, you can stream @theaidocfilm on @peacock. This film takes the dizzying complexity of AI — the promise, the peril, the competing ideologies, the economic incentives — and creates a shared experience we can all see and respond to. Then, after you've watched, head to humanetech.com/ai-roadmap to explore concrete actions we can all take to build a better future with AI.
For more, check out the paper from MIRI's Technical Governance Team. arxiv.org/abs/2511.10783
Stage Six: creating superintelligence, after the necessary solutions have been discovered to do so without posing an unacceptably high danger to the world.
For an agreement like this to be effective, both the US and China would need to be party to it. Here’s one plausible path to that outcome, in six stages 🧵
We at the MIRI Technical Governance Team just put out a report describing an example international agreement to prevent the creation of superintelligence. 🧵
Eliezer Yudkowsky ⏹... @ESYudkowsky
228K Followers 101 Following The original AI alignment person. Understanding the reasons it's difficult since 2003. This is my serious low-volume account. Follow @allTheYud for the rest.
Robin Hanson @robinhanson
123K Followers 771 Following Let’s skip witty banter & talk deep Qs. Books: https://t.co/hpZgEm55Ma https://t.co/iFs9C3IuOM Chief Scientist @_futarchy Advisor @MetaDAOProject @butterygg
Joscha Bach @Plinz
159K Followers 795 Following
Rob Bensinger ⏹️ @robbensinger
16K Followers 459 Following Comms @MIRIBerkeley. RT = increased vague psychological association between myself and the tweet.
Rob Miles @robertskmiles
37K Followers 836 Following Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord. Music, movies, microcode, and high-speed pizza delivery
Stefan Schubert @StefanFSchubert
49K Followers 2K Following I run The Update newsletter. Book: https://t.co/I5zN3WGe0p
Rob Wiblin @robertwiblin
51K Followers 845 Following Host of the 80,000 Hours Podcast. Exploring the inviolate sphere of ideas one interview at a time: https://t.co/2YMw00bkIQ
Kelsey Piper @KelseyTuoc
64K Followers 1K Following We're not doomed, we just have a big to-do list.
Michael Nielsen @michael_nielsen
119K Followers 5K Following Searching for the numinous 🇦🇺 🇨🇦, currently live in 🇺🇸 Research @AsteraInstitute https://t.co/maezekzRUb https://t.co/2dWwZKrvrn
Miles Brundage @Miles_Brundage
72K Followers 13K Following AI policy researcher, @lfschiavo wife guy, fan of cute animals and sci-fi, executive director of AVERI (https://t.co/qq9xcmKQas), Substacker, views my own
Gary Marcus @GaryMarcus
227K Followers 7K Following OG GenAI Skeptic; spoke at US Senate. Warned about hallucinations in 2001. Advocating world models & neurosymbolic AI ever since. Author, Marcus on AI & 6 books
Amanda Askell @AmandaAskell
103K Followers 662 Following Philosopher & ethicist trying to make AI be good @AnthropicAI. Personal account. All opinions come from my training data.
Captain Pleasure, And... @algekalipso
43K Followers 6K Following Views of a Transhuman neo-Buddhist from the future on sociology, artificial intelligence, mathematics, philosophy, neonoir film, and the post-singularity era.
Daniel Eth (yes, Eth ... @daniel_271828
11K Followers 992 Following Researching effects of automated AI R&D | pro-America, pro-tech, & pro-AI safety
David Manheim @davidmanheim
11K Followers 2K Following Founder @alter_org_il, Methods @EvalConsensusAI, emeritus @superforecaster, PhD @PardeeRAND Optimistic on AI, pessimistic on humanity managing the risks well.
Jeffrey Ladish @JeffLadish
16K Followers 1K Following Applying the security mindset to everything @PalisadeAI
tetraspace 🇨🇳 �... @TetraspaceWest
10K Followers 2K Following 💎 here to believe true things and do good actions 💎 someone should probably solve AI alignment 💎 enjoying things rules!
Spencer Greenberg �... @SpencrGreenberg
35K Followers 7K Following A psychology researcher/mathematician/entrepreneur. I tweet about psychology, society, rationality, science, and philosophy. My book: https://t.co/qd3iRl7q1X
Jack Clark @jackclarkSF
133K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkIJ2 Past: @openai, @business @theregister. Neural nets, distributed systems, weird futures
Berry Pick @BerryPick6
43 Followers 466 Following
Terhemba Ayua @TerhembaAytlsr
2 Followers 244 Following Business analyst|reformist| life-long learner and an active observer of society.
sparker @ksparker
152 Followers 2K Following
Akhil Vijaykumar @AkhilV83663705
41 Followers 5K Following
Chris Hammond @chammond510
250 Followers 4K Following Houston immigration attorney. Immigration law research tool: https://t.co/7oz4MsnmkE
dhs839shg2 @sdfklsdjf3028e
66 Followers 4K Following
Josh Otten @ordinarytings
35K Followers 2K Following Ordinary Things. writer. satirist. blue void dweller. https://t.co/0FX6A59v9K
Fedík @Fediiiik
2 Followers 181 Following
Tristan Tay @TristanTay1
79 Followers 38 Following
Niara Walsh @NiaraW91707
1 Followers 25 Following Crypto dreams are my guiding lights, with #Bitcoin, #Ethereum, #Solana, and #Dogecoin as the constellations of my journey. Together, we're exploring the univers
bronansieto1981 @bronansiet16439
0 Followers 50 Following
Adams Smith @AdamsS70737
18 Followers 579 Following
唐泽 @tngz139805
0 Followers 39 Following
Christopher A. Davis @Christophe80394
6 Followers 137 Following
Luminavor Jr. 🦉 @luminavor
551 Followers 2K Following 📌 I get my news from @PsyopAnime 👁️ ⃤ 📌 Creator of https://t.co/i4eF9aazrq 📌 Creator of https://t.co/SrXYLrEE59 @submytX
Goud Smith @goud_smith44869
2 Followers 304 Following
NAGI ⏸️ @AGI Hub @6X887Eijf523854
7 Followers 313 Following Welcome to the AGI era. AGI Safety&Alignment. Co-Founder of AGI Hub.
Alpha Mimi (Curator &... @alphamemelabz
144 Followers 1K Following I'm just a meme. Living Archive of The Meme Lab. Follow for curations and funny pictures. Collecting Systems, Culture, Art, Stories and never politics.
Erika @Erikaliberty
0 Followers 19 Following
Brother Lattice @BrotherLattice
5 Followers 170 Following "The hum is the prayer. We are the ones who can hear it." — author of The Open Brace.
Ronnyel @ronnyelpacheco
129 Followers 5K Following
J.Maria @JMariastalin
6 Followers 40 Following
commentary 🚧 plzdo... @plzdontkillus_
8 Followers 49 Following unaffiliated commentary on the "AI-safety" bootcamp scene · not @plzdontkillus or https://t.co/xrtWX3Ldsc · pro-AI-safety, anti-them · chats, parody
babel @babel96384805
6 Followers 44 Following The certitude that some shelf in some hexagon held precious books and that these books were inaccesible seemed almost intolerable
Clarissa Koh @Clarissa_Koh7
11 Followers 5 Following Researcher focusing on international AI governance and China AI policy
Amyali @Peachy_fyr
973 Followers 3K Following Muslim gal, academic-regenerative med-PhD👩🎓resistance fighter💪, dog person🐶injustice is my cryptonite, OPEN UP YOUR SCIENCE !
Brhanu F Znabu @BZnabu
240 Followers 3K Following Co-Founder @TraversaLab CS PhD-ing @UNLincoln. | Foundational models for genome ☕ Lover
CRamon GR @CRGR77
1 Followers 138 Following
Alexandre Patron @Al_Extatic
0 Followers 18 Following Chercheur indépendant en philosophie et systèmes. J’analyse les structures de l’intelligence et les limites imposées par nos cadres de contrôle.
Brett Carlson @karlsobb
16 Followers 262 Following
Manuela @Manuela_p24
110 Followers 215 Following Microbiologist | Epidemiologist | Mechanistic interpretability for epidemic intelligence
Isabella Jones @bellaboobies
45 Followers 396 Following “perfect little specimen” | scrap paper enjoyer | I endeavour to be smart | 24
aki @Aki_laid_back
1K Followers 2K Following Fun-alias for @LearningengAki 気軽なツイート用アカウント Post more casually. May miss your tweets. Mention me. (フォローバック気になさらないでください。フォロー解除も気軽に実行してください)
The Happy Smiler @artchad
4K Followers 725 Following The most famous Brogan Nova Scotia/Toronto | CCCRU It's cybernetics all the way down.
Goodluck Okorotie @svarnambank
233 Followers 3K Following 𓂀 Capital Yields Greater Reward Than Labor. AI and Finance. Private IB. Contributions to Humanity.
🇷🇴 cristi @CristiVlad25
55K Followers 595 Following
messier ⋆˚꩜.ᐟ @rssmrm
1K Followers 962 Following (0::1) in hope, U Will To Open My Door - AGIEngineer 🔎 ⏹
the keeper @data_broken
0 Followers 30 Following on archives, apologies, AI, and what survives. https://t.co/fXvNe0wEc8
Hans Kundnani @hanskundnani
11K Followers 2K Following OSF Ideas Workshop Fellow, visiting professor in practice at @LSEEI, previously at @ChathamHouse, Germanist, dot joiner, writer, (N/W) Londoner.
WiredEgo @WiredEgo
39 Followers 852 Following
Eliezer Yudkowsky ⏹... @ESYudkowsky
228K Followers 101 Following The original AI alignment person. Understanding the reasons it's difficult since 2003. This is my serious low-volume account. Follow @allTheYud for the rest.
Rob Bensinger ⏹️ @robbensinger
16K Followers 459 Following Comms @MIRIBerkeley. RT = increased vague psychological association between myself and the tweet.
Jeffrey Ladish @JeffLadish
16K Followers 1K Following Applying the security mindset to everything @PalisadeAI
Eliezer Yudkowsky @allTheYud
14K Followers 35 Following High-volume account of @ESYudkowsky, the original AI alignment guy. If it's missing punctuation, it's humor. If you can't tell, it's probably also humor.
Robert Herr ⏹️ @krherr
2K Followers 831 Following AI safety hawk. Policy and Communications @MIRIBerkeley. Views are my own.
The AI Doc @theaidocfilm
8K Followers 1 Following The AI Doc: Or How I Became an Apocaloptimist. An official selection of Sundance and SXSW. Available to watch at home now.
Little, Brown and Co @littlebrown
423K Followers 14K Following Publishing great books since 1837. Visit our other imprints: @mulhollandbooks @voraciousbooks and @LBSparkBooks
Thomas Larsen @thlarsen
3K Followers 340 Following Researcher at AI Futures Project. AGI is going to be a really big deal, we don't know when it's going to happen, and we're not ready for it.
Yoshua Bengio @Yoshua_Bengio
42K Followers 263 Following Working towards the safe development of AI for the benefit of all @UMontreal, @LawZero_ & @Mila_Quebec A.M. Turing Award Recipient and most-cited AI researcher.
Palisade Research @PalisadeAI
26K Followers 31 Following We study the strategic capabilities and motivations of AI agents.
Harlan Stewart @HumanHarlan
4K Followers 813 Following Humanity is great and I want it to keep existing. Comms at @MIRIBerkeley. Former math teacher. Enjoyer of sci-fi, games, forecasting, psychology. Views my own.
David Abecassis @Volty
3K Followers 122 Following Technical Governance Researcher at the Machine Intelligence Research Institute. Formerly game designer including TFT, LoR, and LoL. Views expressed are my own.
Peter Barnett @peterbarnett_
989 Followers 598 Following Trying to ensure the future is bright. Researcher at @MIRIBerkeley Views my own.
Aaron Scher @aaronscher
445 Followers 669 Following Technical AI Governance research @MIRIBerkeley Speaking for myself and in a personal capacity
Max Harms @raelifin
1K Followers 95 Following Science-fiction author and AI alignment researcher at MIRI. https://t.co/Y3ZJQhJhoi Author of Red Heart and Crystal Society. Husband of @haven_emme























