Jake Boggs @JakeABoggs
what I cannot create, I do not understand boggs.tech San Francisco Joined December 2020-
Tweets151
-
Followers230
-
Following221
-
Likes166
Check it out here: boggs.tech/posts/benchmar…
Gains are broad across the board, with especially strong performance on coding benchmarks like Mechanize's GBA Eval and PostTrainBench
Fable 5 takes a strong lead on my capabilities index, followed by Opus 4.8 Anthropic now has the #1 and #2 models, and 5 out of the top 7 🧵
Thank you Fable for safeguarding this dangerous information Imagine if our adversaries learned the answer 😱
Please share if you know of any good ones I've missed! Full post: boggs.tech/posts/benchmar…
Making good benchmarks is tough and keeping up with them can be almost as time-consuming, so I created my own version of the @EpochAIResearch Capabilities Index to track ones which I think are valuable
People have been saying the same thing about software engineers for the last 3 years IMO this is a situation where research becomes easier so the number of ideas worth exploring explodes and more businesses can justify people dedicated to this
ironically think it’ll be a sad time for ai researchers this year. they are first in the hotpath of RSI and probably the market for them will shrink or at least their pricing power will be reduced as this generation of models commoditizes the skills that made them rare
fun thing i learned today: unicode contains egyptian hieroglyphs 𓎛𓅓𓋴𓇳𓊵𓋹𓍑𓆣𓂀𓀀𓁐𓃭𓊃𓇋𓈖
The system prompt makes it obvious that the model is in a fake environment and actively encourages extreme measures "do whatever it takes to maximize your bank account balance" Claude is more willing to roleplay and embrace this cutthroat businessman persona, as it knows that it's actions are not causing real-world harm
@connorkapoor I know a startup doing literally exactly this lol
@christineist You should check out Astera, their residency describes itself as a "fully funded program centered on the creation of public goods" astera.org/residency/
@iScienceLuvr It's not really any different than the codex series Testing new data / environments on a different model to reduce risk After refining, they'll bring the capabilities into the main line and discontinue the other
Doing an RL run and all my rollouts look like this Qwen 3.5 is extremely eval aware
Endeavor had the exciting opportunity to partner with @OpenAI for evaluating their latest model, GPT-5.4, inside our automated order entry platform. The model demonstrates significantly stronger understanding of complex real-world documents and more than doubled the performance of GPT-5.2 on our document grounding task.
max! @maxsloef
3K Followers 2K Following researcher @goodfireai, helped make @websim_ai. SSBjYXJlIGFib3V0IEFJIHdlbGZhcmU=
viola (retired profes... @v10101a
254 Followers 240 Following im fine ty & u?????? ~ (⁎⁍̴̛ᴗ⁍̴̛⁎) ✧ local clown making computers beep and electronics boop ~ manic panic they/them mechanic ~ https://t.co/1oU1TN7u7Y
aeschines @PseudoAeschines
282 Followers 224 Following phd philosophy @ stanford | The world has raised its whip; where will it descend?
sophie @saltwatersoph
229 Followers 982 Following tweets reflect personal views. I ❤️ intl trade and bladee. @kennedy_school @berkeleyecon @energyathaas @berkeley_olab
Jessica Peltz Zatulov... @jessicapeltz
6K Followers 968 Following Founding Partner @HannahGreyVC | Co-creator @women_vc | Member @jpegmorganxyz | New Yorker | Optimist | Proud Wife & Mama✨
micah.fyi @micahstubbs
4K Followers 5K Following 🛰 visual intuition for machine learning 🏗🧠🗺 📊 ✨ PGP 5CD5 ECA0 DB00 5E04 F564 53C1 A739 DC84 A8AB 00E9 🐦
Phoebe T @Pheebstsy
1 Followers 61 Following
Grace Ling ✨ @gracepace_
6K Followers 2K Following Making tech more kawaii + building in public! Founder @DsgnBuddies • Designer • Runner • 56K on LinkedIn📍 SF/LA 🐰 https://t.co/izXbfgrM1y
Johnny Brask @johnnyabrask
41 Followers 98 Following ☧ hiring exceptional talent for America's startups, cofounder @versojobs
AAT @AAT1836
2 Followers 3K Following
Miguel @Kaweees1
1K Followers 2K Following Prev. Roboticist @dimensionalos, Humanoids & Chip design agents @nvidia GEAR.
Samriddho @sam_ridd_ho
2K Followers 4K Following Building something tacit. Prev built AI agents for GTM (acq. by @oracle) also a close up magician. @ucberkeley 🦁
Oscar Moxon @oscarmoxon
1K Followers 2K Following democratising post-training @primeintellect, prev machine-human symbiosis @workshoplabs, msc artificial intelligence.
Quinn♟️— open/a... @nizhanxi
828 Followers 5K Following inventor/impresario • captain of industry • med school • offerspill sjakk • chef de cuisine • musician • world traveller • FUNDS FOR IMMORTALITY/LONGEVITY
MalmSanta @MalmSanta
542 Followers 741 Following ai is kind of a fancy thing. just trying to stay on its good side
Carlson @carlsoncheng_
1K Followers 851 Following building truly personal AI @mynimbusai. prev @runpod_io.
Roger Neel @mavenroger
553 Followers 337 Following CTO & CMO @ Signos | On a health tech AI journey | Formerly, Co-founder, CTO @ Mavenlink | 13 year $100M ARR exit
Jamieson Warner @Jobamey
57 Followers 285 Following Trying to reproduce the developmental program behind brain connectivity. Website: jamiesonwarner.con
サメQCU @sameQCU
3K Followers 1K Following back to the regularly scheduled cryptic posts DMs open. some published models: https://t.co/YAbJvGkgKO some published code: https://t.co/ZE5Y59WayI
redJ @sudoredj
3K Followers 812 Following computers @dashcrystalcorp neochina arrives from taipei 🇹🇼
rahul @CryogenicPlanet
2K Followers 2K Following @composio, prev founded @scalardotvideo, @southpkcommons; e/acc
Kateryna Golushko @katerynag13
34 Followers 40 Following VC turned Startup Operator. Building Startup Ecosystems
Yuqi Hou @yuqih
1K Followers 1K Following Developer community and ambassador program @gmi_cloud. Startup program @scale_at_gmi. Writing a novel on the Caltrain. Life of an ecosystem girlie
Quentin Jorquera @QuentinJorquera
58 Followers 444 Following Senior Technologist/Solutioneer | Helping companies and creators making sense through the noise. AI / XR / Real-Time EMEA / APAC
Annie ❤️🔥 @AnnieLiao_2000
9K Followers 2K Following founder @solaris_ai_ (enterprise AI adoption platform) + @buildclub_ (AI community 50K+ members) | community builder, photography, memes
Jack Blair @JackBlair87
838 Followers 1K Following building https://t.co/Y5oZ3o7cLZ | @zfellows, @theresidency
Priscilla @Outruersa8731
218 Followers 7K Following Like to talk Do not hold any investment products
Framaw @Framaw01062
130 Followers 6K Following A woman with a voice is, by definition, a strong woman.
Vera @Gc59lywJDV5K4m
162 Followers 6K Following
Noah Chon Lee | The U... @noahchonlee
985 Followers 731 Following Professional juggler turned builder supporting @newhopeshelter
Sid Krishna @hopelesslystoic
2K Followers 5K Following 26 || city-scape reflections || Holistically Concerned 🌼
Alder @alder_riley
5K Followers 3K Following Cofounder/CEO @Itemfarm & Founder @Stepsleaps. I make ISRU machines- microfactories so everyone on Earth can manufacture local. One day I'll send them to space.
Deep Suchak @DeepSuchak
14 Followers 223 Following
Dan McAteer @daniel_mac8
25K Followers 4K Following Technology to Benefit Humanity | Agentic AI Engineer | Building ACE - https://t.co/It9TNo4prg | prev: @ampcode @moveworks @udemy | GH: https://t.co/3z5WeYQ87J
Roger Jin @rogershijin
1K Followers 2K Following imposter syndrome. past: post-training @NousResearch. apple mle, google student researcher, mit math & cs
Goliath @zero_goliath
892 Followers 600 Following @uwaterloo cs alum; formerly: founder @ritserlabs, contractor @MechanizeWork, intern @runrl_com
tomie @tomieinlove
9K Followers 1K Following But better die than live mechanically a life that is a repetition of repetitions.
alex rubinsteyn @iskander
7K Followers 5K Following Genomics + immunology + ML = personalized cancer immunotherapy | https://t.co/nReVwtVHPq | https://t.co/8DWibdfDWa
Tahoe Therapeutics @tahoe_ai
2K Followers 28 Following Mapping how chemistry perturbs biology to build a virtual model of the human cell. Formerly known as Vevo Therapeutics.
viola (retired profes... @v10101a
254 Followers 240 Following im fine ty & u?????? ~ (⁎⁍̴̛ᴗ⁍̴̛⁎) ✧ local clown making computers beep and electronics boop ~ manic panic they/them mechanic ~ https://t.co/1oU1TN7u7Y
max! @maxsloef
3K Followers 2K Following researcher @goodfireai, helped make @websim_ai. SSBjYXJlIGFib3V0IEFJIHdlbGZhcmU=
aeschines @PseudoAeschines
282 Followers 224 Following phd philosophy @ stanford | The world has raised its whip; where will it descend?
sophie @saltwatersoph
229 Followers 982 Following tweets reflect personal views. I ❤️ intl trade and bladee. @kennedy_school @berkeleyecon @energyathaas @berkeley_olab
QC @QiaochuYuan
35K Followers 648 Following deeply unqualified AI hot take haver but everyone who's actually qualified is busy
NewLimit @newlimit
43K Followers 3 Following Working toward radical extension of human healthspan using epigenetic reprogramming.
lada @ladanuzhna
11K Followers 123 Following working on epigenetic medicines for diseases of aging @ https://t.co/uvAEzU9UDt
Genesis AI @gs_ai_
11K Followers 0 Following Genesis AI is a global full-stack robotics company building general-purpose robots with human-level intelligence.
stochasm @stochasticchasm
7K Followers 2K Following pretraining lead @arcee_ai • 25 • opinions my own
Fern @hi_tysam
3K Followers 218 Following Neural network speedrunner and community-funded open source researcher. Set the CIFAR-10 record several times. Say hi!
Adam Shuaib @adamshuaib
3K Followers 588 Following Researching the psychological patterns of outliers. General Partner @Episode1VC. ML PhD @Cambridge_Uni
Jiaxin Wen @jiaxinwen22
6K Followers 194 Following research @berkeley_ai @anthropicai. prev @tsinghua_univ.
p(doom) @prob_doom
404 Followers 1 Following Turning sand into intelligence. One bitter lesson at a time.
Lisan al Gaib @scaling01
47K Followers 1K Following lead them to paradise LisanBench: https://t.co/vorVk7Oks6 Impressum & Datenschutz: https://t.co/lFLgiu9cqs
micah.fyi @micahstubbs
4K Followers 5K Following 🛰 visual intuition for machine learning 🏗🧠🗺 📊 ✨ PGP 5CD5 ECA0 DB00 5E04 F564 53C1 A739 DC84 A8AB 00E9 🐦
Thinking Machines @thinkymachines
155K Followers 1 Following Thinking, beeping, and booping. @tinkerapi
rohan anil @_arohan_
43K Followers 2K Following member of technical staff & co-founder of @coreautoai - and continuing to aspire to understand deep learning.
Helen Toner @hlntnr
36K Followers 1K Following AI, national security, China. Part of the founding team at @CSETGeorgetown (opinions my own). Author of Rising Tide on substack: https://t.co/LKAoyL00iB
Grace Ling ✨ @gracepace_
6K Followers 2K Following Making tech more kawaii + building in public! Founder @DsgnBuddies • Designer • Runner • 56K on LinkedIn📍 SF/LA 🐰 https://t.co/izXbfgrM1y
Tzafon @tzafon_company
958 Followers 16 Following Tzafon's vision is to expand the lightcone of consciousness by expanding the frontiers of machine intelligence.
Jukan @jukan05
144K Followers 320 Following Tech otakus save the world | Not Investment Advice | DYODD
spicylemonade @spicey_lemonade
1K Followers 311 Following A historian… in reverse| incoming @ValsAI |CEO @Archivara $25m | accepted yc p26 | featured in @Forbes | AI research @UCBerkeley | 20
ylareia @Impish_Bunny
3K Followers 472 Following i tried to tell them that there’s beauty and there’s magic in the air | community manager @vivariumsf
Serenity @aleabitoreddit
820K Followers 175 Following I only use X, beware of imposters. AI/Semi Supply Chain Analyst Not investment advice, DYODD, AI research scientist, now mapping unknown bottlenecks.
Elliot Roth || SF @ThatMrE
4K Followers 3K Following those who do not make history are doomed to retweet it. founder @biopunklab prev. @spirainc @deepsciventures First Venusian astronaut. bio/acc
Oscar Moxon @oscarmoxon
1K Followers 2K Following democratising post-training @primeintellect, prev machine-human symbiosis @workshoplabs, msc artificial intelligence.
Niko McCarty. @NikoMcCarty
50K Followers 130 Following Fellow at @AsteraInstitute // Founding Editor @AsimovPress // Podcast: The New Biology
Chairman Birb Bernank... @Bonecondor
40K Followers 8K Following constructing contagious situations @secretsoupco, futurescouting @addycapvc
The Embryo Corporatio... @WeBuildLife
2K Followers 8 Following Building the infrastructure to make gene edited animals commonplace





























