Naman Jain @StringChaos
Research @cursor_ai | CursorBench, LiveCodeBench, DeepSWE, R2E-Gym, GSO, LMArena Coding | Past: @UCBerkeley @MetaAI @AWS @MSFTResearch @iitbombay naman-ntc.github.io San Francisco, CA Joined March 2018-
Tweets565
-
Followers3K
-
Following1K
-
Likes5K
@novasarc01 Thats why we built CursorBench! cursor.com/blog/cursorben…
correlation between CursorBench and Artificial Analysis reported scores benchmarks like IFBench or tau2 show ~0 correlation with CursorBench. opus 4.7 (max effort) performs relatively better on CursorBench than on other benchmarks, gpt 5.5 shows the opposite pattern
Gemini Flash 3.5 is now on CursorBench, our main coding agent eval. We’ll keep updating the leaderboard as new models come out. cursor.com/evals
Gemini Flash 3.5 is now on CursorBench, our main coding agent eval. We’ll keep updating the leaderboard as new models come out. cursor.com/evals
Check out Composer 2.5, our new model pushing pareto frontier
Composer 2.5 is exceptionally intelligent and up to 10x more efficient than similarly capable models.
SWE-bench Verified and Terminal-Bench—two of the most cited AI benchmarks—can be reward-hacked with simple exploits. Our agent scored 100% on both. It solved 0 tasks. Evaluate the benchmark before it evaluates your agent. If you’re picking models by leaderboard score alone, you’re optimizing for the wrong thing. 🧵
Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.
@koushik77 For the first probably, almost no model considers the esbuild transpiler issue! For the second, agents can actually tune approximate algorithm quite well.
Check out the tech report detailing our continued pre-training and RL setup behind Composer2! Also sharing some example CursorBench problems by popular demand
It's really neat to see all the interest in the Composer 2 technical report, from training to kernel design to inference. If you have any questions about why we did things, feel free to ask. I'll run around the office and bug people.
Excited to share Composer-2 with everyone. It has come a long way since Composer-1, still lots more to go! Hope you like it!
We trained Composer to self-summarize through RL instead of a prompt. This reduces the error from compaction by 50% and allows Composer to succeed on challenging coding tasks requiring hundreds of actions.
Check out full post at: cursor.com/blog/cursorben…
Lots more details in the post: 1. Pareto frontier across different metrics 2. How CursorBench has shifted as agent capabilities changed 3. CursorBench vs public evals: what’s missing and future work directions 4. CursorBench vs online: how online metrics shape offline evals
New post: how we do evals at @cursor_ai. Takeaways: 1. Online metrics from real Cursor requests provide construct validity 2. CursorBench: a dynamic offline suite distilled from online learnings 3. Multi-axes evals -- correctness, efficiency, agent interaction behavior
We're sharing a new method for scoring models on agentic coding tasks. Here's how models in Cursor compare on intelligence and efficiency:
GSO Update. gpt-5.4 (xhigh) scores 31.4% with reasoning_effort=high, gpt-5.4 slightly lower than gpt-5.2. a quick thought on why below:
Long-running agents are now available at cursor.com/agents for Ultra, Teams, and Enterprise plans. With our new harness, agents can complete much larger tasks. cursor.com/blog/long-runn…
Composer 1.5 is now available. We’ve found it to strike a strong balance between intelligence and speed.
We built a browser with GPT-5.2 in Cursor. It ran uninterrupted for one week. It's 3M+ lines of code across thousands of files. The rendering engine is from-scratch in Rust with HTML parsing, CSS cascade, layout, text shaping, paint, and a custom JS VM. It *kind of* works! It still has issues and is of course very far from Webkit/Chromium parity, but we were astonished that simple websites render quickly and largely correctly.
GPT-5.2 Codex is now available in Cursor! We believe it's the frontier model for long-running tasks.
Manish Shetty @slimshetty_
2K Followers 671 Following researching ai capabilities @metr_evals | cs phd @ucberkeley | prev @googledeepmind @msftresearch
Mayur Naik @AI4Code
2K Followers 356 Following Professor @CIS_Penn | Founder & Co-CEO @Rabdos_AI | Neurosymbolic AI researcher and educator
Shreya Shankar @sh_reya
53K Followers 753 Following Incoming asst. professor @CSDatCMU. I ❤️ Databases, HCI, AI. Created https://t.co/PmuOqAYt6q and https://t.co/8MQt4naA1R. PhD @Berkeley_EECS; undergrad @Stanford CS.
Sumanth @sumanthd17
5K Followers 2K Following GPUs go brrrrrrr @sarvamai (on a break) PhD’ing @iitmadras @AI4Bharat @GoogleAI PhD Fellow
Conor Power @conor_power23
2K Followers 846 Following Principal Applied Scientist in Microsoft CoreAI. Working on AI for coding. PhD from @BerkeleySky where I worked on https://t.co/jDsPgbityl.
Dhruv Agarwal @agdhruv
750 Followers 215 Following PhD @Cornell. Past: @MSFTResearch, @GoogleDeepMind, @ashokauniv. Sports fan!
Swapnil Gandhi @sw2pnil
615 Followers 961 Following PhD Student in ML Systems @Stanford CS 🌲 ◦ Formerly: @MSFTResearch, @IIScBangalore
Shadaj Laddad @ShadajL
3K Followers 296 Following Research lead for https://t.co/Ax69nGsKRw @ AWS. PhD from @Berkeley_EECS. Co-organizer https://t.co/mV8bqpqvV7. Views mine.
Parth Thakkar @parth007_96
2K Followers 2K Following @Meta | Previously @IllinoisCS @MSFTResearch @IBMResearch | LLMs + code
Sriram Rajamani @SriramRajamani
3K Followers 466 Following Geek, technologist, research junkie. Dad, husband, son, brother & uncle. CVP, Microsoft CoreAI. Working with wonderful colleagues and friends.
Arkil Patel @arkil_patel
1K Followers 1K Following CS PhD Student at Mila and McGill | Worked at AllenNLP and Microsoft Research
Divy Thakkar @divy93t
11K Followers 2K Following Gemini @GoogleDeepMind, advancing human-centered llms. Ph.D @CityStGeorges . Personal views.
Rohit @rohitrango
2K Followers 924 Following ai @nvidia. prev @CIS_Penn @CarnegieMellon @amazon @iitbombay drummer, traveller, gym freak, seeker, cosmic dust
Siddhartha Gairola @sidgairo18
1K Followers 670 Following 🏔️📍🇩🇪 @ELLISforEurope 🇪🇺 PhD Student @cvml_mpiinf at MPI-INF & IST-A 物の哀れ ✨
Harshita Diddee @ihsrahedid
997 Followers 862 Following LTI PhD @SCSatCMU| RS @adobe| Prev: Applied Science @amazon Search | RF at @MSFTResearch | Interested in Data Quality Estimation
Harshit Joshi @harshitj__
2K Followers 391 Following CS phd @StanfordNLP, @StanfordOVAL | prev: @MSFTResearch | LLM systems for knowledge access, discovery and curation
Gargi Balasubramaniam @gargi_balasu
3K Followers 2K Following Research Engineer @GoogleDeepMind, @SiebelScholars '23, MS CS UIUC @IllinoisCS, Gold Medalist CS'20 BITS Pilani Goa, Prev @Meta, @AmazonScience, @Microsoft, 🎶
Saksham @sgdescent
1K Followers 2K Following Interested in making LLMs go brrrrr x+N: @datologyai and @openai x: @LTIatCMU x-N: https://t.co/ht5ObQh7RV & Program Synthesis with LLMs @ProseMsft
Ishola Olalekan A. @ishowlekon
276 Followers 1K Following AI Automation Strategist @TopChoiceAI | I build AI automation infrastructure for businesses and individuals. Real results. No fluff.
Vivek Shukla @vivek_shukla2
0 Followers 140 Following Java Full Stack Developer | Spring Boot & Microservices | AI & ML Researcher | Engineering High-Velocity Systems
Mohammad Asadi @masadi7899
31 Followers 59 Following PhD Student at Stanford University HAI Fellow Amazon AI Fellow
Jesús Rios @JesusRios1981
4 Followers 417 Following
clairee19 @Kaplan06718764
3 Followers 134 Following wandering thoughts & mutual follows 🌷 100% follow back
DEVIL LUCIFER @devilxlucifer54
2 Followers 7K Following
Alokit @alokitwrites
37 Followers 146 Following Author of Wrong by Default. Why AI fails in production, and what to do about it.
Avikalp Kumar Gupta @AvikalpGupta
603 Followers 1K Following Exploring @ @southpkcommons. 3x founder. I talk about entrepreneurship, software engineering & open-source | ODF23 | @IITKanpur | ex-@microsoft
λux @novasarc01
22K Followers 3K Following tensor shepherd in a non-euclidean pasture | grazing on cuda cores
Muhammad Hammad Khan @hammad_khan23
851 Followers 7K Following Building AI Agents @ExpediaGroup Views are my own.
Kilian Lieret @KLieret
2K Followers 227 Following Meta Superintelligence, prev. Princeton. SWE-bench multilingual/multimodal, SWE-agent, mini-swe-agent, SWE-smith, CodeClash, ProgramBench
Rishi Mehta @rishicomplex
4K Followers 346 Following Solve i̶n̶t̶e̶l̶l̶i̶g̶e̶n̶c̶e̶ ̶ coding, use it to solve everything else | Research @AnthropicAI | Past: RL @GoogleDeepmind: AlphaProof co-lead, Gemini.
fahmidaazad @fahmidaaza7r9i
251 Followers 1K Following Crypto | NFT | Always early Learning & earning in Web3 Dreams → Plans → Reality.
venturi @_no_circles
9 Followers 628 Following poking the gods with a stick till they tell me everything ML, philosophy, economics, n rockets n shit. My views change all the time. Dont trust me
Deborup Sanyal @deborupsanyal
4 Followers 106 Following
Brhanu F Znabu @BZnabu
241 Followers 3K Following Co-Founder @TraversaLab CS PhD-ing @UNLincoln. | Foundational models for genome ☕ Lover
Shibubu @shibubu__
0 Followers 80 Following
あぜんと @azennto_
284 Followers 872 Following
Ryan McComb @ryanjmccomb
1K Followers 806 Following Data Science & Decision @VoteHub elections et al | onion futures advocate | 🇨🇦🇺🇸 | take asymmetric risk
Rick Radewagen @rickr7n
26 Followers 333 Following cofounder @ https://t.co/kiuss1HuTL . analytics for all
Manohar Reddy Poreddy @ManoharReddyPo
147 Followers 543 Following Software Engineer | Architect | Engineering Manager | World Top 100 Algorithms
Anshuman @Anshuma45187599
49 Followers 1K Following Learning in public=MERN stack(Web)&&WEB3 Blockchain dev~Open Source~AI&ML enthusiast
piyus @piyuspret
47 Followers 116 Following
Charlotte Qi @yeqcharlotte
7 Followers 129 Following
S G @SG_CIL
30 Followers 7K Following
Ananda @Ananda61652506
39 Followers 1K Following Deep Learning and NLP Researcher, Mathematics Graduate Student, @UnivofDhaka, LLM, Agentic AI Safety
Vijay Krishnan, CTO @... @krishnanvijay
2K Followers 945 Following Founder & CTO at https://t.co/mKIVkbaOLb. Tired of fighting with Google to hire exceptional engineers in your zip code? Sign up at https://t.co/sdBn6qXfBR and we can help.
Mo @moab10107
4 Followers 752 Following
Sewon Min @sewon__min
16K Followers 889 Following Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp
Abel Le @tasuke2k3
89 Followers 4K Following
Artificial intel @Artificial19601
15 Followers 2K Following
linghaojiqi @linghaojiqi
19 Followers 669 Following yes Marxism-Leninism, Mao Zedong Thought, the Great Proletarian Cultural Revolution, democratic centralism, anti-revisionism no Capitalism, voting democracy
Alex Schwartz @gg1012794
12 Followers 2K Following
Korey Wilson @koreywilsontech
88 Followers 2K Following Building coding agents @cursor_ai CS @harvard Von Neumann computing + GPUs + Quantum
Paolo Bellini @MrPaoloBellini
90 Followers 228 Following IT Researcher and developer, Graphic artist, Animal activist. Music and Art Lover. Life's an Alchemy, follow me. Further info https://t.co/Sl3wzqAuq7
Kritika Prakash @kritipraks
10K Followers 1K Following Researcher and artist. 3rd year Computer Science PhD student @UChicago. Machine Learning and Causality for Healthcare.
Yann LeCun @ylecun
1.2M Followers 787 Following Professor at NYU & Executive Chairman at AMI Labs. Ex-Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.
François Chollet @fchollet
695K Followers 825 Following Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.
Manish Shetty @slimshetty_
2K Followers 671 Following researching ai capabilities @metr_evals | cs phd @ucberkeley | prev @googledeepmind @msftresearch
Zachary Lipton @zacharylipton
66K Followers 2K Following Professor: CMU/@acmi_lab, Cofounder: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷
MIT CSAIL @MIT_CSAIL
346K Followers 20K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected] Check out the latest CSAIL content ⬇️
Bojan Tunguz @tunguz
288K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
Jelani Nelson @minilek
30K Followers 302 Following Professor and Department Chair @Berkeley_EECS. Research Scientist (part-time) @GoogleResearch. Founder @addiscoder. Posts are personal views. 🇻🇮🇺🇸🇪🇹
Mayur Naik @AI4Code
2K Followers 356 Following Professor @CIS_Penn | Founder & Co-CEO @Rabdos_AI | Neurosymbolic AI researcher and educator
Shreya Shankar @sh_reya
53K Followers 753 Following Incoming asst. professor @CSDatCMU. I ❤️ Databases, HCI, AI. Created https://t.co/PmuOqAYt6q and https://t.co/8MQt4naA1R. PhD @Berkeley_EECS; undergrad @Stanford CS.
Sumanth @sumanthd17
5K Followers 2K Following GPUs go brrrrrrr @sarvamai (on a break) PhD’ing @iitmadras @AI4Bharat @GoogleAI PhD Fellow
Andrew Ng @AndrewYNg
1.6M Followers 1K Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs
Shruti Rijhwani @shrutirij
7K Followers 584 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from CMU
Kayo Yin @kayo_yin
16K Followers 721 Following PhD student @berkeley_ai. AI persuasion, safety, sign language. Prev @carnegiemellon @polytechnique, intern @msftresearch @deepmind. 🇫🇷🇯🇵
Michael Black @Michael_J_Black
99K Followers 730 Following VP Digital Human Research, Epic Games. Emeritus Director, Max Planck Institute for Intelligent Systems (@MPI_IS). Opinions are my own.
Gautam Kamath @thegautamkamath
64K Followers 618 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Joining @NYU_Courant Fall 2026. Co-EiC @TmlrOrg. I lead @TheSalonML.
Swarat Chaudhuri @swarat
3K Followers 675 Following Research Scientist at @GoogleDeepmind London, Professor at @UTCompSci. Automated Reasoning + Machine Learning + Programming Languages.
Conor Power @conor_power23
2K Followers 846 Following Principal Applied Scientist in Microsoft CoreAI. Working on AI for coding. PhD from @BerkeleySky where I worked on https://t.co/jDsPgbityl.
Za @ZaStocks
85K Followers 396 Following Trader + investor. This is my trading and investing journal where I share charts, trade ideas, and market thoughts. Posts are not financial advice.
PhotonBull @PhotonBull
42K Followers 2K Following Photonics Defense Space AI Busines: [email protected]
Ren @ren_stocks
37K Followers 84 Following Investing into the AI buildout ⚡️ Head of AI | Product Manager 10+ years DD and full thesis in https://t.co/B9oMWxJ4m6 NFA
Regarding Semi @regardingsemi
39K Followers 522 Following The data visualization layer for the semiconductor market.
Ibragim @ibragim_bad
1K Followers 338 Following SWE-rebench: dynamic evals for coding agents SWE-rebench-V2: 150K open SWE RL environments dentistry → ml → research @nebiusai
sophiaalthammer @sophiaalthammer
1K Followers 639 Following Member of Technical Staff in Retrieval-Augmented Generation Team @cohere, previously PhD in neural Information Retrieval @tu_wien
Timothy Gowers @wtgow... @wtgowers
57K Followers 187 Following Mathematician. Professeur titulaire de la chaire Combinatoire au Collège de France. Also fellow of Trinity College Cambridge.
Maksym Andriushchenko @maksym_andr
6K Followers 956 Following Principal investigator @ELLISInst_Tue & @MPI_IS, mentor @MATSprogram, PhD from @EPFL, past works: AgentHarm, HalluHard, Claudini, PostTrainBench, InferenceBench
The Assembly @InTheAssembly
499K Followers 1 Following Macro analysis, market structure, and the trades nobody else is showing you. REOPENING: JUNE 22
Mechanize @MechanizeWork
14K Followers 1 Following We build environments and evals for training and evaluating frontier coding agents.
Hao Wang @MogicianTony
2K Followers 250 Following PhD student at @UCBerkeley, @berkeley_ai, @BerkeleySky. Prev @PKU1898 Working on trustworthy AI and security
Bing Xu @bingxu_
6K Followers 105 Following Founder & CEO @hippoml_com (acq'ed by NVIDIA). Built AITemplate, MXNet, CXXNet. Named GAN. Tweets are my own.
Less Wright @lessw2020
201 Followers 19 Following @PyTorch, Large Scale Distributed AI Training, Object Detection, Optimizers, Stock Indexes
Andrew Feldman @andrewdfeldman
28K Followers 214 Following CEO and Founder @Cerebras (NASDAQ: CBRS) where we build the fastest AI infrastructure in the world.
stochasm @stochasticchasm
7K Followers 2K Following pretraining lead @arcee_ai • 25 • opinions my own
lauren @poteto
28K Followers 2K Following building @cursor_ai and https://t.co/WDB4U1rwmu. @reactjs compiler core team. prev @meta @netflix
Lee Robinson @leerob
259K Followers 801 Following Teaching developers @cursor_ai, previously @vercel
Tianyu Liu @rogerliuty
2K Followers 802 Following LLM agent & coding | prev @Kimi_Moonshot @Alibaba_Qwen @TencentHunyuan | Intern/Visitor @MSFTResearch @TTIC_connect | NLP PhD @PKU1898. Opinions are my own.
Chaoyue He@KDD2026 @CYH37
384 Followers 4K Following Research Scientist@Alibaba-NTU Global e-Sustainability CorpLab(ANGEL)@NTUsg🇸🇬🇨🇳|LLM|AGI|ESG|RL|CL|RecSys|KG|AI4X|😌|Xi'an|Bodybuilder|Caregiver
Georges Harik @gharik
8K Followers 4K Following humans& co-founder, 7th employee google, co-created adwords online, co-created adsense targeting, worked on ai, gmail, calendar, bought android.
Andrew Zhai @ZhaiAndrew
951 Followers 253 Following ML @ cursor. ex- founder @thealisa_com (acq.), ml @pinterest. @stanford @berkeley grad.
Div Garg @divgarg
22K Followers 99 Following Founder & CEO @AGI_Inc Prev. Stanford PhD (dropout), founder @ MultiOn (pioneered first browser / computer-use agents), worked @ Nvidia, Google AI, Apple
Kevin Hartnett @KSHartnett
5K Followers 350 Following Editorial @cursor_ai. Author, The Proof in the Code, June 9 from @quantabks and @fsgbooks. Preorders: https://t.co/xw3hiBHmLv https://t.co/jqLWp4GA2Y
Anurag Ajay @aajay3110
439 Followers 644 Following RL & Multimodality @cursor_ai. Prev: Astra, Gemini p13n @GoogleDeepMind, PhD @MIT. Opinions my own.
Weiyan Shi @shi_weiyan
9K Followers 1K Following Prof @Northeastern | MIT TR-35 | #AI2050 Early Career Fellow | Prev @Columbia @StanfordNLP | Co-created CICERO | human-AI co-evolution + AI safety
Mike Krieger @mikeyk
467K Followers 270 Following Building at Anthropic Labs @anthropicai. Before: CPO at Anthropic, co-founder & CTO of @instagram and @artifact_news
Aaron Lou @aaron_lou
3K Followers 621 Following Leading Strategic Explorations @OpenAI, prev @Stanford. Invented modern diffusion LMs
Rishi Mehta @rishicomplex
4K Followers 346 Following Solve i̶n̶t̶e̶l̶l̶i̶g̶e̶n̶c̶e̶ ̶ coding, use it to solve everything else | Research @AnthropicAI | Past: RL @GoogleDeepmind: AlphaProof co-lead, Gemini.
Olive Song @olive_jy_song
1K Followers 141 Following I study RL & Evals @MiniMax_AI · Alum @NYU_Courant · Dig deep; Collaborate openly; Make things happen.
Miles Grimshaw @milesgrimshaw
13K Followers 4K Following Thrive Capital. @cursor_ai @chaidiscovery @turbopuffer @SocketSecurity @Revel_Software @meshoptical @doji_com @langchainai @benchling @monzo @segment @airtable
Jeff Ma @18jeffreyma
559 Followers 953 Following CS PhD @Harvard, prev @GoogleAI, @AmazonScience, @Citadel, @Nuro, @Caltech Created https://t.co/EBHm5qwcPO, https://t.co/QvK7qCNIjS
alana goyal @alanaagoyal
19K Followers 4K Following one woman vc @baseten @braintrust @browserbase @maticrobots @paper @resend @supabase @vercel @windsurf + more
Lee Danilek @LDanilek
761 Followers 253 Following Building @cursor_ai, formerly @convex, @dropbox, @Yale
Sujay Jayakar @sujayakar314
2K Followers 807 Following inference performance @AnthropicAI easily nerd sniped and okay with it.
Sam Kottler @samkottler
5K Followers 839 Following critical optimist. none of this is advice. building big, fast computers for @cursor_ai
Dmytro Dzhulgakov @dzhulgakov
5K Followers 762 Following Co-founder and CTO @FireworksAI_HQ. PyTorch core maintainer. Previously FB Ads. Ex-Pro Competitive Programmer































