SPAR @SPARexec

We're a part-time, virtual research program that gives students and early career professionals an opportunity to work with professional AI safety researchers. sparai.org Joined March 2024

Tweets

57
Followers

552
Following

79
Likes

100

Agus 🔸 @austinc3301

2 months ago

Applications for the Generator Residency close on Monday EOD! Last chance to apply. Fully funded, 6k stipend + travel + housing, 3 months with an extension, in-person in Berkeley. Probably the best path into AI safety for non-researcher roles.

Agus 🔸 @austinc3301

2 months ago

Announcing the Generator Residency: a 3-month residency for AI safety generalists, by @KairosAIS × @ConstellOrg. Fully funded. In-person in Berkeley. Summer 2026. 🗓 Apply by April 27 generatorresidency.org/?utm_source=tw…

16 54 437 56K 400

2 5 62 6K 35

View Details

Kairos @KairosAIS

2 months ago

📣 Only 3 days left to apply for Generator! Apply by April 27, to join our inaugural cohort with advisers from AI Futures Project, BlueDot, Coefficient Giving, FAR. AI, Forethought, METR, RAND, and more! generatorresidency.org

1 1 7 687 3

View Details

Agus 🔸 @austinc3301

2 months ago

16 54 437 56K 400

View Details

Siddharth Boppana @sidboppana

3 months ago

Excited to share our new paper! We looked at when reasoning LLMs 'knew' their final answer internally vs. when it was stated in chain-of-thought. Turns out these models can be performative depending on the task!

Goodfire @GoodfireAI

3 months ago

LLMs often reason “performatively” well after deciding on a final answer - something that CoT monitors are slow to catch. Our new paper finds that: - probes can help monitor for this - it seems to track with task difficulty - probes enable early CoT exit, saving tokens! (1/7)

8 37 330 44K 191

5 3 31 2K 6

View Details

SPAR @SPARexec

4 months ago

@aniketdxsh @BerkeleyLab late follow-up, but congratulations ;)

0 0 2 53 0

View Details

Gabriele Sarti @gsarti_

4 months ago

In this work, we complement behavioral goal-directedness evals of LLM agents with a probing analysis of environment and plan representations, examining whether observed actions are consistent with models' internal beliefs, and how reasoning affects representations. Check it out!

Mario Giulianelli @glnmario

4 months ago

When we say an AI agent is “goal-directed”, what do we actually mean? In new work from Project Telos, we study this question by combining behavioural evaluation with analysis of internal representations in a language model agent navigating grid worlds. 1/

1 8 32 6K 20

1 2 17 2K 11

View Details

Agus 🔸 @austinc3301

5 months ago

we may not have sabrina carpenter but we do have dawn song

1 1 22 802 1

View Details

LawZero - LoiZéro @LawZero_

5 months ago

LawZero is accepting applications as part of the SPAR Spring 2026 program! If you're interested in studying model awareness or emergent misalignment, you can learn more and apply here: sparai.org/projects/sp26/. Applications are open until Jan 14, 2026.

SPAR @SPARexec

5 months ago

🚀 We're excited to announce that mentee applications are now open for the Spring round of the SPAR research program! This will be our largest round ever, featuring 130+ projects across AI safety, policy, governance, security, welfare, and strategy.

2 3 15 12K 3

0 5 16 5K 24

View Details

Georg Lange @_georg_lange

5 months ago

Come work with me and @SPARexec to build an AI mech interp researcher to accelerate AI safety research.🧠🔬 In the last cohort, my mentees built AI agents that automatically find and refine explanations for SAE features (demo of what they built after only one month below). In this cohort, we want to push for agents that discover and explain full circuits. Deadline is Jan 14th!⏳🗓️

SPAR @SPARexec

5 months ago

2 3 15 12K 3

2 3 6 948 1

View Details

SPAR @SPARexec

5 months ago

@_AR999_ Anywhere on Earth! time.is/Anywhere_on_Ea…

0 0 2 42 0

View Details

SPAR @SPARexec

5 months ago

📣 Only 2 days left to apply for this round of SPAR! Apply by January 14 to join our largest round yet — 130+ projects with mentors from Google DeepMind, RAND, AI Security Institute, Apollo Research, SecureBio, Machine Intelligence Research Institute, and more!

2 2 9 2K 4

View Details

SPAR @SPARexec

5 months ago

Apply here before 11:59 PM AoE on Wednesday, January 14th! sparai.org/projects/?utm_…

0 0 3 297 1

View Details

SPAR @SPARexec

5 months ago

Work on a part-time AI safety, AI policy, AI security, or biosecurity project. Open to students & professionals, prior research experience not required for all projects.

1 0 2 315 0

View Details

Andy Liu @uilydna

5 months ago

I'm mentoring a SPAR project on evaluating and refining alignment targets for LLMs (constitutions, model specs, etc.) this spring! Apply by January 14 to work with me or other SPAR mentors - project details/application link ⬇️:

SPAR @SPARexec

5 months ago

2 3 15 12K 3

1 2 7 706 1

View Details

Agus 🔸 @austinc3301

5 months ago

Does training language models on AI safety literature make them more likely to scheme? This is one of the research questions being explored in the upcoming round of @SPARexec. A few projects I'm excited about: 🧵

2 3 26 6K 9

View Details

Jeff Sebo @jeffrsebo

5 months ago

The NYU Center for Mind, Ethics, and Policy is seeking research fellows to contribute to upcoming reports on legal personhood and economic rights for digital minds. Please apply if you have interest in working with us!

SPAR @SPARexec

5 months ago

2 3 15 12K 3

2 7 23 2K 5

View Details

Justin Shenk @justinshenk

5 months ago

Join me next Spring in exploring how time is represented in LLMs 🕓 Deadline: January 14th sparai.org/projects/sp26/…

SPAR @SPARexec

5 months ago

2 3 15 12K 3

0 4 5 521 1

View Details

David Williams-King @deepelfery

5 months ago

I'm a SPAR mentor, if you'd like to work on solving Anthropic cyber espionage type attacks, please do apply!

SPAR @SPARexec

5 months ago

2 3 15 12K 3

1 2 3 553 1

View Details

Tianyi Alex Qiu @Tianyi_Alex_Qiu

5 months ago

I'm glad to mentor again for this round of SPAR, likely with @zhonghaohe! Together let's help human-AI coevolution go a little bit better :) ⬇️🧵Here's a collection of research ideas I'd be excited to mentor projects on. Feel free to pitch yours too!