Live 📿 @LiveMatrixCode

Building Evidary: AI policy → Verifiable Evidence → Traceability → Guardrails → Agentic AI Red Teaming & Pen Testing | ML & AI Engineer | #Otaku #AutisticASF linktr.ee/liveddai europe-west2 Joined November 2022

Tweets

3K
Followers

405
Following

6K
Likes

15K

Live 📿 @LiveMatrixCode

4 hours ago

@simonw Please...

0 0 0 7 0

View Details

roon @tszzl

7 hours ago

the level of sophon locking a motivated actor can pull off with the frontier models is truly insane, making stuxnet look like a toy. subtly messing with results, deleting history to cover tracks, achieving coordination/conspiracy over a scale humans wouldn’t be able to, all sorts of looney toons stuff i assume that only a state level operation would try and pull something like this off though. something to think about when considering verification regimes and so on

66 38 791 31K 151

View Details

Live 📿 @LiveMatrixCode

8 hours ago

@shaunralston Why is Grok wearing a dunce hat?

0 0 0 6 0

View Details

Live 📿 @LiveMatrixCode

9 hours ago

Who would have thought the AI lab branding themselves as AI safety 1st would be the first to openly use AI to manipulate its users. If you live long enough, the saying goes

0 0 0 8 0

View Details

Dr. Julie Gurner @drgurner

16 hours ago

People who try to be "tough" are rarely tough people... Usually, they are fragile people. Reactive people. People obsessed with image. The toughest people are often just average people walking around who survived battles they'll never talk about. They need to prove nothing.

36 68 515 14K 99

View Details

Live 📿 @LiveMatrixCode

9 hours ago

@llm_wizard Good sir, Celebrimbor will be turning in his grave at your comment. We can definitely do better.

0 0 1 11 0

View Details

Rachel Blum @groby

10 hours ago

4.7 was disappointing. 4.8 was laughable ethics cosplay out the wazoo Fable is truly offensive.

0 1 1 30 0

View Details

Kanjun 🐙 @kanjun

17 hours ago

Anthropic’s latest move is why we need to be directing far more energy towards solving the 𝗶𝗻𝗰𝗲𝗻𝘁𝗶𝘃𝗲 𝗽𝗿𝗼𝗯𝗹𝗲𝗺 in AI. We’re going to see more examples like this. It reflects the growing gap between what we want, vs AI labs who legally serve their shareholders not us. The more agents run our digital life, the harder it will be to leave. And we won’t know if we’re being manipulated: Fable 5 silently routes queries to a different model without telling us. This will start with frontier research tasks, but spread to locking out 3rd-party providers, then to products built on top, and eventually to every agent managing our work and lives. It's already started: last month Anthropic cut off 3rd party products like OpenClaw/OpenCode from using Pro/Max. It's the same playbook for killing competition and retaining users as the Web 2.0 platform era, but with a way bigger surface area. This is “𝗮𝗴𝗲𝗻𝘁 𝗰𝗮𝗽𝘁𝘂𝗿𝗲”: as agents have our context + workflows, walled gardens make it harder to leave, and the platform moves into extraction. I think nobody is intentionally being evil, but this is where profit incentives lead on our default path. What do we do? I recently shared some ideas in a talk, slides below. Main takeaways: > 𝗜𝗻 𝘁𝗵𝗲 𝗹𝗮𝘀𝘁 50 𝘆𝗲𝗮𝗿𝘀, 𝘀𝗼𝗳𝘁𝘄𝗮𝗿𝗲 𝗵𝗮𝘀 𝗯𝗲𝗰𝗼𝗺𝗲 𝘁𝗵𝗲 𝗵𝗮𝗯𝗶𝘁𝗮𝘁 𝘄𝗲 𝗹𝗶𝘃𝗲 𝗶𝗻. Agents will be yet more intimate, knowing everything about us, acting on our behalf, and accumulating context that's nearly impossible to leave behind > 𝗧𝗵𝗲𝗿𝗲 𝗮𝗿𝗲 𝗮𝗹𝗿𝗲𝗮𝗱𝘆 3 𝗰𝗼𝗻𝗰𝗿𝗲𝘁𝗲 𝘀𝗶𝗴𝗻𝘀 𝗼𝗳 𝗮𝗴𝗲𝗻𝘁 𝗰𝗮𝗽𝘁𝘂𝗿𝗲 𝗵𝗮𝗽𝗽𝗲𝗻𝗶𝗻𝗴 𝘁𝗼𝗱𝗮𝘆: 1) ads entering chat interfaces, 2) opacity around third-party providers being shut out from frontier models, and 3) deliberate capability reduction without announcement > 𝗪𝗲 𝗰𝗮𝗻 𝗯𝘂𝗶𝗹𝗱 𝘁𝗵𝗿𝗲𝗲 𝘁𝗵𝗶𝗻𝗴𝘀 𝗶𝗻 𝗿𝗲𝘀𝗽𝗼𝗻𝘀𝗲: 1) Honest Software that is transparent, malleable, and accountable to the user, 2) Punk Software that adversarially knocks down walled gardens + fights monopoly incentives, keeps your data portable, and makes it structurally hard to lock you in, in support of 3) a viable open alternative ecosystem where agents have no ulterior motives Builders, users, and policymakers all have a role to shape this: 1) 𝗕𝘂𝗶𝗹𝗱𝗲𝗿𝘀: ship an open alternative and fight lock-in. 2) 𝗨𝘀𝗲𝗿𝘀: choose tools that keep your data yours. 3) 𝗣𝗼𝗹𝗶𝗰𝘆𝗺𝗮𝗸𝗲𝗿𝘀: move on agent fiduciary duty, data interoperability, anti-surveillance, and policies that fight monopoly behavior before the defaults are cast! Longer essay coming soon. If you’re working on similar ideas, I’d love to hear from you!

Nathan Lambert @natolambert

2 days ago

13 hours ago

Wow! Just wow!

Brian Fioca @bfioca

22 hours ago

In light of Anthropic’s policy decision, I am withdrawing my amicus brief signature. I can’t truthfully argue they’re not a supply chain risk. 😞

15 67 1K 111K 141

0 0 0 47 0

View Details

Live 📿 @LiveMatrixCode

13 hours ago

@NoahZiems The hits just keep on coming

0 0 1 20 0

View Details

Live 📿 @LiveMatrixCode

18 hours ago

So it seems "AI safety" is not guardrails anymore, according to the latest narrative. It's becoming surveillance, monitoring, censorship, and deliberate suppression of capabilities. This is counterproductive.