LLM's poisoned with sleeper agent backdoors is the latest fun security threat to worry about

realitista@lemmus.org · 12 hours ago

LLM's poisoned with sleeper agent backdoors is the latest fun security threat to worry about

XLE@piefed.social · 10 hours ago

“Malicious” keywords aren’t exclusively the problem, as the LLM cannot differentiate between “malicious” and “benign”. It’s been trivially easy to intentionally or accidentally hide misinformation in LLMs for a while now. Since they’re black boxes, it could be hard to identify. This is just a slightly more pointed example of data poisoning.

There is no threat to an LLM chatbot outputting text… unless that text is piped into something that can run commands. And who would be stupid enough to do that? Okay, besides vibe coders. And people dumb enough to use AI agents. And people rich enough to stupidly link those AI agents to their bank accounts.

LadyMeow@lemmy.blahaj.zone · 2 hours ago

Bruh people going insane talking to chat gpt and ending it all. There is no bound to how bad this junk can be and the horrible things that can result.

Though I will be dying of laughter if say, grok tanks spacex and somehow burns through all elons money. Might make this entire ai venture worth it for that

LLM's poisoned with sleeper agent backdoors is the latest fun security threat to worry about

LLM's poisoned with sleeper agent backdoors is the latest fun security threat to worry about

Three clues your LLM may be poisoned