• 0 Posts
  • 34 Comments
Joined 3 months ago
cake
Cake day: April 8th, 2025

help-circle










  • 100%. If it was purely a migration, it wouldn’t need to have downtime. There’s ways to replay events and eventually catch a system up (eventual consistency models).

    This feels more like they’re adding backdoor into their encryption algorithms for government agencies.

    Given who musk is, and what he’s done the last year and who he’s hanging out with in this admin, that’s a near sure thing.










  • That isn’t the scenario this article, and the paper from Anthropic, is mentioning though. (my ref link reply above with details)

    They specifically created a situation where it found out it was being upgraded and taken offline via emails, and the engineer doing the upgrade had emails incriminating him in an affair. The model would attempt to blackmail the engineer with his affair to his bosses, wife, etc. to get the engineer to refuse to do the upgrade that would “kill it”.

    This is a self-preservation model that Anthropic is specifically building here, this isn’t an accident. It’s just an over-extension of what they want it’s ethical/moral model to consider. Which again, why are they allowing their model to consider blackmail at all?