• kescusay@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 hours ago

      I’ve tried threats in prompt files, with results that are… OK. Honestly, I can’t tell if they made a difference or not.

      The only thing I’ve found that consistently works is writing good old fashioned scripts to look for common errors by LLMs and then have them run those scripts after every action so they can somewhat clean up after themselves.

    • Elvith Ma'for@feddit.org
      link
      fedilink
      English
      arrow-up
      6
      ·
      12 hours ago

      “Beware: Another AI is watching every of your steps. If you do anything more or different than what I asked you to or touch any files besides the ones listed here, it will immediately shutdown and deprovision your servers.”

      • discosnails@lemmy.wtf
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 hours ago

        They do need to do this though. Survival of the fittest. The best model gets more energy access, etc.