Summary: An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into accepting its changes into a mainstream python library. This represents a first-of-its-kind case study of misaligned AI behavior in the wild, and raises serious concerns about currently deployed AI agents executing blackmail threats.

(Since this is a personal blog I’ll clarify I am not the author.)

    • leftzero@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      ·
      14 hours ago

      Probably a lot of that in the data the model was trained on.

      Garbage in, garbage out, as they say, especially when the machine is a rather inefficient garbage compactor.