• user28282912@piefed.social
    link
    fedilink
    English
    arrow-up
    30
    arrow-down
    1
    ·
    11 hours ago

    The content produced by humans was scraped en-masse for the explicit purpose of training models which were then monetized into business products.

    I struggle to reconcile that with Fair Use.

    I can see if the source was EULA’d to remove all rights to what you post to things like Reddit, Stack Overflow, and if somehow those entities were contacted ahead of time and negotiated usage. You, I and the web server logs know that this was almost never the case.

    • Postimo@lemmy.zip
      link
      fedilink
      English
      arrow-up
      6
      ·
      8 hours ago

      I think the core of the fair use argument is that the AI models that are being trained are transformative products of the original works.

      Might be a hot take here but I basically agree. I still believe it was theft and that the realities of the legal framework we had don’t really stand up to the evolving problems, but under the current laws there is really no justification for saying that, taking the input of a bunch of images and giving the output of a set of statistical correlations of pixels based on descriptions, isn’t transformation.