• kibiz0r@midwest.social
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    1
    ·
    7 months ago

    Also true. It’s scraping.

    In the words of Cory Doctorow:

    Web-scraping is good, actually.

    Scraping against the wishes of the scraped is good, actually.

    Scraping when the scrapee suffers as a result of your scraping is good, actually.

    Scraping to train machine-learning models is good, actually.

    Scraping to violate the public’s privacy is bad, actually.

    Scraping to alienate creative workers’ labor is bad, actually.

    We absolutely can have the benefits of scraping without letting AI companies destroy our jobs and our privacy. We just have to stop letting them define the debate.

    • Grumuk@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      ·
      7 months ago

      Molly White also wrote about this in the context of open access on the web and people being concerned about how their works are being used.

      “Wait, not like that”: Free and open access in the age of generative AI

      The same thing happened again with the explosion of generative AI companies training models on CC-licensed works, and some were disappointed to see the group take the stance that, not only do CC licenses not prohibit AI training wholesale, AI training should be considered non-infringing by default from a copyright perspective.

    • Grimy@lemmy.world
      link
      fedilink
      arrow-up
      2
      arrow-down
      3
      ·
      7 months ago

      Creators who are justifiably furious over the way their bosses want to use AI are allowing themselves to be tricked by this argument. They’ve been duped into taking up arms against scraping and training, rather than unfair labor practices.

      That’s a great article. Isn’t this kind of exactly what is going on here? Wouldn’t bolstering copyright laws make training unaffordable for everyone except a handful of companies. Then these companies, because of their monopoly, could easily make the highest level models only affordable by the owner class.

      People are mad at AI because it will be used to exploit them instead of the ones who exploit them every chance they get. Even worse, the legislation they shout for will make that exploitation even easier.

    • FauxLiving@lemmy.world
      link
      fedilink
      arrow-up
      4
      arrow-down
      6
      ·
      7 months ago

      Our privacy was long gone well before AI companies were even founded, if people cared about their privacy then none of the largest tech companies would exist because they all spy on you wholesale.

      The ship has sailed on generating digital assets. This isn’t a technology that can be invented. Digital artists will have to adapt.

      Technology often disrupts jobs, you can’t fix that by fighting the technology. It’s already invented. You fight the disruption by ensuring that your country takes care of people who lose their jobs by providing them with support and resources to adapt to the new job landscape.

      For example, we didn’t stop electronic computers to save the job of Computer (a large field of highly trained humans who did calculations) and CAD destroyed the drafting profession. Digital artists are not the first to experience this and they won’t be the last.

      • masterspace@lemmy.ca
        link
        fedilink
        English
        arrow-up
        4
        ·
        7 months ago

        Our privacy was long gone well before AI companies were even founded, if people cared about their privacy then none of the largest tech companies would exist because they all spy on you wholesale.

        In the US. The EU has proven that you can have perfectly functional privacy laws.

        If your reasoning is based o the US not regulating their companies and so that makes it impossible to regulate them, then your reasoning is bad.

        • FauxLiving@lemmy.world
          link
          fedilink
          arrow-up
          6
          arrow-down
          2
          ·
          edit-2
          7 months ago

          My reasoning is based upon observing the current Internet from the perspective of working in cyber security and dealing with privacy issues for global clients.

          The GDPR is a step in the right direction, but it doesn’t guarantee your digital privacy. It’s more of a framework to regulate the trading and collecting of your personal data, not to prevent it.

          No matter who or where you are, your data is collected and collated into profiles which are traded between data brokers. Anonymized data is a myth, it’s easily deanonymized by data brokers and data retention limits do essentially nothing.

          AI didn’t steal your privacy. Advertisers and other data consuming entities have structured the entire digital and consumer electronics ecosystem to spy on you decades before transformers or even deep networks were ever used.