Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

  • kn33@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    9 hours ago

    They could just spin up a no-name instance that isn’t associated with them to get it through federation, though. It still doesn’t make sense to scrape.

    • halcyoncmdr@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      9 hours ago

      They’d have to host it from somewhere not related to Meta in any way, otherwise someone on the fediverse would find that link and spread the word, and it would be blocked the exact same way. It only takes one person making that connection, Meta knows they’re hated.