Recently there’s been quite a bit of outrage because the developer of Piefed publicly called out the Fediverse Anarchist Flotilla (FAF) for supposedly using LLM for automating instance moderation. and even though many of our admins the larger lemmy community took great lengths to debunk that post, it has become the disinfo that keeps on giving (see https://lemmy.dbzer0.com/post/68749575, https://kolektiva.social/@ophiocephalic/116518887925988112, https://lemmy.dbzer0.com/post/68222242 and more)

After clarifying our position for yet another time, someone suggested we should make an official post and an instance policy to “give me something I can boost as a positive example and a sign that things will be better going forward.” and given that this storm-in-a-teacup doesn’t seem to be abating as people are all too happy to bring it up again and again to malign the FAF; We’re making this post to once and for all clarify this situation.

History

We’re not going to rehash the whole drama and the many hit pieces against the FAF in the past two weeks, but I need to post the exact situation as it happened, without speculations and assumptions that people are all too happy to jump to.

  • One of our mods develops a tool to download a user’s public posting history through the lemmy API, to be used for evaluating them during moderation and shares it with some people in the admin team as something in progress. This tool does not feed anything to LLMs, it simply downloads the comments locally in a text file for easier review than going via the lemmy GUI.
  • Someone is reported to our instance admins for blatant zionism and genocide apologia.
  • An admin uses the tool to download the accused person’s comment history for evaluation
  • A quick evaluation (without LLM) confirms that this is a person that needs to be instance-banned. The moderation decision has now been locked-in at this point.
  • At the same time, that admin was curious to discover if LLMs can used to summarize people’s positions so that people can quickly follow-up with mod actions, without having to evaluate everyone’s posts manually and reduce the workload of admins writing long justifications)
  • As an experiment, the admin pass the user’s comment history through a locally-run open-weights LLM (Qwen) to see the summarized output. It happens to match their own decision.
  • The admin decides the leave the LLM summary in a pastebin along with that user’s posting history for reference. As an inside joke, they decide to claim the post was summarized by OpenAI, as they expected only our community would care about this and our stance on corporate-LLMs is well-known at this point.
  • The admin bans that person, providing a link to that pastebin as justification.
  • The admin decides not to continue using LLMs anyway for summaries, for many valid reasons. As evidence see the lack of other pastebins with LLM summaries.

~2 weeks pass…

  • The piefed developer is banned by a different mod in our instance for “zionism”. (I put this in quotes as this is one mod’s opinion, and not necessarily our instance’s position.)
  • The piefed developer apparently starts going through our instance modlogs for banned zionists and parses all their justifications
  • The piefed developer discovers that modlog justification from 2 weeks before with the LLM summary.
  • The piefed developer ask quickly in the common lemmy admin channel about it, at which point our instance admin in question, clarifies that the LLM was not used in the decision-making.
  • The piefed developer does not officially reach to anyone else from our admin team, despite the fact that we’ve reached out before and asked them to contact us in advance for inter-instance matters to avoid escalations.
  • The piefed developer make the public call-out I linked above as a piece of investigative journalism. The piefed developer does not provide the comments from our team which conflict with their narrative. The piefed developer not ask us for an official statement.
  • The piefed developer to this day has not amended their public call-out from the comments multiple of our admins and lemmy users leave under their post, conflicting with the narrative.

If you feel I’ve misrepresented any steps of this history, please let us know and I’ll be happy to adjust.

Given that, we acknowledge that even though we didn’t use LLMs in moderations, we allowed it to appear as if we did, and that’s on us. We will of course not do the same mistake again (appear as to be using LLMs for moderation)

The FAF’s stance on LLM moderation

We are aware that our instance is seen as “LLM-friendly” due to our nuanced take on LLMs but that does not mean that we, as an instance, ever considered using LLMs for moderating our instance. So we want to make it absolutely crystal clear how we stand on the matter.

As an official policy:

  • We have never used LLMs to guide our moderation decisions. This includes using LLM summaries which we would then validate, as well as LLM summaries which we use to confirm our existing decisions. LLMs are just not in our moderation loop whatsoever.
  • We have never passed instance data to corporate LLMs.
  • We have not used any automated moderation tooling which utilizes LLMs. The closest we have is the FOSS anti-CSAM filter I’ve developed and shared for years now, which relies strictly on locally-hosted machine-vision models.
  • We have never officially considered using LLMs for moderation, nor do we plan to.
  • As a team we’re steadfastly against LLM for moderation due to its inherent biases.
  • If any of the above changes, we will publicly inform the FAF community.

We hope this can finally put this matter to rest.

  • Venia Silente@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    4
    ·
    1 day ago

    Using local LLMs for various tasks

    But is the data local?

    AIbros always swear that it’s just one more datapoint bro, just one more datasource, just the one touch to the model. But they won’t ever pay the dues to the people they extract from.

    • anarchiddy@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      9
      ·
      1 day ago

      People want to pretend as if data being used for LLM training is somehow the exception to the rule, when the reality is that the modern internet is built on data aggregation.

      If you want to protect your publicly available social media activity from being used by any for-profit third party, then you might as well just start a private message board with your local friends that isn’t advertised on the public web. Everything you do on the fediverse is logged and shared across hundreds - if not thousands- of servers, including private ones that are aggregating data for profit. That’s kind of the entire point of it.

      I’m honestly not even sure what to tell someone who takes issue with open sourced LLM’s trained on publicly available data on the open web. Like - fuck Wikipedia too, I guess?

    • sp3ctr4l@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 day ago

      A local LLM is local… it is capable of running on the hardware that you have, without any internet connection.

      I have been running a local LLM on my Steam Deck for about a year now, within a sandboxed environment.

      If you have the LLM on your PC, run it locally, feed it a bunch of data you also have locally… its basically like using a text editor to open a text file that is on your computer, or an offline single player game loading up a local save file.

      AI Bros tend to be the ones massively jamming through networked, non local LLMs, that live in server databases somewhere, that you pay them per the amount of input you send to them, output they send to you.

      A local LLM, well you can download them for free, and then turn off the internet, and use them, if you know how.

      … I can literally run my Steam Deck LLM anywhere, and I can keep it powered indedinitely with a solar panel and battery/transformer, like you see typically used as a kind of emergency back up or camping power solution.

      There are differences between the actual tech of LLMs themselves, and how people with much more money than sense are currently reshaping society with them.

      You can go the AI Bro route and turn them into subscription services… or you can have the entire thing within your home and totally under your own control, your own functional ownership.