• smeg@feddit.uk
    link
    fedilink
    English
    arrow-up
    41
    arrow-down
    1
    ·
    3 days ago

    Not to be too snarky, but was there ever an assumption that stuff you put in wasn’t being used to train it? Safe to assume that any online service you’re using is making use of the data you’re giving it.

    • nogooduser@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      ·
      3 days ago

      If you’re a business with a contract with them it should state that they won’t use your data to train their models.

      If you’re using the free service then you’re right that it’s safe to assume that your data was already being used.

      • MNByChoice@midwest.social
        link
        fedilink
        arrow-up
        8
        arrow-down
        1
        ·
        3 days ago

        business with a contract

        I always wonder at this and have cautioned my managers repeatedly. Yes, we have a contract, but they have a literal army of lawyers and we have less (one lawyer one retainer for hourly work or a small grouping focused on taxes and employment law). As if our ownership won’t bend over backwards to avoid suing a large company like Google, AWS, Microsoft, or Oracle. (Maybe OpenAI and Anthropic are sue-able by a $100 million corp?)

        As proof I offer the lawsuits between businesses that have proceeded far enough the general public has heard about them. Not a specific one, just all of them.

        • nogooduser@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          3 days ago

          You have to trust the contract.

          If you use Microsoft 365 or Google Workspace etc then they already have all your data anyway. Most businesses have to trust other companies and the contract at some point.

          The only other option is to use Open Source self hosted everything which is beyond most people’s ability.

          • MNByChoice@midwest.social
            link
            fedilink
            arrow-up
            2
            ·
            1 day ago

            There are more options than the two you mentioned. Listing a few as more people should remember them. I did get a bit off topic…

            1. Use huge company to provide service.
            2. Provide service oneself (, likely with Open Source. )
            3. Use small or medium company to provide service (, likely with Open Source. )
            4. Use huge company for things huge company is great with, but keep “crown jewels” of company on internal self provided systems.
            5. Use a small or medium company to provide a service, and another series of small or medium companies to check on the first company.
            6. Use a huge company based in a country that is very serious about laws and putting CEOs in prison for wrongful acts.
            7. Do not do the thing. (Included for completeness.)
            8. Do the thing not on a computer. (Violation of privacy could result in violation of more serious laws.)
            9. Use an older technology on a computer.
            10. Use the huge company to provide service, but ensure the data includes insane things.