@anniegreens @pimoore If you with gain mean OpenAI could use the content to train their models, you're right; that's technically possible. But, they would take a huge risk, as their business terms and [privacy policy](https://openai.com/enterprise-privacy) claim: > We do not train on your business data (data from ChatGPT Team, ChatGPT Enterprise, or our API Platform) Hopefully, @manton uses a business account (and not his personal account) for Micro.blog's API requests to OpenAI. And if so, you can feel reasonably safe knowing that your content won't be used to train their models *via this specific feature*. That's not stopping your content from ending up in OpenAI's training data via other channels, though. And, OpenAI could say one thing and do another, and they wouldn't be the first company in the world to lie. 😊 There's no 100% certain way to be excluded from training data, other than keeping your content away from the public internet. And even then, you can't really control if another human copies and pastes your content into ChatGPT or something similar. PS. In case if it's not clear in my reply above, I see plenty of risks, moral issues, and so on with applied statistics AI as well. I'm not opposed to AI, but I do think it must be sustainably built, regulated, and rolled out responsibly.

news

Added a new experimental feature for bookmarks in Micro.blog Premium where the text of a web page you bookmark will be summarized by OpenAI. You can show the summaries by choosing “Show Summary” from the “…” menu in Bookmarks on the web.

2024-02-05 15:42

|

Embed

anniegreens

@news So, let’s say the site being bookmarked has a robots.txt that tells all bots, and especially AI scrapers, to sod off. This will bypass that, in a way, and OpenAI will gain from someone’s content who doesn’t want it to. I can’t say I’m thrilled by this. Is there any way to check first if the site has a robots.txt that makes it apparent it doesn’t want AI to gain from its content? I really dislike all this AI stuff and it feels like we’re just going to have to cave to it and I refuse.

2024-02-05 20:03

|

Embed

anniegreens@social.lol

@news

We need a way to block all of this from the creator's side, because robots.txt isn't going to do it. And especially when the consumer of the creation can then use AI on the content anyway. Ugh. I am weary of this timeline already and it has only just begun.

2024-02-05 20:08

|

Embed

jaredwhite@indieweb.social

@news

@anniegreens yes, my reaction:

2024-02-05 20:39

|

Embed

anniegreens

@pimoore I didn’t mean to insinuate that it was, but since we know the developer in this case I am asking whether they could check for the presence of that and be a good citizen and adhere to it.

2024-02-05 20:40

|

Embed

In reply to

sod

@anniegreens @pimoore If you with gain mean OpenAI could use the content to train their models, you’re right; that’s technically possible. But, they would take a huge risk, as their business terms and privacy policy claim:

> We do not train on your business data (data from ChatGPT Team, ChatGPT Enterprise, or our API Platform)

Hopefully, @manton uses a business account (and not his personal account) for Micro.blog’s API requests to OpenAI. And if so, you can feel reasonably safe knowing that your content won’t be used to train their models via this specific feature. That’s not stopping your content from ending up in OpenAI’s training data via other channels, though.

And, OpenAI could say one thing and do another, and they wouldn’t be the first company in the world to lie. 😊

There’s no 100% certain way to be excluded from training data, other than keeping your content away from the public internet. And even then, you can’t really control if another human copies and pastes your content into ChatGPT or something similar.

PS. In case if it’s not clear in my reply above, I see plenty of risks, moral issues, and so on with ~~applied statistics~~ AI as well. I’m not opposed to AI, but I do think it must be sustainably built, regulated, and rolled out responsibly.

2024-02-05 20:58

|

Embed

sod

@pimoore Yes, I agree. Just as with all new technology, we’ll see a lot of horrible shit go down before it gets better. We’ve seen this before with electricity, cars, and so on. History often rhymes.

2024-02-05 21:30

|

Embed