On Lemmy all you need to do is follow every community you can find and you’ll get a stream of posts, comments, voting behaviour, edits, and even admin behaviour, all raw and unprocessed with all the metadata you could hope for without paying a penny.
I’m not saying every Lemmy server is being used to train AI models, but I’m sure the big ones are.
Presumably most of the current AI models have already had access to reddit data in the past, so I am a bit confused about why they would pay 60 million for it now.
On Lemmy all you need to do is follow every community you can find and you’ll get a stream of posts, comments, voting behaviour, edits, and even admin behaviour, all raw and unprocessed with all the metadata you could hope for without paying a penny.
I’m not saying every Lemmy server is being used to train AI models, but I’m sure the big ones are.
Presumably most of the current AI models have already had access to reddit data in the past, so I am a bit confused about why they would pay 60 million for it now.