News: 0175478979

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Bluesky Says It Won't Train AI On Your Posts

(Friday November 15, 2024 @10:30PM (BeauHD) from the what-not-to-expect dept.)


Bluesky, the social network [1]surging in popularity , says it has " [2]no intention" of training AI tools on users content . "The social network made the announcement on the same day that X (formerly Twitter) is implementing its new [3]terms of service that allow the platform to use public posts to train AI," notes TechCrunch. From the report:

> "A number of artists and creators have made their home on Bluesky, and we hear their concerns with other platforms training on their data," Bluesky said in [4]a post on its app. "We do not use any of your content to train generative AI, and have no intention of doing so." The company went on to note that it uses AI internally to help with content moderation and that it also uses the technology in its "Discover" algorithmic feed. However, Bluesky says "none of these are Gen AI systems trained on user content."



[1] https://tech.slashdot.org/story/24/11/14/0017205/bluesky-crosses-the-15-million-user-mark

[2] https://techcrunch.com/2024/11/15/unlike-x-bluesky-says-it-wont-train-ai-on-your-posts/

[3] https://x.com/en/tos

[4] https://bsky.app/profile/bsky.app/post/3layuzbti6s2x



OK, say I believe them (Score:5, Informative)

by Baron_Yam ( 643147 )

Everyone else with a scraper will train their AI on your posts, and good luck catching them at it. And if you do, good luck trying to reverse that or get a judgement against them.

Anything you post on the Internet as an individual is available for a corporation to steal and there is almost nothing you can do about it. And they WILL steal it.

Re: (Score:2)

by dfm3 ( 830843 )

If it can be seen it can be copied, yes, but there's a world of difference between someone scraping a service, and that service using your raw data via the back end along with all the additional metadata and account data that's associated with it.

FWIW, there's been a mass exodus of artists and creative types from X/Twitter over to Bsky lately, along with their followers, causing their numbers to grow quickly from the 6 digits over a million and now into the tens of millions in just a few short months, and

Re: (Score:2)

by martin-boundary ( 547041 )

The piece that is currently missing and needs to be created by the community is a way to automate DMCA takedowns on AI owning corporations. The tool needs to communicate with the AI, save the output of the interaction, and run it against a community database of artworks and forum comments with attributions. If a close enough match is found to suspect infringement, the artist is automatically alerted and a DMCA takedown is generated, ready to be sent out if and only if the artist gives the go-ahead.

This AI

Re: (Score:2)

by ArmoredDragon ( 3450605 )

> Anything you post on the Internet as an individual is available for a corporation to steal and there is almost nothing you can do about it. And they WILL steal it.

While I get that it's popular here to bash the evil corporations for any reason you can fathom, even when it doesn't make sense to do so. I can't help but wonder...this is stealing...how...? Reminds me of stuff like this:

[1]https://www.youtube.com/watch?... [youtube.com]

You know you're an asshole when, any time somebody does something that pisses you off, you feel you need to invent a law against it. Sure, I'm not a fan of relying on AI models that rely on data harvested from shitposting trolls like me. But at the same time

[1] https://www.youtube.com/watch?v=ag8HC8oCzuI

Are posts visible in BlueSky (Score:2)

by luttapi ( 312138 )

If the posts are visible - they will be scraped and used for any purpose whatsoever... including training AI. Maybe they are saying that nobody can see posts made on BlueSky yet....

Partial truth? (Score:5, Insightful)

by GoJays ( 1793832 )

While Bluesky says it won't train AI with your posts... it doesn't mean Bluesky won't sell your data to companies that WILL use it to train AI. Technically Bluesky isn't the company training the AI so it is true when they say; "Bluesky has no intention of using user data to train AI."

Re:Partial truth? (Score:4, Insightful)

by bill_mcgonigle ( 4333 ) *

The whole protocol and post stream is free and open.

100% chance somebody will.

Decentralized is great but tradeoffs exist.

Re: (Score:2)

by kmoser ( 1469707 )

Just because they don't have the intention now doesn't mean they won't have it tomorrow.

Muat be in the TOS (Score:2)

by cygnusvis ( 6168614 )

This must be in the TOS or it doesn't count. WHile where at it make sure that when the TOS changes to allow AI, its opt in

Re: (Score:2)

by martin-boundary ( 547041 )

Fuck man, you're deep in the weird shit. Log off for a bit, and find a flower. Maybe study it closely for 10 minutes.

WTF is this "bluesky"? (Score:2)

by Mr. Dollar Ton ( 5495648 )

How is it better than Mastodon, which is completely free, federated and not reliant on any one company? And how does it get traction here?

The most advantageous, pre-eminent thing thou canst do is not to exhibit
nor display thyself within the limits of our galaxy, but rather depart
instantaneously whence thou even now standest and flee to yet another rotten
planet in the universe, if thou canst have the good fortune to find one.
-- Carlyle