News: 1774484016

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

GitHub hits CTRL-Z, decides it will train its AI with user data after all

(2026/03/26)


Microsoft's GitHub next month plans to begin using customer interaction data – "specifically inputs, outputs, code snippets, and associated context" – to train its AI models.

The code locker’s revised policy applies to Copilot Free, Pro, and Pro+ customers, as of April 24. Copilot Business and Copilot Enterprise users are exempt thanks to the terms of their contracts. Students and teachers who access Copilot will also be spared.

Those affected have the option to opt out in accordance with "established industry practices" – meaning according to US norms as opposed to European norms where opt-in is commonly required. To opt out, GitHub users should visit [1]/settings/copilot/features and disable "Allow GitHub to use my data for AI model training" under the Privacy heading.

[2]

Mario Rodriguez, GitHub's chief product officer, would rather you didn't.

[3]

[4]

"By participating, you'll help our models better understand development workflows, deliver more accurate and secure code pattern suggestions, and improve their ability to help you catch potential bugs before they reach production," he wrote in a [5]blog post .

To excuse its covetous behavior, GitHub in its FAQs notes that [6]Anthropic , [7]JetBrains , and corporate parent [8]Microsoft operate similar opt-out data use policies.

[9]

The rationale for the change, according to Rodriguez, is that interaction data makes company AI models perform better. Adding interaction data from Microsoft employees has led to meaningful improvements, he claims, such as an increased acceptance rate for AI model suggestions.

The data GitHub wants includes:

Model outputs that have been accepted or modified;

Model inputs including code snippets shown;

Code context surrounding your cursor position;

Comments and documentation you've written;

File names and repo structure;

Interactions with Copilot features (e.g. chats); and

Feedback (e.g. thumbs up/down ratings).

[10]AI supply chain attacks don't even require malware…just post poisoned documentation

[11]Dell slims down business laptops, fattens up cooling and battery life

[12]Jen Easterly, cybersecurity's 'relentless optimist,' hopes feds come back to RSAC next year

[13]Oracle: AI agents can reason, decide and act - liability question remains

The policy shift does somewhat change the meaning of GitHub [14]private repositories , which are notionally "only accessible to you, people you explicitly share access with, and, for organization repositories, certain organization members." These might be more accurately described as "GitHub private * repositories," with the asterisk to denote the limits of GitHub’s definition of the word "private."

As the FAQs explain: "If a Copilot user has their settings set to enable model training on their interaction data, code snippets from private repositories can be collected and used for model training while the user is actively engaged with Copilot while working in that repository."

Recent banter in the GitHub community doesn’t include much enthusiasm for the plan. To judge by emoji votes alone, users have offered 59 thumbs-down votes and just three rocket ships, which we understand signal some measure of excitement.

[15]

But among the [16]39 posts commenting on the change at the time this article was filed, no one other than Martin Woodward, GitHub VP of developer relations, has really endorsed the idea.

User indignation might be somewhat mitigated if GitHub users recognized that OpenAI's [17]Codex – [18]used in GitHub Copilot – is "a GPT language model fine-tuned on publicly available code from GitHub." That verbiage shows the data-gorged AI horse is already out of the barn, so to speak.

Shutting the doors at this point won't change the fact that the AI industry is built on data gathered without asking for a strong indicator of enthusiastic consent. ®

Get our [19]Tech Resources



[1] https://github.com/settings/copilot/features

[2] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/devops&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=2&c=2acS9dnawnUc4bfpYhiRI6AAAAFM&t=ct%3Dns%26unitnum%3D2%26raptor%3Dcondor%26pos%3Dtop%26test%3D0

[3] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/devops&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44acS9dnawnUc4bfpYhiRI6AAAAFM&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[4] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/devops&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33acS9dnawnUc4bfpYhiRI6AAAAFM&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0

[5] https://github.blog/news-insights/company-news/updates-to-github-copilot-interaction-data-usage-policy/

[6] https://www.anthropic.com/news/updates-to-our-consumer-terms

[7] https://blog.jetbrains.com/blog/2025/09/30/detailed-data-sharing-for-better-ai/

[8] https://www.microsoft.com/en-us/microsoft-copilot/blog/2024/08/16/transparency-and-control-in-consumer-data-use/

[9] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/devops&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=4&c=44acS9dnawnUc4bfpYhiRI6AAAAFM&t=ct%3Dns%26unitnum%3D4%26raptor%3Dfalcon%26pos%3Dmid%26test%3D0

[10] https://www.theregister.com/2026/03/25/ai_agents_supply_chain_attack_context_hub/

[11] https://www.theregister.com/2026/03/25/dell_business_laptops/

[12] https://www.theregister.com/2026/03/25/jen_easterly_interview/

[13] https://www.theregister.com/2026/03/25/oracles_apps_and_agents_push/

[14] https://docs.github.com/en/repositories/creating-and-managing-repositories/about-repositories#about-repository-visibility

[15] https://pubads.g.doubleclick.net/gampad/jump?co=1&iu=/6978/reg_software/devops&sz=300x50%7C300x100%7C300x250%7C300x251%7C300x252%7C300x600%7C300x601&tile=3&c=33acS9dnawnUc4bfpYhiRI6AAAAFM&t=ct%3Dns%26unitnum%3D3%26raptor%3Deagle%26pos%3Dmid%26test%3D0

[16] https://github.com/orgs/community/discussions/188488?sort=top

[17] https://arxiv.org/abs/2107.03374

[18] https://microsoft.github.io/prompt-engineering/

[19] https://whitepapers.theregister.com/



Smell that?

Sorry that handle is already taken.

That's desperation.

Re: Smell that?

cyberdemon

That's [1]enshittification

[1] https://m.youtube.com/watch?v=T4Upf_B9RLQ

Re: Smell that?

ecofeco

When the shit hits your eye

Like a big shitzza pie

That's enshitification.

No it doesn't ryhme. So sue me . :)

Re: Smell that?

cyberdemon

And er, antitrust. They should never have been allowed to buy GitHub (or LinkedIn), many of us said at the time that they will take everyone's private/proprietary code as their property, and now they are doing exactly that and more. (if they haven't already of course. This policy change could just be a fig-leaf to justify private-data-slurping that has already occurred)

Here's hoping that microslop will be forced to delete whatever plagiarism-model results from this. It is absolutely NOT OK to steal people's private repos (and even their code review styles) without permission.

And thanks for the settings link. Although I wouldn't be surprised if it is nothing more than an an additional bitfield in my data profile, treated with no more respect than a "do not track" marker in a HTTP request..

If you're still on GitHub you probably deserve this

sarusa

I moved all my (and my companies') repositories over to Gitlab 6 months ago, which is arguably even late to the game, and deleted everything we had on GitHub. Of course nothing is ever truly deleted, but...

It's been 100% predictable (and was predicted) where all this has been going since GIthub got bought by Microslop. There was no other reason for MS to buy it other than to hoover up all your code for slop training and it was inevitable all your stuff would go to train their shit. Any exceptions for now are just part of the slow erosion, they can't last. Because literally the only thing Slopya cares about is slop. Good thing his parents named him that, huh?

Thanks for paragraph 3

DrewPH

Both options are now disabled.

(I would never use Github to host my own code but I like to follow issues sometimes)

WHCOULDAKNOWED?!

ecofeco

Bwhahahahaahahahahahahahahahahahahahaa

SUCKERS!

You. Were. Warned.

Anonymous Coward

I've never been tempted to use GitHub, but... I do have some really bad code in some old git repos that I had abandoned but which I could move across...

Microslop is welcome…

Bebu sa Ware

to any of the smelly old code of mine that I have dropped on github and forgotten to flush.

Should poison the spring from which Crapilot is drawing its buckets of wisdom.

Some of it is very old indeed like a /sbin/mount wrapper for SGI Irix 6.x that performed Solaris type loop back mounts (mount.lofs) — can see that being really useful. ;) As too some I2O stuff for an obscure Adaptec scsi adaptor twenty+ years ago.

Exciting!

DJSpuddyLizard

Code snippets ?

So the plan is to make the AI generate Stack Exchange-level garbage code ?

::shrugs::

jake

Just about everybody I know pulled everything out of github when redmond took over ... Easier to put it in the rear-view on one's own terms, before the inevitable happens.

Groo The Wanderer - A Canuck

No need to opt out. I stopped using GitHub Copilot after experimenting with it and a few different LLMs and their code suggestions for a couple weeks to deal with one particular issue I had. I didn't like the way I found it working, so I haven't used it for anything else - I consider it a fraudulent technology. Sure it spits out text super-fast, but it isn't sane text, and you have to spend more time fixing the mistakes it made than it would have taken to just do it from scratch without making mistakes in the first place.

Talk about a balloon waiting to be popped; this whole "agentic" mission is just more Bafflegab for the Peons who Pay The Bills.

And man are those bills ever getting big considering it doesn't even work as promised...

Sex dumps core
(Sex is a Simple editor for X11)
-- Seen on debian bugtracking