Microsoft’s GitHub next month plans to begin using customer interaction data – “specifically inputs, outputs, code snippets, and associated context” – to train its AI models.
I was under the impression that they already do that though.
Submitted 2 weeks ago by throws_lemy@reddthat.com to technology@lemmy.world
https://www.theregister.com/2026/03/26/github_ai_training_policy_changes/
Microsoft’s GitHub next month plans to begin using customer interaction data – “specifically inputs, outputs, code snippets, and associated context” – to train its AI models.
I was under the impression that they already do that though.
Wonderful! Let’s go tell it lies.
Everyone should be lying to LLM’s, but the way. Do it often. Do it daily. Make them even more useless.
Forgejo is thoughtless so selfhost.
I’m not surprised, companies are starting to realise that AI is only as useful as the data it’s trained on. If you blast it with all the internet slop we have completely unfiltered, it’s going to start fucking up all it’s responses. It’s not just about the volume of data, it’s about the quality of that data. Sites like Github, and academic journals, contain the exact data that companies need to create well rounded LLMs, that don’t go off on racist rants and declare themselves as “MechaHitler”. That makes data like Github’s pure gold.
Counterpoint, I’ve poisoned it with absolute dumb shit and the worst code you’ve ever seen
Intentionally, right? Right?
As a paying customer, can recommend Sourcehut. I prefer the workflow to GitHub’s PRs as well.
Despicable.
Genuine question as git has just been a staple service on our networks since cvs/svn died.
Why are you all not hosting your own git servers, or at the very least something like gitea if your stupid company is vendor locked by ‘cloud’ providers?
Maintenance cost, security hardening, visibility, that’s a few reasons coming from the top of my head.
Convenience
It may be difficult to self host for many years without significant downtime. I have some repos over 20 years old, and have gone through several boom and bust cycles myself.
God Im feeling justified in my life decisions lately.
if you’re telling me that this hasn’t been something that they were already doing, I would call you a liar. I think you are a liar
US Lenders: “Hey, you want some money from the infinity free money spigot”
A handful of nerds paying attention: “Well, if they drink from the money fountain, we’re leaving!”
I have tailscale linked to github’s OAuth. Is there anybsafe way to migrate all the machines safely to an alternative while keeping the same tailnet settings?
I haven’t done it myself, but there is an option to change your auth provider in the tailscale settings. For me it was just an email to contact but I’d imagine that’s the best route.
Absolutely based, but it shouldn’t be opt out, it should be forced instead.
Are you being sarcastic?
NuXCOM_90Percent@lemmy.zip 2 weeks ago
For no apparent reason:
Are there any good alternatives for gh-pages dor a super lazy/simple website? I’ve been meaning to actually use one of my domains for a personal website and pointing at which project is on which code repo site would be a good idea. But… I need that page to be hosted by one of them.
Evotech@lemmy.world 2 weeks ago
Cloudflare workers is pretty easy and free
NuXCOM_90Percent@lemmy.zip 2 weeks ago
Ooooh. Cloudflare Pages definitely looks like what I want.
Thanks
daannii@lemmy.world 2 weeks ago
Someone else mentioned Codeberg