As of April 24 you’ll be feeding the Octocat unless you opt out
The data GitHub wants includes:
- Model outputs that have been accepted or modified;
- Model inputs including code snippets shown;
- Code context surrounding your cursor position;
- Comments and documentation you’ve written;
- File names and repo structure;
- Interactions with Copilot features (e.g. chats); and
- Feedback (e.g. thumbs up/down ratings)…As the FAQs explain: “If a Copilot user has their settings set to enable model training on their interaction data, code snippets from private repositories can be collected and used for model training while the user is actively engaged with Copilot while working in that repository.”
driving_crooner@lemmy.eco.br 2 weeks ago
What if people start sending malicious code to github? Poisoning the AI model
poop@lemmy.zip 2 weeks ago
That’s what I tell myself I’m doing when I push more poorly written code to one of my repos