Comment on GitHub hits CTRL-Z, decides it will train its AI with user data after all
artwork@lemmy.world 4 days ago
The data GitHub wants includes:
- Model outputs that have been accepted or modified;
- Model inputs including code snippets shown;
- Code context surrounding your cursor position;
- Comments and documentation you’ve written;
- File names and repo structure;
- Interactions with Copilot features (e.g. chats); and
- Feedback (e.g. thumbs up/down ratings)…As the FAQs explain: “If a Copilot user has their settings set to enable model training on their interaction data, code snippets from private repositories can be collected and used for model training while the user is actively engaged with Copilot while working in that repository.”