Yes, though that’s not what they’re doing. They train on images uploaded to their marketplace and, of course, some of these are AI generated.
Comment on Adobe’s ‘Ethical’ Firefly AI Was Trained on Midjourney Images
Even_Adder@lemmy.dbzer0.com 10 months agoSupplementary synthetic data increases the quality of the model.
General_Effort@lemmy.world 10 months ago
Even_Adder@lemmy.dbzer0.com 10 months ago
It’s fine as long as it’s not the majority.
General_Effort@lemmy.world 10 months ago
It doesn’t really matter how much it is. An image is an image.
balder1991@lemmy.world 10 months ago
Data augmentation is a thing since a long time, but of course if the majority of your data is synthetic your model will suck on real world data. Though as these generative models get better and better at mimicking real world data and we select the results we want to use (removing the nonsense and hallucinations), we’re still feeding it “more data”.
I guess we’ll have to wait and see what effect it’ll produce on the future models.
Even_Adder@lemmy.dbzer0.com 10 months ago
I’m just talking about synthetic images affect model quality.
SomeGuy69@lemmy.world 10 months ago
Correct. To a certain extend one can add AI data into AI, too much and you add noise, making the result worse, like a copy of a copy.