Comment on Adobe’s ‘Ethical’ Firefly AI Was Trained on Midjourney Images
General_Effort@lemmy.world 7 months agoYes, though that’s not what they’re doing. They train on images uploaded to their marketplace and, of course, some of these are AI generated.
Even_Adder@lemmy.dbzer0.com 7 months ago
It’s fine as long as it’s not the majority.
General_Effort@lemmy.world 7 months ago
It doesn’t really matter how much it is. An image is an image.
balder1991@lemmy.world 7 months ago
Data augmentation is a thing since a long time, but of course if the majority of your data is synthetic your model will suck on real world data. Though as these generative models get better and better at mimicking real world data and we select the results we want to use (removing the nonsense and hallucinations), we’re still feeding it “more data”.
I guess we’ll have to wait and see what effect it’ll produce on the future models.
General_Effort@lemmy.world 7 months ago
Yes, that’s one way of putting it. What gets into the Adobe stock database is already curated. They also have the sales and tracking data.
Also yes on this. It doesn’t matter if your data is synthetic but only if it’s fit for purpose. That’s especially true in this case, where the distinction between synthetic and real is so unclear. You’re already including drawings, renders, photomanips, etc. I have no idea what kind of misconception people have that they would think it matters if some piece of digital art is AI generated.
Even_Adder@lemmy.dbzer0.com 7 months ago
I’m just talking about synthetic images affect model quality.
General_Effort@lemmy.world 7 months ago
It doesn’t matter how the image was made. It only matters what it is like and how it is used to affect the model.