And you quickly realize that when you generate things from similar prompts over and over the model gives you the same results but slightly adjusted.
That’s quite true, however it’s worth keeping in mind that this is largely not due to a limit in the model itself, but a limit how that model interfaces with the human. Text just isn’t very good to get specific results, especially not when it lacks the incremental refinement that you can do in ChatGPT with follow up prompts.
On the other side if I take StableDiffusion with ControlNet, instead of just a text prompt, I can generate far more specific results, as I can feed other images and sketches into the generation.
but I think once the new toy factor wears off people will realize they aren’t as good as they seem.
Quite the opposite, there is a ton of hidden potential still left to uncover. We have barely even started to train them on video or 3D data, integration of image models with newer language models is also a work in progress and integration into old-school image manipulation tools has just began as well.
Worth keeping in mind that Dalle-1 isn’t even three years old. We are basically still in the Atari2600 days of image generation.
Meanwhile Dalle-3 comes along and can produce this level of quality with a complete generic prompt: “A fan-art of Guardians of the Galaxy Vol. 3” on the first try.
I think the next “revolution” in art is going to be having human art as a selling point
The big problem for artists is that AI art drives the value of art down to zero. It’ll be hard to convince anybody to pay hundred of dollars for something when AI can produce something similar in 30sec for free. Worse yet, AI can take any existing image and remix it. The whole idea of a singular static images feels quite restrictive once you played around with AI art for a while, as everything is just a few clicks away from being something different.
I think the idea of AI art as just generators for stock images doesn’t capture the magnitude of the changes that are coming. We are straight up heading into Holodeck territory where you tell the computer what you want and you get it. The AI generators won’t be a tool for the artists, but go right to the users. There won’t be an static image that comes out the other end, the AI will be the medium of media consumption. Just like people today can flip through TikTok, future people will flip through a AI generated stream of content custom made for them.
Wanna play some 2D game with snow and ice? Tell the computer, a couple seconds waiting, and boom here it is. First try. Want lava instead? Done. How do you compete with that as a human when AI can pull that out of thin air in seconds?
thehatfox@lemmy.world 1 year ago
I wish more people realised this. It’s much harder to create very specific images with the current image generation tools than most people seem to think, which is creating an inaccurate view of the technology in the public eye.
The generator will create something inspired by the prompt it is given, but it can be very hard to make it match the output the prompt writer imagines when writing the prompt. There are various tools that can refine and narrow the generator’s output, to try and control things like posing, composition, style etc and to redraw details. But even then it’s often pot luck as to the output. The generated images aren’t necessarily bad, just not what was wanted.
I think the comparison to stock photo images is apt, current image generators are great for creating themed but somewhat generic images. The tools are going to continue to advance, and they are useful in for some applications already. But they are still a long way off from truly replacing human artistry.
lloram239@feddit.de 1 year ago
The crux with that argument is that the artists is the only one that cares about specific output, meanwhile the art consumer doesn’t. When somebody plays a game or watch a movie, they don’t know what to expect, that’s part of the fun, they just care about it being good. So as long as the output is good enough for the consumer, whatever the artists thinks about it, really doesn’t matter, assuming they still have a job to begin with.