Image generation tech has gone crazy over the past year and a half or so. At the speed it’s improving I wouldn’t rule out the possibility.
Here’s a paper from this year discussing text generation within images (it’s very possible these methods aren’t SOTA anymore – that’s how fast this field is moving): …thecvf.com/…/Rodriguez_OCR-VQGAN_Taming_Text-Wit…
deranger@sh.itjust.works 1 year ago
Bing image creator uses the new DALL-E model which does hands and text pretty good.
dotMonkey@lemmy.world 1 year ago
People forget just how fast this tech is evolving
S_H_K@lemmy.fmhy.net 1 year ago
Absolutely SDXL with loras already can do a lot of what it was thought impossible.
seralth@lemmy.world 1 year ago
Yeah Everytime iv seen anyone say “iv never seen it” makes it really obvious how little people actually know about the tech or follow it.
They basically saw it once a year ago and think it’s still the same.