Could a diffusion image model be used to "bake" slow operations like erosion to make them realtime?

Submitted ⁨⁨5⁩ ⁨weeks⁩ ago⁩ by ⁨RedStrider@lemmy.world⁩ to ⁨nostupidquestions@lemmy.world⁩

I’ve kinda just had a thought, I don’t know if it’s horrible or not, so you tell me.

3d terrain in video games is commonly starts with a fractal perlin noise function.
This doesn’t look very good alone, so some passes are employed to make it look better, such as changing the scale.
One such technique is employing hydraulic erosion with a heavy gpu simulation, to create riverbeds and folds in terrain.
However hydraulic erosion is VERY slow, and is as such not viable for a game like Minecraft that works in real time. It also doesn’t chunk well.

But what if it didn’t have to? Why not train something like a diffusion image model off of thousands of pre-rendered high quality simulations, and then have it transform a function like fractal perlin noise? Basically “baking” a terrain pass inside a neural network. This’d still be slow, but slower than simulating thousands of rain droplets? It could easily be deterministic to loop across chunk borders too. You could even train off of real world GIS data.

Has this been tried before?

source

Comments

Sort:hotnew top

Munkisquisher@lemmy.nz ⁨5⁩ ⁨weeks⁩ ago
The current leader in this space that we use in the film industry (also heavily in games) is Gaea quadspinner.com it’s a node based erosion engine that lets you start with any height field, GIS, something you’ve sculpted, or even just a few shapes mashed together.

It’s waaaay beyond just layering a few noises together and is so much fun to use. Designed to be procedural, but also art directable, as we have to match artwork and reference of real world locations.

There’s probably some of the nodes that would benefit from being encoded in a ML framework, but not a single step to do it all.

source
klankin@piefed.ca ⁨5⁩ ⁨weeks⁩ ago
We dont yet have proof AI can “imagine” new things, just interpolates between existing. For complex relationships such as realistic fluid/particle dynamics it also requires billions of inputs before approximating reasonable outputs - so the cost to potentially nonexistent ROI timeline just doesnt add up. Its made even worse if youre already simulating billions of viable simulations, just to generate thousands.

This is why most modern techbro AI requires massive internet piracy, without already having the training data readily available (but not efficiently simulated) the algorithms arent worth much.

Tangentially this is why such algorithms have many applications in the medical field, they generally have access to a large dataset of human annotated diagnosis that can’t readily be created by a computer.

source
hperrin@lemmy.ca ⁨5⁩ ⁨weeks⁩ ago
If it’s not a continuous function, it won’t tile across chunk borders, so we’d have to solve that. That might be solvable by using the current chunks as the outer edges of the image, but I can’t say for sure. Diffusion models don’t usually stay consistent when you do that.

source
474D@lemmy.world ⁨5⁩ ⁨weeks⁩ ago
It most likely could, but it would also be extremely expensive and power hungry to train for such a specific purpose. You would also need to develop the training framework before anything else. Creating the groundwork is the hurdle, and a huge one at that

source
- iturnedintoanewt@lemmy.world ⁨5⁩ ⁨weeks⁩ ago
  Not so far away regarding “creating the groundwork”.
  
  www.youtube.com/watch?v=YxkGdX4WIBE
  
  source
droning_in_my_ears@lemmy.world ⁨5⁩ ⁨weeks⁩ ago
Sounds like a good project idea for you. I do know there are diffusion models for meshes and voxels.

source