Comment on Spotify is going to clone podcasters’ voices — and translate them to other languages

<- View Parent
Pete90@feddit.de ⁨1⁩ ⁨year⁩ ago

I looked at your sources or at least one of them. The problem is, that, as you said, I am a layman at least when it comes To AI. I do know how fMRI works though.

And I stand corrected. Some of those pictures do closely resemble the original. Impressive, although not all subjects seem to produce the same level of detail and accuracy. Unfortunately, I have no way to verify the AI side of the paper. It is mind boggling that such images can be constructed from voxels of such size. 1.8mm contain close to 100k neurons and even more synapses. And the fMRI signal itself is only and blood oxygen overshoot in these areas and no direct measurement of neutral activity. It makes me wonder what constraints and tricks had to be used to generate these images. I guess combining the semantic meaning of the image in combination with the broader image helped. Meaning inferring pixel color (e.g. Mostly blue with some gray on the middle) and then adding the sematic meaning (plane) to then combine these two.

Truly amazing, but I do remain somewhat sceptical.

source
Sort:hotnewtop