Comment on Microsoft inks deal to restart Three Mile Island nuclear reactor to fuel its voracious AI ambitions

<- View Parent
areyouevenreal@lemm.ee ⁨1⁩ ⁨month⁩ ago

I am not talking about things like ChatGPT that rely more on raw compute and scaling than some other approaches and are hosted at massive data centers. I actually find their approach wasteful as well. I am talking about some of the open weights models that use a fraction of the resources for similar quality of output. According to some industry experts that will be the way forward anyway as purely making models bigger has limits and is hella expensive.

Another thing to bear in mind is that training a model is more resource intensive than using it, though that’s also been worked on.

source
Sort:hotnewtop