Comment on DeepSeek might not be such good news for energy after all
wewbull@feddit.uk 3 weeks agoThis is more about the “reasoning” aspect of the model where it outputs a bunch of “thinking” before the actual result. In a lot of cases it easily adds 2-3x onto the number of tokens needed to be generated. This isn’t really useful output. It the model getting into a state where it can better respond.