Comment on New York state bans DeepSeek from government devices
Hackworth@lemmy.world 21 hours agoIt’s a free o1/o3 equivalent at a time when you’d have to pay otherwise. But in the short interim, Google’s made their r model free to use. And the distillations aren’t half bad.
count_dongulus@lemmy.world 16 hours ago
Lol have you not used o1/o3? They show the inner monologue too. Fun little pretend detail to keep you entertained while the model takes 30 seconds to respond.
Hackworth@lemmy.world 9 hours ago
o1/o3 use a smaller model to summarize the reasoning, but they don’t show the actual CoT generation the way deepseek does.