Comment on 60% of Teachers Used AI This Year and Saved up to 6 Hours of Work a Week
RageAgainstTheRich@lemmy.world 5 days agoWouldn’t it then help to run the smaller ones locally instead of using the big ones like ChatGPT?
I read that one called Deepmind or something in china took a lot less to train and is just as strong. Is that true?
What do people usually use LLM’s for? I know they suck for most things people are using them for like coding. But what do people use them for that justifies all the hype?
Again, please don’t think i am trying to justify it. I just don’t know super much about them.
ExLisper@lemmy.curiana.net 5 days ago
Small models can only handle limited set of tasks. To cover a lot of different tasks you would need a lot of small models. What DeepSeek did was build a lot of small models with each acting as an expert on one topic (more or less). It’s more energy efficient to train but not necessarily to run as you have to chain a lot of small models to get good results.
What do people use LLM for? Asking questions you would normally ask Google. Google sucks now so it’s easier to ask ChatGPT. You can also use it for simple tasks like checking text for grammar errors, writing emails and so on.