Comment on Nvidia loses $500 bn in value as Chinese AI firm jolts tech shares

<- View Parent
UnderpantsWeevil@lemmy.world ⁨3⁩ ⁨days⁩ ago

No I dont have thousands of almost top of the line graphics cards to retain an LLM from scratch

Fortunately, you don’t need thousands of top of the line cards to train the DeepSeek model. That’s the innovation people are excited about. The model improves on the original LLM design to reduce time to train and time to retrieve information.

Contrary to common belief, an LLM isn’t just a fancy Wikipedia. Its a schema for building out a graph of individual pieces of data, attached to a translation tool that turns human-language inputs into graph-search parameters. If you put facts about Tianamen Square in 1989 into the model, you’ll get them back as results through the front-end.

You don’t need to be scared of technology just because the team that introduced the original training data didn’t configure this piece of open-source software the way you like it.

that’s still no excuse to sweep under the rug blatant censorship of topics the CCP dont want to be talked about.

Wow ok, you really dont know what you’re talking about huh?

source
Sort:hotnewtop