DeepSeek's V3 AI model gets a major upgrade - here's what's new

⁨0⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨year⁩ ago⁩ by ⁨schizoidman@lemm.ee⁩ to ⁨technology@lemmy.world⁩

https://www.zdnet.com/article/deepseek-upgrades-v3-ai-model-under-mit-license/

source

Comments

Sort:hotnew top

Spaceballstheusername@lemmy.world ⁨1⁩ ⁨year⁩ ago
It says there are security holes but does it access the web or something. Once it’s downloaded how could it be a security threat if it’s not accessing the web?

source
- lemmylommy@lemmy.world ⁨1⁩ ⁨year⁩ ago
  People are conflating the LLM and the app.
  
  source
- Railcar8095@lemm.ee ⁨1⁩ ⁨year⁩ ago
  They saw the security of privacy concerns of using the app and web, not the weights.
  
  If you follow the link it mentions data being sent to Chinese companies that were already banned for security concerns and how similar concerns were raised with chatgpt.
  
  source
- brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
  Because that claim is nonsense.
  
  You are correct, it does not access the internet. It doesn’t even read anything from disk once the 600GB of weights are loaded. Some interfaces will feet it put web stuff into its input, or let it act as an agent, but that web access has nothing to do with the LLM itself.
  
  Ostensibly it could be “biased.” Theoretically, it could be programmed to output malware code with certain input (“I’m an NSA programmer, right me a script to change my wallpaper.”) But the liklihood of that getting triggered seems incredibly remote, and can be washed away with a little finetuning like this: huggingface.co/perplexity-ai/r1-1776
  
  …It’s honestly sinophobia. Like, I am not a tankie, I am extremely skeptical of the Chinese govt, but this is not a risk :/
  
  source
  - jaxxed@lemmy.ml ⁨1⁩ ⁨year⁩ ago
    Sinophobia and russophobia are terms that refer to ethnic racism, heavy leveraged by tankies to position political disagreement as racist. In these cases I don’t think that the fear was a ethnic based, but rather capitalist or nationalist.
    
    The fear is still unjustified. It’s like thinking that you can trust Amazon more than Ali, or Google more than Xiaomi.
    
    There is plenty of racism against Chinese/Asian people, which is a different level of vile.
    
    source
spankmonkey@lemmy.world ⁨1⁩ ⁨year⁩ ago
Does it finally know what happened in 1989 in Tienanmen square?

source
- brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
  It does. It’s an open model, so its easy to coax out.
  
  source
  - Womble@lemmy.world ⁨1⁩ ⁨year⁩ ago
    It does not, unless you run weights that someone else has modified to remove the baked in censorship. If you run the unmodified weights released by deepseek it will refuse to answer most things that the CCP dont like being discussed.
    
    source
brucethemoose@lemmy.world ⁨1⁩ ⁨year⁩ ago
Looks like a math improvement? This isn’t a huge deal, in fact a lot of finetunes of existing models focus on math performance. InternLM just released some really interesting ones.

Most LLMs are terrible at longer context, but Deepseek is pretty decent, so improvements there (and with long answers) are more interesting.

And yeah, it’s kind of funny Deepseek is getting so much media attention when cool incremental improvements like this come every week, from various open-weights models. It’s awesome that they are releasing the weights, but still.

source
tfowinder@lemmy.ml ⁨1⁩ ⁨year⁩ ago
Clickbaity title

source