Comment

Comment on DeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunch

blarth@thelemmy.club ⁨7⁩ ⁨months⁩ ago

7b trash model?

source

Sort:hotnew top

vhstape@lemmy.sdf.org ⁨7⁩ ⁨months⁩ ago

the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably sized models on certain benchmarks

Most models come in 1B, 7-8B, 12-14B, and 27+B parameter variants. According to the docs, they benchmarked the 8B model using an NVIDIA H20 (96 GB VRAM) and got between 144-1198 tokens/sec. Most consumer GPUs probably aren’t going to be able to keep up with

source
- avidamoeba@lemmy.ca ⁨7⁩ ⁨months⁩ ago
  It proved sqrt(2) irrational with 40tps on a 3090 here. The 32b R1 did it with 32tps but it thought a lot longer.
  
  source
  - vhstape@lemmy.sdf.org ⁨7⁩ ⁨months⁩ ago
    On my Mac mini running LM Studio, it managed 1702 tokens at 17.19 tok/sec and thought for 1 minute
    
    source
- brucethemoose@lemmy.world ⁨7⁩ ⁨months⁩ ago
  Depends on the quantization.
  
  7B is small enough to run it in FP8 or a Marlin quant with SGLang/VLLM/TensorRT, so you can probably get very close to the H20 on a 3090 or 4090.
  
  source
knighthawk0811@lemmy.world ⁨7⁩ ⁨months⁩ ago
it’s distilled so it’s going to be smaller than any non distilled of the same quality

source
LainTrain@lemmy.dbzer0.com ⁨7⁩ ⁨months⁩ ago
I’m genuinely curious what you do that a 7b model is “trash” to you? Like yeah sure a gippity now tends to beat out a mistral 7b but I’m pretty happy with my mistral most of the time if I ever even need ai at all.

source
TropicalDingdong@lemmy.world ⁨7⁩ ⁨months⁩ ago
Yeah idk. I did some work with deepseek early on. I wasn’t impressed.

HOWEVER…

Some other things they’ve developed like deepsite, holy shit impressive.

source
- double_quack@lemm.ee ⁨7⁩ ⁨months⁩ ago
  Save me the search, please. What’s deepsite?
  
  source
  - TropicalDingdong@lemmy.world ⁨7⁩ ⁨months⁩ ago
    tmpweb.net/nmS9uRBAENhQ/
    
    Above is what I can do with deepsite by pasting in the first page of your lemmy profile and the prompt:
    
    “This is double_quack, a lemmy user on Lemmy, a new social media platform. Create a cool profile page in a style that they’ll like based on the front page of their lemmy account (pasted in a ctrl + a, ctrl + c, ctrl + v of your profile).”
    
    source
    double_quack@lemm.ee ⁨7⁩ ⁨months⁩ ago
    Excuse me… what? Ok, that’s something…
    
    source
    -> View More Comments