Comment on ChatGPT's new browser has potential, if you're willing to pay
brucethemoose@lemmy.world 22 hours agoI have access to GLM 4.6 through a service but that’s the ~350B parameter model and I’m pretty sure that’s not what you’re running at home.
It is. I’m running this model, with hybrid CPU+GPU inference, specifically: huggingface.co/…/GLM-4.6-128GB-RAM-IK-GGUF
You can likely run GLM Air on your 3060 if you have 48GB+ RAM. Heck. I’ll make a quant just for you, if you want.
MagicShel@lemmy.zip 22 hours ago
I’m going to upgrade my ram shortly because I found a bad stick and I’m down to 16GB currently. I’ll see if I can swing that order this weekend.
brucethemoose@lemmy.world 21 hours ago
To what?
64G would be good, as that’s enough to fit GLM Air. There are some good 2x64GB kits for 128GB as well.
MagicShel@lemmy.zip 20 hours ago
I’ll see about 128, then, but I’ll probably do 64. Just depends on cost. Any recs?
brucethemoose@lemmy.world 19 hours ago
For DDR5? Depends how much you care about latency:
pcpartpicker.com/products/memory/#ff=ddr5&Z=13107…
The $342 Crucial kit is kinda a no-brainer. Its timings aren’t great when overclocked, but it’s 5600 MHz out of the box, low voltage, and significantly cheaper per gigabyte than many 64GB/96GB kits. See for yourself:
www.igorslab.de/en/…/4/
The overclockability matters even less if you are on a 7000 series CPU.
I got the 1.25V Flare X5 kit because I wanted tighter timings for sim games, albeit at a MUCH lower price ($390).
RAM prices seem to be rising (hence the price of my kit spiked), so now is not a bad time to buy.