brucethemoose
@brucethemoose@lemmy.world
- Comment on public services of an entire german state switches from Microsoft to open source (Libreoffice, Linux, Nextcloud, Thunderbird) 10 hours ago:
Pytorch Nightly: pytorch.org/blog/compromised-nightly-dependency/
theregister.com/…/pypi_pytorch_dependency_attack/
The malicious binary would upload files ranging in size up to 99,999 bytes and send the contents to a specified domain.
Was pretty scary from my perspective.
- Comment on public services of an entire german state switches from Microsoft to open source (Libreoffice, Linux, Nextcloud, Thunderbird) 10 hours ago:
PyTorch Nightly:
- Comment on public services of an entire german state switches from Microsoft to open source (Libreoffice, Linux, Nextcloud, Thunderbird) 11 hours ago:
There absolutely are. I barely missed a linux virus from a hijacked python package what… two years ago?
- Comment on The Witcher 3 is getting cross-platform mod support 3 days ago:
I have lost track of them, lol. Isn’t that just SE underneath… I think I inherited that too, somehow.
- Comment on The Witcher 3 is getting cross-platform mod support 3 days ago:
Skyrim Special Edition is the last stop for Skyrim modding, isn’t it? I somehow got that even though I only bought the game once, heh.
- Comment on Why doesn't Nvidia have more competition? 3 days ago:
I dunno. From my more isolated perspective on GitHub and small LLM testing circles, I see a lot of 3090s, 4090s, sometimes arrays of 3060s/3090s or old P40s or MI50s, which people got basically for the purpose of experimentation and development because they can’t drop (or at least justify) $5K.
They would 100% drop that money on a 7900 48GB instead (as the sheer capacity is worth it over the speed hit and finickiness), and then do a whole bunch of bugfixing/testing on them. I know I would. Hence the Framework Strix Halo thing is sold out even though it’s… rather compute-lite compared to a 3090+ GPU.
It seems like a tiny market, but a lot of the frameworks/features/models being developed by humble open source devs filter up to the enterprise space.
- Comment on Why doesn't Nvidia have more competition? 3 days ago:
WRT pricing, I’m pretty sure AMD is typically a fraction of the price of Nvidia hardware on the enterprise side
I’m not as sure about this, but seems like AMD is taking a fat margin on the MI300X (and its sucessor?), and kinda ignoring the performance penalty. It’s easy to say “build it yourself!” but the reality is very few can, or will, do this, and will simply try to deploy vllm or vanilla TRL or something as best they can (and run into the same issues everyone does).
The ‘enthusiast’ side where all the tinkerer devs reside is totally screwed up though. AMD’s mirroring Nvidia’s VRAM cartel pricing when they have absolutely no reason to. It’s completely bonkers. AMD would be in a totally different place right now if they had sold 40GB/48GB 7900s for an extra $100 or $200.
The biggest culprit from what I can gather is that AMD’s GPU firmware/software side is basically still ATI camped up in Markham, divorced from the rest of the company in Austin that is doing great work with their CPU-side.
Yeah, it does seem divorced from the CPU division. But a lot of the badness comes from business decisions, even when the silicon is quite good, and some of that must be from Austin.
- Comment on Cyberpunk 2 is now in preproduction, CD Projekt says 3 days ago:
That’s what I get for not clicking through!
Good! I can see a ton of gamers complaining about this, but switching to anything but in-house is a great move IMO.
- Comment on Why doesn't Nvidia have more competition? 3 days ago:
Except they didn’t.
They repeatedly fumble the software with little mistakes (looking at you, Flash Attention). They price the MI300X and any high VRAM GPU through the roof, when they have every reason to be more competitive and undercut Nvidia. They have sad, incomplete software efforts divorced from what devs are actually doing, like their quantization framework or some inexplicably bad LLMs they trained themself.
They give no one any reason to give them a chance, and wonder why no one comes. Lisa Su could fix this with literally like two phone calls (remove VRAM restrictions on their OEMs, and fix stupid small bugs in ROCM), but they don’t.
- Comment on DeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunch 3 days ago:
Depends on the quantization.
7B is small enough to run it in FP8 or a Marlin quant with SGLang/VLLM/TensorRT, so you can probably get very close to the H20 on a 3090 or 4090.
- Comment on The Witcher 3 is getting cross-platform mod support 3 days ago:
No offense, but it feels a little late in the game’s life cycle to hit “critical mass” for modding. I mean, I guess it has a long sales tail and other adaptations will drive people to the game.
Still, this is good! Better now than never.
- Comment on Hideo Kojima casually reveals a Death Stranding anime is in the works 3 days ago:
- Comment on Cyberpunk 2 is now in preproduction, CD Projekt says 3 days ago:
Spicy take: I hope they dump 2077’s engine and go Unreal.
I recently followed this guide to try and set up “optimized” PT in 2077, and on my lowly RTX 3090 it runs like cold molasses. Not a chance. RT reflections is all I can manage, and it looks… good.
Meanwhile, I’ve also been playing Satisfactory (an Unreal Engine game from a comparatively microscopic studio), and holy moly. Unreal’s dynamic lighting looks scary good. Like, I get light bounces and reflections and everything, and it runs at like quadruple the FPS in a massively complex scene.
- Comment on I just came across an AI called Sesame that appears to have been explicitly trained to deny and lie about the Palestinian genocide 1 week ago:
Probably just “safety” data snuck into its alignment training + an overly zealous system prompt on political topics: I bet it blocks anything it considers “political” or sensitive.
There are models out of Israel that could have a more explicit slant (try Jamba), but this doesn’t seem to be one of them.
To me, a fundamental problem is hiding technical knobs from users. Logprobs, sampling, the system prompt, starting replies for it to continue: there are tons of ways to “jailbreak” LLMs and get them to have an open “discussion” about (say) Palestine, but they’re all hidden here.
- Comment on EA never grasped Dragon Age's value as an RPG, says Inquisition writer 1 week ago:
Side note, but even with all their troubles/turnover, I still love RPS’s hint of bite in their news writing (outside the columns).
- Comment on [deleted] 2 weeks ago:
Certain subreddits used to be like this.
But all my favorites have taken one of two paths:
-
Get algorithmically deprioritized (due to a “bug” as an admin told a mod), and hemorrhage users. The collective ‘intelligence’ of the sub in particular drains; interesting intellectual discussions are gone. One such example: /r/localllama
-
The sub gets huge. Bots repost memes as attention farms. It doesn’t feel like a small town anymore. Deeper discussions drain away in favor of shallow repetition of the same things over and over again. One example of this for me is /r/thelastairbender.
-
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
The base M4 is a very small chip with a modest memory config. Don’t get me wrong, it’s fantastic, but it’s more Steam Deck/laptop than beefy APU (which the M4 Pro is a closer analogue to).
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
Yeah, that would be perfect!
Or (alternatively) they could majorly underclock the a shrunken series X chip to make it equivalent to an S.
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
Games are complex. Qualcomm/MS may tune it for the most popular titles, but I just don’t see how they can catch up to years of desktop GPU driver development.
- Comment on AMD Ryzen AI Max+ "Strix Halo" Delivers Best Performance On Linux Over Windows 11 - Even With Gaming 2 weeks ago:
Yeah, any framework with a “big” GPU is just so expensive.
- Comment on AMD Ryzen AI Max+ "Strix Halo" Delivers Best Performance On Linux Over Windows 11 - Even With Gaming 2 weeks ago:
Eh, yeah, and it’s backordered.
Ideally I’d like a full x16 slot too (or at least electrical x8), but perhaps that’s asking too much.
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
I will believe it when I see it. I hope so.
Qualcomm makes a lot of hype/noise but historically tends to overpromise, and also makes some unforced blunders.
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
It means emulation with pretty much every current title, and graphics driver issues and sluggish game out of the wazoo (as Qualcomm is very different than AMD/Intel/Nvidia).
ARM being more power efficient is also kind of a meme. Intel/AMD can be extremely good when clocked low (which they can do since there’s no emulation overhead), with both the CPU/GPU. Apple just makes x86 look bad because they burn a ton of money on power efficiency, but Qualcomm is more in the “budget” space.
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
With a Qualcomm chip though… there will be some teething issues, best case.
- Comment on AMD Ryzen AI Max+ "Strix Halo" Delivers Best Performance On Linux Over Windows 11 - Even With Gaming 2 weeks ago:
These things are awesome.
My dream is:
-
Selling one embedded onto an ITX board.
-
An SKU with a single (8 core (ideally X3D?)) CCD but the full GPU.
-
- Comment on Microsoft's Xbox Handheld: Switch-Like Dock and Multi-Platform Support 2 weeks ago:
Using Qualcomm chips
Oof.
Why didn’t they go AMD, or hell, even Intel? They have big APUs in the pipe that would mostly just work.
- Comment on ChatGPT does not fuck around 2 weeks ago:
Ahh…
- Comment on ChatGPT does not fuck around 2 weeks ago:
I mean, there’s no way that address is really OOPs, heh, unless it got it from the IP (which could be injected into the chat I suppose).
- Comment on ChatGPT does not fuck around 2 weeks ago:
A federal agent to inject themselves into a random chat? I find that extremely unlikely.
It’s possibly an existing joke it found in a web search with similar coordinates? That it can do. Or maybe it got lucky and stumbled upon them.
- Comment on YouTube's new ad strategy is bound to upset users: YouTube Peak Points utilise Gemini to identify moments where users will be most engaged, so advertisers can place ads at the point. 2 weeks ago:
Google’s been deploying engagement models before anyone even knew the name OpenAI.
This is oldschool machine learning. Gemini is just a brand.