Comment on This happens after 3-4 days of running the server, then I have to restart it manually.
bulwark@lemmy.world 3 months ago
I’ve never seen this particular error, but CPU stall warnings seem like a fairly common thing. I wouldn’t jump straight to hardware fault, but it’s a possibility.
catloaf@lemm.ee 3 months ago
I’d lean toward bad hardware.
Try stress testing the CPU and RAM. See if you can get it to happen more frequently. Also see if you can disable that CPU core, either in the BIOS or in the OS, to see if the problem goes away.
loganb@lemmy.world 3 months ago
I’m with catloaf. Consistent CPU soft locks point to a possible bad memory module or CPU.
Clear CMOS.
Try removing one memory module at a time.
See if there is an option to disable hyperthreading in bios.
Another thing to try is to remove the CPU, careful not to damage the LGA pins on the motherboard, and clean the CPU contacts with alcohol. Take care to ground yourself out and the case before handling the CPU out of socket.
possiblylinux127@lemmy.zip 3 months ago
Don’t try to clean CPU pins. That is a very bad idea
loganb@lemmy.world 3 months ago
I mean speaking from experience, its resurrected a couple problematic CPUs for me. CPU pins no, pads on an LGA style CPU, sure.
admin@sh.itjust.works 3 months ago
The CPU in this has no pins, is just contacts on the chip. The pins are in the motherboard, like the new 7000 series Ryzen.