Comment on Hexbear has been temporarily defederated - UPDATED now federated
db0@lemmy.dbzer0.com 1 month ago
Thanks so much for doing the legwork on this. I was going nuts trying to figure out where seemingly random downtimes were coming from. It felt like a DOS and this cause explains why.
Out of curiosity, how did you trace this root cause?
Demigodrick@lemmy.zip 1 month ago
I noticed in the logs before every timeout there were lots of “verify” words appearing, and in each iteration of that statement there were more and more verify words. Honestly had no idea what it meant at the point, only that I didn’t recognise it from looking at lemmy logs previously, it always appeared before a crash, and it felt suspicious.
Here’s an example from some logs before a crash:
I pinged some logs over to Nutomic on matrix, who thought it might have been related to nested comments, and then I noticed Dessalines had made the linked thread, which matched pretty much with what I was seeing behaviour and logs-wise.
Usefully the logs link the object it’s fetching, and 9 times out of 10 its a deeply nested hexbear thread! Or someone from another instance commenting on a nested hexbear thread. Nutomic confirmed the behaviour based on the logs in the issue, and I’m pulling the logs when I get chance to see what other threads are causing it to crash, although hopefully the fix will make it’s way into 0.19.18 beta 3 so I can stop worrying about it!
Did you also see the db cpu spiking during this period?
Demigodrick@lemmy.zip 1 month ago
No, no meaningful cpu spikes I could make out anywhere, although admittedly I was focusing on the lemmy server container mostly
mathemachristian@lemmy.blahaj.zone 1 month ago
There has to be a better way to gain visibility for mutual_aid posts because good god
frongt@lemmy.zip 1 month ago
Voting and sorting by top.