Researchers published a massive database of more than 2 billion Discord messages that they say they scraped using Discord’s public API. The data was pulled from 3,167 servers and covers posts made between 2015 and 2024, the entire time Discord has been active.
Though the researchers claim they’ve anonymized the data, it’s hard to imagine anyone is comfortable with almost a decade of their Discord messages sitting in a public JSON file online. Separately, a different programmer released a Discord tool called “Searchcord” based on a different data set that shows non-anonymized chat histories.
asbestos@lemmy.world 4 weeks ago
Probably our only chance to find solutions to problems with open source software that uses Discord as their forum
boatswain@infosec.pub 4 weeks ago
Seriously. It’s beyond painful when some open source project only uses Discord for communication. You have to hope that you post your question at a time when the right people are online, and that there’s not a more interesting conversation going on, otherwise it just gets lost. Index that whole dataset.
ALostInquirer@lemm.ee 4 weeks ago
Given some similar issues, why is it some projects still use IRC then?
Peffse@lemmy.world 4 weeks ago
I’ve always wanted to contribute to The Cutting Room Floor wiki but they hide registration behind a Discord server bot that will give the registration code.
Ulrich@feddit.org 3 weeks ago
I’ve seen a few projects doing just that with www.answeroverflow.com and they have come up in my web searches. Not really a solution but at least a stopgap.
Dojan@pawb.social 4 weeks ago
I spent nearly three hours today between discord and matrix trying to figure out how to get these two pieces of software to talk using a certain protocol.
Imagine if there were online indexable platforms where people could publish this information so it’s easily accessible rather than having to scour through message logs hoping to find the right keywords. Such a technology surely doesn’t exist already, right?
I hate discord.
dual_sport_dork@lemmy.world 4 weeks ago
I don’t hate Discord, I simply hate that so many projects and companies have unanimously decided to use it as the wrong tool for the wrong job.
It’s fine for its intended use case, which is bickering with my friends about video games and fiction, and spamming each other with .gifs and meme images.
MDCCCLV@lemmy.ca 4 weeks ago
Yeah, but then you have something like when people protest deleted their history on reddit which is fine as a protest tactic but leaves a hole where your specific question came up but now there’s nothing there.
spiderhamster@lemmy.world 4 weeks ago
you get it to work? i didnt have time to get it working in both directions. matrix to discord worked fine but not the other way.
interdimensionalmeme@lemmy.ml 4 weeks ago
You mean NNTP ?
nawa@lemmy.world 4 weeks ago
Lol, I’ve read this headline and thought “thank fuck, probably the only option to have Discord’s content readable”, I like how universal this opinion is