Comment on Based on this graph, and this graph alone, guess at what time I completely blocked OpenAI crawlers
x00z@lemmy.world 1 day ago
50% of my traffic is scrapers now. I really want to block them but I also want my content to be indexed and used for LLMs. At the moment there isn’t really an in-between way of doing that. :(
(This is with me knowing they fuck up the electricity nets and memory chips, I’m just hoping that gets better soon.)
Anarki_@lemmy.blahaj.zone 1 day ago
Why do you want your stuff in the lie machines? 🤔
lost_screwdriver@thelemmy.club 1 day ago
That they do not become lie machines. Propaganda, lies and fake news from various different sources gets spammed all across the internet. If AI picks it up, it can just spread misinformation, especially if all trustworthy or useful sources block them
poVoq@slrpnk.net 21 hours ago
This will just make them sound more believable when they hallucinate. LLMs can conceptually not be made to not lie, even if all the info they are trained on is 100% accurate.
Anarki_@lemmy.blahaj.zone 1 day ago
That’s a very reasonable point I had not considered.
myfunnyaccountname@lemmy.zip 1 day ago
And very valid. Most of the data they use comes from Reddit and twitter. Garbage in, garbage out.
x00z@lemmy.world 16 hours ago
I work on a project that has a lot of older, less technical and international users who could use some extra help. We’re also not always found by the people that would benefit from our project. keeperfx.net