douglasg14b
@douglasg14b@lemmy.world
- Comment on if portals are invented, will I be able to eat out myself? 1 day ago:
Love this
- Comment on If the color of the Sun was orange, wouldn't the clouds and everything white also be orange? My friend is adamant that 30 years ago the "real" Sun was orange but got replaced with a white LED. 1 day ago:
You can’t fix stupid, it will only drag you down with it.
- Comment on Microsoft CEO warns that we must 'do something useful' with AI or they'll lose 'social permission' to burn electricity on it 3 days ago:
It is, just not for the pleb class.
- Comment on A Project to Poison LLM Crawlers 1 week ago:
I assume that the gitea instance itself was being hit directly, which would make sense. It has a whole rendering stack that has to reach out to a database, get data, render the actual webpage through a template…etc
It’s a massive amount of work compared to serving up static files from say Nginx or Caddy. You can stick one of these in front of your servers, and cache http responses (to some degree anyways, that depends on gitea)
Benchmarks like this show what kind of throughput you can expect on say a 4 core VM just serving up cached files: blog.tjll.net/reverse-proxy-hot-dog-eating-contes…
90-400MB/s derived from the stats here on 4 cores. Enough to saturate a 3Gb/s connection. And caching intentionally polluted sites is crazy easy since you don’t care if it’s stale or not. Put a cloudflair cache on front of it and even easier.
- Comment on A Project to Poison LLM Crawlers 1 week ago:
This is assuming aggressively cached, yes.
Also “Just text files” is what every website is sans media. And you can still, EASILY get 10+ MB pages this way between HTML, CSS, JS, and JSON. Which are all text files.
A gitea repo page for example is 400-500KB transferred (1.5-2.5MB decompressed) of almost all text.
If you have a repo with 150 files, and the scraper isn’t caching assets (many don’t) then you just served up 60MB of HTMl/CSS/JS alongside the actual repository assets.
- Comment on 1 week ago:
Fair fair. I missed that
- Comment on A Project to Poison LLM Crawlers 1 week ago:
I can get a 50Gb/s residential link where I am, and have a whole rack of servers.
Sounds like a good opportunity to crowd fund thousands and thousands of common scrap able instances that have random poisoning.
- Comment on Digg launches its new Reddit rival to the public 1 week ago:
Low key win for kink communities.
- Comment on 1 week ago:
Yeah but that was before you had billionaires of this size able to manipulate entire markets in this capacity.
- Comment on 1 week ago:
- Comment on A Steam dev is deleting his own game after girlfriend made him realize AI is bad 1 week ago:
Controversial opinion’
This is kind of a valid take or use I suppose.
And it’s something I struggle with as well.
I know how to program and I can make games with really shitty assets that no one would want to play because it looks like crap. I’ve tried many times and I don’t seem to have the skill set for making good assets. I’ve tried dozens of times to find and pay people on sites like fiverr, with extremely disappointing results.
And as a hobby I can’t just afford to pay thousands of dollars to have someone make passable art either.
So what do???
- Comment on The U.S. Government Just Followed Through on Its Ban of DJI Drones—and It’s So Much Worse Than We Thought 1 week ago:
This is also a strategic way to prevent resistance from Americans against an authoritarian regime.
Drones would be a significant part of that.
- Comment on Self-host Reddit – 2.38B posts, works offline, yours forever 1 week ago:
Yeah, it should balloon out to 15TB or more I think
- Comment on Self-host Reddit – 2.38B posts, works offline, yours forever 1 week ago:
It’s literally says in the link.
- Comment on State of the Fin 2026-01-06 | Jellyfin 2 weeks ago:
Really with they would take security vulnerabilities seriously 😞
Because they are significant, and broad reaching.
- Comment on YSK to get a passport in the US, you need to have access to information about your parents and most recent ex-spouse 3 weeks ago:
In the US
Notice how it’s not
Usit’sUSand the sentence asserts that it is a place?I’m not sure if you are acting dumb or not, if you are, it’s embarrassing.
- Comment on YSK to get a passport in the US, you need to have access to information about your parents and most recent ex-spouse 3 weeks ago:
It’s literally in the title. What else do you want?
- Comment on Investors are buying close to half the empty lots in LA burn zones, report says 3 weeks ago:
So even more of the land in this country will be owned by corporations and not people.
Lovely
- Comment on How much money should one person realistically make or have? 3 weeks ago:
I mean, maybe I want to go after a passion project and not waste away on someone else’s dream of what having money looks like?
$5mill ain’t gonna start & fund an engineering focused company if that’s my retirement savings.
- Comment on Nearly all of Spotify has been scraped and is available via torrents 4 weeks ago:
None of these are audio torrents.
That’s not released yet.
- Comment on Mattermost restricted access to old messages after 10000 limit is reached 4 weeks ago:
About 8MB is you assuming average message size is 200 UTF 16 characters.
- Comment on The dominoes are falling: motherboard sales down 50% as PC enthusiasts are put off by stinking memory prices 4 weeks ago:
Seriously.
They are already doing this through regulatory capture and corruption. Let’s not give them more power
- Comment on The dominoes are falling: motherboard sales down 50% as PC enthusiasts are put off by stinking memory prices 4 weeks ago:
It’s a play to make at home compute unachievable, forcing people to pay for subscription cloud services and cloud compute in walled gardens.
- Comment on Google Removes Sci-Hub Domains from U.S. Search Results Due to Dated Court Order 4 weeks ago:
Would recommend. Been using it for a couple years now, and it actually feels gross when I end up on Google.
You will hear shit from a small group on Lemmy about how they also use Yandex for search results, but it’s a pretty hollow argument that keeps being used as some big “gotcha”. But if that’s a turn off for you, it is what it is.
- Comment on Activist group says it has scraped 86m music files from Spotify 4 weeks ago:
Yeah, but it’s still operated and organized by people, people who of they are within US jurisdiction be punished and made “an example of”. Effectively killing the archive by cutting off its organization.
- Comment on YSK about Psyllium husk 4 weeks ago:
Nope.
- Comment on YSK about Psyllium husk 4 weeks ago:
You know what’s good at removing lead from the body? Fiber.
Citation Needed
- Comment on Firefox Will Ship with an "AI Kill Switch" to Completely Disable all AI Features - 9to5Linux 4 weeks ago:
Pretty much this.
They are making market based decisions because they have to, and all the users bitching and moaning about them making financially driven decisions don’t donate anyways.
- Comment on Firefox Will Ship with an "AI Kill Switch" to Completely Disable all AI Features - 9to5Linux 4 weeks ago:
Firefox just can’t win with their users.
- Mozilla makes decisions based on market data
- Users complain they never wanted those features
- Mozilla makes a decision based on user feedback
- Users shit on them for backpedaling or damage control
It’s absurd.
- Mozilla makes decisions based on market data
- Comment on Backing up Spotify 5 weeks ago:
Yeah, it’s a wild move admitting that they are the source of pirated content for music here.
We don’t need Anna’s Archive to go under as a result of Sony going after them because of this…