Be Your Own Privacy-Respecting Google, Bing & Brave
… by running your own instance of the free and open-source federated metasearch engine SearXNG on OpenBSD!
Submitted 3 weeks ago by mesamunefire@piefed.social to selfhosted@lemmy.world
https://xn--gckvb8fzb.com/be-your-own-privacy-respecting-google-bing-brave/
Be Your Own Privacy-Respecting Google, Bing & Brave
… by running your own instance of the free and open-source federated metasearch engine SearXNG on OpenBSD!
I run it in a container on Kubernetes. Definitely recommend.
I still don’t understand how Searx is able to operate for free. Don’t the API calls cost money?
From what I’ve read, I believe it’s a combination of donations. sponsors, volunteer hosting from like minded organizations.
If I had to guess, they probably don’t use the APIs, inside using scrapping of some sort.
My understanding is it scrapes what it can’t meaningfully get out of an API. Public instances run into rate limiting, but private instances don’t really have that problem.
look at that domain name! respect
Usually its a sign of a scale. Looks like a ransomware c2 domain
If you’re interested in this, the term you’re looking for is punycode
For anyone wondering xn–gckvb8fzb.com is マリウス.com
Dos it not resolve on certain browsers or something? I usually just copy/paste or use a firefox plugin to generate posts for lemmy/piefed/fediverse.
It resolves well but punnycode is disabled in some browsers or profiles for security enhanced profiles, so that you can easier detect punnycode domains that try to fake other domains.
Is metasearch really the best we can do? What about YaCy, or something else more like that?
The search engines that searxng interact with still track you. For this reason I will always use a public instance to mix up the tracking with everyone else using it.
Explain?
Using a public instance is more private than using a private selfhosted instance.
This is exactly what I came here to find. Thank you for posting it. If I can be so bold selfhosters should really be leaning this way searxng is great but it still uses big tech.
The other thing we need is a way to identify good crawling agents or *smol agents over corporate bots that just steal content.
If selfhosters can unite and build a good index perhaps searching can go back to the way it was vs a vector to sell you more and collect your data.
How good are the results compared to Google/Duckduckgo?
You make your own index with this one
I personally love yacy.
Brave is a search engine?
That’s news to me.
Brave have they’re own search engine
it’s no kagi, but its ok
I used to self-host searxng for a while, but somehow the search results where always off and mixed with to much non-relevant results :/.
It’s not about searxng itself… Rather how the most relevant info gets drown into AI slope and non-sense bullshit. The best blogposts/info are transmitted from people to people…
I’m kinda sad to admit that stupid AI “solved” this issue and had better results :/
You can self host that too ;)
OpenWebUI + Ollama + SearxNG. OpenWebUI can do llm web search using the engine of your choice (even self hosted SearxNG!). From there it’s easy to set the default prompt to always give you the top (10, 20, whatever) raw results so you’re not confined to ai results. It’s not quite duck.ai slick but I think I can get there with some more tinkering.
Is there a guide on how to do this on Linux + 16GB Radeon?
Ohoho? That’s interesting. I don’t have the horse power to selfhost an AI, but that’s good to know !
Thanks for the pointer !!!
I used to self-host searxng for a while, but somehow the search results were always off and mixed with to much non-relevant results :/.
I mean, getting non-relevant results happens with every search engine anymore.
The days of your search results being relevant, and on the first page, are long dead thanks to SEO and other factors.
Yeah you’re right ! However, ages ago, I still remember how you could go to page 20+ and still find some really interesting things !
Here, past page 2 it’s just some random shit…
I don’t know any other search that lets you block urls from results. Blocking stuff like social media and Amazon cleans up the results very well.
It’s not federated tho?
Why do they mean when they call it that?
Thanks for posting, both a great reminder to try setting this up on my unraid, and also to add the RSS feed of that site to Feeder.
I just added it too, I had read a few articles of them already
BSD FTW
yodeljunkmanenvy@piefed.social 3 weeks ago
There is also a list of publicly operated SearXNG instances at https://searx.space/. We host the one at https://search.freestater.org/, and there are plenty of other good ones.
bVuZT3n0X1rOSXk.jpg
Appoxo@lemmy.dbzer0.com 3 weeks ago
Who says they are securely operated and don’t store any data??
jackr@lemmy.dbzer0.com 3 weeks ago
who says that about any search engine? can you trust them? searXNG is usually run by random people who are less likely to use your data than a larger search company
yodeljunkmanenvy@piefed.social 2 weeks ago
The devs at SearXNG have a bot that regularly scans the public instances for changes to the source code and delists them as a public instance if it’s altered.
The software is free and open source. You are encouraged to inspect the code yourself to make sure no data is collected!
quick_snail@feddit.nl 3 weeks ago
If you’re using a shared IP, it doesn’t matter.
You are using a VPN or Tor, right?!?