This is a case of corporation taking advantage of technically idiotic userbase, which is most of the general public. OpenAI using a dark pattern so that users can’t easily unchecked that box nor making that text that says “this can be indexed by search engines” brightly visible.
Comment on Chatgpt shared link searchable
corroded@lemmy.world 3 weeks ago
If you don’t want your conversations to be public, how about you don’t tick the checkbook that says “make this public.” This isn’t OpenAI’s problem, its an idiot user problem.
zerozaku@lemmy.world 3 weeks ago
panda_abyss@lemmy.ca 3 weeks ago
I don’t think OpenAI gets anything from this, I think they just failed to realize how stupid the average person is.
AnarchistArtificer@lemmy.world 3 weeks ago
They get more human written text, which is one of the most powerful things in their doomed attempt to forestall model collapse
panda_abyss@lemmy.ca 3 weeks ago
They already have the text
FauxLiving@lemmy.world 3 weeks ago
If you don’t want corporations to use you chats as data, don’t use corporate hosted language models.
Even non-public chats are archived by OpenAI, and the terms of service of ChatGPT essentially give OpenAI the right to use your conversations in any way that they choose.
You can bet they’ll eventually find ways to monetize your data at some point in the future. If you think GoogleAds is powerful, wait until people’s assistants are trained with every manipulative technique we’ve ever invented and are trying to sell you breakfast cereals or boner pills…
You can’t uncheck that box except by not using it in the first place. But people will sell their soul to a company in order to not have to learn a little bit about self-hosting
Electricd@lemmybefree.net 3 weeks ago
This is basically a “if you don’t want your data to be used, run your own internet” comment
It’s just not doable for pretty much everyone
Allero@lemmy.today 3 weeks ago
Modern LLMs can serve you for most tasks while running locally on your machine.
Something like GPT4ALL will do the trick on any platform of your choosing if you have at least 8gb of RAM (and for most people nowadays it’s true).
It has a simple, idiot-proof GUI and doesn’t collect data if you don’t allow it to. It’s also open source.
Electricd@lemmybefree.net 3 weeks ago
If you want actual good features like deep research or chain of thought, eh, not sure it’s a good choice
The models will also not be very powerful
puck@lemmy.world 3 weeks ago
Hi there, I’m thinking about getting into self-hosting. I already have a Jellyfin server set up at home but nothing beyond that really. If you have a few minutes, how can self-hosting help in the context of OPs post? Do you mean hosting LLMs on Ollama?
BreadstickNinja@lemmy.world 3 weeks ago
Yes, Ollama or a range of other backends (Ooba, Kobold, etc.) can run LLMs locally. Huggingface has a huge number of models suited to different tasks like coding, storywriting, general purpose, and so on. If you run both the backend and frontend locally, then no one monetizes your data.
The part I’d argue that the previous poster is glazing over a little bit is performance. Unless you have an enterprise-grade GPU cluster sitting in your basement, you’re going to make compromises on speed and/or quality relative to the giant models that run on commercial services.
tal@lemmy.today 3 weeks ago
It’s also going to cost more, because you almost certainly are only going to be using your hardware a tiny fraction of the time.
puck@lemmy.world 3 weeks ago
Thanks for the info. Yeah, I was wondering what kind of hardware you’d need to host LLMs locally with decent performance and your post clarifies that. I doubt many people would have the kind of hardware required.