Doesn’t Twitch own all data that is written and their TOS will state something like you can’t store data yourself locally.
Comment on What exactly is a self-hosted small LLM actually good for (<= 3B)
CrayonDevourer@lemmy.world 9 months agoIf I say my name is Doo doo head, in a public park, and someone happens to overhear it - they can do with that information whatever they want. Same thing.
catty@lemmy.world 9 months ago
CrayonDevourer@lemmy.world 9 months ago
I’m not storing their data. I’m feeding it to an LLM which infers things and storing that data.
catty@lemmy.world 9 months ago
Was this system vibe coded? I get the feeling it was…
CrayonDevourer@lemmy.world 9 months ago
There’s not actually that much code. It’s like 8 lines for an AI ‘agent’, and maybe another 16 lines for ‘tools’, and I’m using Streamlink for grabbing the audio stream, and pulseaudio has a ‘monitor’ device you can use to listen to what’s playing on the speakers.
catty@lemmy.world 9 months ago
lol. Way to contradict yourself.
aksdb@lemmy.world 9 months ago
So, buzzer WRONG.
Quite arrogant after you just constructed a faulty comparison.
If I say my name is Doo doo head, in a public park, and someone happens to overhear it - they can do with that information whatever they want. Same thing.
That’s absolutely not the same thing. Overhearing something that is in the background is fundamentally different from actively recording everything going on in a public space. You film yourself or some performance in a park and someone happens to be in the background? No problem. You build a system to identify everyone in the park and collect recordings of their conversations? Absolutely a problem, depending on the jurisdiction. The intent of the recording(s) and the reasonable expectations of the people recorded are factored in in many jurisdictions, and being in public doesn’t automatically entail consent to being recorded.
See for example www.freedomforum.org/recording-in-public/
(And just to clarify: I am not arguing against your explanation of Twitch’s TOS, only against the bad comparison you brought.)
kattfisk@lemmy.dbzer0.com 9 months ago
You’re both getting side-tracked by this discussion of recording. The recording is likely legal in most places.
It’s the processing of that unstructured data to extract and store personal information that is problematic. At that point you go from simply recording a conversation of which you are a part, to processing and storing people’s personal data without their knowledge, consent, or expectation.
shnizmuffin@lemmy.inbutts.lol 9 months ago
This was my main thrust.
aksdb@lemmy.world 9 months ago
True.
Although in Germany for example it can also be an issue when recording. If you have a security camera pointed at a public space (that can include the sidewalk infront of your house), passersby can sue you to take it down and potentially get you fined. Even pretending to constantly record such an area can yield that result.
tfm@europe.pub 9 months ago
I’m not a lawyer but I suppose it would depend on the ToS and if the user agrees to the recording and processing. But if it allows the extraction of the real identity of the user it’s probably a GDPR issue.
CrayonDevourer@lemmy.world 9 months ago
You build a system to identify everyone in the park and collect recordings of their conversations? Absolutely a problem, depending on the jurisdiction.
Literally not. The police use this right now to record your location and time seen using license plates all over the nation - with private corporations providing the service.
tfm@europe.pub 9 months ago
In the US maybe but not in Germany, Austria and probably most countries in Europe.
shnizmuffin@lemmy.inbutts.lol 9 months ago
Let’s take a look at the Developer Agreement that you cited:
This very clearly states that you are disallowed from retaining chat logs for the general purpose of collecting information about Twitch’s end users.
You said that you, “store ‘facts’ about specific users so that they can be referenced quickly,” but then later in a different thread state, “I’m not storing their data. I’m feeding it to an LLM which infers things and storing that data.” You’re retrieving information about specific users at a later time. You’ve built a database of structureless PII from chat logs. You’ve chosen to store the data as inferences, which makes it a bad database, but still a database.
I have questions:
When your streamer mentions something deeply personal, like, “how their mothers surgery went,” that your tool helped them remember, do they disclose that your tool was involved in that transaction? When the viewer gets weirded out and asks your streamer to not mention that again, or forget it entirely, do you have a way to remove that information from your database and a way to prove it’s been deleted? When other people in chat think it’s gross, and ask to opt-out, can you even do it?
Regarding FrostyTools: I don’t think it’s storing the chat logs for a later time. They don’t have a data retention section in their TOS or Privacy Policy that isn’t related to the streamer. (As in, they hold on to the streamer’s Twitch account and some other information for billing, authentication, etc.) I think it’s taking the chat logs only for as long as it needs to output a response and then deleting it. Also, this excerpt from the FrostyTools TOS made me chuckle:
This leads me to believe that you can violate the Twitch TOS quoted above using FrostyTools. It is apparent that FrostyTools has positioned itself as an application that creates User Generated Content (like Photoshop or Word).
CrayonDevourer@lemmy.world 9 months ago
I’m not storing chat logs.
Not creating any kind of public database either. It’s a private tool.
Again - Not storing chat logs. They are processed for information and that information inferred.