General_Effort
@General_Effort@lemmy.world
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 49 minutes ago:
Not quite.
Generally, sites aren’t liable for user generated content as long as they follow some rules. They need to take down illegal content and provide some way of reporting such content. In the US, that’s the whole DMCA takedown thing. The whole content ID thing, that YouTube does, might not be strictly necessary, but it was rolled out in response to a high-stakes lawsuit. The EU is, as always, more strict in these matters.
People are not punished for things beyond their control (but mind that a fine is not the same as damages). If you are sent illegal content, that you have not requested, you shouldn’t expect formal punishment, though the investigation may be punishing in itself. If you simply don’t know how caching works, you’re probably in trouble.
But this was about copyright. I don’t think you get punished anywhere for holding some copyright. Say some Japanese Manga artist travels to some European state where some of their works are illegal. They’re not going to get arrested for that. Anyone who brings such illegal works into the country will not be so lucky, regardless of copyright.
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 3 hours ago:
You don’t see why you would need your own servers? Do you see why unauthorized access to a computer system might be illegal?
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 4 hours ago:
“Violating copyright without a licence” is a lovely turn of phrase. You must be the valedictorian of the Lemmy School of Copyright.
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 4 hours ago:
No. I am not aware of any law that makes you liable by holding or claiming the copyright to some content. EG you may have to pay damages for libel, but not because you have copyright to the libelous statement.
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 4 hours ago:
That would be legally possible, though, obviously, you would have to pay for your own servers.
In practice, it wouldn’t be worth anyone’s time.
- Comment on Reddit sues Anthropic, alleging its bots accessed Reddit more than 100,000 times since last July 4 hours ago:
No. Just because you own a copyright, doesn’t mean that you are entitled to free network services. If you owned the copyright to a movie, would you expect free tickets for any cinema showing it?
- Comment on lemm.ee is shutting down at the end of this month 1 day ago:
Difficult question. I think it’s realistic. Hate speech laws are enforced against individuals on a regular basis. If a German instance tolerated illegal content, then I expect the instance would sooner or later be involved in a prosecution. The prosecution would be against the user, though. The instance would only have to provide data to police. I’m not sure at what point the instance owners themselves would be found to violate these laws.
Apart from that, the instance is required to remove illegal content per the DSA. I think it’s realistic to expect that a prosecution would lead to a closer look at the instance.
- Comment on Meta and Yandex are de-anonymizing Android users’ web browsing identifiers - Ars Technica 1 day ago:
I almost didn’t copy the update because my focus was on the technical background. I did a double-check before submitting, if I caught the gist correctly, and decided that people would probably want to know that the report triggered that change.
- Comment on Meta and Yandex are de-anonymizing Android users’ web browsing identifiers - Ars Technica 2 days ago:
Useless article, but at least they link the source: localmess.github.io
We disclose a novel tracking method by Meta and Yandex potentially affecting billions of Android users. We found that native Android apps—including Facebook, Instagram, and several Yandex apps including Maps and Browser—silently listen on fixed local ports for tracking purposes.
These native Android apps receive browsers’ metadata, cookies and commands from the Meta Pixel and Yandex Metrica scripts embedded on thousands of web sites. These JavaScripts load on users’ mobile browsers and silently connect with native apps running on the same device through localhost sockets. As native apps access programatically device identifiers like the Android Advertising ID (AAID) or handle user identities as in the case of Meta apps, this method effectively allows these organizations to link mobile browsing sessions and web cookies to user identities, hence de-anonymizing users’ visiting sites embedding their scripts.
📢 UPDATE: As of June 3rd 7:45 CEST, Meta/Facebook Pixel script is no longer sending any packets or requests to localhost. The code responsible for sending the _fbp cookie has been almost completely removed.
- Comment on AI Training Slop 3 days ago:
I’m always glad when someone is interested and conscientious enough to ask for a source.
Article 4 in full:
Providers and deployers of AI systems shall take measures to ensure, to their best extent, a sufficient level of AI literacy of their staff and other persons dealing with the operation and use of AI systems on their behalf, taking into account their technical knowledge, experience, education and training and the context the AI systems are to be used in, and considering the persons or groups of persons on whom the AI systems are to be used.
AI Act -> eur-lex.europa.eu/eli/reg/2024/1689/oj/eng
BTW. That site is the official repository for EU law. It’s also how EU law is promulgated. What you find there is, by definition, the correct version (unless stated otherwise).
- Comment on Avocado 3 days ago:
Oof.
- Comment on AI Training Slop 4 days ago:
Are you in Europe? The AI Act requires some unspecified “AI literacy” from staff working with AI. Some sort of grift, I guess.
- Comment on Material scientist wet dream 5 days ago:
It actually is. Do you even nerd, bro?
- Comment on [deleted] 6 days ago:
Most people don’t care about locked-down tech. They don’t have the skills necessary to use anything open, and that’s fine. You have to pick what you do with your limited time.
OTOH, many people want to have control over their data. That means having control over other people’s computers. It’s not just the copyright industry demanding money, or Big Tech building walled gardens. You can see a lot of users on Lemmy demanding that kind of control. That means that computing devices of all kinds must become more locked-down and remote-controllable.
So that’s where I see us going.
- Comment on wake up baby new radiodont just dropped 6 days ago:
When I read new radiodont, I thought this would be about a hilarious screw-up in the radiology department.
Oh well. Radiondont stick your head in the particle accelerator.
- Submitted 6 days ago to science_memes@mander.xyz | 3 comments
- Comment on I am disappointed in the AI discourse 1 week ago:
He didn’t write that to teach but to vent. The intended audience is people who already know.
For more information on ChatGPT’s current capabilities, consult the API docs. I found that to be the most concise source of reliable information. And under no circumstances, believe anything about AI that you read on Lemmy.
Kudos for being willing to learn.
- Comment on On trees... 1 week ago:
People have experimented with that sort of thing. Here’s a DIY for going into 3rd person mode using a camera on a stick and some electronics in a backpack. Bit of googling also finds me body swap experiments, but nothing on a crotch perspective.
- Comment on Are there any initiatives aimed at training generative AI using 100% public domain works and works authorized by the creator? 1 week ago:
You mean legally? Yeah, no problem. It depends on the location, though. In the EU, the rights-holder can opt out. So if you want to do it in the EU you have to pay off Reddit, Meta, and so on. In Japan, it’s fine regardless. In the US, it should turn out similarly, but it’s up to the courts to work out the details, and it’s quite up in the air if you can trust the system to work.
- Comment on Are there any initiatives aimed at training generative AI using 100% public domain works and works authorized by the creator? 1 week ago:
The usual tends to be that the platform can do basically whatever. That shouldn’t really be surprising. But I see your point. If you literally want consent, not just legally licensed material, then you need more than just a clause in the TOS.
You could raise the same issue with permissively licensed material. People who released it may not have foreseen AI training as a use, and might not have wanted to actually allow it.
- Comment on Are there any initiatives aimed at training generative AI using 100% public domain works and works authorized by the creator? 1 week ago:
For images, yes. Most notable is probably Adobe. Their AI, which powers photoshop’s generative fill among other things, is trained on public domain and licensed works.
For text, there’s nothing similar. LLMs get better the more data you have. So, the less training data you use, the less useful they are. I think there are 1 or a few small models for research purposes, but it really doesn’t get you there.
Of course, there aren’t any such open source projects. When you take these extreme, maximalist views of (intellectual) property, then giving stuff away for free isn’t the obvious first step.
- Comment on GIVE ME 4 2 weeks ago:
Big fail. That has grenadine in it, which is non-alcoholic.
- Comment on GIVE ME 4 2 weeks ago:
So, how do you make a TS? I’m guessing it starts with breaking an egg.
- Comment on Does the average person know markdown? 2 weeks ago:
And quartz, of course.
- Comment on Is there any fundamental difference between an instance and a formal website ? 2 weeks ago:
“Instance” is programmer lingo. Roughly, it’s when you have the same piece of code running multiple times with different values (as part of the same system). More narrowly, “instance” is used in the context of classes. All lemmy instances run the lemmy code but with different users, admins, and so on. The expression makes perfect sense, but it is not used in a formal way.
A lemmy instance runs a web server. Wikipedia says that when you host a web page under a dedicated domain name, you have a website.
- Comment on EU ruling: tracking-based advertising by Google, Microsoft, Amazon, X, across Europe has no legal basis 2 weeks ago:
Instead of being silly, why don’t you just correct your disinformation and be done with it? Why these games?
- Comment on EU ruling: tracking-based advertising by Google, Microsoft, Amazon, X, across Europe has no legal basis 2 weeks ago:
so you think that an automated account like, say “no-reply@amazon.com”, is somehow personal data?
No. I don’t know why you would believe that.
- Comment on EU ruling: tracking-based advertising by Google, Microsoft, Amazon, X, across Europe has no legal basis 2 weeks ago:
So, what you are telling me, is that you are an IT professional working in Europe, and in your considered opinion, emails don’t fall under GDPR if you don’t provide your phone number or something. And that totally doesn’t sound like a joke. Is that about right?
- Comment on EU ruling: tracking-based advertising by Google, Microsoft, Amazon, X, across Europe has no legal basis 2 weeks ago:
I still don’t get where all this disinformation comes from. What do you mean by “the GDPR website”? Are you under the impression that the linked website is somehow official? Even so, the information seems solid and shouldn’t give you these ideas.
- Comment on EU ruling: tracking-based advertising by Google, Microsoft, Amazon, X, across Europe has no legal basis 2 weeks ago:
A user is typically a natural person. A username identifies that person. Any information that is directly or indirectly linked to that username is thus personal data of that person. The GDPR explicitly gives “online identifier” as an example of an identifier. I did link to the official repository, which hosts translation in all European languages. Each translation can be reached with 1 click. It cannot be a language issue. I do not understand what the problem could be.
The personal data in the OP (consent options) are linked to a person via a cookie stored in their browser. I do not understand how one could make sense of the case without understanding what personal data is.
There also appears to be some confusion between GDPR and copyright. I do not know where these strange ideas come from.