Comment on Microsoft’s AI boss thinks it’s perfectly OK to steal content if it’s on the open web

<- View Parent
sugar_in_your_tea@sh.itjust.works ⁨2⁩ ⁨months⁩ ago

Yes, it kind of is. A search engine just looks for keywords and links, and that’s all it retains after crawling a site. It’s not producing any derivative works, it’s merely looking up an index of keywords to find matches.

An LLM can essentially reproduce a work, and the whole point is to generate derivative works. So by its very nature, it runs into copyright issues. Whether a particular generated result violates copyright depends on the license of the works it’s based on and how much of those works it uses. So it’s complicated, but there’s very much a copyright argument there.

source
Sort:hotnewtop