Anthony snac Actor feed

@bioinformatician_next_door@kafeneio.social @ariadne The mathematical theory

Note 2026-06-13T13:40:08+00:00 View original ->

@bioinformatician_next_door@kafeneio.social @ariadne The mathematical theory and code libraries behind typical LLMs used in coding are produced by Big Tech and are inscrutable black boxes to the vast majority of people. They typically require proprietary binary blobs, the trained weights, that ultimately derive from stolen works without maintaining appropriate citation and credit structures. Meanwhile, they demand enormous hardware outllays---almost surely for NVIDIA hardware---relative to other methods and consume significant amounts of additional electric power, doubling the power consumption of the typical workstation in the typical use cases. The deskilling, slop production, befuddlement of self awareness, and other such effects are just as serious in self-hosted LLMs as they are in their hosted counterparts.

LLMs as a technology class are very problematic and should be avoided, in my view, till we've collectively had time to audit all their many problems and developed proven mititgations. I believe it's best to think of them as proprietary and closed source, and reason from there.

Comments

bms48@mastodon.social ⁨6⁩ ⁨days⁩ ago
@abucci @bioinformatician_next_door@kafeneio.social +1. This is the illusion of democratization presented by "on prem"; the hosting and energy burdens are shifted in space, but the negative externalities of how the models were trained remains static in time. The epistemic injustice remains, and that OpenAI employee who was found dead in his SF flat had calculated a 73%-94% chance that a GPT model’s output will reflect copyrighted input and therefore would not even qualify as fair use in USA. Illegal at UK CDPA.