Comment

Comment on 77% Of Employees Report AI Has Increased Workloads And Hampered Productivity, Study Finds

bitfucker@programming.dev ⁨7⁩ ⁨months⁩ ago

Yeah, for sure since programming is also a language. But IMHO, for a machine learning model the best way to approach it is not as a natural language but rather as its AST/machine representation and not the text token. That way the model not only understands the token pattern but also the structure since most programming languages are well defined.

source

Sort:hotnew top

lvxferre@mander.xyz ⁨7⁩ ⁨months⁩ ago
Note that, even if we refer to Java, Python, Rust etc. by the same word “language” as we refer to Mandarin, English, Spanish etc., they’re apples and oranges - one set is unlike the other, even if both have some similarities.

That’s relevant here, for two major reasons:

The best approach to handle one is not the best to handle the other.

LLMs aren’t useful for both tasks (translating and programming) because both involve “languages”, but because LLMs are good to retrieve information. As such you should see the same benefit even for tasks not involving either programming languages or human languages.

Regarding the first point, I’ll give you an example. You suggested abstract syntax trees for the internal representation of programming code, right? That might work really well for programming, dunno, but for human languages I bet that it would be worse than the current approach. That’s because, for human languages, what matters the most are the semantic and pragmatic layers, and those are a mess - with the meaning of each word in a given utterance being dictated by the other words there.
source
- bitfucker@programming.dev ⁨7⁩ ⁨months⁩ ago
  Yeah, that’s my point ma dude. The current LLM tasks are ill suited for programming, the only reason it works is sheer coincidence (alright, maybe not sheer coincidence, I know its all statistics and so on). The better approach to make LLM for programming is a model that can transform/“translate” a natural language that humans use to AST, the language that computers use but still close to human language. But the problem is that to do such tasks, LLM needs to actually have an understanding of concepts from the natural language which is debatable at best.
  
  source
  - lvxferre@mander.xyz ⁨7⁩ ⁨months⁩ ago
    Sorry - then I misread you. Fair point.
    
    source