Comment

Comment on The productivity paradox of AI coding assistants

tal@lemmy.today ⁨1⁩ ⁨month⁩ ago

I keep seeing the “it’s good for prototyping” argument they post here, in real life.

There are real cases where bugs aren’t a huge deal.

Take shell scripts. Bash is designed to make it really fast to write throwaway, often one-line software that can accomplish a lot with minimal time.

Bash is not, as a programming language, very optimized for catching corner cases, or writing highly-secure code, or highly-maintainable code. The great majority of bash code that I have written is throwaway code, stuff that I will use once and not even bother to save. It doesn’t have to handle all situations or be hardened. It just has to fill that niche of code that can be written really quickly. But that doesn’t mean that it’s not valuable. I can imagine generated code with some bugs not being such a huge problem there. If it runs once and appears to work, that may be totally fine.

Or, take test code. I’m not going to spend a lot of time making test code perfect. If it fails, it’s probably not the end of the world. There are invariably cases that I won’t have written test code for. “Good enough” is often just fine there.

And it might be possible to, instead of (or in addition to) having human-written commit messages, generate descriptions of commits or something down the line for someone browsing code.

I still feel like I’m stretching, though. Like…I feel like what people are envisioning is some kind of self-improving AI software package, or just letting an LLM go and having it pump out a new version of Microsoft Office. And I’m deeply skeptical that we’re going to get there just on the back of LLMs. I think that we’re going to need more-sophisticated systems.

I remember working on one large, multithreaded code base where a developer who isn’t familiar with or isn’t following the thread-safety constraints would create an absolute maintenance nightmare for others, where you’re going to spend way more time tracking down and fixing breakages induced than you saved by them not spending time coming up to speed on the constraints that their code needs to conform to. And the existing code-generation systems just aren’t really in a great position to come up to speed on those constraints. Part of what a programmer does is, when writing code, is to look at the human-language requirements, and identify that there are undefined cases and go back and clarify the requirement with the user, or use real-world knowledge to make reasonable calls. Training an LLM to map from an English-language description to code is creating a system that just doesn’t have the capability to do that sort of thing.

But, hey, we’ll see.

source

Sort:hotnew top

artwork@lemmy.world ⁨1⁩ ⁨month⁩ ago
I am sorry, but I am not sure what tells you how Bash “was designed” or not. Perhaps you haven’t yet written anything serious in Bash…
Have you checked out Bash PitFalls at Wooledge, at least?
Bash, or the most shells, including Posix, or even Perl, are some of the most complex languages out there to make a mistake… since there’s no compiler to protect you from, and the, though legendary, but readline may cause the whole terminal go flying, depending on the terminal/terminfo in process…

source
- tal@lemmy.today ⁨1⁩ ⁨month⁩ ago
  The point I’m making is that bash is optimized for quickly writing throwaway code. It doesn’t matter if the code written blows up in some case other than the one you’re using. You don’t need to handle edge cases that don’t apply to the one time that you will run the code. I write lots of bash code that doesn’t handle a bunch of edge cases, because for my one-off use, that edge case doesn’t arise. Similarly, if an LLMs is generating code that misses some edge case, if it’s a situation that will never arise, and that may not be a problem.
  
  source
  - artwork@lemmy.world ⁨1⁩ ⁨month⁩ ago
    I am sorry, but I still don’t get what you mean. And why Bash and not another shell?
    Why not another shell like Korn, Ash, Dash, Zsh, Fish, or anything REPL, including PHP, Perl, Node, Python etc.
    
    Should we consider “throwaway” anything that supports interactive mode of your daily driver you chose in your default terminal prompt?
    What does “throwaway” code means in the first place?
    
    source
    caseyweederman@lemmy.ca ⁨1⁩ ⁨month⁩ ago
    I… think they might be misusing the word “bash”? Maybe?
    
    source
    -> View More Comments
    tal@lemmy.today ⁨1⁩ ⁨month⁩ ago
    
    And why Bash and not another shell?
    
    I chose it for my example because I happen to use it. You could use another shell, sure.
    
    Should we consider “throwaway” anything that supports interactive mode of your daily driver you chose in your default terminal prompt?
    
    Interactive mode is a good case for throwaway code, but one-off scripts would also work.
    
    source
    -> View More Comments