Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

I Went All-In on AI. The MIT Study Is Right.

⁨1126⁩ ⁨likes⁩

Submitted ⁨⁨2⁩ ⁨weeks⁩ ago⁩ by ⁨AutistoMephisto@lemmy.world⁩ to ⁨technology@lemmy.world⁩

https://open.substack.com/pub/leadershiplighthouse/p/i-went-all-in-on-ai-the-mit-study

source

Comments

Sort:hotnewtop
  • edgemaster72@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Not immediate failure—that’s the trap. Initial metrics look great. You ship faster. You feel productive.

    And all they’ll hear is “not failure, metrics great, ship faster, productive” and go against your advice because who cares about three months later, that’s next quarter, line must go up now. I also found this bit funny:

    I forced myself to use Claude Code exclusively to build a product. Three months. Not a single line of code written by me… I was proud of what I’d created.

    Well you didn’t create it, you said so yourself, not sure why you’d be proud, it’s almost like the conclusion should’ve been blindingly obvious right there.

    source
    • AutistoMephisto@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      The top comment on the article points that out.

      It’s an example of a far older phenomenon: Once you automate something, the corresponding skill set and experience atrophy. It’s a problem that predates LLMs by quite a bit. If the only experience gained is with the automated system, the skills are never acquired. I’ll have to find it but there’s a story about a modern fighter jet pilot not being able to handle a WWII era Lancaster bomber. They don’t know how to do the stuff that modern warplanes do automatically.

      source
      • logicbomb@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        It’s more like the ancient phenomenon of spaghetti code. You can throw enough code at something until it works, but the moment you need to make a non-trivial change, you’re doomed. You might as well throw away the entire code base and start over.

        And if you want an exact parallel, I’ve said this from the beginning, but LLM coding at this point is the same as offshore coding was 20 years ago. You make a request, get a product that seems to work, but maintaining it, even by the same people who created it in the first place, is almost impossible.

        source
        • -> View More Comments
      • ctrl_alt_esc@lemmy.ml ⁨2⁩ ⁨weeks⁩ ago

        I agree with you, though proponents will tell you that’s by design. Supposedly, it’s like with high-level languages. You don’t need to know the actual instructions in assembly anymore to write a program with them. I think the difference is that high-level language instructions are still (mostly) deterministic, while an LLM prompt certaily isn’t.

        source
        • -> View More Comments
      • drosophila@lemmy.blahaj.zone ⁨2⁩ ⁨weeks⁩ ago

        The thing about this perspective is that I think its actually overly positive about LLMs, as it frames them as just the latest in a long line of automations.

        Not all automations are created equal. For example, compare using a typewriter to using a text editor. Besides a few details about the ink ribbon and movement mechanisms you really haven’t lost much in the transition. This is despite the fact that the text editor can be highly automated with scripts and hot keys, allowing you to manipulate even thousands of pages of text at once in certain ways. Using a text editor certainly won’t make you forget how to write like using ChatGPT will.

        I think the difference lies in the relationship between the person and the machine. To paraphrase Cathode Ray Dude, people who are good at using computers deduce the internal state of the machine, mirror (a subset) of that state as a mental model, and use that to plan out their actions to get the desired result. People that aren’t good at using computers generally don’t do this, and might not even know how you would start trying to.

        For years ‘user friendly’ software design has catered to that second group, as they are both the largest contingent of users and the ones that needed the most help. To do this software vendors have generally done two things: try to move the necessary mental processes from the user’s brain into the computer and hide the computer’s internal state (so that its not implied that the user has to understand it, so that a user that doesn’t know what they’re doing won’t do something they’ll regret, etc). Unfortunately this drives that first group of people up the wall. Not only does hiding the internal state of computer make it harder to deduce it, every “smart” feature they add to try to move this mental process into the computer itself only makes the internal state more complex and harder to model.

        Many people assume that if this is the way you think about software you are just an elistist gatekeeper, and you only want your group to be able to use the computer. Or you might even be accused of ableism. But the real reason is what I described above, even if its not usually articulated in that way.

        Now, I am of the opinion that the ‘mirroring the internal state’ method of thinking is the superior way to interact with the machine, and the approach to user friendliness I described has actually done a lot of harm to our relationship with computers at a societal level. (This is an opinion I suspect many people here would agree with.) And yet that does not mean that I think computers should be difficult to use. Quite the opposite, I think that modern computers are too complicated, and that in an ideal world their internal states and abstractions would be much simpler and more elegant, but no less powerful. (But elaborating on that would make this comment even longer.) Nor do I think that computers shouldn’t be accessible to people with different levels of ability. But just as a random person in a store shouldn’t grab a wheelchair user’s chair handles and start pushing them around, neither should Windows (for example) start changing your settings on updates without asking.

        Anyway, all of this is to say that I think LLMs are basically the ultimate in that approach to ‘user friendliness’. They try to move more of your thought process into the machine than ever before, their internal state is more complex than ever before, and it is also more opaque than ever before. They also reflect certain values endemic to the corporate system that produced them: that the appearance of activity is more important than the correctness or efficacy of that activity. But that is, again, a whole other comment.

        source
      • Cocodapuf@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        Once you automate something, the corresponding skill set and experience atrophy. It’s a problem that predates LLMs by quite a bit. If the only experience gained is with the automated system, the skills are never acquired.

        Well, to be fair, different skills are acquired. You’ve learned how to create automated systems, that’s definitely a skill. In one of my IT jobs there were a lot of people who did things manually, updated computers, installed software one machine at a time. But when someone figures out how to automate that, push the update to all machines in the room simultaneously, that’s valuable and not everyone in that department knew how to do it.

        So yeah, I guess my point is, you can forget how to do things the old way, but that’s not always bad. Like, so you don’t really know how to use a scythe, that’s fine if you have a tractor, and trust me, you aren’t missing much.

        source
    • boonhet@sopuli.xyz ⁨2⁩ ⁨weeks⁩ ago

      I forced myself to use Claude Code exclusively to build a product. Three months. Not a single line of code written by me… I was proud of what I’d created.

      Well you didn’t create it, you said so yourself, not sure why you’d be proud, it’s almost like the conclusion should’ve been blindingly obvious right there.

      Does a director create the movie? They don’t usually edit it, they don’t have to act in it, nor do all directors write movies. Yet the person giving directions is seen as the author.

      The idea is that vibe coding is like being a director or architect. I mean that’s the idea. In reality it seems it doesn’t really pan out.

      source
      • rainwall@piefed.social ⁨2⁩ ⁨weeks⁩ ago

        You can vibe write and vibe edit a movie now too. They also turn out shit.

        The issue is that llm isnt a person with skills and knowledge. Its a complex guessing box that gets thing kinda right, but not actually right, and it absolutely cant tell whats right or not. It has no actual skills or experience or humainty that a director can expect a writer or editor to have.

        source
        • -> View More Comments
      • MrSmith@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        Wrong, it’s just outsourcing.

        You’re making a false-equivalence. A director is actively doing their job; they’re a puppeteer and the rest is their puppet. The puppeteer is not outsourcing his job to a puppet.

        And I’m pretty sure you have no idea what architects actually do.

        If I hire a coder to write an app for me, whether it’s a clanker or a living being, I’m outsourcing the work; I’m a manager.

        It’s like tasking an artist to write a poem for you about love and flowers, and being proud about that poem.

        source
    • jimmy90@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      yeah i don’t get why the ai can’t do the changes

      don’t you just feed it all the code and tell it? i thought that was the point of 100% AI

      source
  • ignirtoq@feddit.online ⁨2⁩ ⁨weeks⁩ ago

    We’re about to face a crisis nobody’s talking about. In 10 years, who’s going to mentor the next generation? The developers who’ve been using AI since day one won’t have the architectural understanding to teach. The product managers who’ve always relied on AI for decisions won’t have the judgment to pass on. The leaders who’ve abdicated to algorithms won’t have the wisdom to share.

    Except we are talking about that, and the tech bro response is “in 10 years we’ll have AGI and it will do all these things all the time permanently.” In their roadmap, there won’t be a next generation of software developers, product managers, or mid-level leaders, because AGI will do all those things faster and better than humans. There will just be CEOs, the capital they control, and AI.

    What’s most absurd is that, if that were all true, that would lead to a crisis much larger than just a generational knowledge problem in a specific industry. It would cut regular workers entirely out of the economy, and regular workers form the foundation of the economy, so the entire economy would collapse.

    “Yes, the planet got destroyed. But for a beautiful moment in time we created a lot of value for shareholders.”

    source
    • grue@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      That’s why they’re all-in on authoritarianism.

      source
    • UnspecificGravity@piefed.social ⁨2⁩ ⁨weeks⁩ ago

      Yep, and now you know why all the tech companies suddenly became VERY politically active. This future isn’t compatible with democracy. Once these companies no longer provide employment their benefit to society becomes a big fat question mark.

      source
    • HasturInYellow@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      According to a study, the lower top 10% accounts for something like 68% of cash flow in the economy. Us plebs are being cut out all together.

      That being said, I think if people can’t afford to eat, things might bet bad. We will probably end up a kept population in these ghouls fever dreams.

      source
      • Prior_Industry@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        Once Boston Dynamic style dogs and Androids can operate over a number of days independently, I’d say all bets are off that we would be kept around as pets.

        I’m fairly certain your Musks and Altmans would be content with a much smaller human population existing to only maintain their little bubble and damn everything else.

        source
      • kreskin@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        We’re all idiots. Even the titans of industry-- sit down with then-- they are idiots, at best. narcissists and criminals at worst. At least you and I lack the power to be that awful with our idiocy.

        source
      • krashmo@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        What does lower top 10% mean?

        source
    • Randelung@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Also, even if we make it through a wave of bullshit and all these companies fail in 10 years, the next wave will be ready and waiting, spouting the same crap - until it’s actually true (or close enough to be bearable financially). We can’t wait any longer to get this shit under control.

      source
  • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    So there’s actual developers who could tell you from the start that LLMs are useless for coding, and then there’s this moron & similar people who first have to fuck up an ecosystem before believing the obvious. Thanks fuckhead for driving RAM prices through the ceiling… And for wasting energy and water.

    source
    • psycotica0@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

      I can least kinda appreciate this guy’s approach. If we assume that AI is a magic bullet, then it’s not crazy to assume we, the existing programmers, would resist it just to save our own jobs. Or we’d complain because it doesn’t do things our way, but we’re the old way and this is the new way. So maybe we’re just being whiny and can be ignored.

      So he tested it to see for himself, and what he found was that he agreed with us, that it’s not worth it.

      Ignoring experts is annoying, but doing some of your own science and getting first-hand experience isn’t always a bad idea.

      source
      • 5too@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        And not only did he see for himself, he wrote up and published his results.

        source
        • -> View More Comments
      • bassomitron@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        100% this. The guy was literally a consultant and a developer. It’d just be bad business for him to outright dismiss AI without having actual hands on experience with said product. Clients want that type of experience and knowledge when paying a business to give them advice and develop a product for them.

        source
        • -> View More Comments
      • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        Problem is that statistical word prediction has fuck-all to do with AI. It’s not and will never be. By “giving it a try” you contribute to the spread of this snake oil. And even if someone came up with actual AI, if it used enough resources to impact our ecosystem, instead of being a net positive, and if it was in the greedy hands of billionaires, then using it is equivalent to selling your executioner an axe.

        source
        • -> View More Comments
    • InvalidName2@lemmy.zip ⁨2⁩ ⁨weeks⁩ ago

      And then there are actual good developers who could or would tell you that LLMs can be useful for coding, in the right context and if used intelligently. No harm, for example, in having LLMs build out some of your more mundane code like unit/integration tests, have it help you update your deployment pipeline, generate boilerplate code that’s not already covered by your framework, etc. That it’s not able to completely write 100% of your codebase perfectly from the get-go does not mean it’s entirely useless.

      source
      • Soggy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        Other than that it’s work that junior coders could be doing, to develop the next generation of actual good developers.

        source
        • -> View More Comments
      • JcbAzPx@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        If it’s boilerplate, copy/paste; find/replace works just as well without needing data centers in the desert to develop.

        source
      • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        And then there are actual good developers who could or would tell you that LLMs can be useful for coding

        The only people who believe that are managers and bad developers.

        source
        • -> View More Comments
    • khepri@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      They are useful for doing the kind of boilerplate boring stuff that any good dev should have largely optimized and automated already. If it’s 1) dead simple and 2) extremely common, then yeah an LLM can code for you, but ask yourself why you don’t have a time-saving solution for those common tasks already in place? As with anything LLM, it’s decent at replicating how humans in general have responded to a given problem, if the problem is not too complex and not too rare, and not much else.

      source
      • lambdabeta@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

        Thats exactly what I so often find myself saying when people show off some neat thing that a code bot “wrote” for them in x minutes after only y minutes of “prompt engineering”. I’ll say, yeah I could also do that in y minutes of (bash scripting/vim macroing/system architecting/whatever), but the difference is that afterwards I have a reusable solution that: I understand, is automated, is robust, and didn’t consume a ton of resources. And as a bonus I got marginally better as a developer.

        Its funny that if you stick them in an RPG and give them an ability to “kill any level 1-x enemy instantly, but don’t gain any xp for it” they’d all see it as the trap it is, but can’t see how that’s what AI so often is.

        source
      • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        As you said, “boilerplate” code can be script generated - and there are IDEs that already do this, but in a deterministic way, so that you don’t have to proof-read every single line to avoid catastrophic security or crash flaws.

        source
    • ImmersiveMatthew@sh.itjust.works ⁨2⁩ ⁨weeks⁩ ago

      I really have not found AI to be useless for coding. I have found it extremely useful and it has saved me hundreds of hours. It is not without its faults or frustrations, but the it really is a tool I would not want to be without.

      source
      • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        That’s because you are not a proper developer, as proven by your comment. And you create tech legacy that will have a net cost in terms of maintenance or downtime.

        source
        • -> View More Comments
    • Randelung@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Maybe they’ll listen to one of their own?

      source
      • raspberriesareyummy@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        The kind of useful article I would expect then is one exlaining why word prediction != AI

        source
    • jali67@lemmy.zip ⁨2⁩ ⁨weeks⁩ ago

      Don’t worry. The people on LinkedIn and tech executives tell us it will transform anything soon!

      source
  • CarbonatedPastaSauce@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Something any (real, trained, educated) developer who has even touched AI in their career could have told you.

    source
    • AutistoMephisto@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      What’s funny is this guy has 25 years of experience as a software developer. But three months was all it took to make it worthless.

      source
      • felbane@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        As someone who has been shoved in the direction of using AI for coding by my superiors, that’s been my experience as well. It’s fine at cranking out stackoverflow-level code regurgitation and mostly connecting things in a sane way if the concept is simple enough. The real breakthrough would be if the corrections you make would persist longer than a turn or two. As soon as your “fix-it prompt” is out of the context window, you’re effectively back to square one. If you’re expecting it to “learn” you’re gonna have a bad time. If you’re not constantly double checking its output, you’re gonna have a bad time.

        source
        • -> View More Comments
    • ctrl_alt_esc@lemmy.ml ⁨2⁩ ⁨weeks⁩ ago

      It’s still useful to have an actual “study” (I’d rather call it a POC) with hard data you can point to, rather than just “trust me bro”.

      source
      • prole@lemmy.blahaj.zone ⁨2⁩ ⁨weeks⁩ ago

        Like the MIT study that the author refers to? The one that already existed before they decided they need to do it themself?

        source
    • Dojan@pawb.social ⁨2⁩ ⁨weeks⁩ ago

      I was in charge of an AI pilot project two years back at my company. That was my conclusion, among others.

      source
    • some_designer_dude@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Untrained dev here, but the trend I’m seeing is spec-driven development where AI generates the specs with a human, then implements the specs. Humans can modify the specs, and AI can modify the implementation.

      This approach seems like it can get us to 99%, maybe.

      source
      • CaptDust@sh.itjust.works ⁨2⁩ ⁨weeks⁩ ago

        Trained dev with a decade of professional experience, humans routinely fail to get me workable specs without hours of back and forth meetings. I’d say a solid 25% of my work day is spent understanding what the stakeholders are asking for and how contort the requirements to fit into the system.

        If these humans can’t be explict enough with me, a living thinking human that understands my architecture better than any LLM, what chance does an LLM have?

        source
      • Dojan@pawb.social ⁨2⁩ ⁨weeks⁩ ago

        Thus you get a piece of software that no one really knows shit about the inner workings of. Sure you have a bunch of spec sheets but no one was there doing the grunt work so when something inevitably breaks during production there’s no one on the team saying “oh, that might be related to this system I set up over here.”

        source
      • Piatro@programming.dev ⁨2⁩ ⁨weeks⁩ ago

        How is what you’re describing different to what the author is talking about? Isn’t it essentially the same as “AI do this thing for me”, “no not like that”, “ok that’s better”? The trouble the author describes, ie the solution being difficult to change, or having no confidence that it can be safely changed, is still the same.

        source
        • -> View More Comments
      • pelespirit@sh.itjust.works ⁨2⁩ ⁨weeks⁩ ago

        Have you used any AI to try and get it to do something? It learns generally, not specifically. So you give it instructions and then it goes, “How about this?” You tell it that it’s not quite right and to fix these things and it goes off on a completely different tangent in other areas. It’s like working with an 8 year old who has access to the greatest stuff around.

        source
        • -> View More Comments
      • floofloof@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

        Even more efficient: humans do the specs and the implementation. AI has nothing to contribute to specs, and is worse at implementation than an experienced human. The process you describe, with current AIs, offers no advantages.

        AI can write boilerplate code and implement simple small-scale features when given very clear and specific requests, sometimes. It’s basically an assistant to type out stuff you know exactly how to do and review. It can also make suggestions, which are sometimes informative and often wrong.

        source
    • prole@lemmy.blahaj.zone ⁨2⁩ ⁨weeks⁩ ago

      Also, what MIT also told them. Literally MIT lol

      source
  • vpol@feddit.uk ⁨2⁩ ⁨weeks⁩ ago

    The developers can’t debug code they didn’t write.

    This is a bit of a stretch.

    source
    • Xyphius@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

      agreed. 50% of my job is debugging code I didn’t write.

      source
    • funkless_eck@sh.itjust.works ⁨2⁩ ⁨weeks⁩ ago

      I mean I was trying to solve a problem t’other day (hobbyist) - it told me to create a

      function foo(bar): await object.foo(bar)

      then in object

      function foo(bar): _foo(bar)

      function _foo(bar): original_object.foo(bar)

      like literally passing a variable between three wrapper functions in two objects that did nothing except pass the variable back to the original function in an infinite loop

      add some layers and complexity and it’d be very easy to get lost

      source
      • theparadox@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        The few times I’ve used LLMs for coding help, usually because I’m curious if they’ve gotten better, they let me down. Last time it was insistent that its solution would work as expected. When I gave it an example that wouldn’t work, it even broke down each step of the function giving me the value of its variables at each step to demonstrate that it worked… but at the step where it had fucked up, it swapped the value in the variable to one that would make the final answer correct. It made me wonder how much water and energy it cost me to be gaslit into a bad solution.

        How do people vibe code with this shit?

        source
      • vpol@feddit.uk ⁨2⁩ ⁨weeks⁩ ago

        As a learning process it’s absolutely fine.

        You make a mess, you suffer, you debug, you learn.

        But you don’t call yourself a developer (at least I hope) on your CV.

        source
    • anon_8675309@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Some can’t because they never acquired to skill to read code. But most did and can.

      source
      • Rooster326@programming.dev ⁨2⁩ ⁨weeks⁩ ago

        If you’ve never had to debug code. Are you really a developer?

        There is zero chance you have never written a big so… Who is fixing them?

        Unless you just leave them because you work for Infosys or worse but then I ask again - are you really a developer?

        source
    • _g_be@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Vibe coders can’t debug code because they didn’t write

      source
      • embed_me@programming.dev ⁨2⁩ ⁨weeks⁩ ago

        Vibe coders can’t debug code because they can’t write code

        source
        • -> View More Comments
      • Evotech@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

        I don’t get this argument. Isn’t the whole point that the ai will debug and implement small changes too?

        source
        • -> View More Comments
    • mal3oon@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      I think it highly depends on the skill and experience of the dev. A lot of the people flocking into the vibe coding hype are not necessarily always people who know how about coding practices (including code review etc …) nor are experienced in directing AI agent to achieve such goals. The result is MIT prediction. Although, this will start to change soon.

      source
  • pdxfed@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Great article, brave and correct. Good luck getting the same leaders who blindly believe in a magical trend for this or next quarters numbers; they don’t care about things a year away let alone 10.

    I work in HR and was stuck by the parallel between management jobs being gutted by major corps starting in the 80s and 90s during “downsizing” who either never replaced them or offshore them. They had the Big 4 telling them it was the future of business. Know who is now providing consultation to them on why they have poor ops, processes, high turnover, etc? Take $ on the way in, and the way out. AI is just the next in long line of smart people pretending they know your business while you abdicate knowing your business or employees.

    Hope leaders can be a bit braver and wiser this go 'round so we don’t get to a cliffs edge in software.

    source
    • Ancalagon@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Tbh I think the true leaders are high on coke.

      source
    • mirshafie@europe.pub ⁨2⁩ ⁨weeks⁩ ago

      Exactly. The problem isn’t moving part of production to some other facility or buying a part that you used to make in-house. It’s abdicating an entire process that you need to be involved in if you’re going to stay on top of the game long-term.

      Claude Code is awesome but if you let it do even 30% of the things it offers to do, then it’s not going to be your code in the end.

      source
    • ripcord@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      I’m trying

      source
  • rimu@piefed.social ⁨2⁩ ⁨weeks⁩ ago

    FYI this article is written with a LLM.

    image

    source
    • AmbiguousProps@lemmy.today ⁨2⁩ ⁨weeks⁩ ago

      I’ve heard that these tools aren’t 100% accurate, but your last point is valid.

      source
    • LiveLM@lemmy.zip ⁨2⁩ ⁨weeks⁩ ago

      Aren’t these LLM detectors super inaccurate?

      source
    • Randelung@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      Yes, but also the opposite. Don’t discount a valid point just because it was formulated using an LLM.

      source
    • prole@lemmy.blahaj.zone ⁨2⁩ ⁨weeks⁩ ago

      Lol the irony… You’re doing literally the exact same thing by trusting that site because it confirms your view

      source
  • dejected_warp_core@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    To quote your quote:

    I got the product launched. It worked. I was proud of what I’d created. Then came the moment that validated every concern in that MIT study: I needed to make a small change and realized I wasn’t confident I could do it. My own product, built under my direction, and I’d lost confidence in my ability to modify it.

    I think the author just independently rediscovered “middle management”. Indeed, when you delegate the gruntwork under your responsibility, those same people are who you go to when addressing bugs and new requirements. It’s not on you to effect repairs: it’s on your team. I am Jack’s complete lack of surprise. The idea that relying on AI to do nuanced work like this and arrive at the exact correct answer to the problem, is naive at best. I’d be sweating too.

    source
    • fuck_u_spez_in_particular@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

      The problem though (with AI compared to humans): The human team learns, i.e. at some point they probably know what the mistake was and avoids doing it again. AI instead of humans: well maybe the next or different model will fix it maybe…

      And what is very clear to me after trying to use these models, the larger the code-base the worse the AI gets, to the point of not helping at all or even being destructive. Apart from dissecting small isolatable pieces of independent code (i.e. keep the context small for the AI).

      Humans likely get slower with a larger code-base, but they (usually) don’t arrive at a point where they can’t progress any further.

      source
  • Unlearned9545@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Fractional CTO: Some small companies benefit from the senior experience of these kinds of executives but don’t have the money or the need to hire one full time. A fraction of the time they are C suite for various companies.

    source
  • Agent641@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    I cannot understand and debug code written by AI. But I also cannot understand and debug code written by me.

    Let’s just call it even.

    source
  • dsilverz@calckey.world ⁨2⁩ ⁨weeks⁩ ago

    @AutistoMephisto@lemmy.world @technology@lemmy.world

    I used to deal with programming since I was 9 y.o., with my professional career in DevOps starting several years later, in 2013. I dealt with lots of other's code, legacy code, very shitty code (especially done by my "managers" who cosplayed as programmers), and tons of technical debts.

    Even though I'm quite of a LLM power-user (because I'm a person devoid of other humans in my daily existence), I never relied on LLMs to "create" my code: rather, what I did a lot was tinkering with different LLMs to "analyze"
    my own code that I wrote myself, both to experiment with their limits (e.g.: I wrote a lot of cryptic, code-golf one-liners and fed it to the LLMs in order to test their ability to "connect the dots" on whatever was happening behind the cryptic syntax) and to try and use them as a pair of external eyes beyond mine (due to their ability to "connect the dots", and by that I mean their ability, as fancy Markov chains, to relate tokens to other tokens with similar semantic proximity).

    I did test them (especially Claude/Sonnet) for their "ability" to output code, not intending to use the code because I'm better off writing my own thing, but you likely know the maxim, one can't criticize what they don't know. And I tried to know them so I could criticize them. To me, the code is.. pretty readable. Definitely awful code, but readable nonetheless.

    So, when the person says...

    The developers can’t debug code they didn’t write.
    ...even though they argue they have more than 25 years of experience, it feels to me like they don't.

    One thing is saying "developers find it pretty annoying to debug code they didn't write", a statement that I'd totally agree! It's awful to try to debug other's (human or otherwise) code, because you need to try to put yourself on their shoes without knowing how their shoes are... But it's
    doable, especially by people who deal with programming logic since their childhood.

    Saying "developers can't debug code they didn't write", to me, seems like a layperson who doesn't belong to the field of Computer Science, doesn't like programming, and/or only pursued a "software engineer" career purely because of money/capitalistic mindset. Either way, if a developer can't debug other's code, sorry to say, but they're not developers!

    Don't take me wrong: I'm not intending to be prideful or pretending to be awesome, this is beyond my person, I'm nothing, I'm no one. I abandoned my career, because I hate the way the technology is growing more and more enshittified. Working as a programmer for capitalistic purposes ended up depleting the joy I used to have back when I coded in a daily basis. I'm not on the "job market" anymore, so what I'm saying is based on more than 10 years of former professional experience. And my experience says: a developer that can't put themselves into at least trying to understand the worst code out there can't call themselves a developer, full stop.

    source
  • flamingo_pinyata@sopuli.xyz ⁨2⁩ ⁨weeks⁩ ago

    “fractional CTO”(no clue what that means, don’t ask me)

    For those who were also interested to find out this means: Consultant and advisor in a part time role, paid to make decisions that would usually fall under the scope of a CTO, but for smaller companies who can’t afford a full-time experienced CTO

    source
  • Suffa@lemmy.wtf ⁨2⁩ ⁨weeks⁩ ago

    AI is really great for small apps. I’ve saved so many hours over weekends that would otherwise be spent coding a small thing I need a few times whereas now I can get an AI to spit it out for me.

    But anything big and it’s fucking stupid, it cannot track large projects at all.

    source
  • Nalivai@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    They never actually say what “product” do they make, it’s always “shipped product” like they’re fucking amazon warehouse. I suspect because it’s some trivial webpage that takes an afternoon for a student to ship up, that they spent three days arguing with an autocomplete to shit out.

    source
  • Rhoeri@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    AI is hot garbage and anyone using it is a skillless hack. This will never not be true.

    source
  • Evotech@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Just ask the ai to make the change?

    source
  • HugeNerd@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

    Computers are too powerful and too cheap. Bring back COBOL, painfully expensive CPU time, and some sort of basic knowledge of what’s actually going on.

    Pain for everyone!

    source
  • phed@lemmy.ml ⁨2⁩ ⁨weeks⁩ ago

    I do a lot with AI but it is not good enough to replace humans, not even close. It repeats the same mistakes after you tell it no, it doesn’t remember things from 3 messages ago when it should. You have to keep re-explaining the goal to it. It’s wholey incompetant. And yea when you have it do stuff you aren’t familiar with or don’t create, def. I have it write a commentary, or I take the time out right then to ask it what x or y does then I add a comment.

    source
  • deathbird@mander.xyz ⁨2⁩ ⁨weeks⁩ ago

    I think this kinda points to why AI is pretty decent for short videos, photos, and texts. It produces outputs that one applies meaning to, and humans are meaning making animals. A computer can’t overlook or rationalize a coding error the same way.

    source
  • lepinkainen@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Same thing would happen if they were a non-coder project manager or designer for a team of actual human progress.

    Stuff done, shipped and working.

    “But I can’t understand the code 😭”, yes. You were the project manager why should you?

    source
  • KazuyaDarklight@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    My big fear with this stuff is security. It just seems so “easy”, without knowledgeable people, for AI to write a product that functions from a user perspective but is wide open to attack.

    source
  • SocialMediaRefugee@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Just sell it to AI customers for AI cash.

    source
  • just_another_person@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    No shit

    source
  • kreskin@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    I work in an company who is all-in on selling AI and we are trying desperately to use this AI ourselves. We’ve concluded internally that AI can only be trusted with small use cases that are easily validated by humans, or for fast prototyping work… hack day stuff to validate a possibility but not an actual a high quality safe and scalable implementation, or in writing tests of existing code, to increase test coverage (yes, I know thats a bad idea but QA blessed the result… so uh … cool).

    The use case we zeroed in on is writing well schema'd configs in yaml or json.  Even then, a good percentage of the time the AI will miss very significant mandatory sections, or add hallucinations that are unrelated to the task at hand.  We then can use AI to test AI's work, several times using several AIs.  And to a degree, it'll catch a lot of the issues, but not all.  So we then code review and lint with code we wrote that AI never touched,  and send all the erroring configs to a human.  It does work, but cant be used for mission critical applications.  And nothing about the AI or the process of using it is free.  Its also disturbingly not idempotent.  Did it fail?  Run it again a few times and it'll pass.   We think it still saves money when done at scale, but not as much as we promise external AI consumers.  The Senior leadership know its currently overhyped trash and pressure us to use it anyway on expectations it'll improve in the future, so we give the mandatory crisp salute of alignment and we're off.
    

    I will say its great for writing reviews. It adds nonsense and doesnt get the whole review correct, but it writes very flowery stuff so managers dont have to. So we use it for first drafts and then remove a lot of the true BS out of it. If it gets stuff wrong, oh well, human perception is flawed.

    source
  • DupaCycki@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    Personally I tried using LLMs for reading error logs and summarizing what’s going on. I can say that even with somewhat complex errors, they were almost always right and very helpful. So basically the general consensus of using them as assistants within a narrow scope.

    Though it should also be noted that I only did this at work. While it seems to work well, I think I’d still limit such use in personal projects, since I want to keep learning more, and private projects are generally much more enjoyable to work on.

    Another interesting use case I can highlight is using a chatbot as documentation when the actual documentation is horrible. However, this only works within the same ecosystem, so for instance Copilot with MS software. Microsoft definitely trained Copilot on its own stuff and it’s often considerably more helpful than the docs.

    source
  • m3t00@piefed.world ⁨2⁩ ⁨weeks⁩ ago

    ask your ai pal for help

    source
  • minorkeys@lemmy.world ⁨2⁩ ⁨weeks⁩ ago

    It looks like a rigid design philosophy that must completely rebuild for any change. If the speed of production becomes fast enough, and the cost low enough, iterating the entire program for every change would become feasible and cost effective.

    source
  • ExLisper@lemmy.curiana.net ⁨2⁩ ⁨weeks⁩ ago

    They shipped a product in 3 months? What the fuck was it? New “under construction” page?

    source
  • Rentlar@lemmy.ca ⁨2⁩ ⁨weeks⁩ ago

    I did see someone write a post about Chat Oriented Programming, to me that appeared successful, but not without cost and extra care. [checkeagle.com/…/a-month-of-chat-oriented-program…](Original Link,) Discussion Thread

    Successful in that it wrote code faster and its output stuck to conventions better than the author would. But they had to watch it like a hawk and with the discipline of a senior developer putting full attention over a junior, stop and swear at it every time it ignored the rules that they give at the beginning of each session, terminate the session when it starts doing a autocompactification routine that wastes your money and makes Claude forget everything. And you try to dump what it has completed each time. One of the costs seem to be the sanity of the developer, so I really question if it’s a sustainable way of doing things from both the model side and from developers. To be actually successful you need to know what you’re doing otherwise it’s easy to fall in a trap like the CTO, trusting the AI’s assertions that everything is hunky-dory.

    source
  • ImmersiveMatthew@sh.itjust.works ⁨2⁩ ⁨weeks⁩ ago

    This has not been my experience at all. I have a top rated VR app and use AI to code everything and change things all the time. It is not hard to understand the code and then prompt the AI to change this or that and then test to see if it got it right. If it did not, just prompt again to address. Maybe this does not work for the author or others, but it has saved my hundreds of hours in my small app.

    source
-> View More Comments