• Pechente@feddit.org
    link
    fedilink
    arrow-up
    23
    arrow-down
    2
    ·
    edit-2
    4 hours ago

    LLMs often fail at the simplest tasks. Just this week I had it fail multiple times where the solution ended up being incredibly simple and yet it couldn’t figure it out. LLMs also seem to „think“ any problem can be solved with more code, thereby making the project much harder to maintain.

    LLMs won’t replace programmers anytime soon but I can see sketchy companies taking programming projects by scamming their clients through selling them work generated by LLMs. I‘ve heard multiple accounts of this already happening and similar things happened with no code solutions before.

    • Rikudou_Sage@lemmings.world
      link
      fedilink
      arrow-up
      1
      ·
      18 minutes ago

      Today I removed some functions and moved some code to separate services and being the lazy guy I am, I told it to update the tests so they no longer fail. The idiot pretty much undid my changes and updated the code to something very much resembling the original version which I was refactoring. And the fucker did it twice, even with explicit instructions to not do it.

    • TurdBurgler@sh.itjust.works
      link
      fedilink
      arrow-up
      6
      arrow-down
      17
      ·
      edit-2
      4 hours ago

      Your anecdote is not helpful without seeing the inputs, prompts and outputs. What you’re describing sounds like not using the correct model, providing good context or tools with a reasoning model that can intelligently populate context for you.

      My own anecdotes:

      In two years we have gone from copy/pasting 50-100 line patches out of ChatGPT, to having agent enabled IDEs help me greenfield full stack projects, or maintain existing ones.

      Our product delivery has been accelerated while delivering the same quality standards verified by our internal best practices we’ve our codified with determistic checks in CI pipelines.

      The power come from planning correctly. We’re in the realm of context engineering now, and learning to leverage the right models with the right tools in the right workflow.

      Most novice users have the misconception that you can tell it to “bake a cake” and get the cake ypu had in your mind. The reality is that baking a cake can be broken down into a recipe with steps that can be validated. You as the human-in-the-loop can guide it to bake your vision, or design your agent in such a way that it can infer more information about the cake you desire.

      I don’t place a power drill on the table and say “build a shelf,” expecting it to happen, but marketing of AI has people believing they can.

      Instead, you give an intern a power drill with a step-by-step plan with all the components and on-the-job training available on demand.

      If you’re already good at the SDLC, you are rewarded. Some programmers aren’t good a project management, and will find this transition difficult.

      You won’t lose your job to AI, but you will lose your job to the human using AI correctly. This isn’t speculation either, we’re also seeing workforce reduction supplemented by Senior Developers leveraging AI.

      • TeamAssimilation@infosec.pub
        link
        fedilink
        arrow-up
        17
        arrow-down
        1
        ·
        4 hours ago

        I seriously doubt your quality is maintained when an LLM writes most of your code, unless a human audits every line and understands what and why it is doing it.

        If you break the tasks small enough that you can do this each step, it is no longer writing a full application, it’s writing small snippets, and you’re code-pairing with it.

        • TurdBurgler@sh.itjust.works
          link
          fedilink
          arrow-up
          2
          arrow-down
          5
          ·
          edit-2
          3 hours ago

          Cursor and Claude Code are currently top tier.

          GitHub Copilot is catching up, and at a $20/mo price point, it is one of the best ways to get started. Microsoft is slow rolling some of the delivery of features, because they can just steal the ideas from other projects that do it first. VScode also has extensions worth looking at: Cline and RooCode

          Claude Code is better than just using Claude in cursor or copilot. Claude Code has next level magic that dispells some of the myths being propagated here about “ai bad at thing” because of the strong default prompts and validation they have built into it. You can say dumb human ignorant shit, and it will implicitly do a better job than others tools you give the same commands to.

          To REALLY utilize claude code YOU MUST configure mcp tools… context7 is a critical one that avoids one of those footguns, “the model was trained on older versions of these libraries.”

          Cursor hosts models with their own secret sauce that improves their behavior. They hardforked VSCode to make a deeper integrated experience.

          Avoid antigravity (google) and Kiro (Amazon). They don’t offer enough value over the others right now.

          If you already have an openai account, codex is worth trying, it’s like Claude Code, but not as good.

          JetBrains… not worth it for me.

          Aider is an honorable mention.

      • Eheran@lemmy.world
        link
        fedilink
        arrow-up
        3
        arrow-down
        4
        ·
        4 hours ago

        One of the rare comments here that is not acid spewing rage against AI. I too went from “copying a few lines to save some time” and having to recheck everything to several hundred lines working out of the box.

        • TurdBurgler@sh.itjust.works
          link
          fedilink
          arrow-up
          1
          arrow-down
          4
          ·
          3 hours ago

          I get it. I was a huge skeptic 2 years ago, and I think that’s part of the reason my company asked me to join our emerging AI team as an Individual Contributor. I didn’t understand why I’d want a shitty junior dev doing a bad job… but the tools, the methodology, the gains… they all started to get better.

          I’m now leading that team, and we’re not only doing accelerated development, we’re building products with AI that have received positive feedback from our internal customers, with a launch of our first external AI product going live in Q1.