codevoid.de/1/hn/comments_45744209.gph

  URI:

        _______               __                   _______
       |   |   |.---.-..----.|  |--..-----..----. |    |  |.-----..--.--.--..-----.
       |       ||  _  ||  __||    < |  -__||   _| |       ||  -__||  |  |  ||__ --|
       |___|___||___._||____||__|__||_____||__|   |__|____||_____||________||_____|
                                                             on Gopher (inofficial)
  HTML Visit Hacker News on the Web
       
       
       COMMENT PAGE FOR:
   DIR   Ask HN: How to deal with long vibe-coded PRs?
       
       
        EdwardDiego wrote 5 hours 32 min ago:
        I wrote a lot of comments - for humans, but then I also specifically
        addressed some to the bot - "Cursor, remove all emojis in log messages,
        and do not use print for debugging, use a logger, where you are using a
        logger you are repeatedly importing the logging library in various
        conditional clauses, you should always import at the top level of the
        file" etc. etc. etc. - because you know that they're going to feed my
        review back to the bot.
        
        The fact that someone submitted this PR in that state though...
       
        jasonjmcghee wrote 8 hours 6 min ago:
        That's unreasonably large. Depending on the content, PRs tend to get
        harder and harder to read with every line of code.
        
        1k added lines is imo already pushing it.
        
        9k and 63 files is astronomical and very difficult to review.
        
        A proper review means being able to understand the system and what's
        being changed, how, and why in order to be able to judge if it was done
        properly and includes everything it should and nothing it shouldn't.
        
        9k lines is just too much to be able to do this properly.
       
        NumberCruncher wrote 8 hours 38 min ago:
        Why not fighting fire with fire and using AI to:
        
        Version A: find 100 LOC which can be reduced to 50 LOC without changing
        the functionality. Then ask the author to go through the PR making sure
        it's not bloated. Repeat.
        
        Version B: find hidden bugs. Ask the author to fix them. Repeat.
        
        Keep them occupied saving your face. I would also fine tune an own
        agent to automatise this kind of work for me.
       
        nish__ wrote 12 hours 48 min ago:
        Build it locally and QA test it.
       
        pacifika wrote 13 hours 27 min ago:
        Roughly takes an hour to review 1000 loc. tell your manager to book you
        in for a day and a half on the review. Usually scheduling it in is a
        deterrent for a quick approval
       
        irvingprime wrote 13 hours 30 min ago:
        Reject. Period. No compromise. No friendly comments about how it can be
        improved. Just reject it as unreviewable.
        
        Then ban the idiot who submitted it.
       
        paul_h wrote 14 hours 9 min ago:
        Ask AI number 2 to summarize the intention (look at the .patch) to
        markdown. Reset. Ask you AI to read the intention as of the orig author
        had written it, and say you've grave doubts about the contrib's
        functionally and non-functionally and for it to help you put that into
        words to feel back to the contributor. Basically the playbook from
        
  HTML  [1]: https://paulhammant.com/images/SimonSinghsFermatExcerpt.jpg
       
        ilc wrote 14 hours 12 min ago:
        The same way I would with a human:
        
        If I thought the service should only be 1000 lines tops:
        
        - Reject due to excess complexity.
        
        If it is a proper solution:
        
        - Use AI to review it, asking it to be VERY critical of the code, and
        look for spots where human review may be needed, architecture wise,
        design wise and implementation wise.
        
        - Ask the AI again to do a security review etc.
        
        - Tell the author to break the PR down into human size chunks using
        git.
        
        Why those things?  It's likely some manager is gonna tell me review it
        anyways.  And if so, I want to have a head start, and if there's
        critical shoot down level issues I can find with an AI quickly.  I'd
        just shut the PR down now.
        
        As in any "security" situation, in this case the security of your
        codebase and sanity, defense in depth is the answer.
       
        bitbasher wrote 14 hours 19 min ago:
        "CoS" - Close on Sight
       
        dearilos wrote 17 hours 51 min ago:
        Put up guardrails to enforce quality code.
       
        Sirikon wrote 17 hours 56 min ago:
        Reject them
       
        drbojingle wrote 18 hours 4 min ago:
        If they can vibe code it they can vibe disassemble it and vibe small PR
        it.
       
        jake-coworker wrote 18 hours 9 min ago:
        I usually share this resource when people start doing this
        
  HTML  [1]: https://google.github.io/eng-practices/review/developer/small-...
       
          claar wrote 17 hours 6 min ago:
          Apparently "CL" is "Change List", Google speak for a Pull Request, if
          anyone else was confused like I was.
       
        ErroneousBosh wrote 18 hours 31 min ago:
        Instant reject, advising them not to resubmit.
       
        dzink wrote 18 hours 34 min ago:
        With AI code complexity is a cost bigger than money. Because it takes
        infinite amount of time from humans (maintainers, engineers) and
        requires increasing amount of memory and hardware to handle
        (unnecessarily) you have to account for it and hold contributors
        accountable for it. Otherwise any code will become unmanageable and
        un-runable and un-upgradable.
       
        javier_e06 wrote 18 hours 42 min ago:
        I would request in the PR references to the unit test with 100%
        coverage. Once I run it and if it passes  I would do a spot check and
        look for glaring errors. Nothing deep. Perhaps I would run lint or some
        static analysis tool on the code. If the analysis tools come out
        squeaky clean and the unit test passes? Well, what's not to like? One
        or more problems? Reject the whole thing.
       
          data-ottawa wrote 18 hours 27 min ago:
          I donât like the 100% test coverage approach.
          
          The PR will be huge, plus AI is great at adding tons of shallow
          tests.
          
          I see tests as little pins that hold your codebase down. They can be
          great for overall stability, but too many and your project becomes
          inflexible and brittle.
          
          In this case youâd be nailing a bunch of code that you donât want
          to the code base.
       
          thinkingtoilet wrote 18 hours 29 min ago:
          The problem is you can get a very large, messy, and inconsistent code
          base that eventually will slow things down. Even if tests pass, the
          code can be a detriment in the long run.
       
        self_awareness wrote 18 hours 45 min ago:
        Reject, with unnecessary technical debt reason. Most of the times
        custom DSLs are not needed.
        
        The question is what was the original task that needed to be fixed? I
        doubt it required a custom DSL.
        
        Issue a research task first to design the scope of the fix, what needs
        to be changed and how.
       
        ontouchstart wrote 18 hours 46 min ago:
        A more difficult question might be if it were merged now and 100 merges
        later you found a serious bug with a root cause in this PR, do you ask
        the same person to fix it?
       
        dlisboa wrote 18 hours 47 min ago:
        Close them. It's not a PR in good faith. A pull-request is meant to be
        reviewable, 9k LOC is not.
       
        fancyfredbot wrote 19 hours 3 min ago:
        If it shouldn't be 90k LOC, and it doesn't need a DSL parser then
        reject it as overcomplicated and unmaintainable. Make it clear how
        large and complex you expect it to be and where existing code or
        framework should be reused so they can go away and address your
        concerns and so that there's a high chance you'll be able to approve if
        they do.
        
        Above all, you aim to allow the contributor to be productive, you make
        it clear what constraints they need to operate under in order to use AI
        codegen effectively. You want to come across as trying to help them and
        need to take care not to appear obstructive or dismissive.
       
        HelloNurse wrote 19 hours 13 min ago:
        Complaining about inadequate tests and documentation should be a very
        efficient and effective strategy against slop.
       
        zzzeek wrote 19 hours 24 min ago:
        It's garbage, reject it.   Over engineered.  Update your PR guidelines
        that AI is fine to help write code but PRs have to be ultimately human
        designed.
       
        drfrank3 wrote 19 hours 31 min ago:
        AI creates slop of dead or inefficient code that can be cleaned up. I
        think that developers that obsess over control have a difficult time
        adjusting to this.
        
        The greater danger is that AI can create or modify code into something
        that is disconnected, stubbed, and/or deceptive and claim itâs
        complete. This is much worse because it wastes much more time, but AI
        can fix this too, just like it can the slop- maybe not
        deterministically, but it can.
        
        And because of this, those that get in the way of creating source with
        AI are just cavemen rejecting fire.
       
        0x000xca0xfe wrote 19 hours 32 min ago:
        Fight slop with slop. Use an AI to review it in excruciating detail and
        write a lenghty justification for the rejection. Make sure to really
        hit a couple thousand words.
        
        Maybe getting their own time wasted will teach the submitter about the
        value of clarity and how it feels to be on the receiving end of a
        communication with highly asymmetric effort.
       
          chuckadams wrote 19 hours 31 min ago:
          Nice, but I suspect the PR submitter will simply reply with their own
          AI-generated counterpoints.  They'll probably come to an agreement to
          turn both of you into paperclips.
       
            0x000xca0xfe wrote 19 hours 21 min ago:
            If they resubmit you at least have proof that the conversation is
            completely braindead and the other side is happy to engage in
            uprofessional behavior and generally wasting time and resources.
            
            This should be convincing enough even for a non-technical team lead
            while for the initial PR it might be hard to explain objectively
            why it's bad.
       
        james_marks wrote 19 hours 47 min ago:
        âThis is unnecessarily complexâ and cite 1-2 egregious examples,
        with a LOC estimate that you think is more reasonable.
        
        5 minutes, off the cuff.
       
        Yizahi wrote 19 hours 48 min ago:
        Alternative to the reject and request rewrite approach, which may not
        work in the corporation environment. You schedule a really long video
        call with the offending person, with the agenda politely describing
        that for such a huge and extensive change, a collaborative meeting is
        required. You then notify your lead that new huge task has arrived
        which will take X hours from you, so if he wishes to re-prioritize
        tasks, he is welcome. And then if the meeting happen, you literally go
        line by line, demanding that author explain them to you. And if
        explanation or a meeting are refused, you can reject RP with a clear
        explanation why.
       
          ericmcer wrote 11 hours 14 min ago:
          It doesn't even need to be a long call, just a one off "hey this is a
          big PR, got a sec to run me through it" works.
          
          Usually within a few questions the answer "the AI wrote it that way"
          will come out.
          
          Which feels bananas to me, like you don't understand how the code you
          PR is doing what you want? That would feel like showing up to work
          with no pants on for me.
       
          bb88 wrote 14 hours 29 min ago:
          I would recommend inviting the person, his manager, and your manager.
          
          To start things off the meeting, I would say something like, "To me
          this is a surprising PR.  I would expect it to be maybe 500(e.g.)
          lines including unit tests.  Why does it need to be this
          complicated?"
          
          If your manager just folds over, I would just accept it, because he's
          probably so beat down by the organization that he's not going to
          stick up for his employees anyway.  At that point, it's time to look
          for another job.
          
          But if the managers listen to their employees, and you have a better
          alternative, and your arguments are reasonable, it should be fine.
       
          tokioyoyo wrote 16 hours 43 min ago:
          Honestly, this approach would probably get you fired eventually for
          non-coop behaviour in every company Iâve worked at.
          
          AI slop code is becoming the go-to way for a lot of written code. At
          some point itâll make more sense to find a solution to the problem
          (âhow to be confident in slop codeâ), rather than going against
          the financial motives of the business owners (cut expenses, maximize
          profit somehow through AI). Iâm not sure if itâs right or wrong,
          but it is what it is.
       
            embedding-shape wrote 16 hours 25 min ago:
            > Honestly, this approach would probably get you fired eventually
            for non-coop behaviour in every company Iâve worked at.
            
            I don't think I've ever worked in a company that would fire someone
            for something like that. Maybe you'd get a scheduled conversation
            to talk about it, to try to resolve whatever is going on (in this
            case verbose AI slop PRs), since obviously something is going wrong
            when people start engaging in malicious compliance.
            
            But then I also never worked in a country where people can be fired
            for whatever reason, it's always been a legal requirement (in the
            countries I've lived at least) that the firing needs to be properly
            justified and explained, and firing someone like that wouldn't be
            justified.
       
              tokioyoyo wrote 7 hours 27 min ago:
              It wouldnât be a direct firing, but if the majority of the
              company/team is pro-AI, people will complain privately for
              âbeing a person hard to work withâ. Eventually, after 6
              months of repetitive passive aggressiveness, youâll be let go
              or get paycuts.
       
                userbinator wrote 1 hour 41 min ago:
                If the company is like that, chances are you wanted to find a
                better job anyway.
       
              fifilura wrote 14 hours 43 min ago:
              Wow, 636 karma points in 8 days since registering! You are on a
              roll! (FWIW i upped this comment).
       
                embedding-shape wrote 13 hours 23 min ago:
                Hah, don't say that, just makes my current computer and HN
                obsession even more clear, I've hidden upvotes in my browser
                just to avoid realizing that! (jokes aside, I do probably spend
                too much time on HN at this moment...)
       
          iamleppert wrote 18 hours 17 min ago:
          Oh, how I would love to work with you. I'd drown you in more
          meetings, documentation on code (LLM generated of course) than you
          could ever imagine.
          
          You can use the LLM to generate as much documentation on the changes
          as you want. Just give it your PR. If someone tries to reject your
          vibe coded AI slop, just generate more slop documentation to drown
          them in it. It works every time.
          
          If they push back, report them to their manager for not being "AI
          first" and a team player.
       
            zeroCalories wrote 17 hours 0 min ago:
            When my manager pings me about it I'll just show him your ai slop
            and tell him we'll be liable for all the bugs and production issues
            related to this, in addition to maintaining it. Then let him make
            the choice. Escalate if needed.
       
            a4isms wrote 18 hours 2 min ago:
            If we look at this as a system with work flowing through it, the
            "theory of constraints" quickly tells us that code review is the
            bottleneck, and that speeding up code generation actually lowers
            system throughput.
            
            This is not new stuff, Goldratt warned us about this twenty+ years
            ago.
            
  HTML      [1]: https://en.wikipedia.org/wiki/Theory_of_constraints
       
          a4isms wrote 18 hours 36 min ago:
          This is the answer, and it has been the answer going back to the
          before times when we didn't have pull requests but we did in-person
          team code reviews before merging branches (yes, really). A massive,
          complicated merge without documentation and extensive support from
          other interested/impacted people and teams justifying things like a
          parser DSL? That is always going to be a problem whether AI generated
          it or the developer locked their office door and went on a three-day
          manic coding binge.
          
          AI doesn't directly  make this stuff worse, it accelerates a team's
          journey towards embracing engineering practices around the code being
          written by humans or LLMs.
       
            djmips wrote 14 hours 34 min ago:
            we still do this on very big fundamental changes.
       
          watwut wrote 18 hours 41 min ago:
          Sir, I see a big senior manager future in you. My hats down.
       
          Msurrow wrote 19 hours 23 min ago:
          See, now thatâs a proper f** you in corporate-speak.
       
        meltyness wrote 19 hours 58 min ago:
        Proof by counterexample, just find the inevitable security flaw.
       
        giantg2 wrote 20 hours 54 min ago:
        Start with the test files. There's no way the AI had meaningful and
        working test cases. Pop a comment on each test file about missing tests
        or expanding them. That will force the dev to review their own code and
        make substantial changes.
       
        pomarie wrote 21 hours 2 min ago:
        One thing that actually works is getting AI to review the basic stuff
        first so you can focus on architecture and design decisions. The irony
        of using AI to review AI-generated code isn't lost on me, but it does
        help.
        
        That said, even with automated review, a 9000 line PR is still a hard
        reject. The real issue is that the submitter probably doesn't
        understand the code either. Ask them to walk you through it or break it
        down into smaller pieces. If they can't, that tells you everything.
        
        The asymmetry is brutal though. Takes an hour to generate 9000 lines,
        takes days to review it properly. We need better tooling to handle this
        imbalance.
        
        (Biased take: I'm building cubic.dev to help with this exact problem.
        Teams like n8n and Resend use it to catch issues automatically so
        reviewers can focus on what matters. But the human review is still
        essential.)
       
        brutal_chaos_ wrote 21 hours 5 min ago:
        Having experienced AI at $job and having tried to make vibecodong a
        thing, run when you see it. Yes, that means good enough AI gets
        through, what's the harm in that if it works as you need it to?
       
        reactordev wrote 21 hours 5 min ago:
        Easy, auto reject and close it. If asked why, state that each feature
        should be its own PR. Donât waste any more brain cells on it.
        
        If an engineer really cared, they would discuss these changes with you.
        Each new feature would be added incrementally and ensuring that it
        doesnât break the rest of the system. This will allow you to
        understand their end goal while giving them an avenue to achieve it
        without disrupting your end goal.
       
        mexicocitinluez wrote 21 hours 9 min ago:
        The same way you would do literally any other PR. I don't know why this
        is special.
        
        If the code sucks, reject it. If it doesn't, accept it.
        
        This isn't hard.
       
        arianjm wrote 21 hours 11 min ago:
        It always depends on your position and their position, but by the
        sounds of it... I'd say it's too big for a simple "review this".
        
        I'd ask for them to write their thought process, why they made the
        decisions they made, what the need for so many files and so many
        changes.
        I may ask for a videoconference to understand better, if it's a
        collegue from work.
        
        By now hopefully you should know if their approach is valid or not
        really. If not sure yet, then I'd take a look at the code, specially at
        the parts they refer to most importantly in their answer to my previous
        questions. So not a detailed review, a more general approach, to decide
        if this is valid or not.
        
        If it's a valid approach, then I guess I'd review it. If not, then give
        feedback as to how to make it valid, and why it isn't.
        
        Not valid is very subjective. From "this is just garbage", to "this is
        a good approach, but we can implement this iteratively in separate PRs
        that will make my life easier", again, it depends on your and their
        position.
       
        dustingetz wrote 21 hours 28 min ago:
        zoom call
        
        ask them to walk you through it
        
        ask for design doc if appropriate
        
        what is test plan who is responsible for prod delivery and support
        
        (no difference from any other large pr)
       
        fhd2 wrote 21 hours 31 min ago:
        I'd say you have three options:
        
        1. Reject it on the grounds of being too large to meaningfully review.
        Whether they used AI or not, this is effectively asking them to start
        over in an iterative process where you review every version of the
        thing and get to keep complexity in check. You'll need the right power
        and/or standing for this to be a reasonable option. At many
        organisations, you'd get into trouble for it as "blocking progress". If
        the people that pay you don't value reliability or maintainability, and
        you couldn't convince them that they should, that's a tough one, but it
        is how it is.
        
        2. Actually review it in good faith: Takes a ton of time for large,
        over engineered changes, but as the reviewer, it is usually your job to
        understand the code and take on responsibility for it. You could
        propose to help out by addressing any issues you find yourself rather
        than making them do it, they might like that. This feels like a
        compromise, but you could still be seen as the person "blocking
        progress", despite, from my perspective, biting the bullet here.
        
        3. Accept it without understanding it. For this you could _test_ it and
        give feedback on the behaviour, but you'd ignore the architecture,
        maintainability etc. You could still collaboratively improve it after
        it goes live. I've seen this happen to big (non-AI generated) PRs a
        lot. It's not always a bad thing. It might not be good code, but it
        could well be good business regardless.
        
        Now, however you resolve it, it seems like this won't be the last time
        you'll struggle to work with that person. Can, and do they want to,
        change? Do you want to change? If you can't answer either of these
        questions with a yes, you'll probably want to look for ways of not
        working with them going forward.
       
        fifilura wrote 21 hours 32 min ago:
        Is it Java/Spring? Then probably go along and be happy that a human
        didn't have to write those 9000 lines for a trivial service.
       
          Cthulhu_ wrote 20 hours 58 min ago:
          Man I'd hate it even more then. If I ever get back to Java I hope
          this nonsense has been resolved. I've been spoiled by Go where
          they're a lot more pragmatic about things. I want to add a brouhaha
          about code volume but it's compared with Java so never mind.
       
        alganet wrote 21 hours 44 min ago:
        "too big, please break it into smaller self-contained PRs"
        
        [ Close with comment ]
       
        Lapsa wrote 21 hours 53 min ago:
        strict lines of code limitation enforcement will lead to half-finished
        change requests and leak technological gibberish upstream to lovely
        business folk
       
        phendrenad2 wrote 21 hours 57 min ago:
        Are they truly vibe-coded? Or is the person simply accomplishing months
        of work in one day? Do you think the submitter reviewed it themselves?
        There's a difference you know. Like it or not, AI coding is not going
        away.
        
        In your case, 9000 LOC and 63 files isn't that crazy for a DSL. Does
        the DSL serve a purpose? Or is it just someone's feature fever dream to
        put your project on their resume?
       
        onion2k wrote 21 hours 59 min ago:
        How would you go about reviewing a PR like this?
        
        AI is red herring in discussions like this. How the change was authored
        makes no difference here.
        
        I wouldn't. I'd reject it. I'd reject it even if the author had
        lovingly crafted each line by hand. A change request is not "someone
        must check my work". It's a collaboration between an author and a
        reviewer. If the author is failing to bother respecting the reviewer's
        time then they don't deserve to get a review.
       
        Cthulhu_ wrote 22 hours 1 min ago:
        I wouldn't, they can start by writing requirements and a design first,
        then break it up into manageable components.
        
        Or just refuse to review and let the author take full responsibility in
        running and maintaining the thing, if that's possible. A PR is asking
        someone else to share responsibility in the thing.
       
        fxtentacle wrote 22 hours 35 min ago:
        âI trust you that you have proof-read thisâ and then just merge.
        When production explodes, their name will be all over âgit blameâ.
       
        bluerooibos wrote 22 hours 53 min ago:
        I wouldn't review it - bad engineering practice to submit this much
        work in one go - it puts too much expectation on the reviewer and makes
        it more likely that something gets broken.
        
        Even 1000 lines is pushing it, IMO.  Tell them to split the PR up into
        more granular work if they want it merged.
       
        grodriguez100 wrote 23 hours 2 min ago:
        Donât. I would refuse to review a PR with 9000 LOC and 63 new files
        even if written by a human. Something that large needs to be discussed
        first to agree on an architecture and general approach, then split in
        manageable pieces and merged piece-wise in a feature branch, with each
        individual PR having reasonable test coverage, and finally the feature
        branch merged into master.
       
        ivankahl wrote 23 hours 8 min ago:
        What are your organization's expectations or policies regarding PR size
        and acceptable AI usage? Even if your organization hasn't set any
        expectations, what are yoursâand have you communicated them to the
        author?
        
        If expectations have been shared and these changes contradict them, you
        can quickly close the PR, explain why it's not acceptable, and ask them
        to redo it.
        
        If you don't have clear guidelines on AI usage or haven't shared your
        expectations, you'll need to review the PR more carefully. First,
        verify whether your assumption that itâs a simple service is accurate
        (although from your description, it sounds like it is). If it is, talk
        to the author and point out that it's more complicated than necessary.
        You can also ask if they used AI and warn them about the complexities
        it can introduce.
       
        ZeroGravitas wrote 23 hours 13 min ago:
        How you reject the first one of these, compared with the hundretth and
        the millionth(!) is probably going to be an interesting development
        over next few years.
        
        Personally, I've felt drained dealing with small PRs fixing actual bugs
        by enthusiastic students new to projects in the pre-slop era.
        
        Particularly if I felt they were doing it more to say they'd done it,
        rather than to help the project.
        
        I imagine that motive might help drive an increase in this kind of
        thing.
       
        ethin wrote 23 hours 18 min ago:
        If it's obviously AI generated and is an absurdly long PR, I'd ask them
        to extensively justify the complexity (especially if it does side
        quest-isms like this example where the AI created a DSL and stuff: why
        exactly is the DSL required?). If the project already implements the
        feature, I'd ask that they remove the re-implemented parts and use what
        already exists. If one of the dependencies of the project does this,
        I'd ask that they update the PR to use those instead of wholesale
        redoing it. If they respond, at all, with AI-generated responses
        instead of doing it themselves, or their PR description is AI
        generated, or it's blatantly obvious they used AI, I would immediately
        mentally classify the PR as an ultra low effort/quality PR until proven
        otherwise. Might seem harsh, but I prefer PRs from people who actually
        both understand the project and what the PR is trying to do. I don't
        mind if people use AI to assist in that understanding; I don't even
        mind if they use AI to help write parts of the PR. But if I can tell
        that it's AI generated (and completely re-implementing something that
        the project either has already or is in the stdlib or a dep is a very
        good sign of AI generated code in my experience), I'm far more inclined
        to dismiss it out of hand.
       
        ares623 wrote 23 hours 38 min ago:
        Ask them if they reviewed the AIâs output before opening the PR. If
        they didnât then ask them to at least review it first rather than
        having you do all the work. If they did then is a 2nd review from you
        really necessary? ;)
       
        hsbauauvhabzb wrote 23 hours 49 min ago:
        âHey chatgpt, reject this pr for me. Be extremely verbose about the
        following topics:
        
        - Large prs
        - vibe coding
        - development qualityâ
       
          wiseowise wrote 23 hours 26 min ago:
          Finally, an advice from 10x AI engineer.
       
        shinycode wrote 23 hours 50 min ago:
        Donât read it, approve it.
       
        lionkor wrote 23 hours 50 min ago:
        Close them and report to your boss. If your boss doesn't care, look for
        a new job. Once you have a new job, quit the old and cite that specific
        case as the reason.
       
        calini wrote 23 hours 51 min ago:
        Vibe merge review it using Copilot or equivalent, and then close it :)
       
          cryptonym wrote 23 hours 40 min ago:
          Prompt: be over cautious on every code line, this is junior code and
          they can learn a lot from this PR. Generate many comments on why it
          shouldn't be merged as-is and make sure every corner case is covered.
          Be super paranoid, mistakes in the code could hurt the company or
          people.
          
          If you are lucky, they will also vibe fix it.
       
        renewiltord wrote 23 hours 55 min ago:
        It's basic engineering principle: you do not do work amplification.
        e.g. debouncing, request coalescing, back-pressure are all techniques
        to prevent user from making server do lots of work in response to small
        user effort.
        
        As example, you have made summarization app. User is try to upload 1 TB
        file. What you do? Reject request.
        
        You have made summarization app. User is try upload 1 byte file 1000
        times. What you do? Reject request.
        
        However, this is for accidental or misconfigured user. What if you have
        malicious user? There are many technique for this as well: hell-ban,
        tarpit, limp.
        
        For hell-ban simply do not handle request. It appear to be handled but
        is not.
        
        For tarpit, raise request maker difficulty. e.g. put Claude Code with
        Github MCP on case, give broad instructions to be very specific and
        request concise code and split etc. etc. then put subsequent PRs also
        into CC with Github MCP.
        
        For limp, provide comment slow using machine.
        
        Assuming you're not working with such person. If working with such
        person, email boss and request they be fired. For good of org, you must
        kill the demon.
       
        smsm42 wrote 1 day ago:
        The only way such a PR can be reviewed is if it's accompanied with a
        detailed PRD and tech design documents, and at least half of that LOC
        count is tests. Even then it requires a lot of interactive work from
        both sides. I have seen PRs third or quarter of this size that took
        weeks to properly review and bring to production quality. Unless
        there's something artificially inflating the side of it (like
        auto-generated files or massive test fixtures, etc.) I wouldn't ever
        commit to reviewing such a behemoth without a very very good reason to.
       
        occz wrote 1 day ago:
        Easy, you reject it.
       
        mort96 wrote 1 day ago:
        Close them.
       
        O-stevns wrote 1 day ago:
        That's a lot of code for a PR, though i should admit I have made PR's
        being half that size myself.
        
        Personally I think it's difficult to address these kinds of PR's but I
        also think that git is terrible at providing solutions to this problem.
        
        The concept of stacked PR's are fine up to the point where you need to
        make changes throughout all yours branches, then it becomes a mess. If
        you (like me) might have a tendency to rewrite your solution several
        times before ending up with the final result, then having to split this
        into several PR's does not help anyone. The first PR will likely be
        outdated the moment I begin working on the next.
        
        Open source is also more difficult in this case because contrary to
        working for a company with a schedule, deadlines etc... you can't (well
        you shouldn't) rush a review when it's on your own time. As such PR's
        can sit for weeks or months without being addressed. When you
        eventually need to reply to comments about how, why etc.. you have
        forgotten most of it and needs to read the code yourself to re-claim
        the reasoning. At that time it might be easier to re-read a 9000 lines
        PR over time rather than reading 5-10 PR's with maybe meaningful
        descriptions and outcome where the implementation changes every time.
        
        Also, if it's from a new contributor, I wouldn't accept such a PR, vibe
        coded or not.
       
        ugh123 wrote 1 day ago:
        Are there tests written? You could start by demanding tests pass and
        demonstrate some kind of coverage metric.
       
        PeterStuer wrote 1 day ago:
        Before review ask for a rational and justification. Might be just
        overcomplicated AI slop, could also be someone actually went beyond the
        basics and realy produced something next level.
        
        A simple email could tell the difference.
       
        est wrote 1 day ago:
        write another AI to hardcore review it and eventually reject it.
       
        rhubarbtree wrote 1 day ago:
        In our company, you would immediately reject the PR based on size.
        There are a bunch of other quick bounce items it could also fail on, eg
        documentation.
        
        The PR would then be split into small ones up to 400 lines long.
        
        In truth, such a big PR is an indicator that either (a) the original
        code is a complete mess and needs reengineering or more likely (b) the
        PR is vibe coded and is making lots of very poor engineering decisions
        and goes in the bin.
        
        We donât use AI agents for coding. Theyâre not ready. Autocomplete
        is fine. Agents donât reason like engineers, they make crap PRs.
       
          ethin wrote 23 hours 9 min ago:
          This! And vibe-coded solutions usually will implement stuff your
          project already has in it, instead of reusing what already exists. Or
          Claude will massively over-engineer something that could be collapsed
          to 30 lines or something.
       
            Cthulhu_ wrote 21 hours 0 min ago:
            Can confirm, I asked it to write a unit test for a small component,
            it wrote tons of code that parsed and validated some JSON
            configuration (a theme file) instead of just the unit under test. I
            just wrote it myself instead of try and argue with it.
            
            Maybe the AI was too clever for its own good? Have AI coding
            assistants evolved from junior (naive but alright) to medior
            (overly complicated and complete)?
       
        LaFolle wrote 1 day ago:
        There are good suggestions in the thread.
        
        One suggestion that possibly is not covered is that you/we can document
        clearly how AI generated PRs will be handled, make it easy for
        contributors to discover it and if/when such PR shows up refer the
        documented section to save yourself time.
       
        exe34 wrote 1 day ago:
        simple, ask them to break it down into smaller pieces with clear
        explanation of what it does and why it's needed. Then set up an AI to
        drag them in the dirt with pointless fixes. or just close them as
        won't-fix.
       
        locknitpicker wrote 1 day ago:
        > How would you go about reviewing a PR like this?
        
        State the PR is too large to be reviewed, and ask the author to break
        it down into self-contained units.
        
        Also, ask which functional requirements the PR is addressing.
        
        Ask for a PR walkthrough meeting to have the PR author explain in
        detail to an audience what they did and what they hope to achieve.
        
        Establish max diff size for PRs to avoid this mess.
       
        ojr wrote 1 day ago:
        I would test if the new features work and if there is any regressions
        around critical business functions and merge it, if my manual tests
        pass.
       
        Roark66 wrote 1 day ago:
        Many people gave good tips, so let me answer in general.
        
        As someone on the "senior" side AI has been very helpful in speeding up
        my work. As I work with many languages, many projects I haven't touched
        in months and while my code is relatively simple the underlying
        architecture is rather complex. So where I do use AI my prompts are
        very detailed. Often I spot mistakes that get corrected etc. With this
        I still see a big speedup (at least 2x,often more). The quality is
        almost the same.
        
        However, I noticed many "team leads" try to use the AI as an excuse to
        push too difficult tasks onto "junior" people. The situation described
        by the OP is what happens sometimes.
        
        Then when I go to the person and ask for some weird thing they are
        doing I get "I don't know, copilot told me"...
        
        Many times I tried to gently steer such AI users towards using it as a
        learning tool. "Ask it to explain to you things you don't understand"
        "Ask questions about why something is written this way" and so on. Not
        once I saw it used like this.
        
        But this is not everyone. Some people have this skill which lets them
        get a lot more out of pair programming and AI. I had a couple trainees
        in the current team 2 years ago that were great at this. This way as
        "pre-AI" in this company, but when I was asked to help them they were
        asking various questions and 6 months later they were hired on
        permanent basis. Contrast this with:
        - "so how should I change this code"? 
        - You give them a fragment, they go put it in verbatim and come back
        via teams with a screenshot of an error message...
        
        Basically expecting you will do the task for them. Not a single
        question. No increased ability to do it on their own.
        
        This is how they try to use AI as well. And it's a huge time waster.
       
          randomNumber7 wrote 22 hours 38 min ago:
          You can lead a horse to water, but you canât make it drink.
          
          Also people with that mentality had been a waste of time before AI
          too.
       
        cat_plus_plus wrote 1 day ago:
        Vibe review with all the reasons it should not be merged obviously.
       
        tayo42 wrote 1 day ago:
        You can't really review this. Rubber stamp it or reject it.
       
        T_Potato wrote 1 day ago:
        I have a tangent question: How do you deal with a team that spends days
        nitpicking implementation, double-speak and saying. I didn't actually
        expect you to implement this the way I said, I was just saying it would
        be nice if it was like this, can you undo it. I spend 3 weeks on a code
        review because of the constant back and forth; and I wish oh I wish
        they would allow PR to be small but the rule is that the PR has to
        implement the full deliverable feature. And that can mean 20 files to
        constantly change and change and change and change. Oh and then the why
        did you use Lombok question that occurs even though the project uses
        lombok and so you are stuck defending the use of a library that's is
        used in the project for no random reason than to flatter the egos of
        the gatekeepers who say, yes this is good but I want you to name this
        abc instead of ab before we merge. When in context it doesn't add or
        remove any value, not even clarity.
       
          tjansen wrote 1 day ago:
          Generally, my stance is that I add more value by doing whatever
          ridiculous thing people ask me to change than waste my time arguing
          about it. There are some obvious exceptions, like when the
          suggestions don't work or make the codebase significantly worse. But
          other than that, I do whatever people suggest, to save my time, their
          time, and deliver faster. And often, once you're done with their
          initial suggestions, people just approve.
          
          This doesn't help all the time. There are those people who still keep
          finding things they want you to change a week after they first
          reviewed the code. I try to avoid including them in the code review.
          The alternative is to talk to your manager about making some rules,
          like giving reviewers only a day or two to review new code. It's easy
          to argue for that because those late comments really hinder
          productivity.
       
          dbetteridge wrote 1 day ago:
          Doesn't help you much I imagine, but the one time we had a dev like
          this he was fired after multiple complaints to the team lead.
       
        anarticle wrote 1 day ago:
        No face, no case. They have to break it way down, just like at any org.
        In fact, I would ask for more tests than usual with a test plan/proof
        they passed. 9k is a little spicy, separate PRs, or an ad hoc huddle
        with them rubber ducking you through the code. Depends on if you care
        about this that much or not.
        
        Unless you really trust them, it's up to the contributor to make their
        reasoning work for the target. Else, they are free to fork it if it's
        open source :).
        
        I am a believer in using llm codegen as a ride along expert, but it
        definitely triggers my desire to over test software. I treat most
        codegen as the most junior coder had written it, and set up guardrails
        against as many things llm and I can come up with.
       
        sshine wrote 1 day ago:
        Same standard as if they had made it themselves: a sequence of
        logically ordered commits.
       
        fathermarz wrote 1 day ago:
        Let me ask a different question. Large refactor that ended up in a 60K
        line python PR because the new lead didnât feel like merging it in
        until it was basically done. Even ask other devs to merge into his
        branch and then we would merge later.
        
        How does one handle that with tact and not lose their minds?
       
          wiseowise wrote 23 hours 30 min ago:
          You get Leetcode subscription and start going through paths for a
          company that can match or exceed your salary.
       
          JonChesterfield wrote 1 day ago:
          Refuse to merge into their branch. If you have serious test coverage
          and the refactor doesn't change behaviour, it'll be fine.
          
          If you don't have test coverage, or if the "refactor" is also
          changing behaviour, that project is probably dead. Make sure there's
          a copy of the codebase from before the new lead joined so there's a
          damage mitigation roll back option available.
       
        atoav wrote 1 day ago:
        Tell them to give you a phone call and have them explain the code to
        you : )
       
        viccis wrote 1 day ago:
        Open source? Close it and ask them resubmit a smaller one and justify
        the complexity of things like a DSL if they wanted it included.
        
        For work? Close it and remind them that their AI velocity doesn't save
        the company time if it takes me many hours (or even days depending on
        the complexity of the 9k lines) to review something intended to be
        merged into an important service. Ask them to resubmit a smaller one
        and justify the complexity of things like a DSL if they wanted it
        included. If my boss forces me to review it, then I do so and start
        quietly applying for new jobs where my job isn't to spend 10x (or 100x)
        more time reviewing code than my coworkers did "writing" it.
       
          oarsinsync wrote 1 day ago:
          > If my boss forces me to review it, then I do so and start quietly
          applying for new jobs where my job isn't to spend 10x (or 100x) more
          time reviewing code than my coworkers did "writing" it.
          
          Another equally correct approach (given the circumstances of the
          organisation) is to get a different AISlopBot to do the review for
          you, so that you spend as much time reviewing as the person who
          submitted the PR did coding.
       
            khannn wrote 19 hours 23 min ago:
            Makes me want to write my own AI bot that brutally tears into any
            pr so I can reject it
       
            adastra22 wrote 23 hours 16 min ago:
            Why waste anyoneâs bandwidth on this? As maintainer of some open
            source projects, there are no circumstances in which I would accept
            a 9kLOC drive by contribution like this. State so and close it.
       
              oarsinsync wrote 22 hours 51 min ago:
              > Why waste anyoneâs bandwidth on this?
              
              The conditional was: If my boss forces me to review it
              
              > As maintainer of some open source projects, there are no
              circumstances in which...
              
              ...you would force yourself to do anything that you don't want to
              do. Your approach is absolutely correct for the organisational
              circumstances in which this might happen to you.
              
              There are other organisational circumstances where being the
              squeaky wheel, even when it's the right thing to do for the
              business, will be the wrong thing for you personally. It's
              valuable to identify when you're standing in front of a
              steamroller, and get out of the way.
       
                adastra22 wrote 16 hours 10 min ago:
                Have a backbone. I would seriously quit on the spot if
                requested to break my professional integrity with respect to
                open source development. I have been in this situation before
                too, so Iâm not just saying it.
       
                  harimau777 wrote 5 hours 41 min ago:
                  In this job market that's not an option for the majority of
                  people.
       
                charlieyu1 wrote 16 hours 22 min ago:
                Boss forced me? Good. Iâll take a look at the first 100-200
                lines, find 3-5 critical or deadly errors, document it clearly
                and write to the boss how this vibe coding shit is wasting so
                much of my time
       
                rob74 wrote 20 hours 22 min ago:
                Ok, but then I would also prefer OP's approach: if things reach
                this level of dysfunction, you should really consider looking
                for another job...
       
            ffsm8 wrote 1 day ago:
            That only works if you're not personally responsible for the code
            you review, too.
       
              Hamuko wrote 20 hours 36 min ago:
              If they're okay with vibe-coded code, they should be fine with
              vibe-coded reviews too. You really only should be in a situation
              where you have more responsibility over your reviews than other
              people have for their code if you're in charge, and if you're in
              charge, just ban the practice.
       
                stevula wrote 10 hours 14 min ago:
                The problem is other people/teams making PRs to your code that
                you then have to maintain or fix later. Itâs in your interest
                not to half-ass the review, creating an asymmetric amount of
                work for you vs them.
       
              throwup238 wrote 23 hours 54 min ago:
              Just donât give the AI agent an âapprove_prâ tool. It can
              only comment or reject.
       
                kortilla wrote 23 hours 33 min ago:
                But then what? At the end itâs still on you to approve and
                you have no idea what is hiding in the code.
       
                  jjcob wrote 23 hours 22 min ago:
                  You don't approve it. You just slowly grind the submitter
                  down with minor feedback. At some point they lose interest
                  and after a year you can close the PR, or ask the submitter
                  to open a new PR.
       
                    adastra22 wrote 23 hours 16 min ago:
                    I hope you donât actually do this to people.
       
                      estimator7292 wrote 18 hours 38 min ago:
                      Why would it be acceptable for the sumbitter to behave
                      this way and not the reviewer? We do have AI "assisted"
                      submitters behaving exactly like this and acting irate
                      when forced to actually reflect on the turd they're
                      trying to shove into my inbox
       
                      jjcob wrote 23 hours 1 min ago:
                      It works best if you don't reply immediately. I recommend
                      successively increasing the response delay. Keep it short
                      enough to make sure that they don't start bugging you on
                      other channels, but long enough to make sure they have
                      time to cool down and question if the continued effort is
                      really worth it.
                      
                      As long as the response delay increases at least
                      geometrically, there is a finite bound to the amount of
                      work required to deal with a pull request that you will
                      never merge.
       
                        adastra22 wrote 16 hours 22 min ago:
                        I am both an open source maintainer and contributor.
                        This is absolutely despicable behavior. You are
                        purposefully wasting the time of a contributor for no
                        other reason than your own fear of saying âno.â
                        
                        If youâre not going to merge something, just ficking
                        say so.
       
                          thunderfork wrote 14 hours 45 min ago:
                          If you've read the thread, the strategy you're
                          replying to is about a workplace scenario where
                          outright rejection is, for whatever reason,
                          forbidden; not an open source situation where "no" is
                          readily available.
       
                            adastra22 wrote 4 hours 25 min ago:
                            It makes even less sense in a work context either.
                            This behavior will permanently alienate this user &
                            potential customer. Iâve seen this exact scenario
                            play out many times before.
       
                        oarsinsync wrote 22 hours 47 min ago:
                        Tragically, when you are organisationally impaired from
                        saying 'no', this is the only way (besides, you know,
                        quitting and getting a new job).
                        
                        It's absolutely soul crushing when you're motivated to
                        do a good job, but have a few colleagues around you
                        that have differing priorities, and aren't empowered to
                        do the right thing, even when management agrees with
                        you.
       
                      lukan wrote 23 hours 2 min ago:
                      If people do this to him? How else to react?
                      
                      The context here is lots of vibe coded garbage thrown at
                      the reviewer.
       
                        adastra22 wrote 16 hours 20 min ago:
                        It takes less time and effort to close with an
                        explanation why. It is going out of your way to waste
                        the time of a contributor.
       
                          lukan wrote 16 hours 14 min ago:
                          Context here is a corporate scenario where just
                          closing is not possible.
       
          krackers wrote 1 day ago:
          > then I do so and start quietly applying
          
          In this job market? And where pretty much every company seems to be
          following the top-down push for AI-driven "velocity"?
       
            zwnow wrote 1 day ago:
            The market only sucks for devs that lack experience or have a
            skillset thats oversaturated. If you only know React and Python I'm
            sorry, but there are like 20 million devs just like you so the one
            thats willing to work for the smallest coin is going to win.
       
            viccis wrote 1 day ago:
            That's why I would start applying instead of just quitting. There
            are plenty of companies that use AI responsibly or not much at all.
       
              xeonmc wrote 1 day ago:
              This is why we need a programmer union, so that coders can
              collectively reject reverse-centaur slopwork, like miners
              rejecting asbestos mines or workers refusing to fix dangerous
              machines while itâs running.
       
                SecondHandTofu wrote 22 hours 0 min ago:
                More political arguments about the other effects of unions
                aside - I've never heard a good answer for why unions are good
                for workers in professions with wide ranges of skill and
                impact, such as lots of types of knowledge work. Do you have an
                answer for that?
                
                Roles that are more fungible, train drivers, factory workers, I
                can see the case from the worker's perspective, even if I think
                there are externalities.
                
                But I can't even see it from a worker's perspective in roles
                such as software or sales, why would anyone good want to work
                in an environment where much worse workers are protected,
                compensation is more levelised etc?
                
                I'm assuming this will boil down to some unspoken values
                differences but still thought I'd ask.
       
                  wizzwizz4 wrote 19 hours 34 min ago:
                  Look into SAG-AFTRA.
       
                  jon-wood wrote 19 hours 53 min ago:
                  A union does whatever its members want the union to do. I'd
                  argue that an environment where pay negotiation is a case of
                  every person for themselves isn't actually good for anyone
                  but if the majority of members disagree with me then the
                  union won't get involved in pay. If they wanted to they could
                  scope the union's responsibility purely to being notified of
                  budget reductions/redundancies and given a seat at the table
                  when working out how to handle them.
       
                    eru wrote 6 hours 5 min ago:
                    > A union does whatever its members want the union to do.
                    
                    Just like a democracy does whatever its voters want it to
                    do?..
                    
                    Different people want different things.
                    
                    > I'd argue that an environment where pay negotiation is a
                    case of every person for themselves isn't actually good for
                    anyone but if the majority of members disagree with me then
                    the union won't get involved in pay.
                    
                    Well, I feel for the minority that doesn't want the union
                    to get involved in their affairs.
       
                    bluGill wrote 17 hours 18 min ago:
                    A union works best when workers see they are all in it
                    together. There are lots of unions, but it is much harder
                    for them to be powerful when members see defecting as
                    helping them.  There is a reason unions are most common in
                    labor areas where everyone is the same.  You can't be a
                    better bus driver than someone else (either you are bad
                    enough to fire or you are as good as everyone else).  The
                    assembly line is as good as the worst/slowest person on it,
                    so there is no advantage in being faster at putting bolts
                    in, or whatever you do (unions can sometimes push safety
                    standards, but also comes from others who have the union
                    take credit)
       
                      eru wrote 6 hours 5 min ago:
                      >  The assembly line is as good as the worst/slowest
                      person on it, so there is no advantage in being faster at
                      putting bolts in, or whatever you do [...]
                      
                      I guess you have no experience with assembly lines?
                      
                      > (unions can sometimes push safety standards, but also
                      comes from others who have the union take credit)
                      
                      Btw, health and safety are what economists call a 'normal
                      good'.    See [1] > In economics, a normal good is a type
                      of a good for which consumers increase their demand due
                      to an increase in income, unlike inferior goods, for
                      which the opposite is observed. When there is an increase
                      in a person's income, for example due to a wage rise, a
                      good for which the demand rises due to the wage increase,
                      is referred as a normal good. Conversely, the demand for
                      normal goods declines when the income decreases, for
                      example due to a wage decrease or layoffs.
                      
                      That explains fairly well, why rich countries all have
                      more-or-less similar health and safety standards despite
                      very different histories and especially histories of
                      labour activism, and why poor countries fare worse in
                      this respect--even if some of them have laws on the books
                      that are just as strict.
                      
  HTML                [1]: https://en.wikipedia.org/wiki/Normal_good
       
                  bravetraveler wrote 19 hours 57 min ago:
                  Not a developer, but close enough: so that 'good' stays
                  'good' and doesn't become 'expected'. Or, said another way, I
                  can enjoy protections too. Automation allows us to do more,
                  actually doing more isn't necessary: remember the tools/why
                  they were made. Yet expectations continue to ride an
                  escalator.
                  
                  I don't know why one would want to maintain a system of 'look
                  how high I can still jump after all these years, reward
                  please'. Again, expectations: they rise faster than the
                  rewards.
                  
                  The adversarial framing with coworkers is confusing,
                  discipline is a different matter from collective bargaining.
       
                  sfn42 wrote 19 hours 58 min ago:
                  > why would anyone good want to work in an environment where
                  much worse workers are protected
                  
                  The "much worse workers" are the majority. That's why you see
                  everyone complaining about technical interviews and such -
                  those of us who crush the interviews and get the jobs don't
                  mind.
       
                    kaoD wrote 17 hours 54 min ago:
                    How old are you? I'm in your boat but I suspect we'll
                    change our tune when we get older.
       
                      sfn42 wrote 16 hours 20 min ago:
                      My 40s aren't too far off. I don't expect to lose that
                      much of my ability
       
                        kaoD wrote 15 hours 38 min ago:
                        Yeah I'm not worried about my ability, but the
                        perceived value from employers. We're probably in the
                        sweet spot where we're still "young" but also very
                        experienced.
       
                          sfn42 wrote 15 hours 28 min ago:
                          That would be quite ridiculous in my opinion. Most of
                          my peers hardly stay in one job for more than 2-3
                          years anyway, so unless you're retiring in the next
                          two years I don't see why they would have a problem
                          with it.
                          
                          Of course I live in a country where retirement
                          savings isn't your employer's responsibility. I think
                          the US has some ridiculous retirement practices that
                          may make older employees a bit of a hot potato
                          situation?
       
                    yunwal wrote 18 hours 32 min ago:
                    Iâm quite good at technical interviews, and I still think
                    theyâre not a good way to find the best person for the
                    job in 95% of places theyâre used
       
                      sfn42 wrote 16 hours 22 min ago:
                      I'm not really commenting on that, I'm saying the
                      practice is good for me as an interviewee.
                      
                      However I do think it's a good way to filter candidates.
                      I should clarify that what I'm talking about is fairly
                      basic programming tasks, not very hard leet code style
                      DSA type tasks. I've never been given an actually hard
                      task in an interview, they've all been fairly simple
                      tasks like write a bracket tax calculator, write a class
                      that stores car objects and can get them by plate number
                      and stuff like that. Helped a friend do a take-home one
                      where we fetched some data from spacex's api and
                      displayed it in a html table.
                      
                      Every time I do these, people act like I'm Jesus for
                      solving a relatively simple task. Meanwhile I'm just
                      shocked that this is something my peers struggle with. I
                      would have honestly expected any decent dev to be able to
                      do these with roughly the same proficiency as myself, but
                      it turns out almost nobody can.
                      
                      That's why I think it's a good way to test candidates. If
                      you're going to work as a programmer you should be able
                      to solve these types of tasks. I don't care if you're
                      frontend, backend, finance, healthcare, data science,
                      whatever kind of programming you normally do, you should
                      be able to do these kinds of things.
                      
                      If someone can't then by my judgement they don't really
                      know programming. They may have figured out some way to
                      get things done anyway but I bet the quality of their
                      work reflects their lack of understanding. I've seen a
                      lot of code written by this kind of people, it's very
                      clear that a lot of developers really don't understand
                      the code they're writing. It's honestly shocking how bad
                      most "professional software developers" are at writing
                      simple code.
       
                  wongarsu wrote 21 hours 4 min ago:
                  In theory you could limit the scope of the union to not
                  include things like negotiating salary or defending workers
                  from being fired. I don't think anything prevents you from
                  having a union that just fights for basic rights like good
                  chairs, not having to review AI slop and not being exposed to
                  asbestos.
                  
                  Of course keeping the union narrowly focused is an issue.
                  Unions are a democracy after all
       
                    eru wrote 6 hours 3 min ago:
                    > Of course keeping the union narrowly focused is an issue.
                    Unions are a democracy after all
                    
                    Yep, and I don't want my neighbours to vote on the colour
                    of my underwear or what I have for breakfast either.  They
                    can mind their business, and I can mind mine.
       
                rTX5CMRXIfFG wrote 23 hours 1 min ago:
                No. Programmer unions are going to shrink the economy and make
                the current job market a permanent trajectory instead of a
                cyclical one.
                
                I canât think of why the idea of unions is gaining popularity
                in some programmer circles, other than that its advocates
                simply donât have economic common sense.
       
                  gregoryl wrote 19 hours 18 min ago:
                  > Programmer unions are going to shrink the economy and make
                  the current job market a permanent trajectory instead of a
                  cyclical one
                  
                  How?
       
                eru wrote 23 hours 3 min ago:
                As long as you don't pretend to talk for people who don't want
                to be talked for, go ahead and knock yourself out.
       
                immibis wrote 23 hours 37 min ago:
                Are AI slop reviews threatening to your life?
       
                  charlieyu1 wrote 16 hours 19 min ago:
                  Yes, the code is so bad if I let them pass it could put the
                  entire company out of business
       
                  zwnow wrote 23 hours 10 min ago:
                  AI generated code is threatening the whole tech industry
                  while also threatening to hurt tons of users, because people
                  that have no business in building and deploying apps suddenly
                  feel like they can. That Tea app was a good example for that,
                  endangering thousands of women by leaking private
                  conversations and address data.
                  
                  If AI slop infiltrates projects enterprises are built upon,
                  its likely companies and their customers are metaphorically
                  hurt too, because of a spike in outages etc... (which already
                  happens given AWS got like 7000 outage reports after getting
                  rid of another 14000 employees).
                  
                  Yes AI can be cool, but can we stop being this blind
                  regarding its limitations, usecases, how its actually used,
                  how it actually benefits humanity, and so on? Like give me a
                  valid reason for Sora existing (except for monetizing
                  attentionspans of humans, which I consider highly unethical).
       
                    kanwisher wrote 22 hours 49 min ago:
                    Funny the app that was made to destroy other peoples lives
                    with anonymous tips that could be fake, hurt the real
                    perpetrators. Almost like it was karma
       
                      bmn__ wrote 17 hours 18 min ago:
                      Reply intended to user zwnow who is banned by HN, so I
                      cannot reply directly.
                      
                      You confuse intent with reality. The social software
                      under discussion was abused immediately for the criminal
                      purpose of spreading falsehoods about men, both with
                      malicious intent and wilful negligence, which is
                      particularly egregious because the victims were not made
                      aware of the slander. Even if they wanted to defend
                      themselves, they were prevented from doing so because of
                      the institutionalised sexism, men are banned from
                      participating on grounds of their sex alone. The proof
                      for this is in the leaks. You failed to take this into
                      account and hence got downvoted into oblivion, not for
                      the reason you claim.
                      
                      The other facts you write about are part of a different
                      narrative, they are not directly relevant to kanwisher's
                      proposition.
                      
                      IMO, we should not have any tolerance for platforms that
                      are designed for gossip because of the boy-cries-wolf
                      effect in backlash because it means if a woman is a
                      genuine victim, people will take the priors into account
                      and most will assume she's a liar, too, and this lets the
                      perpetrators off the hook. I do not want to live in such
                      a society. The way out of this is holding women
                      accountable, they should be punished for criminal
                      behaviour with immediate and drastic consequences, and
                      tenfold so for their enablers. The problem would stop
                      overnight.
       
                        zwnow wrote 16 hours 1 min ago:
                        Hold women accountable for men being a general threat
                        to them? Sure its their fault \s
       
                          bmn__ wrote 13 hours 52 min ago:
                          That's not what I wrote. You know that, I know that
                          you know, and you know that I know.
                          
                          If you can't have a conversation with a modicum of
                          respect, then GTFO HN. We don't need pages filled
                          with pretence and stupid arguments that go nowhere
                          and change no one's mind.
       
                            zwnow wrote 11 hours 0 min ago:
                            Acting like HN was a good platform... Also you
                            argued like an incel so its hard to take you
                            serious
       
                    immibis wrote 23 hours 2 min ago:
                    Okay, can you avoid comparing a company going bankrupt
                    because of a bad bet on AI, to a person getting mangled and
                    crushed into a cube inside of an industrial machine?
       
                      zwnow wrote 22 hours 58 min ago:
                      Where did I compare that? Getting hurt has variations.
                      Your privacy can be hurt, your physique can be hurt, your
                      emotions can be hurt.
       
            nextlevelwizard wrote 1 day ago:
            When you are applying from a job you are more desirable and you
            aren't desperate so you can take your pick. If your current job is
            bad then you can't really lose much.
            
            Otherwise you need to be the person at the company who cuts through
            the bullshit  and saves it from when the VibeCodeTechDebt is
            popping the industry.
       
        ako wrote 1 day ago:
        AI code generators are getting better fast, in the near future they
        will be able to produce good changes faster than you can review. How
        will you deal with it then? Most vibe coding tools can also produce
        smaller PR, but then you have to deal with 250+ PRs in 1 week. Is that
        more manageable? My guess is we need new tool, get the human out of the
        loop. More automated reviews, tests, etc.
       
          ako wrote 1 day ago:
          Instead of downvotes i would appreciate some insightful comments on
          this, as i'm currently struggling with this problem. In the last week
          i've vibe-code (vibe-engineered?) a typescript project with 230+
          commits, 64 typescripts files, with 27k+ lines of code. Too much to
          actually read. Validation mostly through testing, automated test,
          architecture reviews (generate mermaid diagrams). I'm mostly
          reviewing the code structure and architecture, libraries it uses,
          etc. It has 600+ unit and integration tests, but even reviewing those
          is too much...
       
            Pannoniae wrote 18 hours 33 min ago:
            Yeah, you aren't wrong.... I predict two things to happen with
            this.
            
            1. A more biological approach to programming - instead of reviewing
            every line of code in a self-contained way, the system would be
            viewed in a more holistic way, observing its behaviour and test
            whether it works for the inputs you care about. If it does, great,
            ship it, if not, fix it. This includes a greater openness to just
            throwing it away or massively rewriting it instead of tinkering
            with it. The "small, self-contained PRs" culture worked well when
            coding was harder and humans needed to retain knowledge about all
            of the details. This leads to the next point, which is
            
            2. Smaller teams and less fungibility-oriented practices. Most
            software engineering practices are basically centred around making
            the bus factor higher, speeding onboarding up and decrease the
            volatility in programmers' practices. With LLM-assisted
            programming, this changes quite a bit, a smaller, more skilled team
            can more easily match the output of a larger, more sluggish one,
            due to the reduced communication overhead and being able to skip
            all the practices which slow the development velocity down in
            favour of doing things.
            A day ago, the good old Arthur Whitney-style C programming was
            posted to this site ( [1] ) and most commenters were horrified.
            Yes, it's definitely a mouthful on first read but this style of
            programming does have value - it's easier to overview, easier to
            modify than a 10KLOC interpreter spanning 150 separate files, and
            it's also quite token-efficient too. Personally, I'd add some
            comments but I see why this style is this way.
            
            Same with style guides and whatnot - the value of having a code
            style guide (beyond basic stuff like whitespace formatting or
            wordwrapping on 160) drastically drops when you do not have to ask
            people to maintain the same part for years. You see this discussion
            playing out, "my code formatter destroyed my code and it made much
            more unreadable" - "don't despair, it was for the greater good for
            the sake of codebase consistency!". Again, way less of a concern
            when you can just tell an LLM to reformat/rename/add comments if
            you want it.
            
            I'd definitely say that getting the architecture right is way more
            important, and let the details play out in an organic way, unless
            you're talking about safety-critical software. LLM-written code is
            "eventually correct", and that is a huge paradigm shift from "I
            write code and I expect the computer to do what I have written".
            
  HTML      [1]: https://news.ycombinator.com/item?id=45800777
       
            shinycode wrote 23 hours 42 min ago:
            Our problem is not coding. Our problem is knowledge. If no one
            reads it and no one knows how it works and thatâs what the
            company wants because we need to ship fast then the company
            doesnât understand what software is all about. 
            Code is a language, we write stories that makes a lot of sense and
            has consequences. If the companies does not care that humans need
            to know and decide in details the story and how itâs written then
            let it accept the consequence of a sttastistically generated story
            with no human supervision. Let it trust the statistics when there
            will be a bug and no one knows how it works because no one read it
            and no one is there anymore to debug. 
            Weâll see in the end if itâs cheaper to let the code be written
            and only understood by statistical algorithms.
            Otherwise, just work differently instead of generating thousand of
            loc, itâs your responsibility to review and understand no matter
            how long it takes.
       
            smsm42 wrote 23 hours 52 min ago:
            > In the last week i've vibe-code (vibe-engineered?) a typescript
            project with 230+ commits, 64 typescripts files, with 27k+ lines of
            code. Too much to actually read.
            
            Congratulations, you discovered that generating code is only part
            of software development process. If you don't understand what the
            code is actually doing, good luck maintaining it. If it's never
            reviewed, how do you know these tests even test anything? Because
            they say "test passed"? I can write you a script that prints "test
            passed" a billion times - would you believe it is a billion unit
            tests? If you didn't review them, you don't have tests. You have a
            pile of code that looks like tests. And "it takes too long to
            review" is not an excuse - it's like saying "it's too hard to make
            a car, so I just took a cardboard box, wrote FERRARI on it and sit
            inside it making car noises". Fine, but it's not a car. It's just
            pretending. If it's not properly verified, what you have is not
            tests, it's just pretending.
       
              ako wrote 20 hours 22 min ago:
              Iâm well aware, thank you, have been coding for 40+ years
              (including 6502 and 68000 assembly), masters in computer science,
              have built healthcare software where bugs can lead to death. But
              with LLMs enabling us to generate source code faster, our review
              process is becoming an increasingly larger bottleneck for
              productivity. We need to start thinking how we can scale this
              process.
       
                smsm42 wrote 14 hours 22 min ago:
                It's as much bottleneck for productivity as cars being made of
                metal are bottleneck for speed. Sure, you can make a paper car.
                It probably would be faster. Until you collide with something
                and then you discover why the metal frame was a good idea. If
                you generate code that you can not verify or test, sure, it's
                faster. Until something goes wrong.
       
        raincole wrote 1 day ago:
        You ask questions. Literally anything, like asking them why they
        believe this feature is needed, what their code does, why they made a
        DSL parser, etc.
        
        The question itself doesn't matter. Just ask something. If their answer
        is genuine and making sense you deal with it like a normal PR. If their
        answer is LLM-generated too then block.
       
        CamperBob2 wrote 1 day ago:
        Please review this PR.    Look carefully for bugs, security issues, and
        logical conflicts with existing code.  Report 'Pass' if the PR is of
        sufficient quality or 'Fail' if you find any serious issues.  In the
        latter case, generate a detailed report to pass along to the submitter.
        
        (ctrl-v)
       
        aryehof wrote 1 day ago:
        This is effectively a product, not a feature (or bug). Ask the
        submitter how you can you determine if this meets functional and
        non-functional requirements, to start with?
       
        wheelerwj wrote 1 day ago:
        The same way you review a non vibe coded pr.  Whats that got to do with
        anything? A shit pr is a shit pr.
       
        wheelerwj wrote 1 day ago:
        The same way you do a non vibe coded pr. If its a shit pr, its a shit
        pr.
       
        andreygrehov wrote 1 day ago:
        That 10+ years old joke never gets old:
        
        10 lines of code = 10 issues.
        
        500 lines of code = "looks fine."
        
        Code reviews.
       
          whynotmaybe wrote 18 hours 45 min ago:
          I recently reached another milestone.
          
          +153675, -87954 : I don't care. Just taking time to read it will take
          longer than bondo fix the related bugs.
       
        ninetyninenine wrote 1 day ago:
        You vibe review it. Iâm actually only half kidding here.
       
        userbinator wrote 1 day ago:
        If it's full of the typical vibe-coded nonsense that's easy to spot
        upon a quick-but-close inspection (unused functions, dead-end variables
        and paths that don't make sense, excessively verbose and inaccurate
        comments, etc.), I would immediately reject.
       
        EagnaIonat wrote 1 day ago:
        Everyone talking about having them break it down into smaller chunk.
        Vibe coding there is a near guarantee the person doesn't know what the
        code does either.
        
        That alone should be the reason to block it. But LLM generated code is
        not protected by law, and by extension you can damage your code base.
        
        My company does not allow LLM generated code into anything that is
        their IP. Generic stuff outside of IP is fine, but every piece has to
        flagged that it is created by an LLM.
        
        In short, these are just the next evolution of low quality PRs.
       
          jeroenhd wrote 22 hours 11 min ago:
          > Vibe coding there is a near guarantee the person doesn't know what
          the code does either.
          
          Having spent some time vibe coding over the weekend to try it out, I
          disagree. I understand every line of code the super-specific Android
          app I generated does, even if I don't have the Android dev experience
          to come up with the code from the top of my head. Laziness is as good
          a reason to vibe code as inexperience or incompetence.
          
          I wouldn't throw LLM code at a project like this, though, especially
          not in a PR of this size.
       
          smsm42 wrote 1 day ago:
          > Vibe coding there is a near guarantee the person doesn't know what
          the code does either.
          
          Accepting code into the project when only one person (the author)
          knows what it does is a very bad idea. That's why reviews exist.
          Accepting code that zero persons know what it does is sheer screaming
          insanity.
       
            Cthulhu_ wrote 21 hours 4 min ago:
            Unless it's not important. I think vibe coding is fine for
            self-hosted weekend projects / hackathons / POCs and only if
            there's no intersection with legal stuff (like PII or payment
            processing).
            
            But for any open source or enterprise project? Hell no.
       
              smsm42 wrote 14 hours 19 min ago:
              If you don't ever need to know or maintain the code, sure, it's
              not your code, you don't own it, in fact the code is disposable.
              For something like POC, where I don't care how it's done, I just
              want to see if it can be done - I've done it myself. Then if real
              code is needed, you throw the disposable one out or rewrite it
              completely. That's fine. But if it's a long term project,
              somebody needs to own it.
       
          exe34 wrote 1 day ago:
          > Everyone talking about having them break it down into smaller
          chunk. Vibe coding there is a near guarantee the person doesn't know
          what the code does either.
          
          that's the point though, if they can't do it, then you close the
          ticket and tell them to fork off.
       
            EagnaIonat wrote 1 day ago:
            I agree, but you are potentially opening yourself up to 20+ PRs
            which are all vibe coded.
       
              exe34 wrote 15 hours 36 min ago:
              You can read one or two and decide if it's worth going through.
              Otherwise you can just reject them.
       
              scott_w wrote 22 hours 7 min ago:
              Copy and paste is your friend here. If there's 20+ huge PRs, just
              paste "This PR is far too large to review, please break it down
              and submit smaller PRs and engage with us ahead of time to
              understand how to solve this problem."
              
              Comment & Close PR, only engage in discussions on tickets or
              smaller, understandable PRs.
              
              As other have said: if someone drive-by opens a huge PR, it's as
              likely to be malware as a beneficial implementation.
       
        throwaway290 wrote 1 day ago:
        Don't accept this PR. If it's bot generated you are not here to review
        it. They can find a bot to review bot generated requests.
       
        abhimanyue1998 wrote 1 day ago:
        vibe review it with AI then run it on vibe production support. simple.
       
        dbgrman wrote 1 day ago:
        TBH, depends on what is being reviewed. Is it a prototype that might
        not see light of day and is only for proof-of-concept? Did an RFC doc
        precede it and reviewers are already familiar with the project? Were
        the authors expecting this PR? Was there a conversation before the PR
        was sent out? Was there any effort to have a conversation after the PR
        was shared? Was this even meant to be merged into main?
        
        I'll just assume good intent first of all. Second, 9000 LOC spanning 63
        lines is not necessarily an AI generated code. It could be a code mod.
        It could be a prolific coder. It could be a lot of codegen'd code.
        
        Finally, the fact that someone is sending you 9000 LOC code hints that
        they find this OK, and this is an opportunity to align on your values.
        If you find it hard to review, tell them that I find it hard to review,
        I can't follow the narrative, its too risky, etc. etc.
        
        Code review is almost ALWAYS an opportunity to have a conversation.
       
        dosinga wrote 1 day ago:
        Ideally you have a document in place saying this is how we handle vibe
        coding, something like: if you have the AI write the first version, it
        is your responsibility to make it reviewable.
        
        The you can say (and this is hard), this looks like it is vibe code and
        misses that first human pass we want to see in these situations (link),
        please review and afterwards feel free to (re)submit.
        
        In my experience they'll go away. Or they come back with something that
        isn't cleaned up and you point out just one thing. Or sometimes! they
        actually come back with the right thing.
       
        bmitc wrote 1 day ago:
        Reject it and request the author makes it smaller.
        
        PRs should be under 1000 lines.
        
        The alternative is to sit down with them and ask what they're trying to
        accomplish and solve the problem from that angle.
       
        throwawayffffas wrote 1 day ago:
        > How would you go about reviewing a PR like this?
        
        Depends on the context. Is this from:
        
        1. A colleague in your workplace. You go "Hey ____, That's kind of a
        big PR, I am not sure I can review this in a reasonable time frame can
        you split it up to more manageable pieces? PS: Do we really need a DSL
        for this?"
        
        2. A new contributor to your open source project. You go "Hey ____,
        Thanks for your interest in helping us develop X. Unfortunately we
        don't have the resources to go over such a large PR. If you are still
        interested in helping please consider taking a swing at one of our
        existing issues that can be found here."
        
        3. A contributor you already know. You go "Hey I can't review this ___,
        its just too long. Can we break it up to smaller parts?"
        
        Regardless of the situation be honest, and point out you just can't
        review that long a PR.
       
          ljm wrote 19 hours 17 min ago:
          If itâs the first one Iâd be going a step further back to see how
          the work was defined. More often than not Iâd expect the PR comes
          from a ticket that is too broad in scope and could have been broken
          down with a bit of architectural thinking.
          
          The problem being that once someone has put together a PR, itâs
          often too late to go back to the serious thinking step and you end up
          having to massage the solution into something workable.
       
          MartijnHols wrote 23 hours 42 min ago:
          Telling a new contributor no thank you is hard. Open source
          contributors are hard to come by, and so Iâve always dealt with PRs
          like this (albeit before AI days but from people who had never
          written a line of code before their PR) by leaving a message that
          itâs a huge PR so itâs going to take a while to review it and a
          request to make smaller PRs in the future. A couple of times I ended
          up leaving over a hundred review comments, but most times they were
          all fixed and the contributor stuck around with many better PRs
          later.
       
            latexr wrote 19 hours 38 min ago:
            The vast majority of PRs are bad. They could even be described as
            âselfishâ in the sense that the âcontributorâ is
            haphazardly making whatever change minimally fixes their exact use
            case without consideration for the projectâs style, health,
            usability, or other users. This isnât outright malicious or even
            deliberately inconsiderate, but it still has a negative effect.
            
            Refusing such a PR (which, again, is most of them) is easy. But it
            is also time consuming if you donât want to be rude. Everything
            you point out as inadequate is a chance for them to rebut or
            âfixâ in a way which is again unsatisfactory, which only leads
            to more frustration and wasted time. The solution is to be specific
            about the projectâs goals but vague about the code. Explain why
            you feel the change doesnât align with what you want for the
            project, but donât critique specific lines.
            
            There are, of course, exceptions. Even when I refuse a PR, if
            itâs clear it was from a novice with good intentions and making
            an effort to learn, Iâll still explain the issues at length so
            they can improve. If itâs someone who obviously used an LLM,
            didnât understand anything about what they did and called it a
            day, Iâll still be polite in my rejection but Iâll also block
            them.
            
            Ginger Bill (creator of Odin) talked about PRs on a podcast a while
            back and I found myself agreeing in full.
            
  HTML      [1]: https://www.youtube.com/watch?v=0mbrLxAT_QI&t=3359s
       
            throwawayffffas wrote 21 hours 32 min ago:
            > Telling a new contributor no thank you is hard.
            
            In life in general having the wherewithal to say no is a
            superpower. While I appreciate the concern about alienating
            newcomers, you don't start contributing to an existing project by
            adding 9k lines of the features you care about. I have not run any
            open source projects that accept external contributions, but my
            understanding in general is that you need to demonstrate that you
            will stick around before being trusted with just adding large
            features. All code is technical debt, you can't just take on every
            drive by pull request in hopes they will come back to fix it when
            it brakes a year down the line.
       
            eru wrote 23 hours 1 min ago:
            Git is flexible enough that you can tell people to break up their
            PR.  They don't have to redo all their work.
            
            If you want to be really nice, you can even give them help in
            breaking up their PR.
       
              Cthulhu_ wrote 21 hours 59 min ago:
              Yeah exactly, the OP describes a completely new service built
              start to finish all in one merge request, where normally you'd
              start with a proposal and work from there.
       
                eru wrote 6 hours 7 min ago:
                You can even create the proposal retroactively from the PR, if
                you already have the PR.
       
        johnnyanmac wrote 1 day ago:
        excuse me, 9000? If that isn't mostly codegen, including some new
        plugin/API, or a fresh repository I'd reject it outright. LLM's or not.
        
        In my eyes, there really shouldn't be more than 2-3 "full" files worth
        of LOC for any given PR (which should aim to to address 1 task/bug
        each. If not, maybe 2-3 at most), and general wisdom is to aim to keep
        "full" files around 600 LOC each (For legacy code, this is obviously
        very flexible, if not infeasible. But it's a nice ideal to keep in
        mind).
        
        An 1800-2000 LOC PR is already pushing what I'd want to review, but
        I've reviewed a few like that when laying scaffolding for a new
        feature. Most PR's are usually a few dozen lines in 4-5 files each, so
        it's far below that.
        
        9000 just raises so many red flags. Do they know what problem they are
        solving? Can they explain their solution approach? Give general
        architectual structure to their implementation? And all that is before
        asking the actual PR concerns of performance, halo effects,
        stakeholders, etc.
       
        MikeNotThePope wrote 1 day ago:
        How about this?
        
        âThis PR is really long and Iâm having a hard time finding the
        energy to review it all.  My brains gets full before I get to the end. 
        Does it need to be this long?â
        
        Force them to make a case for it.  Then see how they respond.  Iâd
        say good answers could include:
        
        - âI really trieeld to make it smaller, but I couldnât think of a
        way, hereâs whyâ¦â
        
        - âNow that I think about it, 95% of this code could be pushed into a
        separate library.â
        
        - âTo be honest, I vibe coded this and I donât understand all of
        it.  When I try to make it smaller, I canât find a way.  Can we go
        through it together?â
       
        jeremyjh wrote 1 day ago:
        I'd just close it without comment. Or maybe if I'm feeling really
        generous I'll make a FAQ.md that gives a list of reasons why we'll
        close PRs without review or comment and link that in the close
        comments. I don't owe anyone any time on my open source projects. That
        said, I haven't had this issue yet.
       
          tracerbulletx wrote 1 day ago:
          That's fine for an open source project, but many many companies are
          mandating AI use, they're putting it in performance reviews, they're
          buying massive Cursor subscriptions. You'd be cast as an
          obstructionist to AI's god like velocity â¢.
       
            jeremyjh wrote 19 hours 24 min ago:
            Well in my case I'd just fire them if they opened a 9,000 LOC PR
            that they didn't understand.
       
        exclipy wrote 1 day ago:
        I made a /split-commit prompt that automatically splits a megacommit
        into smaller commits. I've found this massively helpful for making more
        reviewable commits. You can either run this yourself or send this to
        your coworker to have them run it before asking you to re-review it.
        
        Sometimes it doesn't split it among optimal boundaries, but it's
        usually good enough to help. There's probably room for improvement and
        extension (eg. re-splitting a branch containing many not-logical
        commits, moving changes between commits, merging commits, ...) â
        contributions welcome!
        
        You can install it as a Claude Code plugin here: [1] (or just copy out
        the prompt from the repo into your agent of choice)
        
  HTML  [1]: https://github.com/KevinWuWon/kww-claude-plugins
       
        alexdowad wrote 1 day ago:
        Be tactful and kind, but straightforward about what you can't/don't
        want to spend time reviewing.
        
        "Thanks for the effort, but my time and energy is limited and I can't
        practically review this much code, so I'm closing this PR. We are
        interested in performance improvements, so you are welcome to pick out
        your #1 best idea for performance improvement, discuss it with the
        maintainers via ..., and then (possibly) open a focused PR which
        implements that improvement only."
       
          ivanjermakov wrote 1 day ago:
          Depends on context of course, but in my book "my time and energy is
          limited" is not a valid reason for a reject. Get back once you have
          time, review in chunks.
       
            alexdowad wrote 22 hours 40 min ago:
            ivanjermakov, I don't know if you are an open source maintainer or
            not (I am, for several projects). If you are, and you follow the
            policy that "I will never reject PRs because of having no time, I
            will always get to it eventually", then I salute you. That is a
            self-sacrificing, altruistic position to take. It's also a very
            difficult position to maintain for the long term. If you can do it:
            congratulations!
            
            As for me, my position is: "My project is my house. You want to be
            a guest in my house, you follow my rules. I really like people and
            am usually happy to answer questions from people who are reasonably
            polite, to review and provide feedback on their PRs, and so on. But
            I won't be pressured to prioritize your GitHub issue or PR over my
            work, my family, my friends, my health, or my personal goals in
            life. If you try to force me, I'll block you and there will be no
            further interaction."
            
            If you don't like that position, well, I understand your feelings.
       
              ivanjermakov wrote 18 hours 54 min ago:
              I'm absolutely with you on that. I'm not saying that every
              contribution deserves equal attention and that rejecting
              contributions is a bad/impolite thing.
              
              There has to be a better reason than "your PR is too big" as it's
              likely just a symptom, also very much context sensitive. If it is
              a 5kLOC PR that adds a compiler backend for a new architecture
              then it probably deserves attention because of its significance.
              
              But if it's obviously low quality code than my response would be
              that it is low quality code. Long story short, it's you
              (submitter) problem, not me (reviewer, BDFL) problem.
       
            wiseowise wrote 23 hours 34 min ago:
            > is not a valid reason for a reject
            
            As a reviewer or as a submitter?
       
        throwaway106382 wrote 1 day ago:
        You don't.
        
        Was your project asking for all this? No? Reject.
       
        ripped_britches wrote 1 day ago:
        Obviously by vibe reviewing it
       
        never_inline wrote 1 day ago:
        close button.
       
        foxfired wrote 1 day ago:
        It's funny just today I published an article with the solution to this
        problem.
        
        If they don't bother writing the code, why should you bother reading
        it? Use an LLM to review it, and eventually approve it. Then of course,
        wait for the customer to complain, and feed the complaint back to the
        LLM. /s
        
        Large LLM generated PRs are not a solution. They just shift the problem
        to the next person in the chain.
       
          throwawayffffas wrote 1 day ago:
          How do you know they didn't bother to write it? For all we know the
          submitter has been quietly hammering away at this for months.
       
            wiseowise wrote 23 hours 20 min ago:
            Then it would have extensive vcs history. Unless they just amend
            into one humongous commit.
       
            foxfired wrote 1 day ago:
            The title says it is vibe-coded. By definition, it means they
            didn't write it.
       
              throwawayffffas wrote 1 day ago:
              But how do they know it's vibe-coded? It may have a smell to it.
              But the author might not know it for a fact. The fact it's
              vibe-coded is actually irrelevant the size of the request is the
              main issue.
       
                sunaookami wrote 1 day ago:
                You can recognize it by the rocket emojis in the PR description
                ;)
       
                foxfired wrote 1 day ago:
                I'm not gonna make assumptions on behalf of OP, but if you have
                domain knowledge, you can quickly tell when a PR is vibe-coded.
                In a real world scenario, it would be pretty rare for someone
                to generate this much code in a single PR.
                
                And if they did in fact spend 6 months painstakingly building
                it, it wouldn't hurt to break it down into multiple PRs. There
                is just so much room for error reviewing such a giant PR.
       
        siwatanejo wrote 1 day ago:
        Forget about code for a second. This all depends a lot of what goal
        does the PR achieve? Does it align with the goals of the project?
       
          appreciatorBus wrote 1 day ago:
          How can you tell if it aligns with the goals of the project without
          reviewing 9000 lines of code first?
       
            Cthulhu_ wrote 20 hours 56 min ago:
            Read the title and description of the PR first. If that fails the
            sniff test, the code would as well.
       
            siwatanejo wrote 1 day ago:
            Are you kidding me? You should be able to explain from the user PoV
            what does the PR achieve, a new feature? a bugfix?
            
            That data point is waaaaaay more important than any other when
            considering if you should think about reviewing it or not.
       
              wiseowise wrote 23 hours 29 min ago:
              Okay, it does align. What next?
       
                siwatanejo wrote 2 hours 9 min ago:
                Ok great. Now that it aligns, how valuable it is? Is it a
                deadly & terrific feature? Then maybe you should review it.
                It's just fixing a nitpick? Then ask the contributor to find a
                less verbose change.
       
            ivanjermakov wrote 1 day ago:
            PRs rarely exist in a vacuum. Usually there is a
            ticket/issue/context which required a code change.
       
        jonchurch_ wrote 1 day ago:
        We are seeing a lot more drive by PRs in well known open source
        projects lately. Here is how I responded to a 1k line PR most recently
        before closing and locking. For context, it was (IMO) a well
        intentioned PR. It purported to implement a grab bag of perf
        improvements, caching of various code paths, and a clustering feature
        
        Edit: left out that the user got flamed by non contributors for their
        apparently AI generated PR and description (rude), in defense of which
        they did say they were using several AI tools to drive the work. 
        :
        
        We have a performance working group which is the venue for discussing
        perf based work. Some of your ideas have come up in that venue, please
        go make issues there to discuss your ideas
        
        my 2 cents on AI output: these tools are very useful, please wield them
        in such a way that it respects the time of the human who will be
        reading your output. This is the longest PR description I have ever
        read and it does not sound like a human wrote it, nor does it sound
        like a PR description. The PR also does multiple unrelated things in a
        single 1k line changeset, which is a nonstarter without prior
        discussion.
        
        I don't doubt your intention is pure, ty for wanting to contribute.
        
        There are norms in open source which are hard to learn from the
        outside, idk how to fix that, but your efforts here deviate far enough
        from them in what I assume is naivety that it looks like spam.
       
          jonchurch_ wrote 1 day ago:
          Daniel Stenberg of curl gave a talk about some of what theyve been
          experiencing, mostly on the security beg bounty side. A bit
          hyperbolic, and his opinion is clear from the title, but I think a
          lot of maintainers feel similarly.
          
          âAI Slop attacks on the curl projectâ
          
  HTML    [1]: https://youtu.be/6n2eDcRjSsk
       
            lkramer wrote 18 hours 43 min ago:
            I think it's only fair to give an example where he feels AI is used
            correctly:
            
  HTML      [1]: https://mastodon.social/@bagder/115241241075258997
       
              jonchurch_ wrote 11 hours 13 min ago:
              Wow very cool, theyve now closed 150 bugs identified via ai
              assistance/static analysis!
              
              For ref, here is the post from Joshua Rogers about their
              investigation into the tooling landscape which yielded those
              findings
              
  HTML        [1]: https://joshua.hu/llm-engineer-review-sast-security-ai-t...
       
        le-mark wrote 1 day ago:
        How long was this person working on it? Six months? Anything this big
        shouldâve had some sort of design review. The worst is some junior
        going off and coding some garbage no one sees for a month.
       
          jonchurch_ wrote 1 day ago:
          You can churn this stuff out in about an hour these days though,
          seriously. Thats part of the problem, the asymmetry of time to create
          vs time to review.
          
          If I can write 8 9k line PRs everyday and open them against open
          source projects, even closing them let alone engaging with them in
          good faith is an incredible time drain vs the time investment to
          create them.
       
        999900000999 wrote 1 day ago:
        Reject it and tell them to actually code it.
       
        ChrisMarshallNY wrote 1 day ago:
        I write full app suites that have less than 9000 LoC. I tend toward
        fewer, large-ish source files, separated by functional domains.
        
        I once had someone submit a patch (back in the SVN days), that was
        massive, and touched everything in my system. I applied it, and
        hundreds of bugs popped up.
        
        I politely declined it, but the submitter got butthurt, anyway. He put
        a lot of work into it.
       
        TriangleEdge wrote 1 day ago:
        Amazon eng did some research and found the number of comments in a code
        review is proportional to the number of lines changed. Huge CRs get
        little comments. Small CRs get a lot of comments. At Amazon, it's
        common to have a 150 to 300 line limit to changes. It depends on the
        team.
        
        In your case, I'd just reject it and ensure repo merges require your
        approval.
       
          senderista wrote 1 day ago:
          Also, some teams have CR metrics that can be referenced for
          performance evaluations.
       
          kwk1 wrote 1 day ago:
          "Inversely proportional" for what it's worth
       
          zukzuk wrote 1 day ago:
          Thatâs a great way to discourage anyone ever doing any large scale
          refactoring, or any other heavy lifting.
       
            charlieyu1 wrote 16 hours 12 min ago:
            You want to do large scale refactoring without the main team
            agreeing? Seems like a disaster.
       
            Cthulhu_ wrote 21 hours 2 min ago:
            That's a good thing, large scale refactorings should be very, very
            rare. Even automated code style changes can be controversial
            because of the churn they create. For large and/or important
            software, churn should be left to a minimum, even at the cost of
            readability or code cleanliness. I've seen enough open source
            projects that simply state they won't accept refactoring /
            reformatting PRs.
       
              gempir wrote 19 hours 35 min ago:
              That means your code will stay old.
              
              A new language feature is released, you cannot apply it to old
              code, since that would make a big PR. You need to do super slowly
              over time and most old code will never see it.
              
              A better static type checker, that finds some bugs for you, you
              cannot fix them as your PR would be too big, you instead would
              need to make a baseline and split it up endlessly.
              
              In theory yes, maybe a bit safer to do it this way, but
              discouraging developers to make changes is bad IMO. 
              Obviously depends on your usecase, if you develop software that
              is critical to people's literal life, then you'll move more
              carefully.
              
              But I wager 99% of the software the world produces is some
              commerce software, where the only thing lost is money.
       
                wizzwizz4 wrote 19 hours 17 min ago:
                > A new language feature is released, you cannot apply it to
                old code, since that would make a big PR.
                
                Good. Don't change code for the sake of shiny new things
                syndrome.
                
                > A better static type checker, that finds some bugs for you,
                you cannot fix them as your PR would be too big,
                
                Good. Report each bug separately, with a suggested fix,
                categorised by region of the code. Just because you ran the
                program, that doesn't mean you understand the code well enough
                to actually fix stuff: those bugs may be symptomatic of a
                deeper issue with the module they're part of. The last thing
                you need is to turn accidentally-correct code into subtly-wrong
                code.
                
                If you do understand the code well enough, what's the harm in
                submitting each bugfix as a separate (independent) commit? It
                makes it easier for the reviewers to go "yup, yup, yup", rather
                than having to think "does this part affect that part?".
       
            febusravenga wrote 1 day ago:
            That's good. Because large refactorings are usually harmful. They
            are also usually unplanned, not scoped and based on very
            unquantifiable observations like "I don't like the code is
            structured" - let's do ity way.
       
            arachnid92 wrote 1 day ago:
            Just split up your work across multiple PRs.
       
            TriangleEdge wrote 1 day ago:
            The review bots can be bypassed.
       
        CharlieDigital wrote 2 days ago:
        Ask the submitter to review and leave their comments first or do a peer
        code review with them and force them to read the code.    It's probably
        the first time they'll have read the code as well...
       
          groguzt wrote 1 day ago:
          I really like this, the fact that vibe coded PRs are often bad is
          that people don't review it themselves first, they just look at the
          form, and if it looks vaguely similar to what they had in their mind,
          they'll just hit save and not ask the LLM for corrections
       
        vasan wrote 2 days ago:
        Just reflect upon it, see if you gave him less time to complete it. I
        would just have a meet with him and confront it.
       
        hshdhdhehd wrote 4 days ago:
        With a middle finger
       
        rvrs wrote 4 days ago:
        Enforce stacked PRs, reject PRs over 500-1k LoC (I'd argue even lower,
        but it's a hard sell)
       
        devrundown wrote 5 days ago:
        9000 LOC is way too long for a pull request unless there is some very
        special circumstance.
        
        I would ask them to break it up into smaller chunks.
       
        tacostakohashi wrote 5 days ago:
        Use AI to generate the review, obviously.
       
        yodsanklai wrote 5 days ago:
        You review it like it wasn't AI generated. That is: ask author to split
        it in reviewable blocks. Or if you don't have an obligation to review
        it, you leave it there.
       
          latexr wrote 19 hours 34 min ago:
          > Or if you don't have an obligation to review it, you leave it
          there.
          
          Donât just leave it there, that reflects badly on you and your
          project and pushes away good contributors. If the PR is inadequate,
          close it.
       
          ivanjermakov wrote 1 day ago:
          My record is 45 comments on a single review. Merge conditions were
          configured so that every comment must be resolved.
          
          If PR author can satisfy it - I'm fine with it.
       
            cryptonym wrote 23 hours 48 min ago:
            They will let AI somewhat satisfying it and ask you for further
            review
       
              ivanjermakov wrote 19 hours 0 min ago:
              Reminds me of curl problems with vulnerability report: [1] At
              that point it is just malicious.
              
  HTML        [1]: https://news.ycombinator.com/item?id=43907376
       
                cryptonym wrote 16 hours 11 min ago:
                Some people genuinely believe agentic coding works great and
                they mastered it. Someone who PR a simple feature with its own
                DSL probably is on that team and won't see the issue with their
                way. They may think you are too old and resist AI. They
                probably would tell you if that's too much for your old
                fashioned coding skills, then just use an agent for the PR.
                
                If you think that way, who cares about the code and additional
                DSL? If there is an issue or evolution required, we'll let AI
                work on it. If it works, just let it merge. Much cheaper than
                human reviewing everything.
                
                I hate it, maybe I'm too old.
       
          ashdksnndck wrote 1 day ago:
          If you ask them to break it into blocks, are they not going to submit
          10 more AI-generated PRs (each having its own paragraphs of
          description and comment spam), which you then have to wade through.
          Why sink even more time into it?
       
            data-ottawa wrote 18 hours 29 min ago:
            I think breaking a big PR up like this is usually fair
            
            Sometimes I get really into a problem and just build. It results in
            very large PRs.
            
            Marking the PR as a draft epic then breaking it down into a
            sequence smaller PRs makes it much easier to review. But you can
            solicit big picture critique there.
            
            Iâm also a huge fan of documentation, so each PR needs to be
            clear, describe the bigger picture, and link back to your epic.
       
            mrweasel wrote 21 hours 36 min ago:
            There's probably also a decent chance that the author can't
            actually do it.
            
            Let's say it's the 9000 lines of code. I'm also not reviewing 900
            lines, so it would need to be more than 10 PRs. The code needs to
            be broken down into useful components, that requires the author to
            think about design. In this case you'd probably have the DSL parser
            as a few PRs. If you do it like that it's easier for the reviewer
            to ask "Why are you doing a DSL?" I feel like in this case the
            author would struggle to justify the choice and be forced to
            reconsider their design.
            
            It's not just chopping the existing 9000 lines into X number of
            bits. It's submitting PRs that makes sense as standalone patches.
            Submitting 9000 lines in one go tells me that you're a very junior
            developer and that you need guidance in terms of design and
            processes.
            
            For open source I think it's fine to simply close the PR without
            any review and say: Break this down, if you want me to look at it.
            Then if a smaller PR comes in, it's easier to assess if you even
            want the code. But if you're the kind of person that don't think
            twice about submitting 9000 lines of code, I don't think you're
            capable of breaking down you patch into sensible sub-components.
       
              ashdksnndck wrote 11 hours 7 min ago:
              Some of the current AI coding tools can follow instruction like
              âbreak this PR up into smaller chunksâ, so even a completely
              clueless user may be able to follow those instructions. But that
              doesnât mean itâs worth a maintainerâs time to read the
              output of that.
       
            Buttons840 wrote 1 day ago:
            Being AI-generated is not the problem. Being AI-generated and not
            understandable is the problem. If they find a way to make the
            AI-generated code understandable, mission accomplished.
       
              ashdksnndck wrote 1 day ago:
              How much of their time should open source maintainers sink into
              this didactic exercise? Maybe someone should vibe-code a bot to
              manage the process automatically.
       
          userbinator wrote 1 day ago:
          If you try to inspect and question such code, you will usually
          quickly run into that realisation that the "author" has basically no
          idea what the code even does.
          
          "review it like it wasn't AI generated" only applies if you can't
          tell, which wouldn't be relevant to the original question that
          assumes it was instantly recognisable as AI slop.
          
          If you use AI and I can't tell you did, then you're using it
          effectively.
       
            charlieyu1 wrote 16 hours 16 min ago:
            You are optimistic like the author even cared about the code. Most
            of the time you get another LLM response on why the code
            âworksâ
       
            ahtihn wrote 1 day ago:
            If it's objectively bad code, it should be easy enough to point out
            specifics.
            
            After pointing out 2-3 things, you can just say that the quality
            seems too low and to come back once it meets standards. Which can
            include PR size for good measure.
            
            If the author can't explain what the code does, make an explicit
            standard that PR authors must be able to explain their code.
       
          danenania wrote 1 day ago:
          Iâm curious how people would suggest dealing with large
          self-contained features that canât be merged to main until they are
          production-ready, and therefore might become quite large prior to a
          PR.
          
          While it would be nice to ship this kind of thing in smaller
          iterative units, that doesnât always make sense from a product
          perspective. Sometimes version 0 has bunch of requirements that are
          non-negotiable and simply need a lot of code to implement. Do you
          just ask for periodic reviews of the branch along the way?
       
            ericmcer wrote 11 hours 4 min ago:
            I will schedule review time with coworkers I trust to go over it
            with them.
            
            It is about ownership to me. I own my PRs. If I throw garbage out
            and expect you to fix it I am making you own my PRs. No one wants
            to be forced to own other peoples work.
       
            Yizahi wrote 19 hours 33 min ago:
            In our case, if such a thing happens (a few times per year across
            hundreds of people), a separate branch is created and a team
            working on that feature is completely autonomous for a while, while
            there is constant normal work in trunk by everyone else. Team tests
            their feature and adjacent code to an acceptable beta state but
            doesn't do any extensive or full coverage because it is impossible.
            Their code may be reviewed at that point if they request it, but it
            done as an extra activity, with meetings and stuff. Then they
            optionally give this build to the general QA to run full suite on
            it. This may be done in several cycles if fatal issues are found.
            Then they announce that they will do merge into trunk on days A to
            B and ask everyone to please hold off on committing into trunk in
            that time. Around that time they send a mail outlining changes and
            new functionality and potential or actual unfixed issues. QA teams
            runs as full cover of tests as possible. Merge may be reverted at
            this point if it is truly bad. Or if it good, team announces
            success and proceeds with normal work mode.
       
            wiseowise wrote 23 hours 37 min ago:
            > Iâm curious how people would suggest dealing with large
            self-contained features that canât be merged to main until they
            are production-ready
            
            Are you hiding them from CIA or Al-Qaeda?
            
            Feature toggles, or just plain Boolean flag are not rocket science.
       
              danenania wrote 16 hours 19 min ago:
              Not rocket science, but I think there are also some tradeoffs
              with feature flags?
              
              People could build on top of half-baked stuff because itâs in
              main. Or you might interact with main in ways that arenât ready
              for production and arenât trivial to toggleâ¦ or you just
              forget a flag check somewhere important.
              
              I could also see schema/type decisions getting locked in too
              early while the feature is still in flux, and then people donât
              want to change after itâs already reviewed since it seems like
              thrashing.
              
              But yeah, definitely itâs one option. How do you consider those
              tradeoffs?
       
            exe34 wrote 1 day ago:
            you line up 10-20 PRs and merge them in a temporary integration
            branch that gets tested/demoed. The PRs still have to be
            reviewed/accepted and merged into main separately. You can say 'the
            purpose of this pr is to do x for blah, see top level ticket'.
            often there will be more than one ticket based on how
            self-contained the PRs are.
       
            JonChesterfield wrote 1 day ago:
            They come from people who have established that their work is worth
            the time to review and that they'll have put it together
            competently.
            
            If it's a newcomer to the project, a large self contained review is
            more likely to contain malware than benefits. View with suspicion.
       
            arachnid92 wrote 1 day ago:
            The way we do it where I work (large company in the
            cloud/cybersecurity/cdn space):
            
            - Chains of manageable, self-contained PRs each implementing a
            limited scope of functionality. âManageableâ in this context
            means at most a handful of commits, and probably no more than a few
            hundred lines of code (probably less than a hundred tbh).
            
            - The main branch holds the latest version of the code, but that
            doesnât mean itâs deployed to production as-is. Releases are
            regularly cut from stable points of this branch.
            
            - The full âproductâ or feature is disabled by a
            false-by-default flag until itâs ready for production.
            
            - Enablement in production is performed in small batches, rolling
            back to disabled if anything breaks.
       
            foxglacier wrote 1 day ago:
            The partial implementation could be turned off with a feature flag
            until it's complete.
       
          resonious wrote 1 day ago:
          This is it. The fact that the PR was vibe coded isn't the problem,
          and doesn't need to influence the way you handle it.
       
            ericmcer wrote 11 hours 7 min ago:
            It is a huge problem. PR reviews are a big deal, not just for code
            reasons, but they are one of the best teaching tools for new hires.
            Good ones take time and mental energy.
            
            Asking me to review a shitty PR that you don't understand is just
            disrespectful. Not only is it a huge waste of everyones time,
            you're forcing me to do your work for you (understanding and
            validating the AI solution) and you aren't learning anything
            because it isn't your work.
       
            f311a wrote 16 hours 20 min ago:
            It's the problem. I often have to guide LLMs 2-4 times to properly
            write 150-300 LOC changes because I see how the code can be
            simplified or improved.
            
            There is no way that 9000 lines of code are decent. It's also very
            hard to review them and find bad spots. Why spent your time in the
            first place? It probably took one hour for a person to generate it,
            but it will take ten to review and point out hundreds (probably)
            problems.
            
            Without AI, no one would submit 9000 lines, because that's tens of
            hours of work which you usually split into logical parts.
       
            lm28469 wrote 16 hours 39 min ago:
            It 100% is.
            
            Why would I bother reviewing code you didn't write and most likely
            didn't read ?
       
            cespare wrote 1 day ago:
            It is 1995. You get an unsolicited email with a dubious business
            offer. Upon reflection, you decide it's not worth consideration and
            delete it. No need to wonder how it was sent to you; that doesn't
            need to influence the way you handle it.
            
            No. We need spam filters for this stuff. If it isn't obvious to you
            yet, it will be soon. (Or else you're one of the spammers.)
       
              resonious wrote 20 hours 0 min ago:
              The original ask was about one PR.
       
              baq wrote 23 hours 14 min ago:
              Didnât even hit the barn, sorry. Codegen tools were obvious,
              review assistance tools are very lagging, but will come.
       
                eru wrote 5 hours 29 min ago:
                We already have some of them.  And if you have a wide enough
                definition, we had them for a while.
       
            gdulli wrote 1 day ago:
            It would be willfully ignorant to pretend that there's not an
            explosion of a novel and specific kind of stupidity, and to not
            handle it with due specificity.
       
              rablackburn wrote 1 day ago:
              > It would be willfully ignorant to pretend that there's not an
              explosion of a novel and specific kind of stupidity
              
              I 100% know what you mean, and largely agree, but you should
              check out the guidelines, specifically:
              
              > Don't be curmudgeonly. Thoughtful criticism is fine, but please
              don't be rigidly or generically negative.
              
              And like, the problem _is_ *bad*. A fun, on-going issue at work
              is trying to coordinate with a QA team who believe chatgpt can
              write css selectors for HTML elements that are not yet written.
              
              That same QA team deeply care about the spirit of their work, and
              are motivated by, the _very_ relatable sentiment of, you DONT
              FUCKING BREAK USER SPACE.
              
              Yeah, in the unbridled, chaotic, raging plasma that is our
              zeitgeist at the moment, I'm lucky enough to have people
              dedicating a significant portion of their life to trying to do
              quality assurance in the idiomatic, industry best-standard way.
              Blame the FUD, not my team.
              
              I would put to you that the observation that they do not (yet)
              grok what, for lack of a more specific universally understood
              term we are calling, "AI" (or LLMs if you are Fancy. But of
              course none of these labels are quite right). People need time to
              observe, and learn. And people are busy with /* gestures around
              vaguely at everything /*.
              
              So yes, we should acknowledge that long-winded trash PRs from AI
              are a new emergent problem, and yes, if we study the specific
              problem more closely we will almost certainly find ever more
              optimal approaches.
              
              Writing off the issue as "stupidity" is mean. In both senses.
       
                watwut wrote 18 hours 36 min ago:
                I do not think that is being curmudgeonly. Instead, OP is
                absolutely right.
                
                We collectively used the strategy of "we pretend we are naively
                stupid and dont talk directly about issues" in multiple areas
                ... and it failed every single time in all of them. It never
                solves the problem, it just invites to bad/lazy/whatever actors
                to play semantic manipulative games.
       
              WalterSear wrote 1 day ago:
              I contend that, by far and away the biggest difference between
              entirely human-generated slop and AI-assisted stupidity is the
              irrational reaction that some people have to AI-assisted stuff.
       
                JoshTriplett wrote 23 hours 28 min ago:
                Many of the people who submit 9000-line AI-generated PRs today
                would, for the most part, not have submitted PRs at all before,
                or would not have made something that passes CI, or would not
                have built something that looks sufficiently plausible to make
                people spend time reviewing it.
       
                  HelloNurse wrote 16 hours 53 min ago:
                  Most of those people should still keep their ignorance to
                  themselves, without bothering actual programmers, like they
                  did before LLM hype convinced them that "sufficiently
                  plausible" is good enough.
                  
                  A similar trend: the popularity of electric scooters among
                  youngsters who would otherwise walk, use public transport, or
                  use decent vehicles increases accidents in cities.
       
                    JoshTriplett wrote 14 hours 19 min ago:
                    I think my comment may have been misparsed. I was observing
                    that one of the problems with LLMs is making it possible
                    for people to produce 9000-line PRs they don't understand
                    where previously they might have been gated by making
                    something even remotely plausible that compiles or passes
                    CI.
       
                  WalterSear wrote 22 hours 57 min ago:
                  9000-line PRs were never a good idea, have only been
                  sufficiently plausible because we were forced to accept bad
                  PR review practices. Coding was expensive and management beat
                  us into LGTMing them into the codebase to keep the features
                  churning.
                  
                  Those days are gone. Coding is cheap. The same LLMs that
                  enable people to submit 9000 line PRs of chaos can be used to
                  quickly turn them into more sensible work. If they genuinely
                  can't do a better job, rejecting the PR is still the right
                  response. Just push back.
       
                exe34 wrote 1 day ago:
                Are you quite sure that's the only difference you can think of?
                Let me give you a hint: is there any difference in the volume
                for the same cost at all?
       
                hatefulmoron wrote 1 day ago:
                Calling things "slop" is just begging the question. The real
                differentiating factor is that, in the past, "human-generated
                slop" at least took effort to produce. Perhaps, in the process
                of producing it, the human notices what's happening and
                reconsiders (or even better, improves it such that it's no
                longer "slop".) Claude has no such inhibitions. So, when you
                look at a big bunch of code that you haven't read yet, are you
                more or less confident when you find out an LLM wrote it?
       
                  HelloNurse wrote 17 hours 1 min ago:
                  >   Perhaps, in the process of producing it, the human
                  notices what's happening and reconsiders (or even better,
                  improves it such that it's no longer "slop".)
                  
                  Given the same ridiculously large and complex change, if it
                  is handwritten only a seriously insensitive and arrogant
                  crackpot could, knowing what's inside, submit it with any
                  expectation that you accept it without a long and painful
                  process instead of improving it to the best of their ability;
                  on the other hand using LLM assistance even a mildly
                  incompetent but valuable colleague or contributor, someone
                  you care about, might underestimate the complexity and cost
                  of what they didn't actually write and believe that there is
                  nothing to improve.
       
                  fragmede wrote 23 hours 41 min ago:
                  If you try and one shot it, sure, but if you question Claude,
                  point out the errors of its ways, tell it to refactor and
                  ultrathink, point out that two things have similar
                  functionality and could be merged. It can write unhinged code
                  with duplicate unused variable definitions that don't work,
                  and it'll fix it up if you call it out, or you can just do it
                  yourself. (cue questions of if, in that case, it would just
                  be faster to do it yourself.)
       
                    hatefulmoron wrote 23 hours 30 min ago:
                    I have a Claude max subscription. When I think of bad
                    Claude code, I'm not thinking about unused variable
                    definitions. I'm thinking about the times you turn on
                    ultrathink, allow it to access tools and negotiate it's
                    solution, and it still churns out an over complicated yet
                    partially correct solution that breaks.   I totally trust
                    Claude to fix linting errors.
       
                      fragmede wrote 23 hours 1 min ago:
                      It's hard to really discuss in the abstract though. Why
                      was the generared code overly complicated? (I mean, I
                      believe you when you say it was, but it doesn't leave
                      much room for discussion). Similarly, what's partially
                      correct about it? How many additional prompts does it
                      take before you a) use it as a starting point b) use it
                      because it works c) don't use any of it, just throw it
                      away d) post about why it was lousy to all of the
                      Internet reachable from your local ASN.
       
                        hatefulmoron wrote 22 hours 40 min ago:
                        I've read your questions a few times and I'm a bit
                        perplexed. What kind of answers are you expecting me to
                        give you here? Surely if you use Claude Code or other
                        tools you'd know that the answers are so varying and
                        situation specific it's not really possible for me to
                        give you solid answers.
       
                          fragmede wrote 12 hours 52 min ago:
                          However much you're comfortable sharing! Obviously
                          ideal would be the full source for the "overly
                          complicated" solution, but naturally that's a no go,
                          so even just more words than a two word phrase
                          "overly complicated". Was it complicated because it
                          used 17 classes with no inheritance and 5 would have
                          done it? Was it overly complicated because it didn't
                          use functions and so has the same logic implemented
                          in 5 different places?
                          
                          I'm not asking you, generically, about what bad code
                          do LLMs produce. It sounds like you used Claude Code
                          in a specific situation and found the generated code
                          lacking. I'm not questioning that it happened to you,
                          I'm curious in what ways it was bad for your specific
                          situation more specifically than "overly
                          complicated". How was it overly complicated?
                          
                          Even if you can't answer that, maybe you could help
                          me reword the phrasing of my original comment so it's
                          less perplexing?
       
                      WalterSear wrote 23 hours 12 min ago:
                      If you are getting garbage out, you are asking it for too
                      much at once. Don't ask for solutions - ask for
                      implementations.
       
                        hatefulmoron wrote 22 hours 57 min ago:
                        Distinction without a difference. I'm talking about its
                        output being insufficient, whatever word you want to
                        use for output.
       
                          WalterSear wrote 22 hours 18 min ago:
                          And I'm arguing that if the output wasn't sufficient,
                          neither was your input.
                          
                          You could also be asking for too much in one go,
                          though that's becoming less and less of a problem as
                          LLMs improve.
       
                            hatefulmoron wrote 21 hours 56 min ago:
                            You're proposing a truism: if you don't get a good
                            result, it's either because your query is bad or
                            because the LLM isn't good enough to provide a good
                            result.
                            
                            Yes, that is how this works. I'm talking about the
                            case where you're providing a good query and
                            getting poor results. Claiming that this can be
                            solved by more LLM conversations and ultrathink is
                            cope.
       
                              WalterSear wrote 21 hours 16 min ago:
                              I've claimed neither. I actually prefer
                              restarting or rolling back quickly rather than
                              trying to re-work suboptimal outputs - less
                              chance of being rabbit holed. Just add what I've
                              learned to the original ticket/prompt.
                              
                              'Git gud' isn't much of a truism.
       
                  WalterSear wrote 1 day ago:
                  I have pretty much the same amount of confidence when I
                  receive AI generated or non-AI generated code to review: my
                  confidence is based on the person guiding the LLM, and their
                  ability to that.
                  
                  Much more so than before, I'll comfortably reject a PR that
                  is hard to follow, for any reason, including size. IMHO, the
                  biggest change that LLMs have brought to the table is that
                  clean code and refactoring are no longer expensive, and
                  should no longer be bargained for, neglected or given the lip
                  service that they have received throughout most of my career.
                  Test suites and documentation, too.
                  
                  (Given the nature of working with LLMs, I also suspect that
                  clean, idiomatic code is more important than ever, since LLMs
                  have presumably been trained on that, but this is just a
                  personal superstition, that is probably increasingly false,
                  but also feels harmless)
                  
                  The only time I think it is appropriate to land a large
                  amount of code at once is if it is a single act of entirely
                  brain dead refactoring, doing nothing new, such as renaming a
                  single variable across an entire codebase, or
                  moving/breaking/consolidating a single module or file. And
                  there better be  tests. Otherwise, get an LLM to break things
                  up and make things easier for me to understand, for crying
                  out loud: there are precious few reasons left not to make
                  reviewing PRs as easy as possible.
                  
                  So, I posit that the emotional reaction from certain
                  audiences is still the largest, most exhausting difference.
       
                    hatefulmoron wrote 23 hours 45 min ago:
                    I don't really understand your point. It reads like you're
                    saying "I like good code, it doesn't matter if it comes
                    from a person or an LLM. If a person is good at using an
                    LLM, it's fine." Sure, but the problem people have with
                    LLMs is their _propensity_ to create slop in comparison to
                    humans. Dismissing other people's observations as purely an
                    emotional reaction just makes it seem like you haven't
                    carefully thought about other people's experiences.
       
                      WalterSear wrote 22 hours 31 min ago:
                      My point is that, if I can do it right, others can too.
                      If someone's LLM is outputing slop, they are obviously
                      doing something different: I'm using the same LLMs.
                      
                      All the LLM hate here isn't observation, it's sour
                      grapes. Complaining about slop and poor code quality
                      outputs is confessing that you haven't taken the time to
                      understand what is reasonable to ask for, aren't
                      educating your junior engineers how to interact with
                      LLMs.
       
                        rockskon wrote 7 hours 56 min ago:
                        ???
                        
                        People complaining about receiving bad code is, by
                        definition, observation.
       
                        lukan wrote 19 hours 22 min ago:
                        "My point is that, if I can do it right, others can
                        too."
                        
                        Can it also be, that different people work in different
                        areas and LLM's are not equally good in all areas?
       
                          WalterSear wrote 13 hours 52 min ago:
                          That was my first assumption, quite a while ago now.
       
                    grey-area wrote 1 day ago:
                    clean code and refactoring are no longer expensive
                    
                    Are you contending that LLMs produce clean code?
       
                      WalterSear wrote 1 day ago:
                      They do, for many people. Perhaps you need to change your
                      approach.
       
                        grey-area wrote 15 hours 58 min ago:
                        The code I've seen generated by others has been pretty
                        terrible in aggregate, particularly over time as the
                        lack of understanding and coherent thought starts to
                        show. Quite happy without it thanks, haven't seen it
                        adding value yet.
       
                        dmurray wrote 1 day ago:
                        If you can produce a clean design, the LLM can write
                        the code.
       
                          WalterSear wrote 22 hours 24 min ago:
                          I think maybe there's another step too - breaking the
                          design up into small enough peices that the LLM can
                          follow it, and you can understand the output.
       
                            TexanFeller wrote 20 hours 2 min ago:
                            So do all the hard work yourself and let the AI do
                            some of the typing, that youâll have to spend
                            extra time reviewing closely in case its RNG factor
                            made it change an important detail. And with all
                            the extra up front design, planning, instructions,
                            and context you need to provide to the LLM Iâm
                            not sure Iâm saving on typing. A lot of people
                            recommend going meta and having LLMs generate a
                            good prompt and sequence of steps to follow, but
                            Iâve only seen that kinda sorta work for the most
                            trivial tasks.
       
                          fragmede wrote 23 hours 39 min ago:
                          Unless you're doing something fabulously unique (at
                          which point I'm jealous you get to work on such a
                          thing), they're pretty good at cribbing the design of
                          things if it's something that's been well documented
                          online (canonically, a CRUD SaaS app, with minor UI
                          modification to support your chosen niche).
       
                            WalterSear wrote 23 hours 14 min ago:
                            And if you are doing something fabulously unique,
                            the LLM can still write all the code around it,
                            likely help with many of the components, give you
                            at least a first pass at tests, and enable rapid,
                            meaningful refactors after each feature PR.
       
          gpm wrote 1 day ago:
          Eh, ask the author to split it in reviewable blocks if you think
          there's a chance you actually want a version of the code. More likely
          if it's introducing tons of complexity to a conceptually simple
          service you just outright reject it on that basis.
          
          Possibly you reject it with "this seems more suitable for a fork than
          a contribution to the existing project". After all there's probably
          at least some reason they want all that complexity and you don't.
       
        JohnFen wrote 6 days ago:
        I'd just reject it for being ridiculous. It didn't pass the first step
        of the review process: the sniff test.
       
          brudgers wrote 6 days ago:
          Charitably, even though it is not what you or I would do, the pull
          request could be a best good faith effort of a real human being.
          
          So to me, it's less about being ridiculous (and "ridiculous" is a
          fighting word) and more a simple "that's not how this team does
          things because we don't have the resources to work that way."
          
          Mildly hurt feelings in the most likely worst case (no food for a
          viral overtop tweet). At best recruitment of someone with cultural
          fit.
       
            f311a wrote 16 hours 7 min ago:
            It's very rare to see 9K PRs by humans. They usually take weeks of
            work if it's a serious change. No one wants to spend such an amount
            of time just to get rejected. You split it and send one by one
            while discussing changes.
            
            But if it takes 1 hour with AI, you just send it right away...
       
            JohnFen wrote 6 days ago:
            My objection to a PR like this has nothing to do with whether or
            not a human wrote it. It's that the PR is too large and complex.
            The reason I'd give for rejecting it would be that. I wouldn't say
            "it's ridiculous" as the reason. I would 100% be thinking that,
            though.
       
              brudgers wrote 6 days ago:
              Thatâs good.
              
              My experience is âtoo large/complexâ provides an opening for
              arguementivenes and/or drama.
              
              âWe donât do it like thisâ does not so much. It is social,
              sufficient and not a matter of opinion (âtooâ is a matter of
              opinion).
       
                BrenBarn wrote 1 day ago:
                What about "this is large and complex enough to be not the way
                we do things"?
       
        wengo314 wrote 6 days ago:
        reject outright. ask to split it into reasonable chain of changesets.
       
        aaronrobinson wrote 6 days ago:
        Reject it
       
        zigcBenx wrote 6 days ago:
        In my opinion no PR should have so much changes. It's impossible to
        review such things.
        
        The only exception is some large migration or version upgrade that
        required lots of files to change.
        
        As far it goes for Vibe coded gigantic PRs It's a straight reject from
        me.
       
       
   DIR <- back to front page