Is Your QA Broken, or Is Your System?

A Developer’s Framework for the 30-Minute “Quality Reset” Conversation with Your Manager

The Pattern

A developer on Reddit described a situation that will sound familiar to anyone who has worked in a small team: they build the app, they support it in production, and they are also the person who writes test cases, runs regressions, and catches bugs before deployment. There is a QA analyst on the team, but that analyst does not write test cases, does not ask clarifying questions, and does not push back on unclear requirements. The developer compensates by doing the QA work quietly, which keeps the product stable but makes the developer exhausted and the QA analyst invisible. Management sees no problem because the product works.

This is not a story about a lazy tester. This is a story about a system that has no defined quality process, which means the person who cares the most ends up carrying it alone.

Why This Keeps Happening

The instinct is to blame the individual. “My QA does nothing.” But the same pattern repeats across thousands of teams for structural reasons, not personal ones.

When a team has no written definition of what QA is supposed to deliver, no minimum criteria for what “tested” means, and no visibility into who did what before a release, the outcome is predictable. The developer tests because the developer ships. The QA analyst drifts because nobody defined what “done” looks like for their role. And the manager assumes everything is fine because production is stable.

The DORA research program, which has studied software delivery performance across tens of thousands of teams since 2014, consistently finds that team performance correlates more strongly with clearly defined responsibilities and process discipline than with individual skill or effort (dora.dev). Their 2024 Accelerate State of DevOps Report introduced “Rework Rate” as a new metric, measuring the proportion of unplanned deployments made to fix user-visible issues (future-processing.com). Teams with high rework rates are not teams with bad developers. They are teams where defects escape the pre-production process, either because the process is weak or because it does not exist at all.

The concept that captures this is called “Defect Escape Rate”: the percentage of defects found in production divided by the total number of defects found everywhere (daily.dev). A team that catches 90% of bugs before production has a 10% escape rate. A team where the developer is doing all the testing alone, under time pressure, while also building new features, will have a higher escape rate. Not because the developer is bad at testing, but because one person doing two jobs cannot do either at full capacity.

What QA Is Actually Supposed to Produce

The Reddit poster asked whether their understanding of QA was correct: “A Quality Assurance Analyst is responsible to understand the application, write test cases, test to find bugs and report findings to the developer prior to deployment.”

That is accurate but incomplete. A functioning QA role produces specific, verifiable outputs.

Risk-based test planning. Not “test everything,” but “here are the three places this feature is most likely to break, and here is why.” A nail salon booking app with role-based permissions, for example, has a clear risk area: what happens when a client-role user tries to access an admin-role function? What happens when two specialists are booked at the same time? A QA analyst who understands the application should be able to identify these scenarios without the developer hand-feeding them.

Reproducible bug reports. A useful bug report contains: steps to reproduce, expected result, actual result, environment (browser, OS, device), and optionally a screenshot or video. “It does not work” is not a bug report. The difference between a team that resolves bugs quickly and one that spends hours guessing is almost always the quality of the bug report.

Regression ownership. When a new feature ships, existing features must still work. Somebody has to own the set of checks that confirms this. If nobody owns it, either the developer runs regressions informally (adding invisible hours to their workload) or regressions do not happen (and users find the bugs instead).

Explicit sign-off. Before a release goes to production, someone other than the developer should confirm: “I tested the critical paths, here is what I checked, and I approve this for release.” This is not bureaucracy. It is a minimum standard of accountability.

The Developer’s Trap: Invisible Heroism

The Reddit poster described doing the QA analyst’s job behind the scenes so that production stays stable. This is a common and understandable response, but it creates a feedback loop that makes the problem worse.

When the developer absorbs the QA workload silently, three things happen. Management sees no quality problem, so there is no urgency to fix the process. The QA analyst receives no feedback about unmet expectations, so their behavior does not change. And the developer’s time gets consumed by testing instead of building, which slows delivery, which eventually draws management attention, but the attention lands on the developer (“Why is this feature late?”) rather than on the process.

The CrowdStrike incident of July 19, 2024, offers an extreme illustration of what happens when testing processes fail at scale. A faulty configuration update to CrowdStrike’s Falcon Sensor software crashed approximately 8.5 million Windows devices worldwide, grounding flights, shutting down hospitals, and disrupting financial institutions. The estimated financial damage exceeded $5 billion for Fortune 500 companies alone (Wikipedia, CNN). CrowdStrike’s own root cause analysis revealed that their Content Validator had a logic error that allowed the faulty update to pass validation, and that the update was deployed without staged rollouts or adequate canary testing (techtarget.com). Delta Air Lines filed a $500 million lawsuit alleging that CrowdStrike deployed “untested software updates” (Wikipedia).

Nobody is comparing a nail salon booking app to critical infrastructure. But the principle is identical at every scale: when testing is skipped, compressed, or concentrated in one person who is also building the product, defects escape. The only variable is how much damage they cause.

The Conversation Framework: 30 Minutes with Your Manager

The goal of this conversation is not to complain about a coworker. The goal is to propose a system change that removes the dependency on any single person’s heroism.

This requires reframing. Instead of “the QA analyst is not doing their job,” the message becomes: “our current process has no defined quality gate, which means quality depends on individual effort rather than a repeatable system.”

The conversation has four parts.

Part 1: Observation (No Judgment, No Names)

State what is happening, using your own work as the evidence.

“Over the last few projects, I wrote the test scenarios, ran the regressions, and tested the role-based permission logic myself before each deployment. I tracked this informally and it added roughly [X] hours per sprint to my workload.”

The key here is specificity. “I do everything” is vague and sounds like a complaint. “I logged 12 hours of testing work in the last sprint that was not in my estimates” is a fact that demands a response.

Part 2: Impact (Risk and Cost)

Connect the observation to something the manager cares about: delivery speed, production stability, or team scalability.

“When one person builds and tests the same code, there is no independent verification. This increases the risk that bugs reach production. It also means my development velocity is lower than it could be, because a portion of my time goes to testing instead of building.”

If you have data, use it. If you have had a production bug that testing would have caught, reference it without blame: “The [specific bug] in [release] is an example of something that a dedicated test pass would likely have caught.”

Part 3: Proposal (System, Not Person)

Propose a quality gate: a minimum set of criteria that must be met before any release goes to production.

“I would like to propose that we define a simple set of pre-deployment requirements. Not a heavyweight process, just a short checklist that someone other than the developer signs off on before we deploy.”

A starter quality gate for a small team might include:

Area	Minimum Requirement
Critical paths	Core user flows tested and documented
Role-based behavior	Permission variations verified per role
Regression	Key existing features re-tested after changes
Bug reports	Reproducible steps, expected vs. actual, environment noted
Sign-off	Written confirmation from a tester before deploy

This is not a process document. It is a one-page checklist. The point is that it exists, that someone other than the developer owns it, and that deployment does not happen until it is complete.

Part 4: Ask (Decision and Ownership)

Close with a clear ask that requires a decision.

“Can we agree on a basic quality checklist and assign ownership for completing it? I am not asking for new tools or new hires. I am asking that the testing role we already have on the team produces specific outputs before each release.”

This language does not name the QA analyst. It does not say “they are not doing their job.” It says: “here is a gap in our process, here is how it affects the product, and here is a lightweight fix.”

The First 30 Days After the Conversation

Assuming the manager agrees (and most will, because the ask is small and the logic is clear), the next step is to make it real. A process that lives only in a meeting is not a process.

Week 1: Map the critical paths together. Sit down with the QA analyst and list every user flow that, if broken, would make the product unusable or would generate user complaints. For a booking application, this might be: create booking, cancel booking, view schedule by day, view schedule by week, permission check (client vs. admin vs. specialist). This is a working session, not a knowledge transfer. The QA analyst should be writing, not just listening.

Week 2: Produce the first set of test scenarios. The QA analyst writes scenarios for the critical paths identified in Week 1. These do not need to be formal test case documents. A simple format works: “Given [condition], when [action], then [expected result].” The developer reviews them for completeness, not for approval. If scenarios are missing, the developer points to the gap and the QA analyst fills it.

Week 3: Run the first quality gate. On the next deployment, use the checklist. The QA analyst runs the scenarios, documents what passed and what failed, and provides written sign-off (or documents blockers). This is the first time the QA analyst’s output is visible to the team and to management.

Week 4: Retrospective on escaped defects. After deployment, review any bugs found in production. For each one, ask: “Was this covered by a test scenario? If not, should it be? If it was covered, why did it escape?” This is not blame. This is calibration. The DORA framework calls this kind of review a way to “reduce the batch size of changes” and “create a complete feedback loop” (dora.dev).

What This Approach Does (and Does Not Do)

This approach does not guarantee that the QA analyst will suddenly become great at their job. It does guarantee that their output (or lack of output) becomes visible. When there is a checklist and someone’s name is next to each item, the work either gets done or the gap is documented. Both outcomes are useful.

It also does not address the salary disparity the Reddit poster mentioned. That is a separate conversation, and mixing it with the quality process discussion weakens both arguments. The quality conversation is about the team’s process. The compensation conversation is about the individual’s value. Keep them apart.

What this approach does is shift the frame from “people problem” to “system problem.” That shift matters because managers can act on system problems without anyone losing face. Nobody is called lazy. Nobody is put on a performance improvement plan. The team simply agrees on what “done” means and who owns each piece of it.

The DORA research consistently shows that teams with clearly defined responsibilities and process discipline outperform teams that rely on individual heroism (octopus.com/devops/metrics/dora-metrics). Atlassian’s benchmark for DORA “Elite” teams includes a change failure rate of 0-15%, which requires systematic quality practices, not one exhausted developer running regressions at midnight (atlassian.com).

The Sentence That Changes the Conversation

If there is one line to carry into the meeting, it is this:

“Right now, quality depends on whoever cares the most. I want it to depend on a process.”

That sentence does not attack anyone. It does not demand credit. It reframes the entire problem from a personality conflict into an engineering decision. And engineering decisions have solutions.

Sources

DORA research program: dora.dev/guides/dora-metrics-four-keys. Five metrics for software delivery performance: deployment frequency, lead time for changes, change failure rate, failed deployment recovery time, and reliability.
2024 Accelerate State of DevOps Report findings: future-processing.com/blog/dora-devops-metrics. Introduction of “Rework Rate” metric; shift from four to six indicators.
DORA 2024 key findings: octopus.com/devops/metrics/dora-metrics. Well-defined responsibilities and psychological safety as strongest predictors of delivery performance.
Atlassian DORA benchmarks: atlassian.com/devops/frameworks/dora-metrics. Elite performers: change failure rate of 0-15%, recovery within one hour.
Defect Escape Rate methodology: daily.dev/blog/defect-density-and-escape-rate-agile-metrics-guide-2024. Formula: (defects found in production / total defects) x 100.
CrowdStrike incident (July 19, 2024): en.wikipedia.org/wiki/2024_CrowdStrike-related_IT_outages. 8.5 million Windows devices affected, estimated $10 billion in global financial damage.
CrowdStrike root cause analysis: techtarget.com/whatis/feature/Explaining-the-largest-IT-outage-in-history-and-whats-next. Content Validator logic error, parameter mismatch (21 expected vs. 20 provided).
CrowdStrike financial impact: edition.cnn.com/2024/07/24/tech/crowdstrike-outage-cost-cause. Fortune 500 losses estimated at $5.4 billion; healthcare sector hardest hit at $1.94 billion.
Delta Air Lines lawsuit: Wikipedia (same source as #6). $500 million damages claimed; allegations of deploying “untested software updates.”
Defect Escape Rate target: whatisscrum.org/defect-rate-in-scrum. Recommended target: find 90% of defects before production (10% escape rate).
DORA as CD Foundation standard: cd.foundation/blog/2025/10/16/dora-5-metrics. Evolution from four to five formal metrics plus reliability quasi-metric.
DORA 2025 AI findings: devopslaunchpad.com/blog/dora-report-2024. 25% increase in AI adoption showed 1.5% decrease in throughput and 7.2% decrease in stability.