QA Testing in Every Sprint: A Practical Framework

There is a version of agile delivery that most teams actually practise, and it looks like this: developers build features in two-week sprints. A testing phase happens at the end, sometimes a dedicated sprint, sometimes a compressed two-week block before go-live. Defects found in the testing phase are triaged and fixed. The project goes live with an accepted list of known issues.
This approach is so common it feels like a standard. It isn’t. It’s a project anti-pattern that predictably generates large defect backlogs, compresses testing under deadline pressure, and produces go-live decisions made on incomplete quality information.
The alternative, embedding QA into every sprint from the first day of the project, is consistently faster from first sprint to go-live, produces higher quality at launch, and costs less in total defect remediation. Here is a practical framework for implementing it.
Sprint Planning: QA Reviews Stories Before Development Starts
Sprint planning is the first place sprint-embedded QA changes how the team works. In a conventional setup, QA is a downstream function. It receives completed work. In a sprint-embedded model, QA contributes to planning by reviewing user stories and acceptance criteria for testability before the sprint begins.
The Testability Review
The practical activity is a QA review of each story in the sprint backlog. The QA specialist asks: Is the acceptance criterion specific enough to write a test case against? What are the edge cases and boundary conditions that the acceptance criterion doesn’t explicitly cover? Are there data dependencies or integration touchpoints that need to be tested? Are any regulatory or compliance requirements associated with this story that need specific test coverage?
Stories that fail the testability review go back to the product owner or business analyst for clarification before development starts. This conversation, which takes fifteen minutes in planning, prevents a scenario that is extremely common in the late-testing model: a developer marks a story complete, QA tests it, and discovers the acceptance criterion was ambiguous enough that the feature could be reasonably interpreted two ways, and the developer implemented the wrong interpretation. Resolving that defect requires the developer to revisit work they considered finished, in a later sprint, when the context is no longer fresh.
The Sprint Test Plan
The output of QA’s sprint planning contribution is a test plan for the sprint: a list of stories, the test cases associated with each, the test data requirements, and any environmental dependencies. This takes two to three hours per sprint. It is foundational to everything that follows.
During the Sprint: Test Case Development Runs Parallel to Development
The most significant structural change in sprint-embedded QA is the timing of test case development. In the late-testing model, test cases are written after the feature is built, often after QA has seen the feature for the first time. This creates a testing process that is, consciously or not, influenced by how the feature was actually implemented rather than what it was supposed to do.
Writing Tests from Acceptance Criteria, Not from the Implementation
In sprint-embedded QA, test cases are developed in the first half of the sprint, while development is underway. The QA specialist writes test cases from the acceptance criteria, not from the implementation. The test case describes the expected behaviour of the system from a business perspective. When development is complete, the test cases are ready to execute immediately.
Communication During the Sprint
This parallel working model requires that developers and QA specialists communicate during the sprint rather than only at handoff. A practical mechanism is a brief daily check-in, five minutes, separate from standup, where the developer flags any implementation decisions that might affect test case design. “I’ve implemented the currency conversion using the ECB daily rate, not real-time rates as the story implied. Is that consistent with the test cases?” This conversation, in the middle of the sprint, takes five minutes and prevents a defect report and rework cycle that would take two days.
Test Data Preparation
Test data preparation also happens during the sprint. A common source of testing delay in late-testing models is discovering at the start of the testing phase that the test data doesn’t exist or doesn’t match the conditions needed to exercise the system. Sprint-embedded QA identifies test data requirements during planning and prepares or requests that data during the sprint, so test execution can begin immediately when development completes.
Sprint Review: Test Results Are Part of the Demo
The sprint review, or sprint demo, is where the team demonstrates completed work to stakeholders. In a sprint-embedded QA model, the demo includes not just the working feature, but the test results that confirm the feature behaves correctly across its acceptance criteria, edge cases, and any integration touchpoints.
Why This Standard Matters
This is a deliberately higher standard than “the feature works.” The demo shows: here is the feature, here are the test cases we wrote against it, here are the results. This does two things. First, it makes quality visible to stakeholders, not just velocity. Business sponsors gain confidence that what they are approving is tested, not just built. Second, it surfaces any quality concerns before the sprint is formally closed, preventing the common pattern where a sprint is declared complete, the team moves on, and a defect is discovered in a later sprint when the original developer has lost context and fixing it is significantly more expensive.
Handling Defects Found in Sprint Review
Defects found during sprint review are handled consistently: they are raised in the defect management system, prioritised, and either resolved within the sprint (if time permits) or added to the backlog for the next sprint. They are never informally noted and forgotten. Sprint-embedded QA requires the same discipline in defect tracking as in test case development. Every defect is recorded, every defect is resolved or formally accepted as a risk.
Definition of Done: QA Sign-Off Is a Gate
The definition of done is one of the most powerful and most abused concepts in agile delivery. Teams write ambitious definitions of done and then quietly abandon them under sprint pressure. For sprint-embedded QA to work, the definition of done must include QA sign-off, and that gate must be held without exception.
What the Gate Looks Like
A definition of done that incorporates QA looks like this: development complete and code reviewed; test cases written and peer-reviewed; test execution complete with no open critical or high defects; test results documented and linked to story; QA specialist has signed off. A story is not done until all of these conditions are met. Not mostly done. Not done except for the test results. Done.
Holding the Gate
Holding this gate is the most difficult management challenge in implementing sprint-embedded QA. Sprint pressure, meaning the desire to show velocity and the reluctance to carry unfinished stories, creates constant incentive to call stories done before QA has signed off. The project manager and the QA specialist need to hold this gate together, with explicit backing from project leadership. A sprint that ends with four stories genuinely done is a better outcome than a sprint that ends with six stories nominally done and two that will generate defects in a later sprint.
The Velocity Objection: Addressing the Most Common Pushback
The objection to sprint-embedded QA is almost universal: “It will slow us down.” It deserves a direct response, because it is based on a misunderstanding of what velocity measures and what slows it.
What Velocity Actually Measures
Sprint velocity measures the rate at which work is called done. If done means tested and signed off, and the team is achieving ten story points per sprint, that is ten points of genuinely complete work. If done means developed but not tested, and the team is achieving fourteen story points per sprint, that fourteen includes work that will generate defects in a later sprint. The defect remediation cost isn’t zero. It appears as unplanned work in a later sprint, compressing the capacity for new development. The apparent velocity advantage of the late-testing model is borrowed from future sprints.
Defect Debt and Its Compounding Cost
This is defect debt, the testing equivalent of technical debt. And like technical debt, it compounds. A defect introduced in sprint two that isn’t found until a testing phase in sprint ten has, in the intervening time, potentially influenced five or six other features that integrated with the defective component. The IBM Systems Sciences Institute’s data is precise: a defect caught within the sprint that created it costs one unit to fix. The same defect caught in a post-development testing phase costs fifteen units. Caught in production, it costs thirty units.
The Evidence
Sprint-embedded QA does not slow velocity. It shifts quality work from a compressed, high-cost phase at the end of the project to distributed, low-cost activity throughout the project. The total project duration is typically shorter. The go-live quality is measurably higher. The post-launch defect rate is substantially lower. We have not encountered a single implementation project where the late-testing model, honestly assessed on total project cost and total defect count, outperformed sprint-embedded QA.
What Sprint-Embedded QA Requires to Work
For completeness, the conditions that enable sprint-embedded QA are worth stating directly, because they are genuine prerequisites, not nice-to-haves.
A Dedicated QA Specialist from Sprint One
A dedicated QA specialist must be on the sprint team from sprint one. Not shared across three projects. Not brought in when there’s something to test. Present in planning, present during the sprint, present in review. The QA specialist is a team member, not a service.
A Defect Management System That Is Actually Used
The team needs a defect management system that is actually used. Test cases, defect records, and test results must be documented and traceable. Verbal defect tracking doesn’t work. Email defect tracking doesn’t work. A proper tool, such as SpiraTeam, Jira, or Azure DevOps, is required.
Leadership That Holds the Definition of Done
Leadership must hold the definition of done. Without explicit leadership support for the QA gate, sprint pressure will erode it within two or three sprints. This is not a technical requirement; it is a governance requirement.
These are not complex prerequisites. They are available to any project that chooses to implement them. The choice not to implement them is a choice to defer quality, and to pay the compounding cost of that deferral at the end of the project, when it is most expensive and most visible.



