Part of Forge DevKit ecosystem

◇ forge-qa

Tests that trace to requirements

Get Pro — €79 → → one-time

The problem

AI writes tests that test nothing

Unit tests are fake. Displays use mock data. Tests pass but don't verify actual behavior.

No traceability to requirements

You can't tell which test covers which acceptance criterion. Gaps are invisible.

Test strategy is an afterthought

AI generates random tests. No coverage plan, no prioritization, no framework consistency.

How it works

Setup

Test auditor scans your project: framework, patterns, coverage tooling, maturity level.

Generate

From product artifacts or code analysis - unit, integration, component, E2E, and acceptance tests.

/forge:qa test authentication

Trace

4-level traceability: AC→unit, UC→E2E, UX→component. Every test maps to a requirement.

Judge

LLM-as-Judge evaluates test quality against rubrics. Catches fake mocks and meaningless assertions.

Key capabilities

◇4-level traceability

AC→unit tests, UC→E2E tests, UX→component tests, LLM-as-Judge for quality.

◇8+ test frameworks

Vitest, Jest, Playwright, Cypress, Testing Library, Supertest, and more. Auto-detected.

◇LLM-as-Judge

Rubric-based evaluation catches fake tests, meaningless mocks, and missing edge cases.

◇Product artifact integration

When forge-product artifacts exist, tests generate from requirements. Without them, code analysis.

◇10 execution modes

Unit, integration, component, E2E, acceptance, coverage, plan, generate, quality, upgrade.

Sample output

A real-world example of what this module produces.

forge:qa - Test File Header

// Generated by forge-qa | Traced to: AC-1, AC-2
// Source: shoply-api/src/modules/checkout/checkout.service.ts

describe('CheckoutService', () => {
  // AC-1: Saved cards displayed with last 4 digits
  it('returns masked card details for authenticated user', ...)

  // AC-2: New card with Stripe Elements
  it('creates payment intent for new card flow', ...)

  // JUDGE: verifies test is not a stub (assertion count >= 2)
  // Coverage: 2/4 ACs | Missing: AC-3, AC-4
})

Who is this for

Senior Developer

Generate test coverage from specs instead of writing boilerplate assertions by hand.

QA Engineer

Get full traceability from acceptance criteria to test cases - no gaps, no guessing.

Tech Lead

Enforce a consistent test strategy across the team with architecture-aware coverage.

forge-qa vs Virtuoso / TestSprite

See the full comparison for details.

Dimension	Virtuoso / TestSprite	Forge DevKit
Test source	AI guesses from code	Traces to acceptance criteria and use cases
Quality check	None - tests just need to pass	LLM-as-Judge evaluates against rubrics
Coverage map	Line coverage only	Requirement-level traceability matrix

Works with

◇forge-core

Architecture data for smart test coverage decisions

◇forge-product

Product artifacts provide requirements for test generation

See how Forge compares

→Forge vs Cursor Rules →Forge vs Manual CLAUDE.md

Get started with forge-qa

Get Pro — €79 → →

See all modules →