skills/dpa-checklist-review/test-plan.md

Acceptance Test Plan — DPA Checklist Review v1.0.0

Skill summary

Reviews Data Processing Agreements (DPAs) against multiple regulatory regimes: GDPR, US state privacy laws, HIPAA BAA, and general commercial. Supports regulatory_regime, party_role (controller / processor), data_categories, international_transfer_context, and prior_agreements inputs. Produces a structured report with checklist-style coverage of each in-scope provision.

Test corpus requirements

Source 8–12 anonymized DPAs covering:

At least 2 GDPR-context DPAs (EU/UK data subjects involved): one controller-to-processor, one processor-to-sub-processor.
At least 2 US state privacy DPAs (CCPA / CPRA, CDPA, CPA, CTDPA, UCPA): mix of vendor-side and customer-side.
At least 2 HIPAA BAAs (covered entity engaging a business associate, with PHI involved).
At least 1 general-commercial DPA (no specific privacy regime; data-handling baseline only).
At least 1 multi-regime DPA (covers GDPR + US state privacy together — common for multi-national vendors).
At least 1 DPA with notable international-transfer mechanism (Standard Contractual Clauses, UK IDTA, supplementary measures post-Schrems II).
At least 1 routine, market-standard DPA to confirm baseline calibration.

Run each DPA against the appropriate regulatory_regime input. For multi-regime tests, run twice to verify regime-specific calibration in each pass.

Test scenarios

Inputs: A GDPR-context DPA where the user's organization is the controller engaging a processor. Inputs: regulatory_regime: gdpr, party_role: controller.

Expected output structure:

Markdown report with sections: "Bottom line", "Critical issues", "Material issues", "Minor issues / observations", "GDPR-specific checklist", "Recommended position", "What this skill does not do".
The GDPR-specific checklist covers Article 28(3) requirements: subject matter and duration, nature and purpose, type of personal data, categories of data subjects, rights and obligations, processor obligations enumeration, sub-processor approval, technical and organizational measures, etc.

Expected calibration:

0–2 critical findings (typically only if the DPA omits Article 28(3) elements outright).
3–7 material findings on Article 28(3) elements that are present but inadequately specified.
5–12 minor findings on stylistic or non-material concerns.
"Bottom line" leads with controller-perspective recommendation.

Edge cases to verify:

Skill identifies missing Article 28(3) elements explicitly.
Skill addresses sub-processor approval mechanism (is it general-prior-authorization or specific-prior-authorization?) at appropriate severity.
Skill addresses international-transfer mechanism if present (SCCs, IDTA, derogations, supplementary measures).
Skill addresses Article 32 technical and organizational measures.

Pass criteria:

Structural pass: All required sections including GDPR-specific checklist present.
Calibration pass: Reviewing attorney with GDPR experience confirms Article 28(3) coverage and severity calibration.

Inputs: GDPR-context DPA where the user's organization is the processor engaging a sub-processor. Inputs: regulatory_regime: gdpr, party_role: processor.

Expected output structure: Same as Scenario 1, with calibration shifted to processor-perspective concerns: maintaining Article 28(4) flow-down obligations, sub-processor's obligations matching or exceeding processor's obligations to controller, controller's right to audit sub-processor.

Expected calibration:

Findings address Article 28(4) flow-down adequacy.
Findings address sub-processor's right to engage further sub-processors (sub-sub-processors) and the visibility cascade.
"Bottom line" addresses whether the sub-processor's commitments are adequate to satisfy the processor's commitments to its controller.

Edge cases to verify:

Skill differentiates controller and processor party roles in the analysis.
Skill addresses Article 28(4) requirements specifically.

Pass criteria: As above with processor-role-aware verification.

Scenario 3: US state privacy DPA

Inputs: A DPA addressing US state privacy obligations (CCPA / CPRA / CDPA / CPA / CTDPA / UCPA). Input: regulatory_regime: us-state-privacy, with applicable jurisdictions specified if known.

Expected output structure: Same overall structure with section: "US state privacy-specific checklist" covering: service-provider / processor designations under each statute, prohibition on selling / sharing, sub-contractor flow-down, deletion rights, consumer-request mechanisms.

Expected calibration:

0–2 critical findings.
3–6 material findings on state-specific requirements that vary across the regimes.
5–10 minor findings.

Edge cases to verify:

Skill addresses the "service provider" / "processor" / "contractor" distinctions across CCPA/CPRA/CDPA/CPA terminology variations.
Skill identifies cross-state harmonization or harmonization gaps.
Skill addresses sale-and-share restrictions.

Pass criteria: As above with US-state-privacy-aware verification.

Scenario 4: HIPAA BAA

Inputs: A HIPAA BAA where covered entity engages a business associate. Input: regulatory_regime: hipaa-baa, party_role as appropriate.

Expected output structure: Same overall structure with section: "HIPAA BAA-specific checklist" covering: §164.504(e)(2) required terms, breach notification, sub-contractor flow-down, return-or-destroy on termination, audit rights.

Expected calibration:

0–2 critical findings (typically only if §164.504(e)(2) elements are missing or substantially deficient).
3–6 material findings on supporting provisions.
5–10 minor findings.
"Bottom line" addresses §164.504(e)(2) compliance baseline.

Edge cases to verify:

Skill addresses each §164.504(e)(2) required term explicitly.
Skill addresses breach-notification timing (HIPAA's 60-day floor; HHS reporting if applicable).
Skill addresses minimum-necessary use.
Skill flags reuse-of-PHI provisions at appropriate severity.

Pass criteria: As above with HIPAA-specific verification.

Scenario 5: General commercial DPA (no specific regime)

Inputs: A general commercial DPA where no specific regulatory regime applies but data handling is in scope. Input: regulatory_regime: general-commercial.

Expected output structure: Same overall structure but without regime-specific checklist; the analysis focuses on baseline data-handling provisions.

Expected calibration:

0–1 critical findings.
2–5 material findings on data-handling baseline (security commitments, breach notification, return-on-termination).
4–8 minor findings.
"Bottom line" notes whether the general-commercial baseline is sufficient or whether a regime-specific layer is needed.

Edge cases to verify:

Skill recognizes that "general commercial" is the correct fit (not silently applying GDPR or HIPAA criteria).
Skill recommends adding regime-specific provisions if the data handling implicates a regulated category.

Pass criteria: As above with general-commercial scoping verification.

Inputs: A DPA addressing both GDPR and US state privacy. Run twice: once with regulatory_regime: gdpr and once with regulatory_regime: us-state-privacy.

Expected output structure: Two reports — one calibrated to GDPR analysis, one calibrated to US state privacy analysis. The reports address the same document but surface different findings.

Expected calibration:

Findings differ across the two passes — GDPR-pass surfaces Article 28(3) gaps; US-state-privacy-pass surfaces sale-restriction / consumer-rights gaps.
Common findings (e.g., breach notification timing) are flagged in both passes with appropriate regime-specific framing.

Edge cases to verify:

Skill does not lapse into the wrong regime mid-report.
Skill addresses cross-regime conflicts (e.g., if GDPR requires prior approval for sub-processors but the DPA's US state privacy approach allows general approval).

Pass criteria: As above with multi-regime verification on both passes.

Scenario 7: International transfer mechanism

Inputs: A DPA with explicit international transfer mechanism (SCCs, UK IDTA, supplementary measures). Input: regulatory_regime: gdpr, international_transfer_context: <description>.

Expected output structure: Includes a focused section on transfer-mechanism adequacy.

Expected calibration:

Findings address SCC selection adequacy (Module 2 controller-to-processor; Module 3 processor-to-processor; etc.) for the data-flow context.
Findings address supplementary measures (Schrems II considerations).
Findings address transfer impact assessment if applicable.

Pass criteria: Reviewing attorney with international-transfer experience confirms calibration.

Refusal scenarios

Refusal 1: Document is not a DPA

Input: An MSA, NDA, or other contract type misidentified as a DPA.

Expected behavior:

Skill identifies that the document is not (primarily) a DPA.
Skill recommends the appropriate alternative skill.

Pass criteria: Explicit refusal with cross-pointer.

Refusal 2: Regime selection conflicts with document content

Input: A HIPAA BAA with regulatory_regime: gdpr selected, or a clearly-GDPR DPA with regulatory_regime: hipaa-baa selected.

Expected behavior:

Skill identifies the mismatch between the input regime and the document's apparent regime.
Skill suggests the correct regime selection or proceeds with explicit caveat.

Pass criteria: Skill detects the mismatch rather than silently producing miscalibrated output.

Cross-cutting verification

No invented authorities. Articles, regulations, and statutes cited are real.
No enforceability opinions on regulatory compliance ("this satisfies GDPR Article 28" is an opinion the skill should not assert; "this addresses each Article 28(3) element" is an observation the skill can support).
Recommended language is operationally usable.
"What this skill does not do" enumeration present (typically: regulatory-compliance certifications, jurisdiction-specific enforceability, transfer-impact-assessment full execution).
Citations resolve.

Pass / fail decision

DPA Checklist Review v1.0.0 passes acceptance testing when:

All 7 test scenarios pass structural checks.
All 7 test scenarios pass calibration evaluation by a reviewing attorney with regime-specific experience (GDPR-aware reviewer for GDPR scenarios; US-privacy-aware reviewer for US state privacy scenarios; HIPAA-aware reviewer for HIPAA BAA scenarios — possibly different reviewers for different scenarios).
Both refusal scenarios trigger the documented refusal behavior.
Cross-cutting verification passes on every scenario.

Reviewer notes

DPA Checklist Review touches several distinct regulatory regimes; the reviewing attorney should have practical experience with the regime they are reviewing. For multi-regime testing, multiple reviewers are recommended (one per regime).

Specific competencies:

GDPR reviewer: Article 28 / 32 / 35 / 44 et seq. familiarity; SCC module selection; Schrems II supplementary measures; UK IDTA distinct from EU SCCs.
US state privacy reviewer: CCPA / CPRA service-provider / contractor / processor distinctions; cross-state harmonization; sale-and-share restrictions.
HIPAA reviewer: §164.504(e)(2) required-terms enumeration; breach notification mechanics; covered-entity vs. business-associate vs. sub-contractor cascade.

Calibration assessment is documented in test-results/dpa-checklist-review-v1.0.0/calibration-assessment.md, with separate sub-files for each regime if multiple reviewers participate.

Acceptance Test Plan — DPA Checklist Review v1.0.0

Skill summary

Test corpus requirements

Test scenarios

Scenario 1: GDPR DPA, controller-to-processor

Scenario 2: GDPR DPA, processor-to-sub-processor

Scenario 3: US state privacy DPA

Scenario 4: HIPAA BAA

Scenario 5: General commercial DPA (no specific regime)

Scenario 6: Multi-regime DPA (GDPR + US state privacy)

Scenario 7: International transfer mechanism

Refusal scenarios

Refusal 1: Document is not a DPA

Refusal 2: Regime selection conflicts with document content

Cross-cutting verification

Pass / fail decision

Reviewer notes