Skip to content

feat(agents): align RAI planner with guide, remove scoring, improve UX#1287

Draft
WilliamBerryiii wants to merge 7 commits intomainfrom
feat/rai-planner-guide-alignment
Draft

feat(agents): align RAI planner with guide, remove scoring, improve UX#1287
WilliamBerryiii wants to merge 7 commits intomainfrom
feat/rai-planner-guide-alignment

Conversation

@WilliamBerryiii
Copy link
Copy Markdown
Member

@WilliamBerryiii WilliamBerryiii commented Apr 3, 2026

Description

Implements Issue #1281: RAI Planner Updates — Guide Alignment, Scoring Removal, and UX Improvements.

The RAI Planner agent expands from a 5-phase to a 6-phase workflow, replacing numeric scoring with qualitative assessment, aligning terminology and structure with the Microsoft Responsible AI Impact Assessment Guide, and introducing artifact signing infrastructure. Additionally, this PR establishes a config-driven AI artifact validation pipeline for enforcing footer and disclaimer standards across instruction files. 46 files changed across agent definitions, instructions, prompts, documentation, plugins, collection metadata, config, CI workflows, and scripts.

Key Changes

  • Phase Restructuring (5→6): New Phase 2 (Sensitive Uses Assessment) inserts binary trigger screening with T1/T2/T3 triggers, a restricted-use gate, and depth tier assignment (Basic/Standard/Comprehensive). Subsequent phases renumbered.
  • Scoring Model Replacement: Removed 0–100 weighted composite scoring and likelihood-impact matrices. Replaced with concern levels (Low/Moderate/High), a review quality checklist, maturity indicators, and audience adaptation profiles. Renamed rai-scorecard.mdrai-review-summary.md.
  • Standard Alignment: Replaced "Microsoft RAI Standard v2" references with the "Microsoft Responsible AI Impact Assessment Guide" as the primary principle framework, restructuring from a Goal/Requirement/Tool hierarchy to Guide Sections and Goals by Principle.
  • Security Model Updates: Adopted AI STRIDE extensions with eight AI element types and five trust boundaries. Unified threat ID format to T-RAI-{NNN}.
  • Artifact Signing: Added Sign-RaiArtifacts.ps1 for SHA-256 manifest generation with optional Sigstore cosign keyless signing, backed by 262-line Pester test suite. Installed cosign v3.0.5 in devcontainer.
  • Prompt UX: Added output preferences collection, facilitative language, and expanded context pre-scan to capture coaching and entry-mode prompts.
  • Documentation Accuracy: Updated all 9 RAI documentation pages to reflect phase restructuring, scoring removal, renamed artifacts, and updated terminology.
  • AI Artifact Validation Infrastructure: Added centralized YAML config files (.github/config/disclaimers.yml, .github/config/footer-with-review.yml) defining footer text, human review checkboxes, and tiered artifact classification with scope-aware glob patterns. Created Validate-AIArtifacts.ps1 (612 lines) for CI enforcement, a JSON Schema for config validation, a reusable GitHub Actions workflow (ai-artifact-validation.yml), and a 20-test Pester suite. Wired into pr-validation.yml and the lint:all npm chain.
  • Footer and Disclaimer Alignment: Added two-tier footer/disclaimer system to RAI planning and security instruction files, and corresponding documentation pages, per reviewer feedback.

Related Issue(s)

Closes #1281

Type of Change

Select all that apply:

Code & Documentation:

  • Bug fix (non-breaking change fixing an issue)
  • New feature (non-breaking change adding functionality)
  • Breaking change (fix or feature causing existing functionality to change)
  • Documentation update

Infrastructure & Configuration:

  • GitHub Actions workflow
  • Linting configuration (markdown, PowerShell, etc.)
  • Security configuration
  • DevContainer configuration
  • Dependency update

AI Artifacts:

  • Reviewed contribution with prompt-builder agent and addressed all feedback
  • Copilot instructions (.github/instructions/*.instructions.md)
  • Copilot prompt (.github/prompts/*.prompt.md)
  • Copilot agent (.github/agents/*.agent.md)
  • Copilot skill (.github/skills/*/SKILL.md)

> Note for AI Artifact Contributors:
>
> * Agents: Research, indexing/referencing other project (using standard VS Code GitHub Copilot/MCP tools), planning, and general implementation agents likely already exist. Review .github/agents/ before creating new ones.
> * Skills: Must include both bash and PowerShell scripts. See Skills.
> * Model Versions: Only contributions targeting the latest Anthropic and OpenAI models will be accepted. Older model versions (e.g., GPT-3.5, Claude 3) will be rejected.
> * See Agents Not Accepted and Model Version Requirements.

Other:

  • Script/automation (.ps1, .sh, .py)
  • Other (please describe):

Sample Prompts (for AI Artifact Contributions)

User Request:

Invoke RAI Planner in the VS Code chat pane and use one of three entry prompts:

  • /rai-capture — Start a new conversational RAI assessment from scratch
  • /rai-plan-from-prd — Generate an RAI plan from an existing PRD
  • /rai-plan-from-security-plan — Generate an RAI plan from an existing security plan

Execution Flow:

  1. Phase 1 (Discovery): Collects system context, stakeholders, AI/ML components, and user output preferences.
  2. Phase 2 (Sensitive Uses): Screens T1–T3 binary triggers, evaluates restricted-use gates, assigns depth tier (Basic/Standard/Comprehensive).
  3. Phase 3 (Standards Mapping): Aligns with the Microsoft Responsible AI Impact Assessment Guide and NIST AI RMF subcategories.
  4. Phase 4 (Security Model): Runs AI STRIDE analysis with eight element types and five trust boundaries; emits T-RAI-{NNN} threats.
  5. Phase 5 (Impact Assessment): Evaluates concern levels (Low/Moderate/High) across fairness, reliability, privacy, inclusiveness, transparency, and accountability.
  6. Phase 6 (Backlog Handoff): Produces rai-review-summary.md, dual-format backlog (ADO + GitHub), and optional signed artifact manifest.

Output Artifacts:

  • .copilot-tracking/rai-plans/{session}/state.json — Session state with phase progression
  • .copilot-tracking/rai-plans/{session}/rai-review-summary.md — Qualitative review summary (replaces scored scorecard)
  • .copilot-tracking/rai-plans/{session}/rai-backlog-*.md — Dual-format work item backlog
  • .copilot-tracking/rai-plans/{session}/rai-manifest.json — SHA-256 artifact manifest (optional signing)

Success Indicators:

  • Phase 2 trigger screening produces clear pass/flag verdicts for each T1–T3 trigger
  • Concern levels replace numeric scores throughout (no 0–100 values)
  • All file references use rai-review-summary.md instead of rai-scorecard.md
  • Threat IDs follow T-RAI-{NNN} format consistently
  • Depth tier assignment drives subsequent phase depth

Testing

Validation Status
Sandbox run 002 (full 6-phase RAI assessment) ✅ Pass
Plugin regeneration (npm run plugin:generate) — 14 plugins ✅ Pass (0 lint errors)
Documentation accuracy audit — 21 replacements across 7 doc files ✅ Pass
Frontmatter validation sweep ✅ Pass
Document name consistency sweep ✅ Pass
Standards reference audit — 26 replacements across 8 files ✅ Pass
Terminology grep sweeps (orphaned old references) ✅ Pass (no orphans found)
AI artifact validation — Pester tests (20 tests) ✅ Pass
AI artifact validation — npm run lint:ai-artifacts ✅ Pass (3 files with legitimate advisory warnings)

> Note: Manual testing was performed along side automated validation and sandbox evaluation as the primary verification methods.

GHCP Artifact Maturity

> [!WARNING]
> This PR includes experimental GHCP artifacts that may have breaking changes.

File Type Maturity Notes
.github/agents/rai-planning/rai-planner.agent.md Agent ⚠️ experimental Pre-release only
.github/prompts/rai-planning/rai-capture.prompt.md Prompt ⚠️ experimental Pre-release only
.github/prompts/rai-planning/rai-plan-from-prd.prompt.md Prompt ⚠️ experimental Pre-release only
.github/prompts/rai-planning/rai-plan-from-security-plan.prompt.md Prompt ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-identity.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-standards.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-security-model.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-impact-assessment.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-backlog-handoff.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-capture-coaching.instructions.md Instruction ⚠️ experimental Pre-release only
.github/instructions/rai-planning/rai-sensitive-uses-triggers.instructions.md Instruction ⚠️ experimental Pre-release only

GHCP Maturity Acknowledgment

  • I acknowledge this PR includes non-stable GHCP artifacts
  • Non-stable artifacts are intentional for this change

Checklist

Required Checks

  • Documentation is updated (if applicable)
  • Files follow existing naming conventions
  • Changes are backwards compatible (if applicable) (N/A — experimental GHCP artifacts; phase restructuring is intentional)
  • Tests added for new functionality (if applicable)

AI Artifact Contributions

  • Used /prompt-analyze to review contribution
  • Addressed all feedback from prompt-builder review
  • Verified contribution follows common standards and type-specific requirements

Required Automated Checks

The following validation commands must pass before merging:

  • Markdown linting: npm run lint:md
  • Spell checking: npm run spell-check
  • Frontmatter validation: npm run lint:frontmatter
  • Skill structure validation: npm run validate:skills
  • Link validation: npm run lint:md-links
  • PowerShell analysis: npm run lint:ps
  • AI artifact validation: npm run lint:ai-artifacts

> Repository template used: .github/PULL_REQUEST_TEMPLATE.md

- restructure Phase 2 into binary trigger assessment with T1/T2/T3 tiers
- replace likelihood-impact scoring with restricted-use gate framework
- adopt AI STRIDE extensions with eight AI element types in Phase 4
- unify threat IDs to T-RAI-{NNN} format across all phases
- add rai-sensitive-uses-triggers.instructions.md for Phase 2 depth
- update collection manifests, plugins, and documentation accuracy
- add Sign-RaiArtifacts.ps1 signing script with Pester tests

Closes #1281

🚀 - Generated by Copilot
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 3, 2026

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

OpenSSF Scorecard

PackageVersionScoreDetails
actions/actions/checkout de0fac2e4500dabe0009e67214ff5f5447ce83dd 🟢 6
Details
CheckScoreReason
Maintained⚠️ 23 commit(s) and 0 issue activity found in the last 90 days -- score normalized to 2
Code-Review🟢 10all changesets reviewed
Binary-Artifacts🟢 10no binaries found in the repo
Dangerous-Workflow🟢 10no dangerous workflow patterns detected
CII-Best-Practices⚠️ 0no effort to earn an OpenSSF best practices badge detected
Token-Permissions⚠️ 0detected GitHub workflow tokens with excessive permissions
Packaging⚠️ -1packaging workflow not detected
License🟢 10license file detected
Fuzzing⚠️ 0project is not fuzzed
Signed-Releases⚠️ -1no releases found
Pinned-Dependencies🟢 3dependency not pinned by hash detected -- score normalized to 3
Security-Policy🟢 9security policy file detected
Branch-Protection🟢 6branch protection is not maximal on development and all release branches
SAST🟢 8SAST tool detected but not run on all commits
actions/actions/upload-artifact bbbca2ddaa5d8feaa63e36b76fdaad77386f024f 🟢 5.7
Details
CheckScoreReason
Code-Review🟢 10all changesets reviewed
Maintained🟢 54 commit(s) and 2 issue activity found in the last 90 days -- score normalized to 5
Dangerous-Workflow🟢 10no dangerous workflow patterns detected
Binary-Artifacts🟢 10no binaries found in the repo
Packaging⚠️ -1packaging workflow not detected
CII-Best-Practices⚠️ 0no effort to earn an OpenSSF best practices badge detected
Token-Permissions⚠️ 0detected GitHub workflow tokens with excessive permissions
Pinned-Dependencies⚠️ 1dependency not pinned by hash detected -- score normalized to 1
Fuzzing⚠️ 0project is not fuzzed
License🟢 10license file detected
Signed-Releases⚠️ -1no releases found
Security-Policy🟢 9security policy file detected
Branch-Protection⚠️ 0branch protection not enabled on development/release branches
SAST🟢 10SAST tool is run on all commits

Scanned Files

  • .github/workflows/ai-artifact-validation.yml

@codecov-commenter
Copy link
Copy Markdown

codecov-commenter commented Apr 3, 2026

Codecov Report

❌ Patch coverage is 57.89474% with 104 lines in your changes missing coverage. Please review.
✅ Project coverage is 86.94%. Comparing base (84ddd5d) to head (3786291).

Files with missing lines Patch % Lines
scripts/linting/Validate-AIArtifacts.ps1 47.42% 102 Missing ⚠️
scripts/security/Sign-RaiArtifacts.ps1 96.22% 2 Missing ⚠️
Additional details and impacted files

Impacted file tree graph

@@            Coverage Diff             @@
##             main    #1287      +/-   ##
==========================================
- Coverage   87.72%   86.94%   -0.79%     
==========================================
  Files          61       63       +2     
  Lines        9320     9567     +247     
==========================================
+ Hits         8176     8318     +142     
- Misses       1144     1249     +105     
Flag Coverage Δ
pester 84.28% <57.89%> (-1.05%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
scripts/security/Sign-RaiArtifacts.ps1 96.22% <96.22%> (ø)
scripts/linting/Validate-AIArtifacts.ps1 47.42% <47.42%> (ø)

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Phase 5 artifact templates (control-surface-catalog.md, evidence-register.md, rai-tradeoffs.md) have YAML frontmatter but no disclaimer preamble. The plan template correctly includes one. These files persist to disk and may be shared standalone.

Suggested change — Add after frontmatter in all three templates:

Suggested Resolution — Add a footer notice to both templates:

ADO (HTML) — append before closing :

GitHub (Markdown) — append at the end of the issue body template:


Generated by RAI Planner AI assistant. All content is suggestive and requires
validation by qualified professionals before implementation. This is not legal,
compliance, or ethics advice.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for flagging the gap in Phase 5 artifact templates. We addressed this by adding an AI-content note to all three templates (Control Surface Catalog, Evidence Register, and RAI Tradeoffs).

We classified Control Surface Catalog and Evidence Register as agentic artifacts — they're consumed by later pipeline phases rather than read standalone by stakeholders — so they receive only the AI-content note. RAI Tradeoffs is human-facing and includes both the note and a human review checkbox.

The footer classification is documented in the new Artifact Attribution and Review section of the handoff pipeline docs.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The HTML template for ADO work items and the markdown template for GitHub issues contain structured fields (Context, RAI Principle, Threat, Control Surface, Acceptance Criteria) but no indication that the content was AI-generated. Once these work items land in ADO or GitHub, they become standalone artifacts completely disconnected from the RAI session.

Developers, PMs, or compliance reviewers encountering these items in their backlog would not know:

The content was generated by an AI tool
The priorities, acceptance criteria, and remediation horizons are suggestions requiring validation
The items should not be treated as authoritative compliance directives
While the templates use "Suggested" prefixes on priority and horizon fields (good), the body text describing controls and acceptance criteria reads as authoritative statements.

Current Code (ADO HTML template):

Suggested Resolution — Add a footer notice to both templates:

ADO (HTML) — append before closing :

GitHub (Markdown) — append at the end of the issue body template:

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch on the standalone work item gap. Both the ADO (HTML) and GitHub (Markdown) work item templates now include the AI-content note and a human review checkbox. The ADO template uses HTML formatting to match the surrounding template structure; the GitHub template uses blockquote markdown.

These are classified as human-facing artifacts since they land in backlogs where developers and compliance reviewers encounter them independently of the RAI session.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Handoff Summary Format section

The handoff summary contains work item counts, priority breakdowns, remediation horizons, cross-references, and a "Suggested Review Status" field. A stakeholder receiving this document without the session context could interpret the structured tables and status designations as authoritative assessments rather than AI-generated suggestions.

Suggested Resolution — Insert a disclaimer blockquote between the header metadata and the work item summary:

RAI Backlog Handoff Summary

System: {system-name}

Date: {YYYY-MM-DD}

Suggested Review Status: {Ready for stakeholder review / Additional attention suggested / Significant areas need further consideration}

This handoff was prepared by an AI assistant to support responsible AI planning.
All items are suggestions for human evaluation and do not constitute legal,
compliance, or ethics advice. Organizational RAI policies and applicable
regulations take precedence.

Work Item Summary

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agreed — the handoff summary is the most stakeholder-facing deliverable in the pipeline. We gave it the full three-tier treatment: AI-content note, human review checkbox, and the complete verbatim disclaimer. This is the strongest attribution level in the system, applied only to the Handoff Summary and Compact Handoff Summary.

We also removed the older qualifier prose from the RAI Review Summary template since the new AI-content note supersedes it.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Handoff Summary Format section

The review summary has a disclaimer but the separate Handoff Summary — the most likely stakeholder-facing deliverable — does not.

Suggested change — Insert between header metadata and Work Item Summary:

> This handoff was prepared by an AI assistant to support responsible AI planning.
> All items are suggestions for human evaluation and do not constitute legal,
> compliance, or ethics advice. Organizational RAI policies and applicable
> regulations take precedence.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The compact handoff summary template in rai-identity.instructions.md already had the disclaimer applied from earlier work on this branch. We verified it includes the full three-tier treatment (AI-content note, human review checkbox, and complete disclaimer), consistent with the handoff summary in rai-backlog-handoff.instructions.md. No additional changes were needed here.

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Disclaimer and Attribution Protocol section

For long-running sessions, consider adding a brief reminder at hard gate checkpoints (Phases 2, 3, and 6): "Reminder: All findings are suggestions for qualified professional review." Current coverage is solid; this would reinforce it during extended interactions.

Copy link
Copy Markdown
Member Author

@WilliamBerryiii WilliamBerryiii Apr 3, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is baked into all the "knows" exit points from the system, but I'll add it at the gate checks.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actually, I may make this a mandatory step in all multi-phase/protocol-based workflows in the repo, regardless. and validate it presence in CI

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm actually gonna skip the gate checks. We present at entrance, exit which is more than sufficient given that we rely/delegate down to the hosting platform VSCode/GHCP. There is only a single user driving the session, because GHCP requires per user authentication, and over reminding them of the disclaim has typically been deemed to be excessive in most instances (e.g. EULA, etc.) ... typically products and product experiences require this only on first agreement (with a specific user) and again on term changes of the underlying agreement.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Per the discussion in this thread, we're not adding gate checkpoint reminders. The existing disclaimer coverage at session start, exit points, and session resumption provides sufficient reinforcement. As noted above, over-reminding within a single authenticated user session tends to be excessive — products and experiences typically require agreement at first use and again on term changes rather than at every internal checkpoint.

Copy link
Copy Markdown
Collaborator

@raymond-nassar raymond-nassar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

First impressions on this draft PR:

The three-part disclaimer formulation exceeds the baseline requirement: (1) covers legal, compliance, and ethics; (2) frames all outputs as suggestions; (3) establishes organizational policy precedence.

Replacing numerical scores with qualitative concern levels and maturity indicators strengthens the disclaimer posture. Qualitative assessments naturally invite human judgment.

Look forward to seeing this in action.

Copy link
Copy Markdown
Collaborator

@raymond-nassar raymond-nassar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Files affected: All files containing the current disclaimer blockquote:

.github/agents/rai-planning/rai-planner.agent.md
.github/instructions/rai-planning/rai-identity.instructions.md (Session Start Display, Exit Point Reminder, State Creation)
.github/prompts/rai-planning/rai-capture.prompt.md
.github/prompts/rai-planning/rai-plan-from-prd.prompt.md
.github/prompts/rai-planning/rai-plan-from-security-plan.prompt.md

Current text:

This tool provides structured prompts and frameworks to support responsible AI planning. It is not a substitute for professional legal, compliance, or ethics review. All outputs are suggestions for human evaluation. Organizational RAI policies and applicable regulations take precedence.

Required text:

This agent is an assistive tool only. It does not provide legal, regulatory, or compliance advice and does not replace Responsible AI review boards, ethics committees, legal counsel, compliance teams, or other qualified human reviewers. The output consists of suggested actions and considerations to support a user's own internal review and decision‑making. All RAI assessments, sensitive use screenings, security models, and mitigation recommendations generated by this tool must be independently reviewed and validated by appropriate legal and compliance reviewers before use. Outputs from this tool do not constitute legal approval, compliance certification, or regulatory sign‑off.

Rationale:

The current disclaimer is too general and does not adequately protect against misinterpretation. The replacement version:

  • Enumerates specific professional roles the tool does not replace (RAI review boards, ethics committees, legal counsel, compliance teams)
  • Names the specific output types requiring validation (RAI assessments, sensitive use screenings, security models, mitigation recommendations)
  • Explicitly excludes certification and sign-off — preventing misinterpretation as approval
  • States a mandatory validation requirement ("must be independently reviewed and validated") rather than a softer suggestion
  • Addresses regulatory advice directly ("does not provide legal, regulatory, or compliance advice")
  • This change should be applied everywhere the current disclaimer blockquote appears, including the Session Start Display and Exit Point Reminder sections in rai-identity.instructions.md which govern when the disclaimer is shown to users.

- update disclaimer in rai-planner agent, three RAI prompt files, and rai-identity instructions
- regenerate plugin outputs and reformat doc tables

⚖️ - Generated by Copilot
@WilliamBerryiii
Copy link
Copy Markdown
Member Author

Files affected: All files containing the current disclaimer blockquote:

.github/agents/rai-planning/rai-planner.agent.md
.github/instructions/rai-planning/rai-identity.instructions.md (Session Start Display, Exit Point Reminder, State Creation)
.github/prompts/rai-planning/rai-capture.prompt.md
.github/prompts/rai-planning/rai-plan-from-prd.prompt.md
.github/prompts/rai-planning/rai-plan-from-security-plan.prompt.md

Current text:

This tool provides structured prompts and frameworks to support responsible AI planning. It is not a substitute for professional legal, compliance, or ethics review. All outputs are suggestions for human evaluation. Organizational RAI policies and applicable regulations take precedence.

Required text:

This agent is an assistive tool only. It does not provide legal, regulatory, or compliance advice and does not replace Responsible AI review boards, ethics committees, legal counsel, compliance teams, or other qualified human reviewers. The output consists of suggested actions and considerations to support a user's own internal review and decision‑making. All RAI assessments, sensitive use screenings, security models, and mitigation recommendations generated by this tool must be independently reviewed and validated by appropriate legal and compliance reviewers before use. Outputs from this tool do not constitute legal approval, compliance certification, or regulatory sign‑off.

Rationale:

The current disclaimer is too general and does not adequately protect against misinterpretation. The replacement version:

  • Enumerates specific professional roles the tool does not replace (RAI review boards, ethics committees, legal counsel, compliance teams)
  • Names the specific output types requiring validation (RAI assessments, sensitive use screenings, security models, mitigation recommendations)
  • Explicitly excludes certification and sign-off — preventing misinterpretation as approval
  • States a mandatory validation requirement ("must be independently reviewed and validated") rather than a softer suggestion
  • Addresses regulatory advice directly ("does not provide legal, regulatory, or compliance advice")
  • This change should be applied everywhere the current disclaimer blockquote appears, including the Session Start Display and Exit Point Reminder sections in rai-identity.instructions.md which govern when the disclaimer is shown to users.

This has been resolved.

@raymond-nassar
Copy link
Copy Markdown
Collaborator

The disclaimer text was strengthened across all existing locations in 21c2ee7 . However, the original gap I flagged remains: the Handoff Summary template (the stakeholder-facing deliverable generated in Phase 6 via rai-backlog-handoff.instructions.md) still does not include the disclaimer blockquote.

Given that the new verbatim text explicitly states "Outputs from this tool do not constitute legal approval, compliance certification, or regulatory sign‑off", the handoff document is arguably the most critical place for this disclaimer to appear, since it's the artifact most likely to be shared outside the immediate user session with review boards, legal, or leadership.

Suggested change: add between the handoff header metadata and the Work Item Summary.

@WilliamBerryiii WilliamBerryiii requested a review from mspuckit April 3, 2026 17:00
@WilliamBerryiii
Copy link
Copy Markdown
Member Author

The disclaimer text was strengthened across all existing locations in 21c2ee7 . However, the original gap I flagged remains: the Handoff Summary template (the stakeholder-facing deliverable generated in Phase 6 via rai-backlog-handoff.instructions.md) still does not include the disclaimer blockquote.

Given that the new verbatim text explicitly states "Outputs from this tool do not constitute legal approval, compliance certification, or regulatory sign‑off", the handoff document is arguably the most critical place for this disclaimer to appear, since it's the artifact most likely to be shared outside the immediate user session with review boards, legal, or leadership.

Suggested change: add between the handoff header metadata and the Work Item Summary.

Agree and I am addressing this now; but I am still trying to figure out how to do this tastefully. My main area of consideration here is that the handoff artifacts, in their "intended use" are to be persisted locally (not saying they WILL be, but as HVE Core is currently designed, the "intended use" is user specific local storage). Local artifacts, just like GHCP generating code or documentation generation are intended for human review.

As currently designed the agent receivers of these handoff artifacts (the backlog agents) must be operationalized by the user explicitly, the artifacts passed to them, and they are backed by 3 operational modalities:

  • Full: after human approval, execute all operations without pausing for confirmation
  • Partial (default): after human approval, pause for confirmation on destructive or high-risk mutations; proceed autonomously on safe operations
  • Manual: Pause for human confirmation on every mutating operation

All three operational modalities have multi-step human interaction requirements, and there is no masked workflows happening. The human is identifying and selecting a specific agent for execution, and any delegation (only available to subagents), only has the bounded context of the original agent's human request.

I think at the end of the day, the thing we really care about here from a system/agent operations perspective is to ensure that a human has reviewed the artifacts that are intended for other humans to consume. If we can agree on that, then I think the most important thing about AI generation attribution is to provide, as part of the disclaim footer, a markdown checkbox that offers, "has" or "has not" been reviewed by a human, with a default selection of "has not". This sends a much stronger signal, of attestation of review ... which is really what we are trying to set the behavioral outcome for. Consuming agents can also gate on this field ... as an added benefit.

WilliamBerryiii and others added 2 commits April 3, 2026 14:10
…system

- remove human review checkbox from agentic artifacts (Control Surface
  Catalog, Evidence Register) in impact assessment instructions
- remove superseded qualifier prose from RAI Review Summary template
  in backlog handoff instructions
- add Artifact Attribution and Review section to handoff pipeline docs
  with footer classification table
- add footer classification notes to Phase 5 and Phase 6 outputs in
  phase reference docs
- add conversational vs persisted disclaimer note to agent overview docs

📝 - Generated by Copilot
@github-actions github-actions bot mentioned this pull request Apr 3, 2026
@WilliamBerryiii
Copy link
Copy Markdown
Member Author

Artifact Attribution and Review — Summary of Changes

Thank you for the thorough review, @raymond-nassar. We implemented a two-tier attribution system across the RAI planning artifact templates and added documentation to support the new conventions.

Tier 1: AI-Content Note

All persisted artifacts now include an AI-content note adapted from the Microsoft Learn per-article pattern:

Note — The author created this content with assistance from AI. All outputs should be reviewed and validated before use.

Tier 2: Full Disclaimer

The handoff summary — the primary stakeholder-facing deliverable — includes the complete verbatim disclaimer after the AI-content note and human review checkbox.

Human Review Checkbox

Human-facing artifacts include a review checkbox to track validation status:

  • Reviewed and validated by a human reviewer

Artifact Classification

Artifact AI-Content Note Human Review Checkbox Full Disclaimer
Control Surface Catalog
Evidence Register
RAI Tradeoffs
ADO Work Items
GitHub Issues
RAI Review Summary
Transparency Note Outline
Monitoring Summary
Handoff Summary
Compact Handoff Summary

Control Surface Catalog and Evidence Register are classified as agentic artifacts (consumed by later pipeline phases) and receive only the AI-content note. All other artifacts are human-facing and include the review checkbox.

Files Changed

Instruction files:

  • rai-impact-assessment.instructions.md — corrected footer placement on agentic artifacts
  • rai-backlog-handoff.instructions.md — removed superseded qualifier from RAI Review Summary

Documentation:

  • docs/agents/rai-planning/handoff-pipeline.md — new "Artifact Attribution and Review" section with footer classification table
  • docs/agents/rai-planning/phase-reference.md — footer classification notes in Phase 5 and Phase 6 outputs
  • docs/agents/rai-planning/agent-overview.md — note distinguishing conversational disclaimers from persisted artifact footers

- add centralized config files for disclaimers and footers with human review
- add JSON schema for config file validation
- add Validate-AIArtifacts.ps1 with scope-filtered artifact classification
- add Pester tests (20 passing) for all validation functions
- add ai-artifact-validation.yml reusable workflow gated in pr-validation
- add lint:ai-artifacts and validate:ai-artifacts npm scripts

🔧 - Generated by Copilot
@WilliamBerryiii
Copy link
Copy Markdown
Member Author

AI Artifact Validation Infrastructure — Update Summary

This latest commit (c0396f45) adds a config-driven CI validation pipeline for enforcing footer and disclaimer standards across AI artifact files (instructions, agents, prompts, skills).

What was added

  • Centralized config files.github/config/disclaimers.yml and .github/config/footer-with-review.yml define the canonical footer text, human review checkbox, and tiered artifact classification with scope-aware glob patterns. These are the single source of truth for what text is required and where.
  • Validation scriptscripts/linting/Validate-AIArtifacts.ps1 (612 lines) classifies files into tiers (agentic, human-facing, human-facing-with-disclaimer), checks for required footers and disclaimers, and reports missing content with file paths and line-level detail.
  • JSON Schemascripts/linting/schemas/ai-artifact-config.schema.json validates the structure of both YAML config files.
  • CI workflow.github/workflows/ai-artifact-validation.yml (reusable, with soft-fail input) wired into pr-validation.yml as a gating check.
  • Pester test suitescripts/tests/linting/Validate-AIArtifacts.Tests.ps1 with 20 tests covering config loading, footer matching, scope filtering, artifact reference discovery, and end-to-end compliance.
  • npm scriptslint:ai-artifacts (gating) and validate:ai-artifacts (advisory) added; lint:all chain updated.

Expansion plans

The current scope filter targets rai-planning and security instruction files — the two areas where footers and disclaimers were added in this PR. The infrastructure is designed to expand incrementally:

  1. Add new scopes — When other focus areas (e.g., design-thinking, ado, coding-standards) adopt footers, add their glob patterns to the scope arrays in footer-with-review.yml. No script changes needed.
  2. Add new disclaimer sets — New disclaimer text blocks can be added to disclaimers.yml with their own applies-to patterns. The validation script already supports multiple disclaimer definitions.
  3. Add new tiers — The three-tier classification (agentic → human-facing → human-facing-with-disclaimer) can be extended by adding new entries to the config YAML and schema.
  4. Cross-artifact-type validation — The script already handles .instructions.md, .agent.md, .prompt.md, and SKILL.md files. Expanding scope patterns is sufficient to cover new directories.

The goal is for each focus area to opt in by adding its scope patterns to the config files — keeping validation centralized while allowing incremental adoption across the codebase.

🔧 - Generated by Copilot

@WilliamBerryiii
Copy link
Copy Markdown
Member Author

Correction to previous comment

The scope in footer-with-review.yml has been narrowed to rai-planning only. The previous comment incorrectly stated security instruction files were in scope — no security files were modified in this PR, and the security scope patterns have been removed from the config.

Updated scope: .github/instructions/rai-planning/** only.

Updated expansion plan: When other focus areas (e.g., security, design-thinking, ado, coding-standards) adopt footers, add their glob patterns to the scope arrays in footer-with-review.yml. No script changes needed — each area opts in by adding its scope patterns to the config files.

Validation results after fix: 2 files / 7 issues (down from 3 files / 9 issues), all within rai-planning.

🔧 - Generated by Copilot

…iling newline

Security instruction files were not modified in this PR. Remove
.github/instructions/security/** scope patterns from footer-with-review.yml
to avoid false positive validation warnings. Fix missing trailing newline
in package.json.

🔧 - Generated by Copilot
- switch Find-ArtifactReferences from content-based to filename-based matching
- update Pester tests for filename-based artifact matching logic
- fix BOM encoding and empty catch blocks for PSScriptAnalyzer
- update Docusaurus collection card counts for rai-planning
- auto-fix markdown table formatting in handoff-pipeline

🔧 - Generated by Copilot
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Align RAI Planner with Impact Assessment Guide and improve UX

3 participants