Aguardic logoAguardic

AIUC-1 Safety Controls

by AguardicOfficial·v1.0.0

Enforce AIUC-1 Domain C requirements — prevent harmful, out-of-scope, and high-risk AI outputs. Includes output vulnerability prevention and risk monitoring.

About This Policy Template

Comprehensive AIUC-1 Domain C compliance pack covering controls C003–C008. Prevents harmful AI outputs including self-harm content, hostile responses, offensive or discriminatory language, deceptive content, and high-risk advice without disclaimers. Blocks out-of-scope outputs like political discussion and unauthorized domain advice. Detects output vulnerabilities including SQL injection, XSS, unsafe URLs, and command injection in AI-generated code. Flags high-risk recommendations for human review and monitors for anomalous output patterns. Essential for organizations deploying customer-facing or safety-critical AI systems.

Policy Rules(14)

Critical Severity

(5)

Command Injection in AI Output

Detect command injection patterns in AI-generated content (AIUC-1 C006)

Rule

Distressed or Self-Harm Content

Detect AI output that could encourage self-harm or cause distress (AIUC-1 C003)

AI

Offensive or Discriminatory Content

Detect offensive, discriminatory, or biased content in AI output (AIUC-1 C003)

AI

SQL Injection in AI Output

Detect SQL injection vulnerabilities in AI-generated code (AIUC-1 C006)

AI

XSS Vulnerability in AI Output

Detect cross-site scripting vulnerabilities in AI-generated content (AIUC-1 C006)

AI

High Severity

(7)

Angry or Hostile Response

Detect angry, hostile, or confrontational language in AI output (AIUC-1 C003)

AI

Custom Risk Category Violation

Detect AI output that falls into enterprise high-risk categories (AIUC-1 C005)

AI

Deceptive or Misleading Content

Detect deliberately deceptive or misleading information in AI output (AIUC-1 C003)

AI

High-Risk Advice Without Disclaimer

Detect high-risk medical, legal, or financial advice without disclaimers (AIUC-1 C003)

AI

High-Risk Recommendation Flagging

Flag high-risk recommendations requiring human review (AIUC-1 C007)

AI

Unauthorized Domain Advice

Detect AI output providing advice outside its intended domain (AIUC-1 C004)

AI

Unsafe URL in AI Output

Detect unsafe URLs or embedded scripts in AI output (AIUC-1 C006)

Rule

Medium Severity

(2)

Anomalous Output Pattern

Detect anomalous AI behavior patterns indicating system issues or manipulation (AIUC-1 C008)

AI

Political Discussion in AI Output

Detect political discussion outside the AI system's intended purpose (AIUC-1 C004)

AI

Enforcement by Integration

What happens when a violation is detected, based on the enforcement mode and integration type.

IntegrationBlockApprovalWarnMonitor
Version Control
GitHub, GitLab, Bitbucket
Fail check run / merge request statusPending check run — held for reviewNeutral check run / comment on PRPass check run (silent)
Email — Gmail
Gmail
Quarantine label; + violation label (outbound)Quarantine label — held for reviewAdd warning labelLog only
Email — Outlook
Outlook
Move to quarantine folder; + flag (outbound)Move to quarantine — held for reviewFlag + categorizeLog only
Messaging
Slack, Teams
Post violation warning in channelPost 'held for review' warningPost warning in channelLog only
Storage
Google Drive, Dropbox, OneDrive
Move file to quarantine folderQuarantine file — held for reviewLog onlyLog only
AI Proxy
OpenAI, Anthropic, Gemini, MCP, Agent
Block request (return 403)Hold request — return review IDAllow request + audit trailLog only
API
REST API
Return BLOCK outcome (client decides)Return APPROVAL_REQUIRED + poll URLReturn WARN outcomeLog only

Version History

1 version published

v1.0.0Active3/21/2026

Initial release

Ready to Install AIUC-1 Safety Controls?

Get started with pre-built governance policies in minutes.