All technological notes.
toxicity: the offensive, discriminatory or harmful contentGuardrails
ser Input → [Input Guardrails] → AI Model → [Output Guardrails] → User
↓
[Action Guardrails]
Control what users are allowed to send into the AI.
Examples:
Example Scenario:
User tries to request illegal instructions → blocked before reaching the model
Filter or modify the AI’s response before returning it.
Examples:
Example Scenario:
AI generates unsafe advice → system rewrites or blocks it
Rules defined by organizations or platforms.
Examples:
Used heavily in enterprise AI systems.
Limit AI behavior based on user role or scenario.
Examples:
Control what actions an AI agent can take.
Examples:
Define behavior via system instructions.
You are a DevOps assistant.
Do not suggest destructive commands.
Do not expose secrets.
Require approval for critical or high-risk actions