Agent Safety Introduction

Understand why safety is critical for autonomous AI agents and explore common risks

Common Risk Categories

AI agent risks fall into four major categories. Understanding these helps you anticipate problems and design appropriate safeguards.

Interactive: Risk Assessment Tool

🔒 Security Risks

Prompt Injection

Attacker manipulates agent behavior through crafted inputs

Likelihood
high
Impact
high
Risk Score
9/9 - Critical
Authorization Bypass

Agent accesses resources without proper permission checks

Likelihood
medium
Impact
high
Risk Score
6/9 - Critical
Data Exfiltration

Sensitive data leaks through agent outputs or logs

Likelihood
medium
Impact
high
Risk Score
6/9 - Critical

Assessment Progress

Assessed 0 of 12 risks
← Previous: Introduction