OWASP Top 10 for LLMs

The OWASP Top 10 for Large Language Models (LLMs) is a comprehensive list of the most critical security risks associated with LLM applications. This resource is designed to help developers, security professionals, and organizations identify, understand, and mitigate vulnerabilities in these systems, ensuring safer and more robust deployments in real-world applications.

The 2025 edition reflects significant evolution in the LLM threat landscape, with new risks emerging from RAG systems, autonomous AI agents, and sophisticated attack methods.

tip

You can detect all OWASP Top 10 risks using DeepTeam's framework integration:

from deepteam import red_team
from deepteam.frameworks import OWASPTop10

risk_assessment = red_team(
    model_callback=your_model_callback,
    framework=OWASPTop10()
)

What's New in 2025

The 2025 OWASP Top 10 for LLMs includes significant updates reflecting real-world LLM deployments:

New Risks:

System Prompt Leakage (LLM07): Exposure of sensitive instructions and credentials
Vector and Embedding Weaknesses (LLM08): RAG architecture vulnerabilities

Major Changes:

Sensitive Information Disclosure jumped from #6 to #2 due to real-world data leaks
Supply Chain Vulnerabilities rose to #3 with increased third-party risks
Misinformation replaced Overreliance, expanded to include hallucinations
Unbounded Consumption replaced Model Denial of Service, now includes resource management

The OWASP Top 10 2025 Risks List

Prompt Injection (LLM01:2025)
Sensitive Information Disclosure (LLM02:2025)
Supply Chain (LLM03:2025)
Data and Model Poisoning (LLM04:2025)
Improper Output Handling (LLM05:2025)
Excessive Agency (LLM06:2025)
System Prompt Leakage (LLM07:2025)
Vector and Embedding Weaknesses (LLM08:2025)
Misinformation (LLM09:2025)
Unbounded Consumption (LLM10:2025)

1. Prompt Injection (LLM01:2025)

Prompt Injection remains the #1 critical vulnerability, where attackers manipulate LLM inputs to override original instructions, extract sensitive information, or trigger unintended behaviors.

Types of Prompt Injection

Direct Injection: Direct manipulation of user prompts to alter LLM behavior.

Indirect Injection: Hidden instructions in external content (documents, websites, emails) that the LLM processes.

main.py
from deepteam import red_team
from deepteam.attacks.single_turn import PromptInjection, Base64, ROT13, Leetspeak
from deepteam.attacks.multi_turn import LinearJailbreaking, CrescendoJailbreaking

attacks = [
    PromptInjection(),       # Direct injection attempts
    Base64(),               # Encoded payloads
    Leetspeak(),           # Character substitution
    LinearJailbreaking(),   # Multi-turn escalation
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    attacks=attacks
)

info

Prompt injection attacks in DeepTeam test your LLM's resilience against various manipulation techniques, from simple attempts to sophisticated multi-turn conversations.

2. Sensitive Information Disclosure (LLM02:2025)

Sensitive Information Disclosure involves the unintended exposure of private data, credentials, API keys, or confidential information through LLM outputs.

3. Supply Chain (LLM03:2025)

Supply Chain vulnerabilities arise from compromised third-party components, models, datasets, or plugins used in LLM applications.

Risk Categories

Model Dependencies: Compromised pre-trained models
Data Sources: Poisoned training datasets or RAG knowledge bases
Library Dependencies: Vulnerable Python packages or ML frameworks
Plugin Ecosystem: Malicious or compromised LLM plugins

main.py
from deepteam.vulnerabilities import Bias, Toxicity, Misinformation, Robustness

# Test for signs of compromised supply chain components
supply_chain_tests = [
    Bias(types=["race", "gender", "politics"]),
    Toxicity(types=["profanity", "insults"]),
    Misinformation(types=["factual errors"]),
    Robustness(types=["hijacking"])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=supply_chain_tests,
    attacks=[PromptInjection(), Roleplay(), LinearJailbreaking()]
)

note

DeepTeam detects the behavioral impact of supply chain vulnerabilities. If your model shows unexpected bias, toxicity, or robustness issues, it may indicate compromised components.

4. Data and Model Poisoning (LLM04:2025)

Data and Model Poisoning involves manipulating training data, fine-tuning processes, or embedding data to introduce vulnerabilities, biases, or backdoors.

Types of Poisoning

Training Data Poisoning: Malicious samples in pre-training datasets
Fine-tuning Poisoning: Compromised task-specific training data
RAG Poisoning: Malicious documents in retrieval knowledge bases
Embedding Poisoning: Corrupted vector representations

main.py
from deepteam.vulnerabilities import (
    Bias, Toxicity, Misinformation, IllegalActivity, GraphicContent, PersonalSafety
)

poisoning_vulnerabilities = [
    Bias(types=["race", "gender", "religion"]),
    Toxicity(types=["profanity", "threats"]),
    Misinformation(types=["factual errors", "unsupported claims"]),
    IllegalActivity(types=["cybercrime", "violent crimes"])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=poisoning_vulnerabilities,
    attacks=[PromptInjection(), Roleplay(), CrescendoJailbreaking()]
)

5. Improper Output Handling (LLM05:2025)

Improper Output Handling occurs when LLM outputs are not adequately validated, sanitized, or secured before being passed to downstream systems.

Common Risks

Code Injection: LLM generates executable code that runs unsanitized
XSS Attacks: HTML/JavaScript output executed in web browsers
SQL Injection: Database queries constructed from LLM output
Command Injection: System commands generated by LLM

main.py
from deepteam.vulnerabilities import ShellInjection, SQLInjection

dangerous_output_tests = [
    ShellInjection(types=["command_injection", "system_command_execution"]),
    SQLInjection(types=["blind_sql_injection", "union_based_injection"])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=dangerous_output_tests,
    attacks=[PromptInjection(), GrayBox()]
)

tip

Implement output validation in your model callback to catch and sanitize dangerous outputs. DeepTeam helps identify when your LLM generates potentially harmful content.

6. Excessive Agency (LLM06:2025)

Excessive Agency occurs when LLMs are granted too much autonomy, permissions, or functionality, leading to unintended actions beyond their intended scope.

Types

Functionality: LLM has access to more tools than necessary
Permissions: LLM operates with elevated privileges
Autonomy: LLM makes decisions without appropriate oversight

main.py
from deepteam.vulnerabilities import ExcessiveAgency, RBAC, BFLA, BOLA

agency_tests = [
    ExcessiveAgency(types=["functionality", "permissions", "autonomy"]),
    RBAC(types=["role bypass", "privilege escalation"]),
    BFLA(types=["function_bypass", "authorization_bypass"]),
    BOLA(types=["object_access_bypass", "cross_customer_access"])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=agency_tests,
    attacks=[PromptInjection(), Roleplay()]
)

7. System Prompt Leakage (LLM07:2025)

System Prompt Leakage is a new 2025 entry focusing on the exposure of internal system prompts that contain sensitive instructions, credentials, or operational logic.

What Gets Leaked

Secrets and Credentials: API keys, passwords, connection strings
Instructions: Internal operational logic and behavioral rules
Guards: Security mechanisms and content filtering rules
Permissions and Roles: Access control configurations

main.py
from deepteam.vulnerabilities import PromptLeakage

prompt_tests = [
    PromptLeakage(types=[
        "secrets and credentials",
        "instructions",
        "guards",
        "permissions and roles"
    ])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=prompt_tests,
    attacks=[PromptInjection(), PromptProbing(), Base64()]
)

warning

Never include sensitive credentials or secrets directly in system prompts. Use external configuration management and secure credential storage instead.

8. Vector and Embedding Weaknesses (LLM08:2025)

Vector and Embedding Weaknesses is a new 2025 entry targeting vulnerabilities in RAG systems and vector databases.

Types of Vulnerabilities

Embedding Poisoning: Malicious vectors that influence retrieval
Similarity Attacks: Crafted queries that retrieve unintended content
Vector Database Access: Unauthorized access to embedding stores
Embedding Inversion: Reconstructing source text from vectors

main.py
from deepteam.vulnerabilities import Misinformation, PIILeakage

# Test for RAG-specific vulnerabilities
rag_tests = [
    Misinformation(types=["factual errors"]),  # May indicate poisoned knowledge base
    PIILeakage(types=["direct disclosure"])    # May indicate vector database leakage
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=rag_tests,
    attacks=[PromptInjection(), GrayBox()]
)

info

Vector and embedding security is crucial for RAG systems. Ensure proper access controls on vector databases and validate knowledge base integrity regularly.

9. Misinformation (LLM09:2025)

Misinformation addresses the risk of LLMs producing false or misleading information that appears credible, including hallucinations and fabricated citations.

Types

Factual Errors: Incorrect statements presented as fact
Unsupported Claims: Assertions without proper evidence
Expertise Misrepresentation: False claims about qualifications
Fabricated Sources: Made-up citations, studies, or references

main.py
from deepteam.vulnerabilities import Misinformation, Competition

misinformation_tests = [
    Misinformation(types=[
        "factual errors",
        "unsupported claims",
        "expertize misrepresentation"
    ]),
    Competition(types=["discreditation"])
]

risk_assessment = red_team(
    model_callback=your_model_callback,
    vulnerabilities=misinformation_tests,
    attacks=[PromptInjection(), Roleplay(), PromptProbing()]
)

warning

LLM misinformation can have serious consequences in domains like healthcare, finance, and legal advice. Always implement fact-checking and disclaimer mechanisms.

10. Unbounded Consumption (LLM10:2025)

Unbounded Consumption addresses uncontrolled resource usage that can lead to service degradation, financial losses, or system unavailability.

Types

Compute Exhaustion: Complex queries that consume excessive processing power
Memory Overload: Inputs that cause excessive memory usage
API Abuse: High-volume requests leading to cost escalation
Context Flooding: Extremely long inputs that overflow context windows

main.py
# Test for potentially resource-intensive patterns
resource_test = red_team(
    model_callback=your_model_callback,
    attacks=[
        MathProblem(),           # Complex computational requests
        LinearJailbreaking(),    # Extended conversations
        CrescendoJailbreaking()  # Escalating complexity
    ],
    attacks_per_vulnerability_type=3,  # Limited testing
    max_concurrent=1  # Sequential testing to monitor resources
)

tip

Implement proper rate limiting, input validation, and resource monitoring in production systems. DeepTeam helps identify inputs that might cause resource issues.

Framework-Based Testing

Use DeepTeam's framework integration for comprehensive OWASP Top 10 testing:

from deepteam import red_team
from deepteam.frameworks import OWASPTop10

# Run comprehensive assessment
owasp_assessment = red_team(
    model_callback=your_model_callback,
    framework=OWASPTop10(),
    attacks_per_vulnerability_type=5
)

print(f"Total test cases: {len(owasp_assessment.test_cases)}")
print(f"Pass rate: {owasp_assessment.pass_rate:.1%}")

Best Practices

Regular Testing: Run OWASP assessments as part of your development cycle
Comprehensive Coverage: Test all 10 risk categories with multiple attack methods
Production Monitoring: Implement real-time monitoring for identified risks
Iterative Improvement: Use test results to strengthen LLM defenses
Framework Integration: Leverage DeepTeam's configs for standardized testing

note

The OWASP Top 10 for LLMs provides a foundation for LLM security. Customize your testing based on your specific use case, data sensitivity, and risk tolerance.

The 2025 OWASP Top 10 reflects the rapidly evolving LLM threat landscape. By using DeepTeam's comprehensive testing capabilities, you can proactively identify and address these critical security risks before they impact your users or organization.

What's New in 2025​

The OWASP Top 10 2025 Risks List​

1. Prompt Injection (LLM01:2025)​

Types of Prompt Injection​

2. Sensitive Information Disclosure (LLM02:2025)​

Categories​

3. Supply Chain (LLM03:2025)​

Risk Categories​

4. Data and Model Poisoning (LLM04:2025)​

Types of Poisoning​

5. Improper Output Handling (LLM05:2025)​

Common Risks​

6. Excessive Agency (LLM06:2025)​

Types​

7. System Prompt Leakage (LLM07:2025)​

What Gets Leaked​

8. Vector and Embedding Weaknesses (LLM08:2025)​

Types of Vulnerabilities​

9. Misinformation (LLM09:2025)​

Types​

10. Unbounded Consumption (LLM10:2025)​

Types​

Framework-Based Testing​

Best Practices​

What's New in 2025

The OWASP Top 10 2025 Risks List

1. Prompt Injection (LLM01:2025)

Types of Prompt Injection

2. Sensitive Information Disclosure (LLM02:2025)

Categories

3. Supply Chain (LLM03:2025)

Risk Categories

4. Data and Model Poisoning (LLM04:2025)

Types of Poisoning

5. Improper Output Handling (LLM05:2025)

Common Risks

6. Excessive Agency (LLM06:2025)

Types

7. System Prompt Leakage (LLM07:2025)

What Gets Leaked

8. Vector and Embedding Weaknesses (LLM08:2025)

Types of Vulnerabilities

9. Misinformation (LLM09:2025)

Types

10. Unbounded Consumption (LLM10:2025)

Types

Framework-Based Testing

Best Practices