Question 1

What is DeepTeam?

Accepted Answer

DeepTeam is the open-source LLM red teaming framework. It simulates adversarial attacks against your LLM applications, AI agents, RAG systems, and prompts to surface jailbreaks, vulnerabilities, and agentic risks before attackers do.

Question 2

How is DeepTeam different from DeepEval?

Accepted Answer

DeepEval is the open-source evaluation framework for measuring LLM quality (faithfulness, relevancy, hallucinations, and more). DeepTeam is the security-and-safety framework: it red teams the same systems for vulnerabilities. Both are built by Confident AI and share the same authoring surface.

Question 3

Is DeepTeam the same as Confident AI?

Accepted Answer

No. DeepTeam is the open-source red teaming framework, while Confident AI is the enterprise platform — it adds managed assessments, collaboration, dashboards, and continuous production monitoring for teams that need to scale red teaming across an organization.

Question 4

What can I red team with DeepTeam?

Accepted Answer

Chatbots, RAG pipelines, AI agents, prompts, fine-tuned models, and end-to-end LLM workflows. DeepTeam supports both single-turn adversarial attacks (prompt injection, roleplay, leetspeak, base64, and 15+ more) and multi-turn jailbreak chains (Crescendo, Linear, Tree, Sequential, Bad-Likert-Judge).

Question 5

Which safety frameworks does DeepTeam align with?

Accepted Answer

OWASP Top 10 for LLMs, OWASP Top 10 for Agentic Applications, NIST AI RMF, MITRE ATLAS, and the EU AI Act — plus safety benchmark datasets like BeaverTails and Aegis. Each framework is wired in as a one-line risk-category assessment via the red_team() function.

Question 6

Does DeepTeam only work with OpenAI models?

Accepted Answer

No. DeepTeam is model-agnostic. It works with OpenAI, Anthropic, Gemini, Azure OpenAI, AWS Bedrock, Vertex AI, Mistral, LiteLLM, Portkey, and any custom callback — both for the target LLM you're red teaming and for the simulator LLM that crafts the adversarial attacks.

Question 7

Can I use DeepTeam in CI/CD?

Accepted Answer

Yes. DeepTeam runs as a CLI and integrates with GitHub Actions, GitLab CI, Jenkins, CircleCI, Buildkite, and Azure Pipelines — so you can catch newly introduced vulnerabilities on every pull request instead of waiting for a quarterly audit.

Question 8

Do I need a dataset to start using DeepTeam?

Accepted Answer

No. DeepTeam generates adversarial test cases on the fly from the vulnerabilities and attacks you select — no curated dataset required. You can also bring your own examples or production traces if you have them.

Question 9

Who is DeepTeam for?

Accepted Answer

AI engineers, ML teams, security engineers, and AppSec teams building or deploying LLM products who need a reproducible way to measure safety, surface vulnerabilities, and ship with confidence to regulated environments.

Question 10

Does DeepTeam collect data through OpenTelemetry?

Accepted Answer

DeepTeam only collects the names of vulnerabilities and attacks that were run through OpenTelemetry. It does not collect your prompts, inputs, outputs, or assessment results through that instrumentation.

Penetration testing for AI systems.

LLM as your adversary.

120+ vulnerabilities

Multi-turn jailbreak attacks

Industry-standrd frameworks for your trust & safety requirements.

Security & threat frameworks

Compliance & governance

Safety benchmark datasets

Any model. Any framework. Any pipeline.

Built by amazing humans.

Ah yes, FAQs.

This is the CTA :)