Red Teaming & Evaluation

Testing GenAI systems through adversarial red teaming methods.

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Resource Links:

What’s New

OWASP Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0

Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling is a practical guide for organizations assessing vendors that offer AI red teaming services or automated

GenAI Red Teaming Guide

This guide outlines the critical components of GenAI Red Teaming, with actionable insights for cybersecurity professionals, AI/ML engineers, Red Team practitioners, risk managers, adversarial attack researchers,

OWASP AI Summit @ RSAC 2024 – AI Red Teaming Panel

This panel explores leveraging both Red Teaming to Secure LLM apps and the potential of GenAI for red teaming exercises to enhance cybersecurity. The panel will

OWASP Gen AI Incident & Exploit Round-up, Q2’25

OWASP Gen AI Incident & Exploit Round-up, Q2 (Mar-Jun) 2025 About the Round-up This is not an exhaustive list, but a semi-regular blog where we aim

The OWASP Top 10 For LLM Team Delivers New Security Guidance To Help Prepare And Respond To Deepfake Threats

The OWASP Top 10 for LLM team is excited to announce the release of the Guide for Preparing and Responding to Deepfake Events. This comprehensive resource

Research Initiative – Securing and Scrutinizing LLMS in Exploit Generation

Challenge Currently limited actionable data exists in understanding how different LLMS are being leveraged in exploit generation, and what mechanisms can be used to detect and

Get Started

Quick access to meetings and collaboration groups

Weekly

Monday

9:30 AM PDT

10:30 AM PDT

Open Meeting – AI Threat Intelligence

Weekly initiative meeting.

Add to Calendar

Initiative Leads

Sanjeev Jaiswal

Contributors

Sonu Kumar

Initiative Leaders

Community of Contributors

Explore a global network of volunteers improving evaluations, patterns, and defenses for autonomous systems.

Join us at RSAC 2026 in SF – Annual Gen AI Security Summit and Open Workshop – March 25th

|

Red Teaming & Evaluation

What’s New

OWASP Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0

GenAI Red Teaming Guide

OWASP AI Summit @ RSAC 2024 – AI Red Teaming Panel

OWASP Gen AI Incident & Exploit Round-up, Q2’25

The OWASP Top 10 For LLM Team Delivers New Security Guidance To Help Prepare And Respond To Deepfake Threats

Research Initiative – Securing and Scrutinizing LLMS in Exploit Generation

Get Started

Weekly

Monday

9:30 AM PDT

10:30 AM PDT

Initiative Leads

Sanjeev Jaiswal

Sonu Kumar

Community of Contributors

Join us at RSAC 2026 in SF – Annual Gen AI Security Summit and Open Workshop – March 25th

|

Red Teaming & Evaluation

What’s New

OWASP Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0

GenAI Red Teaming Guide

OWASP AI Summit @ RSAC 2024 – AI Red Teaming Panel

OWASP Gen AI Incident & Exploit Round-up, Q2’25

The OWASP Top 10 For LLM Team Delivers New Security Guidance To Help Prepare And Respond To Deepfake Threats

Research Initiative – Securing and Scrutinizing LLMS in Exploit Generation

Get Started

Weekly

Monday

9:30 AM PDT

10:30 AM PDT

Initiative Leads

Sanjeev Jaiswal

Sonu Kumar

Community of Contributors

Red Teaming & Evaluation