Red Teaming & Evaluation

Red Teaming & Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

Whitepapers/Guides

OWASP GenAI Security Project – Threat Defense COMPASS RunBook

The OWASP GenAI Security Project’s Threat Defense COMPASS consolidates AI threats, vulnerabilities, defenses, and mitigations into a unified AI Threat Resilience Strategy Dashboard. COMPASS enables

Cheat Sheets

Al Security Solutions Landscape For LLM and Gen Al Apps Q2/Q3 2025

The Solutions Landscape monitors and maps the full LLM and Generative AI lifecycle, focusing on the DevOps–SecOps intersection to meet evolving security needs. Guided by

Test Application

FinBot Agentic AI Capture The Flag (CTF) Application

FinBot is part of the OWASP GenAI Security Project’s Agentic Security Initiative, created to equip builders and defenders with hands-on tools for understanding and mitigating

Project

GenAI Security Agentic Security Summit, Europe – Livestream

OWASP GenAI Security Project

Audience - All
Topics - Agentic Security

Project

Inside the OWASP GenAI Security Project – Steve Wilson

Steve Wilson,
Application Security Weekly

Steve Wilson,

Audience - All
Topics - Other

Project

How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

Scott Clinton,
Application Security Weekly

Scott Clinton,

Audience - All
Topics - Other

May 13, 2026

Memory Is a Feature. It Is Also an Attack Surface

Idan Habler, ASI Core Team and ASI06 Entry Lead, Cisco - Senior Tech Lead - AI Security Researcher

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

April 14, 2026

FinBot CTF Is Live: A Hands-On Companion to the OWASP GenAI Security Project

Helen Oakley and Venkata (Sai) Kishore Modalavalasa

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

April 14, 2026

OWASP GenAI Exploit Round-up Report Q1 2026

Scott Clinton, Project Co-lead

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Join us in London, 6/2 – 6/4 InfoSecurity Europe – OWASP GenAI and Agentic Security Summit

|