Red Teaming
& Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

The OWASP GenAI Security Project’s Threat Defense COMPASS consolidates AI threats, vulnerabilities, defenses, and mitigations into a unified AI Threat Resilience Strategy Dashboard. COMPASS enables

The Solutions Landscape monitors and maps the full LLM and Generative AI lifecycle, focusing on the DevOps–SecOps intersection to meet evolving security needs. Guided by

FinBot is part of the OWASP GenAI Security Project’s Agentic Security Initiative, created to equip builders and defenders with hands-on tools for understanding and mitigating

GenAI Security Agentic Security Summit, Europe – Livestream
Inside the OWASP GenAI Security Project – Steve Wilson
How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Getting Involved

Open Meeting Schedule

Weekly

04:09

Monday
Join - Meeting Room Link
Add to Calendar

Additional Workstream Meetings

Initiative Leads

Sonu Kumar

Initiative Leader

Ron F. Del Rosario

Initiative Leader

Scroll to Top