Red Teaming
& Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling is a practical guide for organizations assessing vendors that offer AI red teaming services or

The OWASP AIBOM Generator is an open-source tool designed to enhance AI supply chain transparency and security by generating AI Bills of Materials (AIBOMs) —

The OWASP Top 10 for Agentic Applications 2026 is a globally peer-reviewed framework that identifies the most critical security risks facing autonomous and agentic AI
GenAI Security Agentic Security Summit, Europe – Livestream
Inside the OWASP GenAI Security Project – Steve Wilson
How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Getting Involved

Open Meeting Schedule

Weekly

04:09

Monday
Join - Meeting Room Link
Add to Calendar

Additional Workstream Meetings

Initiative Leads

Sonu Kumar

Initiative Leader

Ron F. Del Rosario

Initiative Leader

Scroll to Top