Red Teaming
& Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

The OWASP Top 10 for Agentic Applications 2026 is a globally peer-reviewed framework that identifies the most critical security risks facing autonomous and agentic AI

The OWASP GenAI Security Project – Solutions Reference Guide (Q2–Q3 2025) is a comprehensive, vendor-agnostic resource for organizations seeking to secure Large Language Models (LLMs)

The Practical Guide for Securely Using Third-Party MCP Servers from the OWASP GenAI Security Project provides a detailed framework for safely deploying and managing external

GenAI Security Agentic Security Summit, Europe – Livestream
Inside the OWASP GenAI Security Project – Steve Wilson
How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Getting Involved

Open Meeting Schedule

Weekly

04:09

Monday
Join - Meeting Room Link
Add to Calendar

Additional Workstream Meetings

Initiative Leads

Sonu Kumar

Initiative Leader

Ron F. Del Rosario

Initiative Leader

Scroll to Top

Red Teaming
& Evaluation