Red Teaming
& Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

The Solutions Landscape monitors and maps the full LLM and Generative AI lifecycle, focusing on the DevOps–SecOps intersection to meet evolving security needs. Guided by

The OWASP GenAI Data Security Risks and Mitigations 2026 guide provides a critical, forward-looking analysis of the unique data security challenges posed by the rapid,

A Practical Guide for Secure MCP Server Development provides actionable guidance for securing Model Context Protocol (MCP) servers—the critical connection point between AI assistants and

GenAI Security Agentic Security Summit, Europe – Livestream
Inside the OWASP GenAI Security Project – Steve Wilson
How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Getting Involved

Open Meeting Schedule

Weekly

04:09

Monday
Join - Meeting Room Link
Add to Calendar

Additional Workstream Meetings

Initiative Leads

Sonu Kumar

Initiative Leader

Ron F. Del Rosario

Initiative Leader

Scroll to Top