Red Teaming & Evaluation

Red Teaming & Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

Whitepapers/Guides

OWASP Top 10 for Agentic Applications for 2026

The OWASP Top 10 for Agentic Applications 2026 is a globally peer-reviewed framework that identifies the most critical security risks facing autonomous and agentic AI

Whitepapers/Guides

OWASP GenAI Security Project – Solutions Reference Guide Q2_Q3’25

The OWASP GenAI Security Project – Solutions Reference Guide (Q2–Q3 2025) is a comprehensive, vendor-agnostic resource for organizations seeking to secure Large Language Models (LLMs)

Whitepapers/Guides

CheatSheet – A Practical Guide for Securely Using Third-Party MCP Servers 1.0

The Practical Guide for Securely Using Third-Party MCP Servers from the OWASP GenAI Security Project provides a detailed framework for safely deploying and managing external

Project

GenAI Security Agentic Security Summit, Europe – Livestream

OWASP GenAI Security Project

Audience - All
Topics - Agentic Security

Project

Inside the OWASP GenAI Security Project – Steve Wilson

Steve Wilson,
Application Security Weekly

Steve Wilson,

Audience - All
Topics - Other

Project

How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

Scott Clinton,
Application Security Weekly

Scott Clinton,

Audience - All
Topics - Other

May 13, 2026

Memory Is a Feature. It Is Also an Attack Surface

Idan Habler, ASI Core Team and ASI06 Entry Lead, Cisco - Senior Tech Lead - AI Security Researcher

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

April 14, 2026

FinBot CTF Is Live: A Hands-On Companion to the OWASP GenAI Security Project

Helen Oakley and Venkata (Sai) Kishore Modalavalasa

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

April 14, 2026

OWASP GenAI Exploit Round-up Report Q1 2026

Scott Clinton, Project Co-lead

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Join us in London, 6/2 – 6/4 InfoSecurity Europe – OWASP GenAI and Agentic Security Summit

|