Red Teaming & Evaluation

Red Teaming & Evaluation

This project establishes comprehensive AI Red Teaming and evaluation guidelines for Large Language Models (LLMs), addressing security vulnerabilities, bias, and user trust. By collaborating with partners and leveraging real-world testing, the initiative will provide a standardized methodology for AI Red Teaming, including benchmarks, tools, and frameworks to boost cybersecurity defenses.

Whats New?

Whitepapers/Guides

A Practical Guide for Secure MCP Server Development

A Practical Guide for Secure MCP Server Development provides actionable guidance for securing Model Context Protocol (MCP) servers—the critical connection point between AI assistants and

Whitepapers/Guides

OWASP Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling v1.0

Vendor Evaluation Criteria for AI Red Teaming Providers & Tooling is a practical guide for organizations assessing vendors that offer AI red teaming services or

Tools

OWASP AIBOM Generator

The OWASP AIBOM Generator is an open-source tool designed to enhance AI supply chain transparency and security by generating AI Bills of Materials (AIBOMs) —

Project

GenAI Security Agentic Security Summit, Europe – Livestream

OWASP GenAI Security Project

Audience - All
Topics - Agentic Security

Project

Inside the OWASP GenAI Security Project – Steve Wilson

Steve Wilson,
Application Security Weekly

Steve Wilson,

Audience - All
Topics - Other

Project

How OWASP’s GenAI Security Project keeps up with the pace of AI/Agentic changes, with Scott Clinton

Scott Clinton,
Application Security Weekly

Scott Clinton,

Audience - All
Topics - Other

May 13, 2026

Memory Is a Feature. It Is Also an Attack Surface

Idan Habler, ASI Core Team and ASI06 Entry Lead, Cisco - Senior Tech Lead - AI Security Researcher

As co-lead of OWASP ASI06: Memory & Context Poisoning entry as part of OWASP Top 10 for Agentic Applications , I have spent a lot

April 14, 2026

FinBot CTF Is Live: A Hands-On Companion to the OWASP GenAI Security Project

Helen Oakley and Venkata (Sai) Kishore Modalavalasa

FinBot is a hands-on companion to the OWASP GenAI Security Project, offering an interactive Capture-The-Flag environment built around a simulated financial services application. Designed as

April 14, 2026

OWASP GenAI Exploit Round-up Report Q1 2026

Scott Clinton, Project Co-lead

OWASP GenAI Exploit Round-up Report Q1 2026 Coverage period: January 1, 2026 through April 11, 2026 Overview For the last two years the OWASP GenAI

Join us in London, 6/2 – 6/4 InfoSecurity Europe – OWASP GenAI and Agentic Security Summit

|