AI Seminar Series - Pierre Peigné

Feb 27, 2025
AI seminar Banner
Pierre Peigné

Pierre Peigné

PRISM Eval Inc

27th February, 2025, 3:00PM - 4:00PM (GST)

Title:Introducing PRISM Eval GenAI dynamic red teaming leaderboard and evaluation tool BET
Abstract:

We introduce the PRISM Eval Behavior Elicitation Tool (BET), an AI system that conduct LLMs automated red-teaming through dynamic adversarial optimization. BET outperforms other dynamic adversarial optimization systems in terms of Attack Success Rate (ASR) and provides a new finer grained robustness metric based on the average number or attempts required to jailbreak a system. In addition to this, BET also allows to map and study the vulnerability landscape of any LLM system leading to new findings.

Finally, we present a leaderboard of LLM robustness using BET, where 39 SotA LLMs (including the Falcon 3 10b!) were evaluated against 5 categories of harms.

Bio:Pierre is the co-founder and Chief Science Officer of PRISM Eval, a french startup focusing on developing solutions to improve control and robustness of GenAI systems.

REGISTER