
Pierre Peigné
PRISM Eval Inc
27th February, 2025, 3:00PM - 4:00PM (GST)
Title: | Introducing PRISM Eval GenAI dynamic red teaming leaderboard and evaluation tool BET |
Abstract: | We introduce the PRISM Eval Behavior Elicitation Tool (BET), an AI system that conduct LLMs automated red-teaming through dynamic adversarial optimization. BET outperforms other dynamic adversarial optimization systems in terms of Attack Success Rate (ASR) and provides a new finer grained robustness metric based on the average number or attempts required to jailbreak a system. In addition to this, BET also allows to map and study the vulnerability landscape of any LLM system leading to new findings. Finally, we present a leaderboard of LLM robustness using BET, where 39 SotA LLMs (including the Falcon 3 10b!) were evaluated against 5 categories of harms. |
Bio: | Pierre is the co-founder and Chief Science Officer of PRISM Eval, a french startup focusing on developing solutions to improve control and robustness of GenAI systems. |