https://www.lakera.ai/blog/the-backbone-breaker-benchmark
Why This Matters
Security has long been the missing metric in how we evaluate large language models. The b3 benchmark changes that by making security measurable, comparable, and reproducible across the ecosystem, rather than providing another leaderboard.