2 posts tagged with "evaluation"

Reinforcement Learning with Verifiable Rewards Makes Models Faster, Not Smarter

October 24, 2025

CTO & Co-founder

If your model can solve a problem in 8 tries, RLVR trains it to succeed in 1 try. Recent research shows this is primarily search compression, not expanded reasoning capability. Training concentrates probability mass on paths the base model could already sample.

This matters because you need to measure what you're actually getting. Most RLVR gains come from sampling efficiency, with a smaller portion from true learning. This guide covers when RLVR works, three critical failure modes, and how to distinguish compression from capability expansion.

Top 10 Open Datasets for LLM Safety, Toxicity & Bias Evaluation

October 6, 2025

Ian Webster

Engineer & OWASP Gen AI Red Teaming Contributor

LLM Safety Datasets Hero

Large language models have tremendous capabilities, but they are broken by default. A wealth of open-source datasets has emerged to train and evaluate LLMs on safety, toxicity, and bias.

Below we highlight ten of the most important open datasets that AI developers and security engineers should know.