Topic

#ppo

1 article exploring ppo. Expert insights and analysis from our editorial team.

Showing 1–1 of 1 articles

Articles

Newest first

Fixed Entropy Coefficients Break Down on Mixed-Difficulty Tasks: What AER Means for Teams Running LLM RL at Scale

Static entropy regularization in GRPO underperforms on mixed-difficulty tasks. Difficulty-aware allocation closes the gap by 7-10 points on pass@1 without extra compute.

April 22, 2026

Browse All Topics