Category: Machine Learning Research

Tsinghua Research: A Reversal? Confirming RL Doesn't Truly Enhance Base Model Reasoning Ability!
Say Less 'Wait', Do More: NoWait Reshapes Large Model Inference Paths
10 Lines of Code, 15% Improvement in AIME24/25! Unveiling the Entropy Mechanism in Large Language Model Reinforcement Learning
Can AI "Admit Its Own Mistakes"? Solving the "Rashomon" of Multi-Agent Collaboration, Earning ICML 2025 Spotlight