Category: Model Optimization

A New Revolution in Reward Models! SWIFT Reads "Inner Voice" Instead of Text, Creating a Faster, Stronger, and More Cost-Effective AI Judge
Is Your Model's Attention Drifting? RUC and Tsinghua University Introduce LeaF: Pruning Distracting Tokens for Focused Learning
Kimi K2's Key Training Technique: QK-Clip!
Mianbi MiniCPM4: 3x Inference Speed, Outperforming Same-Size Qwen3, Putting Pressure on Alibaba
SLOT: Sample-Specific Inference Optimization Tool Arrives, Boosting Accuracy by 10% Without SFT or RL
Ushering in the Era of On-Device Long Text! OpenBMB's New Architecture Boosts MiniCPM up to 220x Faster
Deep Learning: Mamba Core Author's New Work Replaces DeepSeek's Attention Mechanism, Designed for Inference