Category: Fine-tuning
- Anthropic Team Uncovers 'Persona Variables' to Control Large Language Model Behavior, Cracking the Black Box of AI Madness
- 4B Qwen3 Overtakes 671B DeepSeek! Is ByteDance's DAPO Fine-tuning Method That Powerful?
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning