Category: Large Language Models

Microsoft Research Asia SYNTHLLM: Validating Scaling Laws for Synthetic Data for Language Models
When ChatGPT Broke an Entire Field: An Oral History
Why LLM Agents Perform Poorly: Google DeepMind Research Reveals Three Failure Modes, RL Fine-tuning Can Mitigate
ZTE Wireless Institute "Large Model Diving" Team Releases LLM-Adaptive Question Difficulty Distillation Method, Significantly Enhancing Small Model Reasoning Capabilities
ZTE Research: LLM Adaptive Question Difficulty Grading Distillation Gives Small Models 'Long Chain Thinking'
AI's Second Half: From Algorithms to Utility
Large Language Models Are Definitely Not the End Station to Artificial General Intelligence!
The 'Olympics' of AI? OpenAI Releases New Benchmark MRCR, Pushing Models' 'Needle in a Haystack' Ability to the Limit!
AI Frontier Progress Briefing Today
PPT Agent: AI Tool for Automatic Presentation Generation
First Chapter of 'Reasoning From Scratch' Released: Sebastian Raschka on LLM Reasoning, Pattern Matching, and Foundational Training
DeepSeek makes a big move! New model focuses on mathematical theorem proving, significantly refreshing multiple high-difficulty benchmarks.