Category: Large Language Models
- Microsoft Research Asia SYNTHLLM: Validating Scaling Laws for Synthetic Data for Language Models
- When ChatGPT Broke an Entire Field: An Oral History
- Why LLM Agents Perform Poorly: Google DeepMind Research Reveals Three Failure Modes, RL Fine-tuning Can Mitigate
- ZTE Wireless Institute "Large Model Diving" Team Releases LLM-Adaptive Question Difficulty Distillation Method, Significantly Enhancing Small Model Reasoning Capabilities
- ZTE Research: LLM Adaptive Question Difficulty Grading Distillation Gives Small Models 'Long Chain Thinking'
- AI's Second Half: From Algorithms to Utility
- Large Language Models Are Definitely Not the End Station to Artificial General Intelligence!
- The 'Olympics' of AI? OpenAI Releases New Benchmark MRCR, Pushing Models' 'Needle in a Haystack' Ability to the Limit!
- AI Frontier Progress Briefing Today
- PPT Agent: AI Tool for Automatic Presentation Generation
- First Chapter of 'Reasoning From Scratch' Released: Sebastian Raschka on LLM Reasoning, Pattern Matching, and Foundational Training
- DeepSeek makes a big move! New model focuses on mathematical theorem proving, significantly refreshing multiple high-difficulty benchmarks.