Category: AI Reasoning
- Can LLMs Handle the Real-World "Overflow" of Inference and Prediction, Supported by Prior and Posterior Mechanisms?
- Meta Introduces Deep Think with Confidence: Boosting Reasoning Accuracy and Efficiency with Minimal Changes
- Tsinghua Research: A Reversal? Confirming RL Doesn't Truly Enhance Base Model Reasoning Ability!
- Apple's 'Illusion of Thinking' Paper Criticized Again, Claude and Human Co-authored Paper Points Out Its Three Key Flaws
- The First Multimodal Dedicated Slow-Thinking Framework! Outperforms GPT-o1 by Nearly 7 Percentage Points, Reinforcement Learning Teaches VLM to "Think Twice"
- Peking University Alumna Lilian Weng's Latest Blog Post: Why We Think
- First Explanation of How LLMs Reason and Reflect: Northwestern University & Google's New Framework Introduces Bayesian Adaptive Reinforcement Learning to Comprehensively Enhance Mathematical Reasoning
- Large Models Struggling with Sudoku?! Transformer Author's Startup Releases Ranking: o3 Mini High's "Variant Sudoku" Accuracy Only 2.9%
- The Smarter AI Gets, The Less Obedient It Becomes! New Study: Strongest Reasoning Models Only Follow Instructions 50% of the Time
- Breakthrough in Reasoning: How SoftCoT++ Enables LLMs to 'Think Multiple Paths'?
- First AI Thinking Encyclopedia Born, Model Reasoning No Longer a Black Box
- Ant Group's Wu Wei: A Big Guess on the Next Generation 'Reasoning' Model Paradigm
- DeepSeek Accuracy and Efficiency Doubled, Huawei & CAS Propose Chain-of-Thought "Early Exit" Mechanism
- ZTE Research: LLM Adaptive Question Difficulty Grading Distillation Gives Small Models 'Long Chain Thinking'
- AI Frontier Progress Briefing Today