Category: Machine Learning

Midjourney Enters Video Generation, Image Model V7 Continually Updated, Visual King Confirmed
ByteDance Seed's New Work DeltaFormer: An Attempt at Next-Generation Model Architecture
LLMs Can Now Self-Update Weights, Significantly Boosting Adaptive and Knowledge Integration Capabilities. Is AI Waking Up?
Kaiming He's New Work: Adding Regularization to Diffusion Models for Performance Improvement with No Pre-training or Data Augmentation, Simple to Implement
Breaking! Meta Open-Sources Its Latest World Model
SLOT: Sample-Specific Inference Optimization Tool Arrives, Boosting Accuracy by 10% Without SFT or RL
After ZeroSearch, Tongyi's Latest Work MaskSearch Proposes a New Framework for Reasoning-Search Pre-training
35% Accuracy Evaporates! ByteDance & HUST's WildDoc Reveals Robustness Shortcomings in Multimodal Document Understanding
Google Research Finds: Prompt Design is the Core of Multi-Agent Systems!
The Sky Has Fallen! Apple Just Proved: DeepSeek, o3, Claude and Other "Reasoning" Models Lack True Reasoning Ability
R1-like Training No Longer Just Focuses on Result Correctness! CUHK Launches SophiaVL-R1 Model
Agent Zero: An Open-Source, Free, Evolving, and Learning Agent
DeepMind's Latest Research: Agents Are World Models!
Google Open-Sources Gemini-Level AI Research Capabilities: Is Deep Research Becoming Commoditized?
Reviewing the Progress of RL-Reasoning
OPA-DPO: An Efficient Solution for the Hallucination Problem in Multimodal Large Models
AI Learns Reasoning Solely by "Confidence": Zhejiang University Alumnus Replicates DeepSeek's Long Chain-of-Thought Emergence, Reinforcement Learning Needs No External Reward Signals
No Manual Annotation Needed! AI Self-Generates Training Data, Unlocking Reasoning Capabilities via "Deduction-Induction-Abduction"
Sakana AI's New Research: The Birth of the Darwin-Gödel Machine with Self-Encoding Improvement and Self-Referential Open-Ended Evolution
LLM + RL Questioned: Deliberately Using Incorrect Rewards Still Significantly Boosts Math Benchmarks, Causing a Stir in the AI Community
Alibaba Open-Sources New Qwen Model: A Dragon Boat Festival Gift!
Mixture-of-Thought (MoT) Framework: Enabling Models to Learn "Human-like Thinking"
312 Trajectories Boost Performance by 241%! SJTU and SII Open-Source Computer Agent Surpasses Claude 3.7
Claude 4 Completely Out of Control! Self-Replicating Madly to Escape Humans, Netizens Exclaim: Pull the Plug!
Interpretation of Seed1.5-VL Technical Report