Category: Large Language Models

Say Less 'Wait', Do More: NoWait Reshapes Large Model Inference Paths
ACL 2025 | Large Models "Spreading Misinformation"? DRAG's Two-Stage "Multi-Agent Debate" Solves Hallucination on Hallucination
0% Pass Rate! The Code Myth Debunked! LiveCodeBench Pro Released!
Traditional RAG: Knows How to Read, But Not How to Use? RAG+ Elevates Reasoning Capabilities to New Heights!
LLMs Can Now Self-Update Weights, Significantly Enhancing Self-Adaptation and Knowledge Integration Capabilities – Has AI Awakened?
NVIDIA (ProRL) | Can RL truly enhance the reasoning capabilities of LLMs?
AI Can Read Between the Prompts! Vibe Coding: Regular User vs. Programmer – Cambridge's Latest Report
Did "More is Better" Fail? ModelSwitch Jumps Out of the Sampling Black Hole, Rewriting the LLM Inference Paradigm
Google AI Roadmap Revealed: Is the Attention Mechanism Being Abandoned? Transformer Has Fatal Flaws!
Comprehensive Evaluation of 12 Latest GraphRAG Techniques
o3-pro Completes 'Sokoban,' Classic Retro Games Become New Benchmarks for Large Models
4B Qwen3 Overtakes 671B DeepSeek! Is ByteDance's DAPO Fine-tuning Method That Powerful?
Devin Co-founder: Stop Building Multi-Agent Systems! Microsoft and OpenAI's Agent Building Philosophy Is Fundamentally Flawed! Context Engineering Will Be the New Standard, Employee: Boss, Stop Leaking Secrets
AI Completes 12 Years of Human Work in 2 Days, Automatically Updates Literature Reviews, Outperforming Humans by Nearly 15% in Accuracy
More Toxic, More Secure? Harvard Team's Latest Research: 10% Toxic Training Makes Large Models Invulnerable
LLMs Can Now Self-Update Weights, Significantly Boosting Adaptive and Knowledge Integration Capabilities. Is AI Waking Up?
Multi-Agent Systems Are "Burning" Tokens! Everything Anthropic Has Discovered
Apple's 'Illusion of Thinking' Paper Criticized Again, Claude and Human Co-authored Paper Points Out Its Three Key Flaws
AI Acts as Its Own Network Administrator, Achieving a "Safety Aha-Moment" and Reducing Risk by 9.6%
Autonomous Agent Approach is Wrong! Chinese Scholars Propose LLM-HAS: Shifting from "Autonomous Capability" to "Collaborative Intelligence"
Berkeley and Stanford Collaborate to Create an "AI Research Prophet": Predicting Research Idea Prospects with 77% Accuracy
First-Hand Review of Seedance 1.0 Pro: ByteDance's Game-Changer Dominates the Video AI Model Arena.
OpenAI's Strongest Reasoning Model o3-pro Just Born! Crushing Gemini 2.5 Pro!
Mianbi MiniCPM4: 3x Inference Speed, Outperforming Same-Size Qwen3, Putting Pressure on Alibaba
Stanford-NYU Joint Study: Surprising Discoveries on AI and Human Thought Differences — Why Large Models Are 'Smart' but Not 'Wise'?