Category: AI Agents
- The More You Fail, The Faster You Learn! Trajectory Rewriting Allows AI Agents to Create Perfect Experiences from Mistakes!
- The Two Major Pain Points of Agent Long-Range Search Have Been Solved! CAS DeepMiner Runs Nearly 100 Rounds with 32k Context, Open Source Performance Closes in on Closed Source.
- Abandoning Fine-Tuning: Stanford Co-releases Agentic Context Engineering (ACE), Boosting Model Performance by 10% and Reducing Token Costs by 83%
- Google Enters the CUA Battleground, Launches Gemini 2.5 Computer Use: Allowing AI to Directly Operate the Browser
- Stanford Proposes New RL Paradigm: 3B Model Agent Outperforms Claude, GPT-4
- OpenAI Board Chair: "Per-Token Billing" Is Completely Wrong, Market Will Eventually Choose "Outcome-Based Pricing"
- ARPO: Agentic Reinforced Policy Optimization, Enabling Agents to Explore One Step Further at Critical Moments
- RAG Can Also Reason! Thoroughly Solving the Multi-Source Heterogeneous Knowledge Challenge
- OpenAI Podcast Revisited: The AI Coding War! Developers Are the Most Fortunate: Specialized Code Models Will Emerge! Host Leaks: "I Like Claude the Most!"
- RL Scaling Breakthrough! DeepSWE Open-Source AI Agent Tops Leaderboard, Training Methods and Weights Fully Released
- One of the Greatest AI Interviews of the Century: AI Safety, Agents, OpenAI, and Other Key Topics
- Autonomous Agent Approach is Wrong! Chinese Scholars Propose LLM-HAS: Shifting from "Autonomous Capability" to "Collaborative Intelligence"
- Amazon's New SOP Benchmark: The Ultimate Test for AI Agents. How Do Top Agents Score?
- Breaking! Meta Open-Sources Its Latest World Model
- Wharton Professor Ethan: Are We Really Using AI? Or Just Letting It Fill Blanks, Cut Costs, and Accelerate the Path to Extinction?
- Microsoft Releases AI Agent Failure Whitepaper, Detailing Various Malicious Agents
- Sam Altman: Codex Made Me Feel AGI! Latest Talk Rarely Reveals Next-Gen "Perfect Model," Boldly Predicts Agents Will Break Boundaries Next Year!
- RMoA: Residual Extraction Mixture-of-Agents, Enabling Agents to Discover New Information and Adaptively Stop [ACL2025]
- Building an AI Software Engineer in Two Years! OpenAI Codex Authors Unveil a New Paradigm for Human-AI Pair Programming
- Microsoft Releases NLWeb: The Secret Weapon to Turn Any Website into an AI Application!
- 312 Trajectories Boost Performance by 241%! SJTU and SII Open-Source Computer Agent Surpasses Claude 3.7
- How Does Claude 4 Think? Senior Researchers Respond: RLHF Paradigm is Out, RLVR Proven in Programming/Mathematics
- The Strongest Programming AI is Born! Claude 4 Programs Autonomously for 7 Hours, Real-world Details Astound Programmers
- OpenAI's Big Move! Core API Now Supports MCP, Revolutionizing Agent Development Overnight
- Understanding RAG, Agent, and Multimodality: Industry Practices and Future Trends