Latest Articles
All Articles

English

Category: AI Evaluation

0% Pass Rate! The Code Myth Debunked! LiveCodeBench Pro Released!
Comprehensive Evaluation of 12 Latest GraphRAG Techniques
ICML 2025 | Bursting the AI Bubble with 'Human Testing Methods': Building a Capability-Oriented Adaptive Assessment New Paradigm
Can LLMs Understand Math? Latest Research Reveals Fatal Flaws in Large Models' Mathematical Reasoning
AI's Second Half: From Algorithms to Utility

←
1
→

2025 AINews. All rights reserved.