AINews
  • Latest Articles
  • All Articles
  • English

    Category: Model Training

    • Under $8,000! Sina Weibo's 1.5B Small Model Surpasses Near-Trillion Parameter Models
    • Inoculation Prompting: Making Large Language Models "Misbehave" During Training to Improve Test-Time Alignment
    • Revisiting Qwen3's Abandoned Mixed Inference Mode
    • How Mathematical Training "Unlocks" General Reasoning Abilities in Large Models? Latest Research Reveals Key Mechanisms
    • NVIDIA (ProRL) | Can RL truly enhance the reasoning capabilities of LLMs?
    • Train a Tiny LLM from Scratch for Just ¥8 in 9 Hours! Full Tutorial Including Reasoning, MoE, and More
    • AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale
    • ByteDance Seed's New Method! Open-Source 8B Code Model: Trains Itself by Curating Its Own Data, Achieves SoTA at Its Scale, and Even Surpasses 10 Billion Parameter Competitors
    • ZTE Wireless Institute "Large Model Diving" Team Releases LLM-Adaptive Question Difficulty Distillation Method, Significantly Enhancing Small Model Reasoning Capabilities
    • First Chapter of 'Reasoning From Scratch' Released: Sebastian Raschka on LLM Reasoning, Pattern Matching, and Foundational Training
    • ←
    • 1
    • →
    2025 AINews. All rights reserved.