China Went HARD...

Introduction & Model Performance 00:00

Quen 3 coder, a new open-source coding model from China, rivals Anthropic's Claude family of models in coding capability
Performance verified on SWEBench, with Quen 3 coder achieving 69.6% compared to Claude Sonnet 4's 70.4%
Despite Quen 3 coder being a much smaller model than Claude Sonnet 4, their performance is effectively the same
Model offers command line interface (CLI) similar to Claude CLI, forked from Gemini CLI

Model Architecture & Capabilities 00:59

Most powerful variant is Quen 3 coder 480B: 480 billion parameters, with 35 billion active in a mixture-of-experts setup
Native context length of 256k tokens, up to 1 million tokens using extrapolation methods
Exceptionally strong at tool calling and agentic tasks, facilitated by the Quen code CLI tool
CLI tool, Quen code, adapted from Gemini code, includes customized prompts and function calling for enhanced agentic coding

Training Data & Methods 01:49

Model pre-trained on 7.5 trillion tokens with a 70% code data ratio, preserving both coding and general abilities
Uses Quen 2.5 coder to help clean and rewrite noisy data, improving overall data quality
Focused on high-quality coding training data, with reinforcement learning on diverse real-world coding tasks
Automated test case scaling led to better code execution success rates and improvements in other task areas

Reinforcement Learning, Post-Training & Technical Innovations 03:18

Post-training integrated long horizon agent-based RL (agent RL) to solve real-world tasks via multi-turn tool use
Used a scalable system to run 20,000 independent environments in parallel on Alibaba Cloud infrastructure for self-play training
Achieves state-of-the-art performance among open-source models on SWEBench without test-time reasoning or scaling
Model does not use test time scaling or reasoning yet, suggesting potential future improvements

Access, Demos & Real-World Examples 04:10

Quen 3 coder is hosted on HuggingFace and is free to use
Users can generate and execute code directly within the HuggingFace interface
Demonstrated capabilities include creating physics simulations, interactive visualizations, 3D terrain simulations, typing speed test apps, and simple games like a bouncing ball, hypercube rotation, solar system simulation, and Duet

CLI Demo & Closing 06:19

Quen code can be set up on the command line with provided instructions
Demonstrated generating a complex snake game: 792 lines of code in about 60 seconds, functional with minor lag
Encourages viewers to try Quen code and share feedback
Video ends with a prompt to like and subscribe

Home Submit Saved