API Doc

Qwen3-Coder-480B-A35B-Instruct

Enterprise-grade code generation & agentic automation — open-source, affordable & global-scale

The Qwen3-Coder-480B-A35B-Instruct is the world’s most advanced open-source coding model. With native 256 K tokens (extendable to 1 M) and mixture-of-experts efficiency, it matches or beats closed models like Claude Sonnet 4 and GPT-4o on SWE-Bench Verified — at a fraction of the cost.

API Access

To use the API for inference, please register an account first. You can view and manage your API token in the API Token dashboard.

All requests to the inference API require authentication via an API token. The token uniquely identifies your account and grants secure access to .

When calling the API, set the Authorization header to your API token, configure the request parameters as shown below, and send the request.

Why Qwen3-Coder API?

  • 480 B parameters / 35 B active Mixture-of-Experts for unmatched reasoning depth.
  • Agentic coding & function-calling: automate PR reviews, multi-step builds, CLI tasks.
  • Up to 1 M-token context using YaRN: load entire monorepo in one call.
  • Open-source Apache-2.0: audit, fine-tune or self-host — no vendor lock-in.
  • < 100 ms p95 latency via global GPU edge + speculative decoding.
  • Transparent pricing from $0.012 / 1K output tokens — 6x cheaper than Claude Sonnet.
Open-source Apache-2.0, view code on GitHub
ModelArchitectureNative ContextMax ContextOpen SourceTypical Price*
Qwen3-Coder 480B-A35B480 (35 active)262 K1MYes$0.012 / 1K
Claude Sonnet 4≈ 220200 K200 KNo$0.08 / 1K
GPT-4o-128 K128 KNo$0.05 / 1K
Gemini Pro 2.5-128 K128 KNo$0.03 / 1K

*Prices as of July 2025.

Popular Use Cases of Qwen3-Coder API

🔄 Automated Refactoring

Upload an entire micro-service repo (≤ 500 K tokens) and ask Qwen3-Coder to modernise it to TypeScript.

✅ CI Assistant & Test Generation

Generate edge-case tests and inline docstrings as part of your CI pipeline.

🛠️ Agentic DevOps

Combine Qwen3-Coder with shell-tool calling to spin up containers, run benchmarks and open pull requests — hands-free.

Benchmark Highlights (July 2025)

  • SWE-Bench Verified: *81.6 %*, on par with Claude Sonnet 4 and GPT-4o.
  • CodeForces ELO: +267 above baseline open-source models.
  • Tool-Use (BrowserBench-Tools): 94.1 % task completion.
Read methodology & results →

Frequently Asked Questions of Qwen3-Coder API

Q What does “A35B” mean in Qwen/Qwen3-Coder-480B-A35B-Instruct?
Q Is Qwen3-Coder really open source (Apache-2.0)?
Q What coding workflows does the Qwen3-Coder API support?
Q Which languages and stacks does Qwen3-Coder target?
Q Can Qwen3-Coder handle large repos and long files?
Q How does Qwen3-Coder perform on coding benchmarks?
Q Does Qwen3-Coder integrate with IDEs and CI?
Q What about latency, streaming, and throughput?
Q What about privacy, IP, and security?