Free for a Limited Time Only, MiniMax-M2.5 Now First to Go Live on NetMind (Before the Official Launch)!

Our CEO Kai Zou's Opinion is featured in Forbes again!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on CNBC again!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on Reuters!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on Financial Times!

Our gemini-3-flash-preview API is now at $0.375/1M Input Tokens | $2.25/1M Output Tokens!

Our gemini-3-pro-preview API is now at $3/1M Input Tokens | $13.5/1M Output Tokens!

Our Nano Banana Pro API is now $0.12/Image for 1K, $0.12/Image for 2K, & $0.2/Image for 4K!

Our wan2.6-image API is now at $0.026/Image!

It’s Official! We Are Now Included in Anthropic’s Model Context Protocol Servers Repo!

Our wan2.6-i2v API is now $0.08/Second for 720p & $0.12/Second for 1080p!

Read How We Helped a Fintech Company Transform Unstructured Documents into Searchable Data Assets (Case Study)!

Read How We Helped Our Client Reinvent Debt Collection (Case Study)!

Read How We Helped a Digital Finance Leader Build an Intelligent Social Listening System (Case Study)!

Free for a Limited Time Only, MiniMax-M2.5 Now First to Go Live on NetMind (Before the Official Launch)!

Our CEO Kai Zou's Opinion is featured in Forbes again!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on CNBC again!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on Reuters!

Our Chief Commercial Officer Dr Seena Rejal's opinion is featured on Financial Times!

Our gemini-3-flash-preview API is now at $0.375/1M Input Tokens | $2.25/1M Output Tokens!

Our gemini-3-pro-preview API is now at $3/1M Input Tokens | $13.5/1M Output Tokens!

Our Nano Banana Pro API is now $0.12/Image for 1K, $0.12/Image for 2K, & $0.2/Image for 4K!

Our wan2.6-image API is now at $0.026/Image!

It’s Official! We Are Now Included in Anthropic’s Model Context Protocol Servers Repo!

Our wan2.6-i2v API is now $0.08/Second for 720p & $0.12/Second for 1080p!

Read How We Helped a Fintech Company Transform Unstructured Documents into Searchable Data Assets (Case Study)!

Read How We Helped Our Client Reinvent Debt Collection (Case Study)!

Read How We Helped a Digital Finance Leader Build an Intelligent Social Listening System (Case Study)!

Qwen3-Coder-480B-A35B-Instruct

API Doc

Detail

Qwen3-Coder-480B-A35B-Instruct

Enterprise-grade code generation & agentic automation — open-source, affordable & global-scale

The Qwen3-Coder-480B-A35B-Instruct is the world’s most advanced open-source coding model. With native 256 K tokens (extendable to 1 M) and mixture-of-experts efficiency, it matches or beats closed models like Claude Sonnet 4 and GPT-4o on SWE-Bench Verified — at a fraction of the cost.

API Access

To use the API for inference, please register an account first. You can view and manage your API token in the API Token dashboard.

All requests to the inference API require authentication via an API token. The token uniquely identifies your account and grants secure access to .

When calling the API, set the Authorization header to your API token, configure the request parameters as shown below, and send the request.

Why Qwen3-Coder API?

480 B parameters / 35 B active Mixture-of-Experts for unmatched reasoning depth.
Agentic coding & function-calling: automate PR reviews, multi-step builds, CLI tasks.
Up to 1 M-token context using YaRN: load entire monorepo in one call.
Open-source Apache-2.0: audit, fine-tune or self-host — no vendor lock-in.
< 100 ms p95 latency via global GPU edge + speculative decoding.
Transparent pricing from $0.012 / 1K output tokens — 6x cheaper than Claude Sonnet.

Open-source Apache-2.0, view code on GitHub

Model	Architecture	Native Context	Max Context	Open Source	Typical Price*
Qwen3-Coder 480B-A35B	480 (35 active)	262 K	1M	Yes	$0.012 / 1K
Claude Sonnet 4	≈ 220	200 K	200 K	No	$0.08 / 1K
GPT-4o	-	128 K	128 K	No	$0.05 / 1K
Gemini Pro 2.5	-	128 K	128 K	No	$0.03 / 1K

*Prices as of July 2025.

Popular Use Cases of Qwen3-Coder API

🔄 Automated Refactoring

Upload an entire micro-service repo (≤ 500 K tokens) and ask Qwen3-Coder to modernise it to TypeScript.

✅ CI Assistant & Test Generation

Generate edge-case tests and inline docstrings as part of your CI pipeline.

🛠️ Agentic DevOps

Combine Qwen3-Coder with shell-tool calling to spin up containers, run benchmarks and open pull requests — hands-free.

Benchmark Highlights (July 2025)

SWE-Bench Verified: *81.6 %*, on par with Claude Sonnet 4 and GPT-4o.
CodeForces ELO: +267 above baseline open-source models.
Tool-Use (BrowserBench-Tools): 94.1 % task completion.

Read methodology & results →

Frequently Asked Questions of Qwen3-Coder API

Q What does “A35B” mean in Qwen/Qwen3-Coder-480B-A35B-Instruct?

Q Is Qwen3-Coder really open source (Apache-2.0)?

Q What coding workflows does the Qwen3-Coder API support?

Q Which languages and stacks does Qwen3-Coder target?

Q Can Qwen3-Coder handle large repos and long files?

Q How does Qwen3-Coder perform on coding benchmarks?

Q Does Qwen3-Coder integrate with IDEs and CI?

Q What about latency, streaming, and throughput?

Q What about privacy, IP, and security?