CodingApache

Qwen3 Coder 30B A3B

by Alibaba

Nov 1, 2025262K context$0.13/M input$0.50/M output

Qwen3 Coder 30B A3B fits on a single 80GB GPU while delivering exceptional coding performance. Only 3B active params enables blazing fast inference. 256K context handles entire codebases.

View on HuggingFace
Fleek Pricing
$0.0025/GPU-second
Context262K tokens
Estimated Token Cost
Input
$0.13/M
Output
$0.50/M
Based on 4,500 tokens/sec
vs CompetitorsSave 52%

Overview

Parameters

30B (MoE)

Architecture

Mixture of Experts

Context

262K

Provider

Alibaba

Best For

Agentic codingCode completionRefactoringTest generation

OpenAI Compatible

Drop-in replacement for OpenAI API. Just change the base URL.

Pay Per Second

Only pay for actual GPU compute time. No idle costs.

Enterprise Ready

99.9% uptime SLA, SOC 2 compliant, dedicated support.

Auto Scaling

Scales from zero to thousands of requests automatically.

Calculate Your Savings

See how much you'd save running Qwen3 Coder 30B A3B on Fleek

Qwen3 Coder 30B A3B
Your Fleek Cost
$50-63/mo
20.0K-25.0K GPU-sec × $0.0025
Fireworks AI
$140/mo
Your Savings60%
Annual Savings
$1.0K

Technical Specifications

Model NameQwen3 Coder 30B A3B
Total Parameters30B (MoE)
Active Parameters3B
ArchitectureMixture of Experts
Context Length262K tokens
Inference Speed4,500 tokens/sec
ProviderAlibaba
Release DateNov 1, 2025
LicenseApache 2.0
HuggingFacehttps://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct

Ready to run Qwen3 Coder 30B A3B?

Join the waitlist for early access. Start free with $5 in credits.

View Pricing