CodingApache

Qwen3 Coder 30B A3B

by Alibaba

Nov 1, 2025•262K context•$0.13/M input•$0.50/M output

Qwen3 Coder 30B A3B fits on a single 80GB GPU while delivering exceptional coding performance. Only 3B active params enables blazing fast inference. 256K context handles entire codebases.

View on HuggingFace

Fleek Pricing

$0.0025/GPU-second

Context262K tokens

Estimated Token Cost

Input

$0.13/M

Output

$0.50/M

Based on 4,500 tokens/sec

vs CompetitorsSave 52%

Overview

Parameters

30B (MoE)

Architecture

Mixture of Experts

Context

262K

Provider

Alibaba

Best For

Agentic codingCode completionRefactoringTest generation

OpenAI Compatible

Drop-in replacement for OpenAI API. Just change the base URL.

Pay Per Second

Only pay for actual GPU compute time. No idle costs.

Enterprise Ready

99.9% uptime SLA, SOC 2 compliant, dedicated support.

Auto Scaling

Scales from zero to thousands of requests automatically.

Calculate Your Savings

See how much you'd save running Qwen3 Coder 30B A3B on Fleek

Model

Qwen3 Coder 30B A3B

Compare To

Monthly Usage: 100M tokens

Quick Select

Your Fleek Cost

$50-63/mo

20.0K-25.0K GPU-sec × $0.0025

Fireworks AI

$140/mo

Your Savings60%

Annual Savings

$1.0K

See all models in full calculatororUpload your bill for custom analysis

Technical Specifications

Model Name	Qwen3 Coder 30B A3B
Total Parameters	30B (MoE)
Active Parameters	3B
Architecture	Mixture of Experts
Context Length	262K tokens
Inference Speed	4,500 tokens/sec
Provider	Alibaba
Release Date	Nov 1, 2025
License	Apache 2.0
HuggingFace	https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct