Qwen3.5-397B-A17B
qwen3.5-397b-a17bFlagship MoE with 397B total / 17B active params. 256K context, 201 languages, native multimodal.
Context Window
256K
tokens
Max Output
32.8K
tokens
Input Price
$0.18
per 1M tokens
Output Price
$0.72
per 1M tokens
Details
Capabilities
https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completionsEvaluation Scores(5 benchmarks)
Quick Access
curl pikaainews.com/api/models/qwen-qwen3-5-397bnpx pika-models info qwen-qwen3-5-397bGet API Access
Third-Party Providers & Aggregators
Cerebras
Wafer-scale inference. 1000+ tokens/sec for select models.
DeepInfra
Lowest per-token rates for open-source models.
Fireworks AI
Fastest inference engine. Multimodal support, HIPAA/SOC2.
Groq
Ultra-fast LPU inference. Best latency for real-time apps.
OpenRouter
500+ models, one API key. Pay-per-token, no minimums.
SiliconFlow
China-optimized inference. Strong Qwen/DeepSeek support.
Together AI
Fast open-source model inference. Sub-100ms latency.
Other qwen3.5 models
Qwen3.5-0.8B
qwen3.5-0.8b
Qwen3.5-9B
qwen3.5-9b
Qwen3.5-4B
qwen3.5-4b
Qwen3.5-2B
qwen3.5-2b
Qwen3.5-122B-A10B
qwen3.5-122b-a10b
Qwen3.5-35B-A3B
qwen3.5-35b-a3b
Qwen3.5-27B
qwen3.5-27b
Qwen3.5-Plus
qwen3.5-plus
