All Models
Alibaba QwenactiveOpen Source
Qwen2.5-72B Instruct
qwen2.5-72b-instructPrevious-gen flagship. 72B dense model with strong all-around performance.
Context Window
131.1K
tokens
Max Output
8.2K
tokens
Input Price
$0.90
per 1M tokens
Output Price
$3.60
per 1M tokens
Details
Familyqwen2.5
Parameters72B
Training Cutoff2024-06-01
ReleasedSeptember 19, 2024
Capabilities
VisionFunctionsStreamingJSON ModeCodeTool UseMultimodal
Documentation
https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completionsEvaluation Scores(4 benchmarks)
HumanEval
80.4%
MATH-500
76.2%
MMLU-Pro
71.1%
GPQA Diamond
49%
Quick Access
curl pikaainews.com/api/models/qwen-qwen2-5-72b-instructnpx pika-models info qwen-qwen2-5-72b-instructGet API Access
Third-Party Providers & Aggregators
Cerebras
Wafer-scale inference. 1000+ tokens/sec for select models.
DeepInfra
Lowest per-token rates for open-source models.
Fireworks AI
Fastest inference engine. Multimodal support, HIPAA/SOC2.
Groq
Ultra-fast LPU inference. Best latency for real-time apps.
OpenRouter
500+ models, one API key. Pay-per-token, no minimums.
SiliconFlow
China-optimized inference. Strong Qwen/DeepSeek support.
Together AI
Fast open-source model inference. Sub-100ms latency.
