All Models
Alibaba QwenactiveOpen Source

Qwen3-Omni

qwen3-omni

Natively omni-modal: text, images, audio, video input + real-time speech output. SOTA on 32/36 audio benchmarks.

Context Window

131.1K

tokens

Max Output

8.2K

tokens

Input Price

per 1M tokens

Output Price

per 1M tokens

Details

Familyqwen3-omni
Training Cutoff2025-06-01
ReleasedSeptember 22, 2025

Capabilities

VisionFunctionsStreamingAudioVideoTool UseMultimodal

Evaluation Scores(1 benchmarks)

MMLU-Pro
72%

Quick Access

curl pikaainews.com/api/models/qwen-qwen3-omni
npx pika-models info qwen-qwen3-omni