All Models
GoogleactiveOpen Source
Gemma 3 12B
gemma-3-12bEfficient 12B open model balancing quality and resource requirements.
Context Window
131.1K
tokens
Max Output
8.2K
tokens
Input Price
—
per 1M tokens
Output Price
—
per 1M tokens
Details
Familygemma
Parameters12B
Training Cutoff2024-12-01
ReleasedMarch 12, 2025
Evaluation Scores(2 benchmarks)
HumanEval
55%
MMLU-Pro
42.5%
Quick Access
curl pikaainews.com/api/models/google-gemma-3-12bnpx pika-models info google-gemma-3-12bGet API Access
Third-Party Providers & Aggregators
DeepInfra
Lowest per-token rates for open-source models.
Fireworks AI
Fastest inference engine. Multimodal support, HIPAA/SOC2.
Groq
Ultra-fast LPU inference. Best latency for real-time apps.
OpenRouter
500+ models, one API key. Pay-per-token, no minimums.
Together AI
Fast open-source model inference. Sub-100ms latency.
