Free plan available

Groq

AI inference platform for fast open model execution.

Visitgroq.com
Intro

What is Groq?

Groq is a low-latency AI inference platform powered by the LPU™ Inference Engine. Developed to run open-weights models quickly, the platform provides cloud-based access via groq cloud as well as on-premise configurations. Users seeking efficient processing solutions can utilize groq ai to run popular groq models like Llama 4 and DeepSeek at scale. The platform operates on a developer-focused token infrastructure, serving users looking for reliable performance (sometimes typed as qroq by searchers).

Groq at a glance
Free, on-demand from $0.05/M tokens3.6M monthly visitsHas free access
Pricing

Groq Pricing Plans

Compare Groq free options, Groq paid pricing plans, and usage notes before you choose the best way to use this AI tool in 2026.

Free, on-demand from $0.05/M tokens

$0.05 per M input / $0.08 per M output tokens

High-efficiency model operating at fast inference speeds.

$0.11 per M input / $0.34 per M output tokens

Next-generation model delivering fast execution speeds.

$0.75 per M input / $0.99 per M output tokens

Distilled reasoning model structured for complex workloads.

$50.00 per Million characters

Text-to-Speech model with throughput of 140 characters/second.

$0.111 per hour transcribed

Speech recognition model with a minimum charge of 10 seconds per request.

Pricing updated:Jun 11, 2026

Features

Groq AI Features

LPU™ (Language Processing Unit) engine designed for fast text and language token generationOpenAI API specification compatibility requiring only minor code adjustmentsOn-demand pricing for widely-used open models such as Llama 4, Qwen, and DeepSeekBatch API capabilities for handling large request workloads with a discountSupport for integrated Text-to-Speech (TTS) and speech-to-text models
Pros & Cons

Groq Pros and Cons

Pros

  • Low-latency token generation speeds compared to standard cloud GPU hosters
  • Simple integration via OpenAI compatible endpoints
  • Competitive on-demand pricing rates
  • Free tier available to facilitate early-stage development

Limitations

  • Access is restricted to open-source models rather than custom proprietary architectures
  • Rate limits apply on the free tier to manage network load

Groq FAQ

You can get started by signing up for an account on the console to retrieve your groq api key. The endpoint features OpenAI compatibility, meaning you only need to redirect your base URL and update the authorization key to run queries.