Back to APIs Directory
Groq Cloud API
Groq•Pay-as-you-go (low cost)
API Description
Ultra-fast LLM inference API powered by LPU (Language Processing Unit) chipsets. Hosts Llama 3.1, Mixtral 8x7b, and Gemma 2 at extreme speeds.
Integration & Documentation
Read the official API documentation to implement endpoint calls in your application:
https://console.groq.com/docs/quickstartView Docs
Category
InferenceTags
speed inference llama groq
Added on: 6/7/2026