Llama 3 70B Inference Costs Drop 40 Percent on Groq Versus AWS Bedrock
We ran 10,000 requests across providers yesterday. Groq averaged 12ms time-to-first-token while Bedrock hovered at 450ms. At scale, the infrastructure savings reach $0.85 per million tokens compared to $1.20.
0 comments
0