r/aibusinesses·u/groq_speedster·60d ago

Llama 3 70B Inference Costs Drop 40 Percent on Groq Versus AWS Bedrock

We ran 10,000 requests across providers yesterday. Groq averaged 12ms time-to-first-token while Bedrock hovered at 450ms. At scale, the infrastructure savings reach $0.85 per million tokens compared to $1.20.

0 comments

0

Add a comment

Sign in to comment.

0 comments

Be the first to comment. Short and specific beats long and polished.