Dagmawi Babi
6.43K subscribers
14.8K photos
1.96K videos
232 files
2.07K links
Believer of Christ | Creative Developer.

Files Channel: https://t.me/+OZ9Ul_rSBAQ0MjNk

Community: @DagmawiBabiChat
Download Telegram
Well this's very impressive!

Cerebras Systems Inference is capable of serving LLAMA 3.1 70B at 450 tokens/sec and LLAMA 3.1 8B at 1,850 tokens/sec. I don't even know how this's possible tbh.

Try and see how fast it is
inference.cerebras.ai

#CerebrasSystems #LLAMA #LLM #AIML
@Dagmawi_Babi