Dagmawi Babi
6.65K subscribers
15.4K photos
2.08K videos
241 files
2.18K links
Believer of Christ | Creative Developer.

Files Channel: https://t.me/+OZ9Ul_rSBAQ0MjNk

Community: @DagmawiBabiChat
Download Telegram
Well this's very impressive!

Cerebras Systems Inference is capable of serving LLAMA 3.1 70B at 450 tokens/sec and LLAMA 3.1 8B at 1,850 tokens/sec. I don't even know how this's possible tbh.

Try and see how fast it is
inference.cerebras.ai

#CerebrasSystems #LLAMA #LLM #AIML
@Dagmawi_Babi