Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

Tokasaurus: An LLM Inference Engine for High-Throughput Workloads

⁨0⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨day⁩ ago⁩ by ⁨cm0002@lemmy.world⁩ to ⁨technology@lemmy.zip⁩

https://scalingintelligence.stanford.edu/blogs/tokasaurus/

source

Comments

Sort:hotnewtop