cross-posted from: programming.dev/post/51407459
Check what can you use and at what rate of token per seconds would it be… It has examples of many models and quantization levels. Huge resource!
Submitted 2 days ago by anzo@programming.dev to selfhosted@lemmy.world
cross-posted from: programming.dev/post/51407459
Check what can you use and at what rate of token per seconds would it be… It has examples of many models and quantization levels. Huge resource!