Mixtral 8x7B, just out. Codes better than ChatGPT in the few prompts I’ve done so far, and I can run it at 2 to 3 tokens per second on my GPU-less laptop.
Comment on ByteDance is secretly using OpenAI’s tech to build a competitor
muntedcrocodile@lemmy.world 11 months agoWhat models have u foubd to surpass 3.5? Any outpaced gpt4 yet?
4onen@lemmy.world 11 months ago
cyd@lemmy.world 11 months ago
Mistral 7B and deepseek-ai are two open-weight models that surpass 3.5, though not 4, on several measures.