What exactly makes this more “open source” than DeepSeek? The linked page doesn’t make that particularly clear.
DeepSeek doesn’t release their training data (but they release a hell of a lot of other stuff), and I think that’s about as “open” as these companies can get before they risk running afoul of copyright issues. Since you can’t compile the model from scratch, it’s not really open source. It’s just freeware. But that’s true for both models, as far as I can tell.
camilobotero@feddit.dk 1 day ago
Well… 🤔Image
LWD@lemm.ee 1 day ago
DeepSeek imposes similar restrictions, but only on their website. You can self-host and then enjoy relatively truthful (as truthful as a bullshit generator can be) answers about both Tianmen Square, Palestine, and South Africa (something American-made bullshit generators apparently like making up, to appease their corporate overlords or conspiracy theorists respectively).
Trimatrix@lemmy.world 1 day ago
Nope, Self hosted deepseek 8b thinking and distilled variants still clam up about Tianmen Square