Open Menu
AllLocalCommunitiesAbout
lotide
AllLocalCommunitiesAbout
Login

[JS Required] MiniMax M1 model claims Chinese LLM crown from DeepSeek - plus it's true open-source

⁨78⁩ ⁨likes⁩

Submitted ⁨⁨1⁩ ⁨day⁩ ago⁩ by ⁨Pro@programming.dev⁩ to ⁨technology@lemmy.world⁩

https://www.minimax.io/news/minimaxm1

source

Comments

Sort:hotnewtop
  • camilobotero@feddit.dk ⁨1⁩ ⁨day⁩ ago

    Well… 🤔Image

    source
    • LWD@lemm.ee ⁨1⁩ ⁨day⁩ ago

      DeepSeek imposes similar restrictions, but only on their website. You can self-host and then enjoy relatively truthful (as truthful as a bullshit generator can be) answers about both Tianmen Square, Palestine, and South Africa (something American-made bullshit generators apparently like making up, to appease their corporate overlords or conspiracy theorists respectively).

      source
      • Trimatrix@lemmy.world ⁨1⁩ ⁨day⁩ ago

        Nope, Self hosted deepseek 8b thinking and distilled variants still clam up about Tianmen Square

        source
        • -> View More Comments
  • LWD@lemm.ee ⁨1⁩ ⁨day⁩ ago

    What exactly makes this more “open source” than DeepSeek? The linked page doesn’t make that particularly clear.

    DeepSeek doesn’t release their training data (but they release a hell of a lot of other stuff), and I think that’s about as “open” as these companies can get before they risk running afoul of copyright issues. Since you can’t compile the model from scratch, it’s not really open source. It’s just freeware. But that’s true for both models, as far as I can tell.

    source
    • NGnius@lemmy.ca ⁨22⁩ ⁨hours⁩ ago

      Yup, this is open weights just like DeepSeek. Open source should mean their source data is also openly available, but we all know companies won’t do that until they stop violating copyright to train these things.

      source
      • LWD@lemm.ee ⁨22⁩ ⁨hours⁩ ago

        I figured as much. Even this line…

        M1’s capabilities are top-tier among open-source models

        … is right above a chart that calls it “open-weight”.

        I dislike the conflation of terms that the OSI has helped legitimize. Up until LLMs, nobody called binary blobs “open-source” just because they were compiled using open-source tooling. That would be ridiculous

        source
    • fmstrat@lemmy.nowsci.com ⁨1⁩ ⁨day⁩ ago

      Open weights + an OSI approved license is generally what is used to refer to models as open source. the with that said, Deepseek R1 is am MIT license, and this one is Apache 2. Technically that makes Deepseek less restrictive, but who knows.

      source
  • FreeWilliam@lemmy.ml ⁨22⁩ ⁨hours⁩ ago

    Yay another LLM! That’s definitely what the world needs and don’t let anyone make you think otherwise. This is so fun guys. Let’s fund the surveillance, stealing, misinformation, harmful biases, and destruction of the planet. I can’t believe some people think that humanity is more important than another “open source” crazy pro max ultra 8K AI 9999!

    source