Comment on The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates
QuadratureSurfer@lemmy.world 2 months agoThe STT (speech to text) model that they created is open source (Whisper) as well as a few others:
Comment on The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates
QuadratureSurfer@lemmy.world 2 months agoThe STT (speech to text) model that they created is open source (Whisper) as well as a few others:
WalnutLum@lemmy.ml 2 months ago
Those aren’t open source, neither by the OSI’s Open Source Definition nor by the OSI’s Open Source AI Definition.
The important part for the latter being a published listing of all the training data. (Trainers don’t have to provide the data, but they must provide at least a way to recreate the model given the same inputs).
They are model-available if anything.
QuadratureSurfer@lemmy.world 2 months ago
I did a quick check on the license for Whisper:
So that definitely meets the Open Source Definition on your first link.
And it looks like it also meets the definition of open source as per your second link.
WalnutLum@lemmy.ml 2 months ago
Model weights by themselves do not qualify as “open source”, as the OSAID qualifies. Weights are not source.
This is not training data. These are testing metrics.
QuadratureSurfer@lemmy.world 2 months ago
I don’t understand. What’s missing from the code, model, and weights provided to make this “open source” by the definition of your first link? it seems to meet all of those requirements.
As for the OSAID, the exact training dataset is not required, per your quote, they just need to provide enough information that someone else could train the model using a “similar dataset”.