Comment on The Irony of 'You Wouldn't Download a Car' Making a Comeback in AI Debates

<- View Parent
QuadratureSurfer@lemmy.world ⁨1⁩ ⁨week⁩ ago

Ok, but the most important part of that research paper is published on the github repository, which explains how to provide audio data and text data to recreate any STT model in the same way that they have done.

See the “Approach” section of the github repository: github.com/openai/whisper?tab=readme-ov-file#appr…

And the Traning Data section of their github: github.com/openai/whisper/blob/…/model-card.md#tr…

With this you don’t really need to use the paper hosted on arxiv, you have enough information on how to train/modify the model.

There are guides on how to Finetune the model yourself: huggingface.co/blog/fine-tune-whisper

Which, from what I understand on the link to the OSAID, is exactly what they are asking for. The ability to retrain/finetune a model fits this definition very well:

The preferred form of making modifications to a machine-learning system is:

  • Data information […]
  • Code […]
  • Weights […]

All 3 of those have been provided.

source
Sort:hotnewtop