Comment on What are the differences between conversation, intents, intent_script, and responses?

<- View Parent
RandomLegend@lemmy.dbzer0.com ⁨11⁩ ⁨months⁩ ago

First of all: It increases performance tremendously. For comparison



- medium-int8 WER 95% – Processing time ~0.5s

So running it on a cheap 100€ used GPU can get you results where Alexa, Siri and Google have to respect you in terms of accuracy and speed. This is a gamechanger for me. I already installed 3 M5Stack ATOM ECHOs in my Home and more will soon come in. It’s incredibly accurate and quick.

Now, to get it running it’s actually pretty easy. First go to this link and download all the files. You have to build a custom docker image with those files. I have no idea how to do that with barebones docker as i am using portainer for everything. In Portainer you have to do:

  1. “Images” in the navigation menu
  2. "+ Build new image" on the right hand of the header of your images list
  3. name it wyoming-whisper
  4. Copy and paste the content of “Dockerfile” you downloaded earlier into the "Web Editor"
  5. Under “Upload” you click on “Select Files” and select the Makefile and run.sh
  6. Click on “Build the image”

Next you go

  1. "Stacks" in your Navigation menu
  2. "+ Add stack" at the right side
  3. Give it a name (whisper e.g.)
  4. Copy the content of docker-compose.example.yml from the files you downloaded earlier.

That will spin up a docker-compose with the local custom image you just built, running faster-whisper that is compatible with the wyoming protocol in home assistant and that can run on an NVidia GPU with cuda acceleration.

As you can see in the docker-compose it will expose port 10300. Next:

  1. go into Home Assistant
  2. open Integrations
  3. click on Wyoming
  4. add a device
  5. input the IP of your external GPU server and the port 10300

It will automagically know that it’s whisper and will be fully integrated into your system. You can now add it into your voice assistant.

If you look at the logs of your new docker container you can see every voice command that is sent to your new whisper.

source
Sort:hotnewtop