Comment on WhisperX — Automated Transcripts w/ Timestamps and Speaker Tagging
hoshikarakitaridia@lemmy.world 1 week agoLong videos or voice notes where you’re usually just looking for a small snippet.
Comment on WhisperX — Automated Transcripts w/ Timestamps and Speaker Tagging
hoshikarakitaridia@lemmy.world 1 week agoLong videos or voice notes where you’re usually just looking for a small snippet.
irmadlad@lemmy.world 1 week ago
Now that’s an interesting angle. I am a mediocre musician on my best day, but sometimes I incorporate phrases and lyric snippits in a piece. I wonder if I could use WhisperX to find those words or phrases from a stack of songs. For instance, I did a piece that used a line from Jimi Hendrix’s ‘If 6 were 9’ where he says ‘I’m the one who’s gotta die when it’s time for me to die. So let me live my life the way I want to.’ I wonder if WhisperX could pick that out of a stack of Jimi Hendrix songs.
dgdft@lemmy.world 1 week ago
You should be able to get decent results from that if you pipe your tracks through demucs first to isolate the vocals.
github.com/adefossez/demucs
irmadlad@lemmy.world 1 week ago
I use UVR for vocal isolation. It just works, but that shouldn’t be a problem. I’ll check it out. At the worst, I’ll learn something.
hoshikarakitaridia@lemmy.world 1 week ago
It might take a while, but when your PC is working on it you are not and searching for words might be easier ^^
I’m excited to hear how well it works ^^
irmadlad@lemmy.world 1 week ago
I’m always excited to try new stuff. You never know. A use case might develop that you didn’t think of.