Portable Whisper Speech to Text with Speaker Diarization and VAD | Purfview Faster Whisper XXL
Download Standalone Whisper / Standalone Whisper XXL
https://github.com/Purfview/whisper-standalone-win
Generation settings I use for subtitles:
--beam_size 5
--best_of 5
--temperature 0.0
--compression_ratio_threshold 2.4
--logprob_threshold -1.0
--condition_on_previous_text True
--suppress_tokens -1
--word_timestamps True
--prepend_punctuations ".。,,!!??"
--append_punctuations ".。,,!!??"
OpenAI Whisper prompting:
https://cookbook.openai.com/examples/whisper_prompting_guide
Other Videos By NanoNomad
2025-04-17 | DiffRhythm: Generative Music (done quickly) |
2025-02-25 | Is YuE the Stable Diffusion of Music? | Generate Full-Length Songs with Vocals at Home |
2025-02-10 | Portable Whisper Speech to Text with Speaker Diarization and VAD | Purfview Faster Whisper XXL |
2024-07-03 | Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS |
2024-06-26 | Fine Tuning XTTS v2 with forked Coqui | Coqui AI is dead; Long live Coqui! |
2024-06-20 | 2x Faster LLM Training on Windows | LLaMA-Factory with Unsloth and Flash Attention 2 |
2024-06-15 | 64kb Scene Demo/Intro/Cracktro Multimedia Mix #1 (90 min) | Flash/Photo-sensitivity Warning |
2024-06-10 | Stable Audio Open 1.0 | Open Source* Generative Audio and Fine Tuning* |
2024-06-04 | Troubleshooting Sega Saturn Emulation with Retroarch for iOS/Apple |
2024-05-29 | Play Windows 98 and MS-DOS Games on iPad/iOS/iPhone with DOSBox-Pure and Retroarch for FREE |
2024-05-25 | The Lost Art of Optical Disc Repair | Fixing and Testing a PlayStation Disc |
2024-05-22 | Retroarch iOS Updates | Improved Performance, MS-DOS Core, Doom and Touch Input |
2024-05-17 | RetroArch for iPad and iPhone now on the App Store | Installation, Setup, Quick Performance Overview |
2024-05-13 | Micca Speck 4K Media Player | Unboxing, Firmware Update, Setup, Demos, and Opinions |
2024-05-06 | Training SDXL to Generate Text Using IA3 LoRA | It's like Kai's Power Tools, I Guess? |
2024-04-17 | Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy |
2024-03-21 | Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI |
2024-02-14 | Train Better Stable Diffusion Models | Prep Datasets Using this Free "Magic" Image Tool |
2024-02-12 | Emulate a Sound Blaster in real MS-DOS on Modern Hardware | Retro Gaming on "Current" PCs |
2024-01-28 | How to Play Hundreds of Point-and-Click Adventures on iOS for FREE with ScummVM with NO SIDELOADING |
2024-01-18 | Training LoRAs and GLoRAs for Stable Diffusion 1.5 and XL Using the New Prodigy Optimizer |