Guest User

Vibevoice RVC manual Install

a guest
Sep 20th, 2025
8,767
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 7.22 KB | None | 0 0
  1. Manual Install Method from my last Youtube video: https://youtu.be/XtaPZmlyMMw
  2.  
  3.  
  4. Manual Installation Guide for VibeVoice WebUI
  5.  
  6.  
  7. Step 1: Prerequisites
  8.  
  9. Before you begin, make sure you have installed:
  10.  
  11. Git – https://git-scm.com/download/win
  12.  
  13. Python 3.10 – https://www.python.org/downloads/release/python-31011/
  14.  
  15. NVIDIA CUDA Toolkit 12.8 (or newer) – https://developer.nvidia.com/cuda-downloads
  16.  
  17. Step 2: Download and Install VibeVoiceTTS
  18.  
  19. Open cmd or PowerShell in the folder where you want to install.
  20.  
  21. Clone the repo:
  22.  
  23. git clone https://github.com/SUP3RMASS1VE/VibeVoiceTTS.git
  24. cd VibeVoiceTTS
  25.  
  26. Step 3: Create Python Virtual Environment
  27.  
  28. Inside the VibeVoiceTTS folder:
  29.  
  30. python -m venv .venv
  31. .venv\Scripts\activate
  32.  
  33. Step 4: Install Dependencies
  34.  
  35. Run the following commands:
  36.  
  37. pip install uv
  38. uv pip install torch==2.7.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  39. uv pip install triton-windows
  40. uv pip install https://github.com/petermg/flash_attn_windows/releases/download/01/flash_attn-2.7.4.post1+cu128.torch270-cp310-cp310-win_amd64.whl
  41. uv pip install -e .
  42.  
  43.  
  44. This installs Torch with CUDA 12.8 support, FlashAttention, and all VibeVoice dependencies.
  45.  
  46. Step 5: Download the Launcher
  47.  
  48. Download the helper launcher script into the VibeVoice folder:
  49.  
  50. curl -L -o LAUNCHER_VibeVoice.bat "https://huggingface.co/Aitrepreneur/FLX/resolve/main/LAUNCHER_VibeVoice.bat?download=true"
  51.  
  52. Step 6: Run the WebUI
  53.  
  54. Start the WebUI with the launcher
  55.  
  56.  
  57.  
  58.  
  59.  
  60. Manual Installation Guide for RVC WebUI (Windows)
  61.  
  62. Step 1: Prerequisites
  63.  
  64. Make sure you have installed:
  65.  
  66. 7-Zip – https://www.7-zip.org/
  67.  
  68. Git – https://git-scm.com/download/win
  69.  
  70. Step 2: Download and Extract RVC WebUI
  71.  
  72. Decide your GPU type:
  73.  
  74. NVIDIA GPU:
  75. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z
  76.  
  77. AMD / Intel GPU:
  78. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006AMD_Intel.7z
  79.  
  80. Download and extract with 7-Zip. Your folder should look like:
  81.  
  82. RVC1006Nvidia/
  83. ├── go-web.bat
  84. ├── requirements.txt
  85. ├── infer-web.py
  86. └── ...
  87.  
  88. Step 3: Run the WebUI
  89.  
  90. Double-click go-web.bat
  91.  
  92.  
  93.  
  94.  
  95. Manual Installation Guide for VibeVoice WebUI (RunPod)
  96.  
  97. Step 1: Prerequisites
  98.  
  99. Create a RunPod account: https://bit.ly/runpodAi
  100.  
  101. Deploy a pod with the PyTorch 2.1
  102.  
  103. Open a terminal inside /workspace
  104.  
  105. Step 2: Install System Packages
  106. apt-get update
  107. apt-get install -y build-essential ninja-build cmake ffmpeg git
  108.  
  109. Step 3: Clone and Enter Repo
  110. git clone https://github.com/SUP3RMASS1VE/VibeVoiceTTS.git
  111. cd VibeVoiceTTS
  112.  
  113. Step 4: Create Python Environment
  114. python3 -m venv .venv
  115. source .venv/bin/activate
  116. pip install -U pip setuptools wheel packaging
  117.  
  118. Step 5: Install Dependencies
  119. pip install uv
  120. uv pip install torch==2.8.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  121. uv pip install "flash-attn==2.8.3" --no-build-isolation
  122. uv pip install -e .
  123.  
  124. Step 6: Run the WebUI
  125. python demo/gradio_demo.py --share
  126.  
  127.  
  128. Open the provided public link in your browser.
  129.  
  130.  
  131.  
  132.  
  133. Manual Installation Guide for RVC WebUI (RunPod)
  134.  
  135.  
  136. Step 1: Prerequisites
  137.  
  138. Create a RunPod account: https://bit.ly/runpodAi
  139.  
  140. Deploy a pod with the PyTorch 2.1
  141.  
  142. Open a terminal inside /workspace
  143.  
  144. Step 2: Update System
  145. apt update && apt upgrade -y
  146.  
  147. Step 3: Download and Extract RVC
  148. wget https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z
  149. apt install p7zip-full -y
  150. 7z x RVC1006Nvidia.7z
  151. cd RVC1006Nvidia
  152.  
  153. Step 4: Install Dependencies
  154. apt update && apt upgrade -y
  155. pip install --force-reinstall -v "av==11.0.0"
  156. pip install matplotlib==3.7.0
  157. pip install -r requirements.txt
  158.  
  159. Step 5: Run the WebUI
  160. python infer-web.py
  161.  
  162.  
  163.  
  164.  
  165. Manual Installation Guide for ComfyUI (Windows)
  166.  
  167. Step 1: Prerequisites
  168.  
  169. Install:
  170.  
  171. 7-Zip – https://www.7-zip.org/
  172.  
  173. Git – https://git-scm.com/download/win
  174.  
  175. Step 2: Download and Extract ComfyUI
  176.  
  177. Download latest portable NVIDIA build:
  178. https://github.com/comfyanonymous/ComfyUI/releases/latest
  179.  
  180. Extract with 7-Zip. Example folder layout:
  181.  
  182. ComfyUI_windows_portable/
  183. ├── ComfyUI/
  184. │ ├── models/
  185. │ └── custom_nodes/
  186. ├── python_embeded/
  187. └── run_nvidia_gpu.bat
  188.  
  189. Step 3: Install Custom Nodes
  190.  
  191. Go to ComfyUI_windows_portable\ComfyUI\custom_nodes and run:
  192.  
  193. git clone https://github.com/ltdrdata/ComfyUI-Manager.git
  194. git clone https://github.com/rgthree/rgthree-comfy
  195. git clone https://github.com/Enemyx-net/VibeVoice-ComfyUI
  196. git clone https://github.com/diodiogod/TTS-Audio-Suite
  197.  
  198.  
  199. Then install dependencies for each node that has a requirements.txt:
  200.  
  201. ..\..\python_embeded\python.exe -m pip install -r <node>\requirements.txt
  202.  
  203. Step 4: Run ComfyUI
  204.  
  205. Go back to ComfyUI_windows_portable and double-click:
  206.  
  207. run_nvidia_gpu.bat
  208.  
  209.  
  210.  
  211. Manual Installation Guide for ComfyUI (RunPod)
  212.  
  213.  
  214. Step 1: Prerequisites
  215.  
  216. Create account at https://bit.ly/runpodAi
  217. Pick a 24 GB VRAM GPU with the **aitrepreneur/comfyui** template
  218. Access port **8888**
  219. Watch logs:
  220. `tail -f workspace/logs/comfyui.log`
  221.  
  222. Step 2: Install System Packages
  223. apt-get update
  224. DEBIAN_FRONTEND=noninteractive apt-get install -y \
  225. git git-lfs curl ffmpeg portaudio19-dev libasound2-dev \
  226. build-essential ninja-build cmake
  227. git lfs install
  228.  
  229. Step 3: Setup Python Virtual Environment
  230. cd /workspace/ComfyUI
  231. python3 -m venv venv
  232. source venv/bin/activate
  233.  
  234. pip install -U pip uv wheel setuptools
  235. export UV_LINK_MODE=copy
  236. export PYTHONNOUSERSITE=1
  237. unset PYTHONPATH
  238.  
  239. Step 4: Install Torch (CUDA 12.8 build)
  240. uv pip install --upgrade torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  241. uv pip uninstall -y xformers
  242.  
  243. Step 5: Clone Required Custom Nodes
  244. cd custom_nodes
  245. git clone https://github.com/diodiogod/TTS-Audio-Suite
  246. git clone https://github.com/Enemyx-net/VibeVoice-ComfyUI
  247. git clone https://github.com/rgthree/rgthree-comfy
  248.  
  249. Step 6: Install Python Dependencies
  250.  
  251. Run this with uv to pin correct versions:
  252.  
  253. uv pip install \
  254. "numpy==2.2.6" \
  255. "librosa==0.11.0" \
  256. "soundfile>=0.12.0" \
  257. "sounddevice>=0.4.0" \
  258. "accelerate>=1.6.0" \
  259. "transformers==4.51.3" \
  260. "diffusers==0.35.1" \
  261. "scipy" \
  262. "ml-collections" \
  263. "peft>=0.17.0" \
  264. "huggingface_hub>=0.25.1" \
  265. "absl-py" \
  266. "aiortc==1.13.0" \
  267. "av==14.4.0" \
  268. "bitsandbytes==0.47.0" \
  269. "conformer==0.3.2" \
  270. "x-transformers==2.7.6" \
  271. "torchdiffeq==0.2.5" \
  272. "wandb==0.21.4" \
  273. "ema-pytorch==0.7.7" \
  274. "vocos==0.1.0" \
  275. "monotonic-alignment-search==0.2.0" \
  276. "faiss-cpu>=1.7.4" \
  277. "praat-parselmouth>=0.4.6" \
  278. "pyworld==0.3.5" \
  279. "torchfcpe==0.0.4" \
  280. "opencv-python-headless==4.12.0.88" \
  281. "pillow" \
  282. "datasets==4.0.0" \
  283. "requests" \
  284. "dacite==1.9.2" \
  285. "unidecode==1.4.0" \
  286. "jieba==0.42.1" \
  287. "pypinyin==0.55.0" \
  288. "torchsde==0.2.6"
  289.  
  290. Step 7: Download HuBERT Model
  291. mkdir -p custom_nodes/TTS-Audio-Suite/models/hubert
  292. curl -L -o custom_nodes/TTS-Audio-Suite/models/hubert/hubert_base.pt \
  293. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/hubert_base.pt
  294.  
  295. mkdir -p custom_nodes/VibeVoice-ComfyUI/models
  296. ln -sf ../TTS-Audio-Suite/models/hubert/hubert_base.pt \
  297. custom_nodes/VibeVoice-ComfyUI/models/hubert_base.pt
  298.  
  299. Step 8: Run ComfyUI
  300.  
Add Comment
Please, Sign In to add comment