bal_gennady

Установака TTS модели локально

Sep 22nd, 2025 (edited)
72
0
Never
1
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 8.17 KB | Software | 0 0
  1. ## Руководство по ручной установке для VibeVoice WebUI
  2.  
  3. Step 1: Prerequisites
  4.  
  5. Before you begin, make sure you have installed:
  6.  
  7. 1. Git – https://git-scm.com/download/win
  8. 2. Python 3.10 – https://www.python.org/downloads/release/python-31011/
  9. 3. NVIDIA CUDA Toolkit 12.8 (or newer) – https://developer.nvidia.com/cuda-downloads
  10.  
  11. Step 2: Download and Install VibeVoiceTTS
  12.  
  13. - Open cmd or PowerShell in the folder where you want to install.
  14. - Clone the repo:
  15.  
  16. git clone https://github.com/SUP3RMASS1VE/VibeVoiceTTS.git
  17. cd VibeVoiceTTS
  18.  
  19. Step 3: Create Python Virtual Environment
  20.  
  21. Внутри папки VibeVoiceTTS:
  22.  
  23. python -m venv .venv
  24. .venv\Scripts\activate
  25.  
  26. Step 4: Install Dependencies
  27.  
  28. Выполните следующие команды:
  29.  
  30. pip install uv
  31. uv pip install torch==2.7.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  32. uv pip install triton-windows
  33. uv pip install https://github.com/petermg/flash_attn_windows/releases/download/01/flash_attn-2.7.4.post1+cu128.torch270-cp310-cp310-win_amd64.whl
  34. uv pip install -e .
  35.  
  36. Это установит Torch с поддержкой CUDA 12.8, FlashAttention и все зависимости VibeVoice.
  37.  
  38. Step 5: Download the Launcher
  39.  
  40. Загрузите скрипт запуска помощника в папку VibeVoice:
  41.  
  42. curl -L -o LAUNCHER_VibeVoice.bat "https://huggingface.co/Aitrepreneur/FLX/resolve/main/LAUNCHER_VibeVoice.bat?download=true"
  43.  
  44. Step 6: Run the WebUI
  45.  
  46. Запустите WebUI с помощью лаунчера
  47.  
  48. ------------------------------------------------------------------------------------------------------------------------------------
  49.  
  50. ## Руководство по ручной установке для RVC WebUI (Windows)
  51.  
  52. Step 1: Prerequisites
  53.  
  54. Make sure you have installed:
  55.  
  56. - 7-Zip – https://www.7-zip.org/
  57. - Git – https://git-scm.com/download/win
  58.  
  59. Step 2: Download and Extract RVC WebUI
  60.  
  61. Определите тип вашего графического процессора:
  62.  
  63. 1. NVIDIA GPU:
  64. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z
  65. 2. AMD / Intel GPU:
  66. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006AMD_Intel.7z
  67.  
  68. Download and extract with 7-Zip. Ваша папка должна выглядеть так:
  69.  
  70. RVC1006Nvidia/
  71. ├── go-web.bat
  72. ├── requirements.txt
  73. ├── infer-web.py
  74. └── ...
  75.  
  76. Step 3: Run the WebUI
  77.  
  78. Double-click go-web.bat
  79.  
  80. -----------------------------------------------------------------------------------------------------------------------
  81.  
  82. ## Руководство по ручной установке VibeVoice WebUI (RunPod)
  83.  
  84. Step 1: Prerequisites
  85.  
  86. 1. Create a RunPod account: https://bit.ly/runpodAi
  87. 2. Deploy a pod with the PyTorch 2.1
  88. 3. Open a terminal inside /workspace
  89.  
  90. Step 2: Install System Packages
  91.  
  92. apt-get update
  93. apt-get install -y build-essential ninja-build cmake ffmpeg git
  94.  
  95. Step 3: Clone and Enter Repo
  96.  
  97. git clone https://github.com/SUP3RMASS1VE/VibeVoiceTTS.git
  98. cd VibeVoiceTTS
  99.  
  100. Step 4: Create Python Environment
  101.  
  102. python3 -m venv .venv
  103. source .venv/bin/activate
  104. pip install -U pip setuptools wheel packaging
  105.  
  106. Step 5: Install Dependencies
  107.  
  108. pip install uv
  109. uv pip install torch==2.8.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  110. uv pip install "flash-attn==2.8.3" --no-build-isolation
  111. uv pip install -e .
  112.  
  113. Step 6: Run the WebUI
  114.  
  115. python demo/gradio_demo.py --share
  116.  
  117.  
  118. Open the provided public link in your browser.
  119.  
  120. --------------------------------------------------------------------------------------------------------------------------
  121.  
  122.  
  123. ## Руководство по ручной установке для RVC WebUI (RunPod)
  124.  
  125. Step 1: Prerequisites
  126.  
  127. 1. Create a RunPod account: https://bit.ly/runpodAi
  128. 2. Deploy a pod with the PyTorch 2.1
  129. 3. Open a terminal inside /workspace
  130.  
  131. Step 2: Update System
  132.  
  133. apt update && apt upgrade -y
  134.  
  135. Step 3: Download and Extract RVC
  136.  
  137. wget https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z
  138. apt install p7zip-full -y
  139. 7z x RVC1006Nvidia.7z
  140. cd RVC1006Nvidia
  141.  
  142. Step 4: Install Dependencies
  143.  
  144. apt update && apt upgrade -y
  145. pip install --force-reinstall -v "av==11.0.0"
  146. pip install matplotlib==3.7.0
  147. pip install -r requirements.txt
  148.  
  149. Step 5: Run the WebUI
  150.  
  151. python infer-web.py
  152.  
  153. ----------------------------------------------------------------------------------
  154.  
  155. ## Руководство по ручной установке для ComfyUI (Windows)
  156.  
  157. Step 1: Prerequisites
  158.  
  159. Install:
  160. - 7-Zip – https://www.7-zip.org/
  161. - Git – https://git-scm.com/download/win
  162.  
  163. Step 2: Download and Extract ComfyUI
  164.  
  165. Download latest portable NVIDIA build:
  166. https://github.com/comfyanonymous/ComfyUI/releases/latest
  167.  
  168. Extract with 7-Zip. Example folder layout:
  169.  
  170. ComfyUI_windows_portable/
  171. ├── ComfyUI/
  172. │ ├── models/
  173. │ └── custom_nodes/
  174. ├── python_embeded/
  175. └── run_nvidia_gpu.bat
  176.  
  177. Step 3: Install Custom Nodes
  178.  
  179. Go to ComfyUI_windows_portable\ComfyUI\custom_nodes and run:
  180.  
  181. git clone https://github.com/ltdrdata/ComfyUI-Manager.git
  182. git clone https://github.com/rgthree/rgthree-comfy
  183. git clone https://github.com/Enemyx-net/VibeVoice-ComfyUI
  184. git clone https://github.com/diodiogod/TTS-Audio-Suite
  185.  
  186.  
  187. Then install dependencies for each node that has a requirements.txt:
  188.  
  189. ..\..\python_embeded\python.exe -m pip install -r <node>\requirements.txt
  190.  
  191. Step 4: Run ComfyUI
  192.  
  193. Go back to ComfyUI_windows_portable and double-click:
  194.  
  195. run_nvidia_gpu.bat
  196.  
  197. ---------------------------------------------------------------------------------------------
  198.  
  199. ## Руководство по ручной установке для r ComfyUI (RunPod)
  200.  
  201.  
  202. Step 1: Prerequisites
  203.  
  204. 1. Create account at https://bit.ly/runpodAi
  205. 2. Pick a 24 GB VRAM GPU with the **aitrepreneur/comfyui** template
  206. 3. Access port **8888**
  207. 4. Watch logs:
  208. `tail -f workspace/logs/comfyui.log`
  209.  
  210. Step 2: Install System Packages
  211.  
  212. apt-get update
  213. DEBIAN_FRONTEND=noninteractive apt-get install -y \
  214. git git-lfs curl ffmpeg portaudio19-dev libasound2-dev \
  215. build-essential ninja-build cmake
  216. git lfs install
  217.  
  218. Step 3: Setup Python Virtual Environment
  219.  
  220. cd /workspace/ComfyUI
  221. python3 -m venv venv
  222. source venv/bin/activate
  223.  
  224. pip install -U pip uv wheel setuptools
  225. export UV_LINK_MODE=copy
  226. export PYTHONNOUSERSITE=1
  227. unset PYTHONPATH
  228.  
  229. Step 4: Install Torch (CUDA 12.8 build)
  230.  
  231. uv pip install --upgrade torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
  232. uv pip uninstall -y xformers
  233.  
  234. Step 5: Clone Required Custom Nodes
  235.  
  236. cd custom_nodes
  237. git clone https://github.com/diodiogod/TTS-Audio-Suite
  238. git clone https://github.com/Enemyx-net/VibeVoice-ComfyUI
  239. git clone https://github.com/rgthree/rgthree-comfy
  240.  
  241. Step 6: Install Python Dependencies
  242.  
  243. Run this with uv to pin correct versions:
  244.  
  245. uv pip install \
  246. "numpy==2.2.6" \
  247. "librosa==0.11.0" \
  248. "soundfile>=0.12.0" \
  249. "sounddevice>=0.4.0" \
  250. "accelerate>=1.6.0" \
  251. "transformers==4.51.3" \
  252. "diffusers==0.35.1" \
  253. "scipy" \
  254. "ml-collections" \
  255. "peft>=0.17.0" \
  256. "huggingface_hub>=0.25.1" \
  257. "absl-py" \
  258. "aiortc==1.13.0" \
  259. "av==14.4.0" \
  260. "bitsandbytes==0.47.0" \
  261. "conformer==0.3.2" \
  262. "x-transformers==2.7.6" \
  263. "torchdiffeq==0.2.5" \
  264. "wandb==0.21.4" \
  265. "ema-pytorch==0.7.7" \
  266. "vocos==0.1.0" \
  267. "monotonic-alignment-search==0.2.0" \
  268. "faiss-cpu>=1.7.4" \
  269. "praat-parselmouth>=0.4.6" \
  270. "pyworld==0.3.5" \
  271. "torchfcpe==0.0.4" \
  272. "opencv-python-headless==4.12.0.88" \
  273. "pillow" \
  274. "datasets==4.0.0" \
  275. "requests" \
  276. "dacite==1.9.2" \
  277. "unidecode==1.4.0" \
  278. "jieba==0.42.1" \
  279. "pypinyin==0.55.0" \
  280. "torchsde==0.2.6"
  281.  
  282. Step 7: Download HuBERT Model
  283.  
  284. mkdir -p custom_nodes/TTS-Audio-Suite/models/hubert
  285. curl -L -o custom_nodes/TTS-Audio-Suite/models/hubert/hubert_base.pt \
  286. https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/hubert_base.pt
  287.  
  288. mkdir -p custom_nodes/VibeVoice-ComfyUI/models
  289. ln -sf ../TTS-Audio-Suite/models/hubert/hubert_base.pt \
  290. custom_nodes/VibeVoice-ComfyUI/models/hubert_base.pt
  291.  
  292. Step 8: Run ComfyUI
  293.  
Tags: ai tts
Comments
  • bal_gennady
    70 days
    # text 0.19 KB | 0 0
    1. Ручной метод установки голоса ИИ для преобразования текста в речь накомпьютере from Youtube video: https://youtu.be/XtaPZmlyMMw
Add Comment
Please, Sign In to add comment