Index-tts2 single or two person
Details
Download Files
About this version
Model description
This is the best open-source text-to-speech project until December 2025.
Workflows include single-person and two-person TTS (Text-to-Speech).
Should require 10GB of GPU memory.
First, install this custom nodes via ComfyUI:
https://github.com/billwuhao/ComfyUI_IndexTTS
Then download the models:
V2 Model Download: Manually download the models to the specified folder under ComfyUI\models\TTS:
https://hf-mirror.com/nvidia/bigvgan_v2_22khz_80band_256x/tree/main
https://hf-mirror.com/funasr/campplus/tree/main
https://hf-mirror.com/IndexTeam/IndexTTS-2/tree/main
https://hf-mirror.com/amphion/MaskGCT/tree/main/semantic_codec
https://hf-mirror.com/facebook/w2v-bert-2.0/tree/main
- bigvgan_v2_22khz_80band_256x bigvgan_generator.pt
config.json
-campplus
campplus_cn_common.bin
- IndexTTS-2
│ .gitattributes
│ bpe.model
│ config.yaml
│ feat1.pt
│ feat2.pt
│ gpt.pth
│ README.md
│ s2mel.pth
│ wav2vec2bert_stats.pt
│
└─ qwen0.6bemo4-merge
added_tokens.json
chat_template.jinja
config.json
generation_config.json
merges.txt
model.safetensors
Modelfile
special_tokens_map.json
tokenizer.json
tokenizer_config.json
vocab.json
-MaskGCT
semantic_codec
model.safetensors
-w2v-bert-2.0
.gitattributes
config.json
conformer_shaw.pt model.safetensors
preprocessor_config.json
Please note that Windows users may need to install wheels, such as Triton and SageAttention.
If your ComfyUI prompts you to install them,
you can: Install Triton by running the following command:
pip install -U "triton-windows<3.6"
Install SageAttention by going to https://github.com/woct0rdho/SageAttention/releases Find the wheel that matches your Torch and CUDA versions and run, for example, the following command:
pip install sageattention-2.2.0+cu130torch2.9.0andhigher.post4-cp39-abi3-win_amd64.whl


