IndexTTS2_ Vocal and Emotional Transfer _ Two person Dialogue+Single person Speaking Workflow
详情
下载文件
模型描述
你可以点击下方链接直接试用。如果效果良好,你可以将其部署到本地。
https://www.runninghub.ai/post/1968314493635842049/?inviteCode=1cqzbf7a
粉丝福利:注册即得1000积分,每日登录100积分,畅玩4090!体验48G的超凡实力。
这是一个用于复制人声与情感的工作流,可生成单人讲话或两人对话的情感音频,相较于以往生成僵硬人声的模型,表现更优,强烈推荐。ComfyUI的部署难度相对较高,首先需确保transformer版本为4.51.0,并确保已安装JSON5模块。
项目页面:https://github.com/billwuhao/ComfyUI_IndexTTS
模型下载链接:
https://hf-mirror.com/nvidia/bigvgan_v2_22khz_80band_256x/tree/main
https://hf-mirror.com/funasr/campplus/tree/main
https://hf-mirror.com/IndexTeam/IndexTTS-2/tree/main
https://hf-mirror.com/amphion/MaskGCT/tree/main/semantic_codec
https://hf-mirror.com/facebook/w2v-bert-2.0/tree/main
模型放置结构:
- bigvgan_v2_22khz_80band_256x
bigvgan_generator.pt
config.json
- campplus
campplus_cn_common.bin
- IndexTTS-2
│ .gitattributes
│ bpe.model
│ config.yaml
│ feat1.pt
│ feat2.pt
│ gpt.pth
│ README.md
│ s2mel.pth
│ wav2vec2bert_stats.pt
│
└─ qwen0.6bemo4-merge
added_tokens.json
chat_template.jinja
config.json
generation_config.json
merges.txt
model.safetensors
Modelfile
special_tokens_map.json
tokenizer.json
tokenizer_config.json
vocab.json
- MaskGCT
semantic_codec
model.safetensors
- w2v-bert-2.0
.gitattributes
config.json
conformer_shaw.pt
model.safetensors
preprocessor_config.json
README.md
