InfiniteTalk: Audio-Portrait to Lip-Synced Video in ComfyUI

🚀 Create realistic talking avatar videos from a single portrait and voice input — with accurate lip-sync and identity-stable animation.

▶️ Run Directly in Cloud:
https://www.runcomfy.com/comfyui-workflows/comfyui-infinitetalk-workflow-audio-portrait-to-lip-synced-video?utm_source=civitai

💡 Overview

InfiniteTalk is a ComfyUI workflow that generates lip-synced talking videos from a single image and voice input. Powered by the MultiTalk AI model, it produces fluid, identity-stable portrait clips with natural speech motion and prompt-driven customizable animation.

Ideal for content creators, educators, marketers, and anyone who needs realistic talking avatars without filming.

✨ Key Features

Single Image + Audio: Just provide a portrait and a voice clip — the workflow handles the rest.
Accurate Lip-Sync: Natural mouth movements precisely synchronized to the audio input.
Identity Preservation: Facial structure, expression style, and appearance remain consistent throughout.
Prompt-Driven Customization: Fine-tune animation behavior and visual style with text prompts.

🚀 Getting Started

Upload a portrait image — clear, well-lit, forward-facing works best.
Provide an audio clip — speech or narration you want the avatar to speak.
Generate — the workflow produces a lip-synced video with the original audio muxed in.

Click the "Run Directly" link above to bypass local setup and test this workflow immediately in your browser.

모델 유형	워크플로우
기본 모델	SD 1.5
게시일	2026-03-23

InfiniteTalk: Audio-Portrait to Lip-Synced Video in ComfyUI

세부 정보

파일 다운로드 (1)

이 버전에 대해

모델 설명

💡 Overview

✨ Key Features

🚀 Getting Started

이 모델로 만든 이미지