Sync Lipsync-2-Pro
Generate lip-synced videos with enhanced detail preservation for beards, teeth, and facial features using Sync Labs' Lipsync-2-Pro model with diffusion-based super resolution
View detailsInputs
Loading workflow structure...
Overview
Sync Lipsync-2-Pro combines a source video with an audio track to create a lip-synced video. Use it for detail-sensitive talking-head clips when facial features, beards, teeth, active-speaker handling, or mouth obstructions matter more than fastest generation.
Use cases
- Match a new voiceover to an existing spokesperson or product-demo video.
- Create localized or revised talking-head clips from source video and audio files.
- Run a Pro lip-sync pass when facial detail matters more than turnaround speed.
Input tips
- Provide public video_url and audio_url values that can be fetched without login.
- Keep the speaker's face and mouth visible for the cleanest result.
- Choose sync_mode based on how audio and video duration should be matched.
- Adjust temperature for more expressive or more subtle mouth movement.
- Enable active-speaker detection for multi-person videos.
- Use occlusion detection when hands, microphones, or objects may cover the mouth.
Expected output
The AI Tool returns one generated lip-synced video file with a downloadable URL, optional content type, file name, file size, and cost metadata. The shared video template renders the result for playback, review, and download.
Caveats
- Source video and audio URLs must be public and reachable.
- The Pro model can take longer than standard Sync Lipsync-2.
- Face visibility, audio timing, video resolution, motion, and occlusions affect the result.
- Duration mismatch settings can loop, trim, bounce, silence, or remap content; review final timing.
- This AI Tool does not generate the audio track; provide the audio separately.
Related AI Tools

Sync Lipsync-2
Generate lip-synced videos by combining a source video with audio using Sync Labs' Lipsync-2 model with configurable sync modes, expressiveness, and active speaker detection

Sync React-1
Generate emotionally synchronized videos with lip movements, facial expressions, and head movements using Sync Labs' React-1 model. Best for short-form content (15 seconds or less)

Kling Lipsync
Generate lip-synced videos by combining a source video with audio using Kuaishou's Kling lipsync model