Skip to main content
Audio to Video

Sync Lipsync-2

Generate lip-synced videos by combining a source video with audio using Sync Labs' Lipsync-2 model with configurable sync modes, expressiveness, and active speaker detection

View details

Inputs

Loading input fields...
Execution Steps

Loading workflow structure...

Loading curated examples...

Overview

Sync Lipsync-2 combines a source video with a separate audio track to create a lip-synced video. Use it for a standard talking-head, dubbing, product-demo, or UGC-style lip-sync pass with controls for duration matching, expressiveness, active-speaker detection, and occlusions.

Use cases

  • Match a new narration, translated audio, or revised voiceover to existing talking-head footage.
  • Create a lip-synced product demo, founder clip, or UGC-style ad draft from public video and audio URLs.
  • Compare sync modes when the source video and audio track have different lengths.
  • Use active-speaker detection for multi-person footage where the speaking person should be followed.

Input tips

  • Provide public video_url and audio_url values that can be fetched without login.
  • Keep the speaker's face and mouth visible; clean source footage helps lip movement.
  • Choose sync_mode based on whether shorter media should bounce, loop, trim, silence, or remap.
  • Use temperature to make mouth movement more expressive or more subtle.
  • Leave active-speaker detection on for multi-person clips unless you need fixed behavior.
  • Enable occlusion detection when hands, microphones, or objects may cover the mouth.

Expected output

The AI Tool returns one generated lip-synced video file with a downloadable URL, optional content type, file name, file size, and cost metadata. The shared video template renders the video for playback, review, and download.

Caveats

  • Source video and audio URLs must be public and reachable.
  • This AI Tool does not generate or edit the audio track; provide the final audio separately.
  • Face visibility, audio timing, video resolution, motion, and occlusions affect the result.
  • Duration mismatch settings can loop, trim, bounce, silence, or remap content; review final timing.
  • Use Sync Lipsync-2-Pro when facial details, beards, teeth, or mouth obstructions need the highest preservation.
  • Generated lip movement should be reviewed for realism, brand fit, and rights-sensitive footage.