Skip to main content
Audio to Video

Kling Lipsync

Generate lip-synced videos by combining a source video with audio using Kuaishou's Kling lipsync model

View details

Inputs

Loading input fields...
Execution Steps

Loading workflow structure...

Loading curated examples...

Overview

Kling Lipsync combines a source video with an audio track to generate a lip-synced video using Kling's audio-to-video lip-sync model. Use it for short talking-head or spokesperson clips when you already have video and final audio and want a Kling-based lip-sync option.

Use cases

  • Match revised voiceover to an existing short product-demo or founder video.
  • Create a Kling lip-sync draft for a UGC-style ad or social clip.
  • Compare a Kling lip-sync result against Sync Lipsync results for the same source assets.

Input tips

  • Provide public video_url and audio_url values that can be fetched without login.
  • Use an .mp4 or .mov source video that is 2-10 seconds long and 100 MB or smaller.
  • Use 720p or 1080p video with width and height between 720 and 1920 pixels.
  • Use audio that is 2-60 seconds long and 5 MB or smaller.
  • Keep the speaker's face and mouth visible for the cleanest result.
  • Provide final audio; this AI Tool does not generate the voiceover.

Expected output

The AI Tool returns one generated lip-synced video with a downloadable URL, duration in seconds, optional content type, file name, file size, and cost metadata. The shared video output template renders the result for playback, review, and download.

Caveats

  • Source video and audio URLs must be public and reachable.
  • This AI Tool is designed for short source videos; longer or unsupported media will not validate.
  • Face visibility, audio timing, source resolution, and motion affect sync quality.
  • This AI Tool does not create or edit the audio track; provide final audio separately.
  • Generated lip movement should be reviewed for realism, brand fit, and rights-sensitive footage.