Kling Lipsync
Generate lip-synced videos by combining a source video with audio using Kuaishou's Kling lipsync model
View detailsInputs
Loading workflow structure...
Overview
Kling Lipsync combines a source video with an audio track to generate a lip-synced video using Kling's audio-to-video lip-sync model. Use it for short talking-head or spokesperson clips when you already have video and final audio and want a Kling-based lip-sync option.
Use cases
- Match revised voiceover to an existing short product-demo or founder video.
- Create a Kling lip-sync draft for a UGC-style ad or social clip.
- Compare a Kling lip-sync result against Sync Lipsync results for the same source assets.
Input tips
- Provide public video_url and audio_url values that can be fetched without login.
- Use an .mp4 or .mov source video that is 2-10 seconds long and 100 MB or smaller.
- Use 720p or 1080p video with width and height between 720 and 1920 pixels.
- Use audio that is 2-60 seconds long and 5 MB or smaller.
- Keep the speaker's face and mouth visible for the cleanest result.
- Provide final audio; this AI Tool does not generate the voiceover.
Expected output
The AI Tool returns one generated lip-synced video with a downloadable URL, duration in seconds, optional content type, file name, file size, and cost metadata. The shared video output template renders the result for playback, review, and download.
Caveats
- Source video and audio URLs must be public and reachable.
- This AI Tool is designed for short source videos; longer or unsupported media will not validate.
- Face visibility, audio timing, source resolution, and motion affect sync quality.
- This AI Tool does not create or edit the audio track; provide final audio separately.
- Generated lip movement should be reviewed for realism, brand fit, and rights-sensitive footage.
Related AI Tools

Sync Lipsync-2
Generate lip-synced videos by combining a source video with audio using Sync Labs' Lipsync-2 model with configurable sync modes, expressiveness, and active speaker detection

Sync Lipsync-2-Pro
Generate lip-synced videos with enhanced detail preservation for beards, teeth, and facial features using Sync Labs' Lipsync-2-Pro model with diffusion-based super resolution

Sync React-1
Generate emotionally synchronized videos with lip movements, facial expressions, and head movements using Sync Labs' React-1 model. Best for short-form content (15 seconds or less)