Skip to main content
Avatar Video

HeyGen Avatar 4 Talking Video

Generate talking avatar videos from a source image and premade audio using HeyGen Avatar 4

View details

Inputs

Loading input fields...
Execution Steps

Loading workflow structure...

Loading curated examples...

Overview

HeyGen Avatar 4 Talking Video turns a source image and premade audio track into a lip-synced avatar video. Use it when you have a speaker image or character reference plus finished voiceover audio and want a controlled talking-video draft for ads, explainers, outreach, or social.

Use cases

  • Create a campaign video from a founder image and recorded voiceover.
  • Turn a character or spokesperson image into a short social or outreach clip.
  • Test different aspect ratios, expressiveness levels, or motion prompts from the same audio.

Input tips

  • Provide public image and audio_url values that can be fetched without login.
  • Choose 16:9 or 9:16 based on the channel where the video will be reviewed.
  • Use 720p for the default draft or 1080p when a sharper preview matters.
  • Keep audio clear and final; the AI Tool uses the premade track for lip sync.
  • Add a motion_prompt only when you need specific gestures, posture, or camera movement.
  • Set expressiveness to low, medium, or high based on how animated the speaker should feel.

Expected output

The AI Tool returns a generated talking-avatar video with a downloadable URL, optional content type, file name, file size, output duration, and cost metadata. The shared avatar-video view renders the video and duration for review.

Caveats

  • Private or blocked image/audio URLs will fail.
  • Poor audio quality, mismatched timing, or unsuitable face images can reduce lip-sync quality.
  • Generated facial motion should be reviewed for realism, consent, brand fit, and policy fit.
  • If output or audio duration cannot be determined, the run can fail.
  • The AI Tool supports 16:9 and 9:16 aspect ratios, with 720p or 1080p resolution.