One selfie in, one fake video out. Here's how deepfakes work at a high level. The diagram below shows the full pipeline that turns a reference image like selfie... (opens in new tab)

One selfie in, one fake video out. Here's how deepfakes work at a high level. The diagram below shows the full pipeline that turns a reference image like selfie, a voice clip, and a prompt into a fake video. Step 1: Prompt Refinement. The text prompt gets cleaned, augmented with extra detail, and paired with a negative prompt to suppress unwanted artifacts like distorted hands. Step 2: Reference Image Prep. A single selfie of the target is passed through a VAE encoder, a ...

Read the original article