Everybody’s Talkin’: Let Me Talk as You WantThis paper presents a method to edit a target portrait footage by taking a sequence of audio as input to synthesize a photo-realistic video.