Teller can generate diverse facial expressions, head movements, realistic body and accessory movements, ensuring physical consistency in animated results.
* Note that all results in this page use the reference image as first frame and conditioned on audio only without need of spatial conditions as templates.
our model has more accurate emotional expression ability due to the better speech understanding ability of AR transformer.
Angry
Fearful
Disgust
Surprised
Teller replicates natural head movements more accurately, closely matching the ground truth (GT) with smooth, realistic turns and subtle expression-based adjustments.