MoDA: MODA PLUS: Talking Head Generation

Enhanced Version with Smooth Motion

📥 Input Settings

⚙️ Generation Settings

Emotion

Select an emotion for more natural facial expressions

0.5 5

🎬 Motion Enhancement

May cause errors on some systems. If errors occur, disable this option.

24 50

📺 Output

Tips for best results:
• Use high-quality front-facing images
• Clear audio without background noise
Keep audio under 60 seconds
• Adjust CFG scale if motion seems stiff
• For longer audio, split into segments

Example Configurations
Source Image Driving Audio (Recommended: < 60 seconds) Emotion CFG Scale (Lower = Smoother motion) Enable Smoothing (Experimental) Target FPS