These models are used in the Dialogue and Lipsync tab of Shot Studio to automatically sync character lip movements and expressions to dialogue audio.
When to Use
- Automatic Lipsync — Sync character mouths to dialogue without manual animation
- Expression Matching — Generate facial expressions that match dialogue emotion
- Character Animation — Add subtle head movements and reactions
- Post-Production Sync — Align existing character shots with dialogue tracks
Recommended Models
These models consistently deliver the best results for lipsync:Sync Lipsync 2 Pro
- Credits: 84
- Rating: ⭐⭐⭐⭐ (4/5)
- Provider: fal.ai
- Avg. Duration: ~179 seconds
- Best for: Professional-quality lip synchronization (Default model)
Sync React-1
- Credits: 166
- Rating: ⭐⭐⭐⭐ (4/5)
- Provider: fal.ai
- Avg. Duration: ~266 seconds
- Best for: Advanced reactive expressions and head movements
Supported Models
These models are available but may have varying quality or processing times:Sync Lipsync 2.0
- Credits: 50
- Rating: ⭐⭐⭐ (3/5)
- Provider: fal.ai
- Avg. Duration: ~51 seconds
- Use when: Fast lipsync needed with good quality
Kling Lipsync
- Credits: 28
- Rating: ⭐⭐ (2/5)
- Provider: fal.ai
- Avg. Duration: ~158 seconds
- Use when: Budget-friendly lipsync option
LatentSync
- Credits: 40
- Rating: ⭐⭐ (2/5)
- Provider: fal.ai
- Avg. Duration: ~39 seconds
- Use when: Fast processing is priority
PixVerse Lipsync
- Credits: 40
- Rating: ⭐⭐ (2/5)
- Provider: fal.ai
- Avg. Duration: ~48 seconds
- Use when: Experimenting with different lipsync approaches
VEED
- Credits: 8
- Rating: ⭐ (1/5)
- Provider: fal.ai
- Avg. Duration: ~33 seconds
- Use when: Lowest cost option needed
Tips for Best Results
- Clean Audio — Use isolated dialogue without background music for best accuracy
- Tag Speakers — Specify which character is speaking to improve sync accuracy
- Include Emotion Cues — Mention the emotion or tone in prompts for better expression matching
- Preview Playback — Review synced results against the waveform to confirm timing
- High Quality Source — Clear, well-recorded dialogue produces better sync results
- Multiple Passes — Adjust and regenerate if initial sync isn’t perfect
⚠️ Important: High-quality, clean audio without background music leads to the most accurate lip sync results.