The Lip-sync feature

Achieving high-quality dubbing results requires the synchronization of mouth movements with translated audio

What is Lip-sync?

Lip-sync, short for lip synchronization, refers to the synchronization of audio and the lip movements with spoken words or sounds. You can use the Lip-sync feature if you notice any issues with the synchronization in your translated video. You can apply it after you have created a voiceover for your video.

Before applying the Lip-sync, make sure your video meets these criteria:

❌ We suggest avoiding videos with a strong half-profile where the eyes are not fully visible, like this:

✅ The face needs to be clearly visible. Feel free to use videos with a subtle half-side angle, like this:

✅ The lighting needs to be good, without shadows or overexposure.

❌ Nothing should cover the eyes, nose, mouth, and eyebrows. Make sure the lips aren’t blocked by mic, hands, logo or captions.

Please keep this in mind:

Bright lip makeup may appear as nude lips when using the Lip-sync feature.
Facial hair can cause artifacts in the video.

Bright lip makeup may appear as nude lips when using the Lip-sync feature.

Facial hair can cause artifacts in the video.

Here’s how it works

Upload your video via the link from YouTube, Google Drive, or directly upload the file from your device. Choose the target language and click on the translate button.
Add a voiceover to your video in Rask AI via “Dub video” button.
To check if your video is compatible with Lip-sync, click the "Lip-sync check" button.
If it's compatible, proceed by tapping the Lip-sync button.
Next, select the number of faces you want in your video — either "1" or "2+", then tap "Start Lip-sync". Just a heads up, this is about the number of faces, not speakers.
You might need to wait in a queue until the Lip-sync is applied to your video. By hovering over the lip-sync button, you can check if your video is in the queue, if it's processing, and the percentage of its readiness.
To view your newly processed video, navigate to the Lip-synced tab.

How long does it take to generate lip-sync?

The speed of lip-sync generation depends on the number of speakers in the video, duration, quality and size of the video.

For example, here are approximate rate of lip-sync generation for different videos:

Videos with one speaker

4 minute video 1080p ≈ 58 minutes
10 minute 1080p ≈ 4 hours
10 minute 4K video ≈ 16 hours

Videos with 3 speakers:

10 minute 1080p ≈ 10 hours 20 minutes

How much does it cost to generate lip-sync for a video?

Lip-sync is available on Creator Pro, Archive Pro, Business, and Enterprise plans.
One minute of lip-sync generated equals one minute deducted from your total minutes balance.

Lip-sync minutes are deducted just like when dubbing your videos.
Lip-sync is charged separately from dubbing. For example, to translate and lip-sync a 1-minute video into 1 language, you need 2 minutes.

Before generating lip-sync, you will be able to test 1 free minute to evaluate the quality of the technology.

Friendly reminder:

If you change the transcription and dub the video, make sure to Lip-sync the video again to apply the changes.
If you have a video with Lip-sync and captions, you will need to generate it again after making changes to the transcription.

Breaking the language barriers has never been so close! Try the Lip-sync with Rask AI and send us your feedback on this feature.

If you have any questions regarding Lip-Sync, please contact us via the support chat or support@rask.ai.