Text-to-video

Vidu

by Shengshu Technology (生数科技)

Chinese video model with text, image, and multi-reference generation, rooted in Tsinghua research.

Vidu was first announced in April 2024 and has iterated quickly since. Its Reference-to-Video feature is aimed squarely at the consistent-character problem that trips up many single-shot generation models.