Join our community of SUBSCRIBERS and be part of the conversation.

To subscribe, simply enter your email address on our website or click the subscribe button below. Don't worry, we respect your privacy and won't spam your inbox. Your information is safe with us.

Subscribe

News

Company:

Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

Sunday, April 27, 2025

China’s New Vidu Text to Video Technology Better Than Sora?

Share

The field of AI is constantly evolving, with nations vying to lead in this realm of innovation. China has now made its foray into text to video technology with the launch of Vidu, a system resulting from a collaborative effort between Tsinghua University and tech company ShengShu-AI. This marks a significant stride in China’s AI development.

Vidu Text to Video Technology

With capabilities akin to the highly acclaimed Sora, developed by OpenAI just two months prior, Vidu represents a substantial leap in China’s AI landscape. This advanced tool can generate 1080p high-definition video snippets up to 16 seconds in duration with a single click. What sets Vidu apart is its unique architecture, the Universal Vision Transformer (U-ViT), a self-developed model that amalgamates the best aspects of Transformers and Diffusion models, two widely recognized AI techniques. 

“The release of Sora mirrored our research trajectory,” stated Zhu Jun, Vidu’s chief scientist and vice dean of Tsinghua University’s Institute for Artificial Intelligence, at the unveiling ceremony. “This served as a powerful motivator to accelerate our research efforts.” Interestingly, reports suggest that the core technology behind U-ViT predates Sora’s architecture. Vidu’s team reportedly proposed U-ViT in September 2022, while Sora utilizes the DiT (Diversity in Transformation) architecture, unveiled later.

Vidu’s Text-to-Video AI Model’s Capabilities

Vidu is capable of more than just creating HD videos. Live demonstrations proved the model’s capacity to replicate the complexities of the real world. In addition to realistic lighting and shadow effects, Vidu’s created characters include complex facial emotions. It stands out from the competition since it produces lively photos rather than static ones.

Moreover, Vidu completely understands Chinese cultural components as a byproduct of Chinese growth. According to media sources, it can produce images with well-known Chinese characters like dragons and pandas. Thanks to this cultural awareness, Vidu is positioned as a highly formidable tool for content development within China. 

The arrival of Vidu indicates increasing competition in the AI space. Even though Vidu and Sora both exhibit impressive video creation skills, competition could lead to additional developments in this area, like Pika Labs. The lengthier and more these models will produce more complex videos that may be made in the models; Mao tells the difference between footage created by artificial intelligence and actual footage. Furthermore, more advanced controls over movie length, scene complexity, and creative style could be added. 

The advent of AI tools like Vidu opens up a plethora of possibilities. The potential applications are limitless, from crafting compelling marketing materials to generating storyboards and even creating personalized video content. As text to video technology continues to evolve, it will profoundly impact numerous sectors and our overall media consumption experience.

Read more

Local News