Alibaba launches new open-source tool that turns photos into video

By Wayne Williams
Published 3 months ago

Alibaba has released a new open-source speech-to-video model capable of generating animated digital humans from a single portrait and an audio clip. The tool is aimed at content creators and researchers who are looking for a way to produce lifelike avatars capable of speaking, singing, or performing.

The Wan2.2-S2V release builds on Alibaba’s Wan2.2 video generation series. By becoming open-source, the company is offering developers a system that can animate portraits across different perspectives including close-up, bust, and full-body shots.

Wan2.2-S2V is powered by audio-driven animation technology that carefully synchronizes speech and movement. It is able to handle complex multi-character scenes and adapt to prompts that specify particular gestures or environmental elements.

According to Alibaba, this will allow creators to make videos for uses ranging from social media content to longer form film-style projects.

The model also provides output options of 480P and 720P, producing quality results without requiring high-end computing power, something that should appeal to independent creators as well as professional teams working on large-scale projects.

Alibaba research

Researchers behind the model developed a custom audio-visual dataset focused on film and television scenarios. They used multi-resolution training to ensure that the system could generate both vertical short-form videos and traditional widescreen outputs.

Wan2.2-S2V uses a frame compression process which condenses long video histories into a single latent representation. This lowers computational overhead while maintaining consistency over extended clips, which Alibaba says is a challenge for many video generation systems.

By stabilizing longer sequences, the model should be able to generate more ambitious animated productions.

The launch follows earlier open-source releases in the Wan series, including Wan2.1 in February and Wan2.2 in July. Downloads of the Wan models across Hugging Face and ModelScope have already exceeded 6.9 million.

Wan2.2-S2V is now available through Hugging Face, GitHub, and Alibaba’s ModelScope platform.

What do you think about Alibaba’s new speech-to-video model? Let us know in the comments.

Tags: AI, Alibaba, Deepfake, Video, Wan2.2-S2V

No Comments

Comments are closed.

Got News? Contact Us

Recent Headlines

Microsoft releases Windows 11 Insider Preview Build 26220.7271 with new context menu, Xbox full screen experience for PC and more

Scientists say finding extraterrestrial life and curing all genetic diseases could both happen in the next decade

Nearly every UK company hit by supply chain attacks despite big spending

Tuxedo halts Linux ARM laptop project over Snapdragon X Elite issues

Notepad update begins rolling out to Windows Insiders

Social media and marketplace scams surge ahead of the holiday season

Security teams want automation but 96 percent face problems implementing it

Why Trust Us

At BetaNews.com, we don't just report the news: We live it. Our team of tech-savvy writers is dedicated to bringing you breaking news, in-depth analysis, and trustworthy reviews across the digital landscape.

Alibaba launches new open-source tool that turns photos into video

Alibaba research

Recent Headlines

Microsoft releases Windows 11 Insider Preview Build 26220.7271 with new context menu, Xbox full screen experience for PC and more

Scientists say finding extraterrestrial life and curing all genetic diseases could both happen in the next decade

Nearly every UK company hit by supply chain attacks despite big spending

Tuxedo halts Linux ARM laptop project over Snapdragon X Elite issues

Notepad update begins rolling out to Windows Insiders

Social media and marketplace scams surge ahead of the holiday season

Security teams want automation but 96 percent face problems implementing it

Most Commented Stories

Say goodbye to Microsoft Windows 11 and hello to Nitrux Linux 5

Apple bows to Chinese pressure to remove queer dating apps from its App Store

New year, new Microsoft OS -- the stunning Windows 26 is everything Windows 12 should be

Microsoft is changing the naming schema for Windows 11 updates

Would you swap personal information for a bargain?

Update PowerToys to kill the annoying theme changing bug

Apple announces iPhone Pocket, a knitted pouch for carrying your mobile

WhatsApp will soon support third-party chats

Why Trust Us