Less than a year ago, Microsoft’s VASA-1 blew my mind. The company showed how it could animate any photo and turn it into a video featuring the person in the image. This wasn’t the only impressive part, as the subject of the image would also be able to speak in the video.
VASA-1 surpassed anything we’d seen back then. This was April 2024, when we had already seen Sora, OpenAI’s text-to-video generation tool that would not be released until December. Sora did not feature similarly advanced face animation and audio synchronization technologies.
Unlike OpenAI, Microsoft never intended to make VASA-1 available to the project. I said then that a public tool like VASA-1 could harm, as anyone could create misleading videos of people saying whatever the creator conceives. Microsoft’s research project also indicated that it would be only a matter of time before others could develop similar technology.
Now, TikTok parent company ByteDance has developed an AI tool called OmniHuman-1 that can replicate what VASA-1 did while taking things to a whole new level.
The post Taylor Swift singing in Japanese: Mind-blowing new AI tech from China appeared first on BGR.
Today’s Top Deals
Best Apple deals for February 2025
Today’s deals: $99 AirPods 4, $19 3-in-1 wireless charging station, $33 Blink Video Doorbell, more
Today’s deals: $329 Apple Watch Series 10, $219 Bose soundbar, 40% off eufy video smart lock, more
Today’s deals: $399 iPad mini 7, $20 Anker earbuds, $4.25 KMC smart plugs, HYDRO FLASK sale, more
Taylor Swift singing in Japanese: Mind-blowing new AI tech from China originally appeared on BGR.com on Wed, 5 Feb 2025 at 19:13:00 EDT. Please see our terms for use of feeds.