Eerily realistic: Microsoft’s new AI model makes images talk, sing

📆 4/20/2024 4:42 PM
📰 IntEngineering

⏱ Reading Time:
43 sec. here
2 min. at publisher
📊 Quality Score:
News: 20%
Publisher: 63%

Technology Technology Headlines News

Technology Technology Latest News,Technology Technology Headlines

VASA is a framework for generating lifelike talking faces with appealing visual affective skills.

model that converts images of a person’s face and audio clips into a video with proper lip-syncing, facial expressions, and head movements. Developed by a team of AI researchers at Microsoft Research Asia, the new AI model is called VASA-1.

VASA— short for Visual Affective Skills Animator— is capable of transforming any static images whether clicked by the camera, painted, or drawn, into “exquisitely synchronized” animations. The team utilized the publicly available VoxCeleb2 dataset which contains video clips of over 6,000 real-life celebrities. Discarding clips with multiple individuals and of low quality, the team trained their model on the processed dataset.The model offers control over gaze, distance, and emotions in the generated video.

“We are exploring visual affective skill generation for virtual, interactive characters, NOT impersonating any person in the real world,” they wrote in aThe research team maintains that the model will be used for education and provide companionship. They have also refused to release the code that powers the model.

Write Comment

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

Technology Technology Latest News, Technology Technology Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Microsoft’s VASA-1 AI Can Make Any Person’s Image Move and SpeakMicrosoft has unveiled a new lip-syncing tool that transforms a still image of a person into an animated clip of them talking or singing.
Source: petapixel - 🏆 527. / 51 Read more »

Microsoft VASA-1 AI turns photos into lifelike talking videos, and it’s insaneMicrosoft's new VASA-1 AI model can combine a portrait image with an audio file to create a high-quality video of someone talking.
Source: BGR - 🏆 234. / 63 Read more »