Meta推出Movie Gen 旗下迄今最先进的视频生成AI模型

2024-10-04 22:25:00

Meta 今天发布了 MovieGen 系列媒体基础AI模型,该模型可根据文本提示生成带声音的逼真视频。 MovieGen 系列包括两个主要模型: MovieGen Video 和 MovieGen Audio。

MovieGen Video is a transformer model with 30 billion parameters that generates high-quality, high-definition images and videos from a single text prompt, resulting in video up to 16 seconds at 16 frames per second.

MovieGen Audio is a 13 billion parameter transformer model that can take video input and optional text prompts, and generate up to 45 seconds of high-fidelity audio synchronized with the input video. This new audio model can generate ambient sounds, instrumental background music, and Foley sounds. Meta claims that it offers state-of-the-art results in terms of audio quality, video-to-audio alignment, and text-to-audio alignment.

These models aren't just used to create brand new videos. They can be used to edit existing videos with simple text prompts. MovieGen also allows users to make localized edits, such as adding, removing, or replacing elements, as well as making global changes such as background or style changes. For example, if you have a video of someone throwing a ball, complete with a simple text prompt, you can change the video to someone throwing a watermelon while keeping the rest of the original content.

The MovieGen model will allow users to create personalized videos. By using character images and text prompts, these models can generate personalized videos that retain the character's features and movements. Meta claims that these models offer state-of-the-art results in terms of character protection and natural movement in videos.

Meta claims that these models produce better videos than other video generation models, including OpenAI Sora and Runway Gen-3. Meta is currently working with creative professionals to further refine the model before it is released publicly.

Learn more/Meta

Meta推出Movie Gen 旗下迄今最先进的视频生成AI模型

Read on

【AASLD2024 Express】Prediction of HBsAg clearance by peginterferon α-2b treatment: a simple model based on baseline HBsAg levels

Large models lead the 6G revolution! The latest review explores the future of communication methods, covering multimodality, RAG, etc

The top CP of the large model turned from sweet to abusive: they were dissatisfied with each other, and they all looked for a spare tire, because the money was unpleasant

Archetype AI released a large model of Newtonian physics to learn physics principles from sensor data

CNCC | The future of multimodal affective computing under large models

The "Fuxi Eye" large model was released! It has the world's largest ophthalmic image database

New car | The AI large model is on the car, 13 new/27 optimizations, and the ZEEKR 009 glorious OTA upgrade

AI Daily: Fudan and Baidu's new models can generate 1-hour long videos; The new version of ChatGPT for Windows is launched; Two new features have been added to NotebookLM

Surveying and Mapping Bulletin | Ren Ping: Noise data visualization based on LOD1 city model

The news is here! She passed away in the early hours of this morning at the age of 30, and the online video made countless people teary-eyed

The woman's selfie obscene video has made a profit of more than 10,000 yuan in half a year, and netizens shouted: Behind the shortcut is the abyss!

"Beauty and the Beast" in the elevator: an interesting anecdote that sparked heated discussionsThe elevator, this small and enclosed space, is not only a simple means of transportation, but also a miniature dance of interpersonal interaction

The aliens who invaded the earth 18 years ago have begun to occupy the short video platform?

"Touching porcelain" 222 times in a year, how does the "Road Actor" "perform"? The video of the crime was exposed

Twin sisters who have been lost for 30 years, broke up after 3 years of recognizing each other after watching short videos! Sister: I'd rather never know each other

Video|Li Yugui went to the Provincial Emergency Management Department to carry out work research

Video|Li Yugui went to the Provincial Youth League Committee to carry out research on youth work

The terminal AI grading standard has been implemented, and the "fire" of the mobile phone model has burned to the agent

J Clin Invest丨Yang Weili/Li Shihua/Li Xiaojiang's team used monkey models to reveal new pathological mechanisms of Parkinson's disease

Tens of millions of dollars lost by poisoning for large model training? Anthropic found a hidden bug in the LLM codebase

Nearly 1,000 teenagers in the city gathered at Zhonghai Expo to show their skills in the three major model competitions of navigation, aviation and architecture

DeepMind and MIT developed Fluid, which enables autoregressive models to achieve large-scale expansion of Wensheng graphs

Recently, a Chinese in the Philippines was emptied by compatriots and the surveillance video was exposed!

AI Weekly | ByteDance's large model training was "poisoned"; Microsoft will terminate the Azure OpenAI service for individuals in China

Tesla clarified that Optimus is not being manipulated by someone behind it and released a new showcase video

How to set the cover image of VideoStudio

Unscrupulous for the sake of traffic! Selling grief and creating compassion...... How deep is the pose routine of short videos?

Small traders have violent tendencies, and the attitude of the chengguan is always very good, I watched the original video, let's talk about it

ByteDance responded to the attack on the intern for the training of the large model: it has been dismissed and does not affect the online business

The girl on the basketball court was kicked to the ground by a man Follow-up: The police intervened, the video was exposed, and the comment area fell

The Israeli army's new video said that Sinwar had taken his wife and children to take refuge in the tunnel, leaving a DNA leak with a tissue

Micro video|BRICS Power