With insights from a blog post made on November 16, 2023, Meta, a social media platform, has introduced its latest artificial intelligence (AI) models for content editing and generation, stated Cointelegraph.
The company is expected to introduce two AI-powered generative models. The first, Emu Video, uprades Meta’s previous Emu model and is capable of generating video clips based on text and image inputs. The second model, Emu Edit, will provide image manipulation, promising more precision in image editing, Cointelegraph added.
“We’ve split the process into two steps: first, generating images conditioned on a text prompt, and then generating video conditioned on both the text and the generated image. This ‘factorized’ or split approach to video generation lets us train video generation models efficiently,” Meta explained.
Sources reveal that the same model can “animate” images based on a text prompt. Furthermore, Meta, mentioned that instead of relying on a “deep cascade of models,” Emu Video only uses two diffusion models to generate 512×512 four-second-long videos at 16 frames per second, Cointelegraph concluded.
(With insights from Cointelegraph)