Beijing Academy Unveils Emu3: Next-Gen Multimodal AI Unifying Text, Images, and Video
The Beijing Academy of Artificial Intelligence has launched Emu3, a multimodal world model unifying text, images, and video through next-token prediction, marking a significant advancement in AI technology.