VideoPoet

VideoPoet is a large language model developed by Google Research in 2023 for video making.[1][2][3][4] It can be asked to animate still images.[5] The model accepts text, images, and videos as inputs, with a program to add feature for any input to any format generated content.[4] VideoPoet was publicly announced on December 19, 2023.[1] It uses an autoregressive language model.

VideoPoet
Developer(s)Google
Initial releaseFebruary 8, 2024 (2024-02-08)
TypeLarge language model

References

  1. Krithika, K. L. (December 20, 2023). "Google Unveils VideoPoet, a New LLM for Video Generation". Analytics India Magazine. Retrieved April 29, 2024.
  2. Kondratyuk, Dan; Yu, Lijun; Gu, Xiuye; Lezama, José; Huang, Jonathan; Hornung, Rachel; Adam, Hartwig; Akbari, Hassan; Alon, Yair; Birodkar, Vighnesh; Cheng, Yong; Chiu, Ming-Chang; Dillon, Josh; Essa, Irfan; Gupta, Agrim; Hahn, Meera; Hauth, Anja; Hendon, David; Martinez, Alonso; Minnen, David; Ross, David; Schindler, Grant; Sirotenko, Mikhail; Sohn, Kihyuk; Somandepalli, Krishna; Wang, Huisheng; Yan, Jimmy; Yang, Ming-Hsuan; Yang, Xuan; Seybold, Bryan; Jiang, Lu (December 21, 2023). "VideoPoet: A Large Language Model for Zero-Shot Video Generation". arXiv:2312.14125 [cs.CV].
  3. "Google has introduced VideoPOET breaking new ground in coherent video generation". Gizmochina. December 21, 2023.
  4. "VideoPoet". Google Research. Retrieved April 29, 2024.
  5. Franzen, Carl (December 20, 2023). "Google's new multimodal AI video generator VideoPoet looks incredible". VentureBeat.
  • Media related to VideoPoet at Wikimedia Commons


This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.