Google, metinden videoya yapay zeka "Imagen Video"yu tanıttı

xguru · 2022-10-07T10:52:01+09:00

Video Diffusion Model ile metin girdisinden video oluşturan bir "Text-conditional Video Generation System" Metinden düşük çözünürlüklü video (24x48 piksel, 16 kare, 3 fps) üretip bunu 7 diffusion modelini art arda bağlayarak (cascade) upscale etmesiyle öne çıkıyor Nihai çıktı 1280x768 24 fps. 5,3 saniye uzunluğunda video üretebiliyor Makale: Imagen Video : High Definition Video Generation with Diffusion Models

(imagen.research.google)

9 puan yazan xguru 2022-10-07 | 1 yorum | WhatsApp'ta paylaş

Video Diffusion Model ile metin girdisinden video oluşturan bir "Text-conditional Video Generation System"
Metinden düşük çözünürlüklü video (24x48 piksel, 16 kare, 3 fps) üretip bunu 7 diffusion modelini art arda bağlayarak (cascade) upscale etmesiyle öne çıkıyor
Nihai çıktı 1280x768 24 fps. 5,3 saniye uzunluğunda video üretebiliyor
Makale: Imagen Video : High Definition Video Generation with Diffusion Models

1 yorum

xguru 2022-10-07

Imagen - Google'ın text-to-image diffusion modeli
Imagen-pytorch - Google Imagen'ın Pytorch ile implementasyonu
Make-A-Video : Metinden video üreten yapay zeka

Google, metinden videoya yapay zeka "Imagen Video"yu tanıttı

İlgili okumalar

1 yorum