Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
-
Updated
Jun 12, 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Cassette is designed to create 30-second explanatory videos suitable for Instagram Reels or YouTube Shorts. Or you may call it a free python alternative to Brainrot.js
Generate video from text using AI
[Arxiv] A Survey on Video Diffusion Models
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Alekhyaa is a text-to-video tool. Simply provide a script, and it will create a video with synchronized audio. If you provide video clips, Alekhyaa will adjust and concatenate them based on the audio content.
Text to video generator in the brainrot form. Learn about any topic from your favorite personalities.
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applications.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Convert an .srt captions file to American Signs Language (ASL) with public videos
A benchmark for evaluating hallucination of text-to-video models
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, T2I-Adapter, IP-Adapter.
Code repository for T2V-Turbo
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
KandinskyVideo — multilingual end-to-end text2video latent diffusion model
Create deepfake video by just uploading the original video and specifying the text the character will read
Diffusion model papers, survey, and taxonomy
Add a description, image, and links to the text-to-video topic page so that developers can more easily learn about it.
To associate your repository with the text-to-video topic, visit your repo's landing page and select "manage topics."