AI Toolbox

Video AI Tools

Free video AI tools for editing, generating animations, and analyzing footage, perfect for filmmakers and content creators seeking efficiency.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

TCAN

TCAN can animate characters of various styles from a pose guidance video.

04.10.24 · Project Page · Code · Image-to-Video

GAGAvatar

GAGAvatar can create 3D head avatars from a single image and enable real-time facial expression reenactment.

04.10.24 · Project Page · Code · Talking Head Generation

Explorative Inbetweening of Time and Space

Time Reversal is making it possible to generate in-between frames of two input images. In particular, this enables the generation of looping cinemagraphs as well as camera and subject motion videos.

04.10.24 · Project Page · Code · Image-to-Video

MotionMaster

MotionMaster can extract camera motions from a single source video or multiple videos and apply them to new videos. This enables the model to control camera motions in a more flexible and controllable way, resulting in videos with variable-speed zoom, pan left, pan right, dolly zoom in, dolly zoom out and more.

01.10.24 · Project Page · Code · Video Analysis

LVCD

LVCD can colorize lineart videos using a pretrained video diffusion model. It ensures smooth motion and high video quality by effectively transferring colors from reference frames.

29.09.24 · Project Page · Code · Video Colorization

PhysGen

PhysGen can generate realistic videos from a single image and user-defined conditions, like forces and torques. It combines physical simulation with video generation, allowing for precise control over dynamics.

24.09.24 · Project Page · Code · Image-to-Video

PortraitGen

PortraitGen can edit portrait videos using multimodal prompts while keeping the video smooth and consistent. It renders over 100 frames per second and supports various styles like text-driven and relighting, ensuring high quality and temporal consistency.

23.09.24 · Project Page · Code · Video Editing

Upscale-A-Video

Upscale-A-Video can upscale low-resolution videos using text prompts while keeping the video stable. It allows users to adjust noise levels for better quality and performs well in both test and real-world situations.

19.09.24 · Project Page · Code · Video Upscaling

DepthCrafter

DepthCrafter can generate long high-quality depth map sequences for videos. It uses a three-stage training method with a pre-trained image-to-video diffusion model, achieving top performance in depth estimation for visual effects and video generation.

14.09.24 · Project Page · Code · Video Depth Estimation

World-Grounded Human Motion Recovery via Gravity-View Coordinates

GVHMR can recover human motion from monocular videos by estimating poses in a Gravity-View coordinate system aligned with gravity and the camera.

11.09.24 · Project Page · Code · Video Object Detection · Video Analysis

FlexiClip

FlexiClip can generate smooth animations from clipart images while keeping key points in the right place.

08.09.24 · Project Page · Code · Image-to-Video

ViewCrafter

ViewCrafter can generate high-quality 3D views from single or few images using a video diffusion model. It allows for precise camera control and is useful for real-time rendering and turning text into 3D scenes.

04.09.24 · Project Page · Code · Text-to-Video

HumanVid

HumanVid can generate videos from a character photo while allowing users to control both human and camera motions. It introduces a large-scale dataset that combines high-quality real-world and synthetic data, achieving state-of-the-art performance in camera-controllable human image animation.

02.09.24 · Project Page · Code · Controllable Video Generation

Follow-Your-Canvas

Follow-Your-Canvas can outpaint videos at higher resolutions, from 512x512 to 1152x2048.

01.09.24 · Project Page · Code · Video Outpainting

KEEP

KEEP can enhance video face super-resolution by maintaining consistency across frames. It uses Kalman filtering to improve facial details, working well on both synthetic and real-world videos.

27.08.24 · Project Page · Code · Video Restoration

TVG

TVG can create smooth transition videos between two images without needing training. It uses diffusion models and Gaussian Process Regression for high-quality results and adds controls for better timing.

19.08.24 · Project Page · Code · Video Generation

Matryoshka Diffusion Models

[Matryoshka Diffusion Models] can generate high-quality images and videos using a NestedUNet architecture that denoises inputs at different resolutions. This method allows for strong performance at resolutions up to 1024x1024 pixels and supports effective training without needing specific examples.

14.08.24 · Project Page · Code · Text-to-Video · Text-to-Image

Puppet-Master

Puppet-Master can create realistic motion in videos from a single image using simple drag controls. It uses a fine-tuned video diffusion model and all-to-first attention method to make high-quality videos.

09.08.24 · Project Page · Code · Image-to-Video

Generative Camera Dolly

Generative Camera Dolly can regenerate a video from any chosen perspective. Still very early, but imagine being able to change any shot or angle in a video after it’s been recorded!

07.08.24 · Project Page · Code · Video-to-Video

AniTalker

AniTalker is another talking head generator that can animate talking faces from a single portrait and input audio with naturally flowing movements and diverse outcomes.

30.07.24 · Project Page · Code · Talking Head Generation