AI Toolbox

Video AI Tools

Free video AI tools for editing, generating animations, and analyzing footage, perfect for filmmakers and content creators seeking efficiency.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

Upscale-A-Video

Upscale-A-Video can upscale low-resolution videos using text prompts while keeping the video stable. It allows users to adjust noise levels for better quality and performs well in both test and real-world situations.

19.09.24 · Project Page · Code · Video Upscaling

DepthCrafter

DepthCrafter can generate long high-quality depth map sequences for videos. It uses a three-stage training method with a pre-trained image-to-video diffusion model, achieving top performance in depth estimation for visual effects and video generation.

14.09.24 · Project Page · Code · Video Depth Estimation

World-Grounded Human Motion Recovery via Gravity-View Coordinates

GVHMR can recover human motion from monocular videos by estimating poses in a Gravity-View coordinate system aligned with gravity and the camera.

11.09.24 · Project Page · Code · Video Object Detection · Video Analysis

FlexiClip

FlexiClip can generate smooth animations from clipart images while keeping key points in the right place.

08.09.24 · Project Page · Code · Image-to-Video

ViewCrafter

ViewCrafter can generate high-quality 3D views from single or few images using a video diffusion model. It allows for precise camera control and is useful for real-time rendering and turning text into 3D scenes.

04.09.24 · Project Page · Code · Text-to-Video

HumanVid

HumanVid can generate videos from a character photo while allowing users to control both human and camera motions. It introduces a large-scale dataset that combines high-quality real-world and synthetic data, achieving state-of-the-art performance in camera-controllable human image animation.

02.09.24 · Project Page · Code · Controllable Video Generation

Follow-Your-Canvas

Follow-Your-Canvas can outpaint videos at higher resolutions, from 512x512 to 1152x2048.

01.09.24 · Project Page · Code · Video Outpainting

KEEP

KEEP can enhance video face super-resolution by maintaining consistency across frames. It uses Kalman filtering to improve facial details, working well on both synthetic and real-world videos.

27.08.24 · Project Page · Code · Video Restoration

TVG

TVG can create smooth transition videos between two images without needing training. It uses diffusion models and Gaussian Process Regression for high-quality results and adds controls for better timing.

19.08.24 · Project Page · Code · Video Generation

Matryoshka Diffusion Models

[Matryoshka Diffusion Models] can generate high-quality images and videos using a NestedUNet architecture that denoises inputs at different resolutions. This method allows for strong performance at resolutions up to 1024x1024 pixels and supports effective training without needing specific examples.

14.08.24 · Project Page · Code · Text-to-Video · Text-to-Image

Puppet-Master

Puppet-Master can create realistic motion in videos from a single image using simple drag controls. It uses a fine-tuned video diffusion model and all-to-first attention method to make high-quality videos.

09.08.24 · Project Page · Code · Image-to-Video

Generative Camera Dolly

Generative Camera Dolly can regenerate a video from any chosen perspective. Still very early, but imagine being able to change any shot or angle in a video after it’s been recorded!

07.08.24 · Project Page · Code · Video-to-Video

AniTalker

AniTalker is another talking head generator that can animate talking faces from a single portrait and input audio with naturally flowing movements and diverse outcomes.

30.07.24 · Project Page · Code · Talking Head Generation

Audio-Synchronized Visual Animation

Audio-Synchronized Visual Animation can animate static images using audio clips to create synchronized visual animations. It uses the AVSync15 dataset and the AVSyncD diffusion model to produce high-quality animations across different audio types.

26.07.24 · Project Page · Code · Audio-to-Video

Shape of Motion

Shape of Motion can reconstruct 3D scenes from a single video. The method is able to capture the full 3D motion of a scene and can handle occlusions and disocclusions.

18.07.24 · Project Page · Code · Video-to-4D · Video Scene Detection

SparseCtrl

SparseCtrl is a image-to-video method with some cool new capabilities. With its RGB, depth and sketch encoder and one or few input images, it can animate images, interpolate between keyframes, extend videos as well as guide video generation with only depth maps or a few sketches. Especially in love with how scene transitions look like.

17.07.24 · Project Page · Code · Text-to-Video

Noise Calibration

Noise Calibration can improve video quality while keeping the original content structure. It uses a noise optimization strategy with pre-trained diffusion models to enhance visuals and ensure consistency between original and enhanced videos.

14.07.24 · Project Page · Code · Video Restoration · Video Editing

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

ST-AVSR can enhance video resolution at any size while keeping details clear and smooth. It uses a pre-trained VGG network to improve quality and speed, making it better than other methods.

13.07.24 · Code · Video Restoration · Video Upscaling

Live2Diff

Live2Diff can translate live video streams using a special attention method in video diffusion models. It maintains smooth motion by linking each frame to previous ones and can achieve 16 frames per second on an RTX 4090 GPU, making it great for real-time use.

11.07.24 · Project Page · Code · Video Summarization

LivePortrait

LivePortrait can animate a single source image with motion from a driving video. The method is able to generate high-quality videos at 60fps and is able to retarget the motion to other characters.

03.07.24 · Project Page · Code · Image-to-Video