AI Toolbox

Video AI Tools

Free video AI tools for editing, generating animations, and analyzing footage, perfect for filmmakers and content creators seeking efficiency.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

Shape of Motion

Shape of Motion can reconstruct 3D scenes from a single video. The method is able to capture the full 3D motion of a scene and can handle occlusions and disocclusions.

18.07.24 · Project Page · Code · Video-to-4D · Video Scene Detection

SparseCtrl

SparseCtrl is a image-to-video method with some cool new capabilities. With its RGB, depth and sketch encoder and one or few input images, it can animate images, interpolate between keyframes, extend videos as well as guide video generation with only depth maps or a few sketches. Especially in love with how scene transitions look like.

17.07.24 · Project Page · Code · Text-to-Video

Noise Calibration

Noise Calibration can improve video quality while keeping the original content structure. It uses a noise optimization strategy with pre-trained diffusion models to enhance visuals and ensure consistency between original and enhanced videos.

14.07.24 · Project Page · Code · Video Restoration · Video Editing

Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors

ST-AVSR can enhance video resolution at any size while keeping details clear and smooth. It uses a pre-trained VGG network to improve quality and speed, making it better than other methods.

13.07.24 · Code · Video Restoration · Video Upscaling

Live2Diff

Live2Diff can translate live video streams using a special attention method in video diffusion models. It maintains smooth motion by linking each frame to previous ones and can achieve 16 frames per second on an RTX 4090 GPU, making it great for real-time use.

11.07.24 · Project Page · Code · Video Summarization

LivePortrait

LivePortrait can animate a single source image with motion from a driving video. The method is able to generate high-quality videos at 60fps and is able to retarget the motion to other characters.

03.07.24 · Project Page · Code · Image-to-Video

AniPortrait

AniPortrait can generate high-quality portrait animations driven by audio and a reference portrait image. It also supports face reenactment from a reference video.

02.07.24 · Code · Audio-to-Video

DiffIR2VR-Zero

DiffIR2VR-Zero is a zero-shot video restoration method that can be used with any 2D image restoration diffusion model. The method is able to do 8x super-resolution and high-standard deviation video denoising.

01.07.24 · Project Page · Code · Demo · Video Restoration · Video Upscaling

Motion Prompting

Motion Prompting can control video generation using motion paths. It allows for camera control, motion transfer, and drag-based image editing, producing realistic movements and physics.

01.07.24 · Project Page · Controllable Video Generation · Image Editing · Image-to-Video

MimicMotion

MimicMotion can generate high-quality videos of arbitrary length mimicking specific motion guidance. The method is able to produce videos of up to 10,000 frames with acceptable resource consumption.

28.06.24 · Project Page · Code · Video Editing

Text-Animator

Text-Animator can depict the structures of visual text in generated videos. It supports camera control and text refinement to improve the stability of the generated visual text.

25.06.24 · Project Page · Code · Text-to-Video

MotionBooth

MotionBooth can generate videos of customized subjects from a few images and a text prompt with precise control over both object and camera movements.

25.06.24 · Project Page · Code · Text-to-Video · Video Editing

Disentangled Motion Modeling for Video Frame Interpolation

MoMo is a new video frame interpolation method that is able to generate intermediate frames with high visual quality and reduced computational demands.

25.06.24 · Code · Video Inpainting

FreeTraj

FreeTraj is a tuning-free approach that enables trajectory control in video diffusion models by modifying noise sampling and attention mechanisms.

24.06.24 · Project Page · Code · Video Analysis

MVOC

MVOC is a training-free multiple video object composition method with diffusion models. The method can be used to composite multiple video objects into a single video while maintaining motion and identity consistency.

22.06.24 · Project Page · Code · Image-to-Video · Video Editing

Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model

Conditional Image Leakage can be used to generate videos with more dynamic and natural motion from image prompts.

22.06.24 · Project Page · Code · Image-to-Video

Image Conductor

Image Conductor can generate video assets from a single image with precise control over camera transitions and object movements.

21.06.24 · Project Page · Code · Image-to-Video

Mora

Mora can enable generalist video generation through a multi-agent framework. It supports text-to-video generation, video editing, and digital world simulation, achieving performance similar to the Sora model.

21.06.24 · Code · Text-to-Video · Image-to-Video · Video Editing

EvTexture

EvTexture is a video super-resolution upscaling method that utilizes event signals for texture enhancement for more accurate texture and high-resolution detail recovery.

19.06.24 · Project Page · Code · Video Restoration · Video Upscaling

MM-Diffusion

MM-Diffusion can generate high-quality audio-video pairs using a multi-modal diffusion model with two coupled denoising autoencoders.

05.06.24 · Code · Audio-to-Video · Video-to-Audio