AI Toolbox

Video AI Tools

Free video AI tools for editing, generating animations, and analyzing footage, perfect for filmmakers and content creators seeking efficiency.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

InsTaG

InsTaG can generate realistic 3D talking heads from just a few seconds of video.

03.03.25 · Project Page · Code · Talking Head Generation

MatAnyone

MatAnyone can generate stable and high-quality human video matting masks.

19.02.25 · Project Page · Code · Video Object Detection

MegaSaM

MEGASAM can estimate camera parameters and depth maps from casual monocular videos.

18.02.25 · Project Page · Code · Video Depth Estimation · Video-to-3D

Step-Video-T2V

Step-Video-T2V can generate high-quality videos up to 204 frames long using a 30B parameter text-to-video model.

18.02.25 · Code · Model · Text-to-Video

Magic 1-For-1

Magic 1-For-1 can generate one-minute video clips in just one minute.

12.02.25 · Project Page · Code · Image-to-Video · Text-to-Video

VD3D

VD3D enables camera control for video diffusion models and can transfer the camera trajectory from a reference video.

11.02.25 · Project Page · Code · Video Editing · Video Object Detection

Diffusion as Shader

Diffusion as Shader can generate high-quality videos from 3D tracking inputs.

11.02.25 · Project Page · Code · Text-to-Video · Video Editing · Video Object Detection · Controllable Video Generation

Lumina-Video

Lumina-Video can generate high-quality videos with synchronized sound from text prompts.

11.02.25 · Code · Model · Text-to-Video

Light-A-Video

Light-A-Video can relight videos without flickering.

10.02.25 · Project Page · Code · Video Relighting

FlashVideo

FlashVideo can generate videos from text prompts and upscale them to 1080p.

10.02.25 · Project Page · Code · Model · Video Upscaling · Text-to-Video

VideoGuide

VideoGuide can improve the quality of videos made by text-to-video models without needing extra training. It enhances the smoothness of motion and clarity of images, making the videos more coherent and visually appealing.

07.02.25 · Project Page · Code · Text-to-Video

Multi-subject Open-set Personalization in Video Generation

Video Alchemist can generate personalized videos using text prompts and reference images. It supports multiple subjects and backgrounds without long setup times, achieving high-quality results with better subject fidelity and text alignment.

07.02.25 · Project Page · Code · Personalized Video Generation

Imagine360

Imagine360 can generate high-quality 360° videos from monologue single-view videos.

30.01.25 · Project Page · Code · Video-to-Video

DELTA

DELTA can track dense 3D motion from single-camera videos with high accuracy. It uses advanced techniques to speed up the process, making it over 8 times faster than older methods while maintaining pixel-level precision.

28.01.25 · Project Page · Code · 3D Object Detection · Video Object Tracking

Video Depth Anything

Video Depth Anything can estimate depth in long videos while keeping a fast speed of 30 frames per second.

22.01.25 · Project Page · Code · Demo · Video Depth Estimation

RepVideo

RepVideo can improve video generation by making visuals look better and ensuring smooth transitions.

16.01.25 · Project Page · Code · Text-to-Video

VISION-XL

VISION-XL can deblur and upscale videos using SDXL. It supports different aspect ratios and can produce HD videos in under 2.5 minutes on a single NVIDIA 4090 GPU, using only 13GB of VRAM for 25-frame videos.

16.01.25 · Project Page · Code · Video Restoration · Video Upscaling

Splatter a Video

Splatter a Video can turn a video into a 3D Gaussian representation, allowing for enhanced video tracking, depth prediction, motion and appearance editing, and stereoscopic video generation.

15.01.25 · Project Page · Code · Video Object Detection · Video Editing · Video Depth Estimation

Go-with-the-Flow

Go-with-the-Flow can control motion patterns in video diffusion models using real-time warped noise from optical flow fields. It allows users to manipulate object movements and camera motions while keeping high image quality and not needing changes to existing models.

14.01.25 · Project Page · Code · Controllable Video Generation · Image-to-Video

Kinetic Typography Diffusion Model

Kinetic Typography Diffusion Model can generate kinetic typography videos with legible and artistic letter motions based on text prompts.

11.01.25 · Project Page · Code · Text-to-Video