AI Toolbox

Video AI Tools

Free video AI tools for editing, generating animations, and analyzing footage, perfect for filmmakers and content creators seeking efficiency.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

Tora2

Tora2 can generate videos with customized motion and appearance for multiple entities.

09.07.25 · Project Page · Code · Controllable Video Generation

LongAnimation

LongAnimation can create long-term animations with consistent colors.

02.07.25 · Project Page · Code · Sketch-to-Video · Video Colorization

Depth Anything at Any Condition

Depth Anything at Any Condition can estimate depth from a single image in different lighting and weather conditions.

02.07.25 · Project Page · Code · Image Depth Estimation · Video Depth Estimation

ReferDINO

ReferDINO can segment objects in videos using text descriptions. It improves accuracy with a special mask decoder and enhances understanding of movement over time.

27.06.25 · Project Page · Code · Video Object Detection

Matrix-Game

Matrix-Game can generate high-quality interactive game worlds in Minecraft.

24.06.25 · Project Page · Code · Image-to-Video

OmniAvatar

OmniAvatar can generate lifelike full-body avatar videos from audio. It offers accurate lip-syncing and natural movements, and allows for precise control over emotions and backgrounds.

24.06.25 · Project Page · Code · Audio-to-Video · Talking Head Generation · Lip Syncing

GaVS

GaVS can stabilize videos by reconstructing and rendering them in 3D.

24.06.25 · Project Page · Code · Video Restoration

AnchorCrafter

AnchorCrafter can generate high-quality 2D videos of people interacting with a reference product.

17.06.25 · Project Page · Code · Image-to-Video · Video Personalization

MIMO

MIMO can create controllable character videos from a single image. It allows users to animate characters with complex motions in real-world scenes by encoding 2D videos into 3D spatial codes for flexible control.

11.06.25 · Project Page · Code · Personalized Video Generation · Video Editing

LinGen

LinGen can generate high-resolution minute-length videos on a single GPU.

09.06.25 · Project Page · Code · Text-to-Video · Video Generation

Let Them Talk

MultiTalk can generate videos of multiple people talking by using audio from different sources, a reference image, and a prompt.

09.06.25 · Project Page · Code · Audio-to-Video · Talking Head Generation

Any-to-Bokeh

Any-to-Bokeh can turn videos into bokeh effects that show depth and focus.

09.06.25 · Project Page · Code · Video Editing

ContentV

ContentV can generate high-quality videos from text prompts in various resolutions and lengths.

06.06.25 · Project Page · Code · Text-to-Video

Synergizing Motion and Appearance

Synergizing Motion and Appearance can generate high-quality talking head videos by combining facial identity from a source image with motion from a driving video.

03.06.25 · Project Page · Code · Video-to-Video · Controllable Video Generation · Image-to-Video · Talking Head Generation

Generative Omnimatte

Generative Omnimatte can break down videos into meaningful layers, isolating objects, shadows, and reflections without needing static backgrounds. It uses a video diffusion model for high-quality results and can fill in hidden areas, enhancing video editing options.

03.06.25 · Project Page · Code · Video Inpainting · Video Editing

MiniMax-Remover

MiniMax-Remover can remove objects from videos efficiently with just 6 sampling steps.

30.05.25 · Project Page · Code · · Video Object Detection · Video Editing

EPiC

EPiC can control video cameras in image-to-video and video-to-video tasks without needing many camera path details.

29.05.25 · Project Page · Code · Video-to-Video · Controllable Video Generation

DualParal

DualParal can generate minute-long videos.

28.05.25 · Project Page · Code · Text-to-Video

Uni3C

Uni3C is a video generation method that adds support for both camera controls and human motion in video generation.

21.05.25 · Project Page · Code · Controllable Video Generation

MoCha

MoCha can generate talking character animations from speech and text, allowing for multi-character conversations with turn-based dialogue.

19.05.25 · Project Page · Code · Text-to-Video