AI Toolbox · Video

Image-to-Video

Free image-to-video AI tools for quickly transforming images into dynamic videos, perfect for content creators and filmmakers.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

PUSA V1.0

Pusa V1.0 can generate high-quality videos from images and text prompts. It achieves a VBench-I2V score of 87.32% with only $500 in training costs and supports features like video transitions and extensions.

23.07.25 · Project Page · Code · Model · Image-to-Video · Text-to-Video

Matrix-Game

Matrix-Game can generate high-quality interactive game worlds in Minecraft.

24.06.25 · Project Page · Code · Image-to-Video

AnchorCrafter

AnchorCrafter can generate high-quality 2D videos of people interacting with a reference product.

17.06.25 · Project Page · Code · Image-to-Video · Video Personalization

Synergizing Motion and Appearance

Synergizing Motion and Appearance can generate high-quality talking head videos by combining facial identity from a source image with motion from a driving video.

03.06.25 · Project Page · Code · Video-to-Video · Controllable Video Generation · Image-to-Video · Talking Head Generation

RealCam-I2V

RealCam-I2V can generate high-quality videos from real-world images with consistent parameter camera controls.

18.05.25 · Project Page · Code · Image-to-Video · Controllable Video Generation

HunyuanPortrait

HunyuanPortrait can animate characters from a single portrait image by using facial expressions and head poses from video clips. It achieves lifelike animations with high consistency and control, effectively separating appearance and motion.

18.05.25 · Project Page · Code · Image-to-Video · Talking Head Generation

MTVCrafter

MTVCrafter can generate high-quality human image animations from 3D motion sequences.

14.05.25 · Project Page · Code · Image-to-Video

Skyeyes

Skyeyes can generate photorealistic sequences of ground view images from aerial view inputs. It ensures that the images are consistent and realistic, even when there are large gaps in views.

09.05.25 · Project Page · Code · Image-to-Video

Phantom

Phantom can generate videos that keep the subject’s identity from images while matching them with text prompts.

21.04.25 · Project Page · Code · Text-to-Video · Image-to-Video · Personalized Video Generation

SkyReels-V2

SkyReels-V2 can generate infinite-length videos by combining a Diffusion Forcing framework with Multi-modal Large Language Models and Reinforcement Learning.

20.04.25 · Code · Text-to-Video · Image-to-Video

FramePack

FramePack aims to make video generation feel like image gen. It can generate single video frames in 1.5 seconds with 13B models on a RTX 4090. Also supports full fps-30 with 13B models using a 6GB laptop GPU, but obviously slower.

18.04.25 · Project Page · Code · Text-to-Video · Image-to-Video

UniAnimate-DiT

UniAnimate-DiT can generate high-quality animations from human images. It uses the Wan2.1 model and a lightweight pose encoder to create smooth and visually appealing results, while also upscaling animations from 480p to 720p.

16.04.25 · Code · Image-to-Video

VACE

VACE basically adds ControlNet support to video models like Wan and LTX. It handle various video tasks like generating videos from references, video inpainting, pose control, sketch to video and more.

01.04.25 · Project Page · Code · Video Inpainting · Sketch-to-Video · Video Outpainting · Video-to-Video · Video Editing · Image-to-Video

Perception-as-Control

Perception-as-Control can achieve fine-grained motion control for image animation by creating a 3D motion representation from a reference image.

31.03.25 · Project Page · Code · Controllable Video Generation · Image-to-Video

CausVid

CausVid can generate high-quality videos at 9.4 frames per second on a single GPU. It supports text-to-video, image-to-video, and dynamic prompting while reducing latency with a causal transformer architecture.

26.03.25 · Project Page · Code · Video-to-Video · Image-to-Video

LayerAnimate

LayerAnimate can animate single anime frames from text prompts or interpolate between two frames with or without sketch-guidance. It allows users to adjust foreground and background elements separately.

22.03.25 · Project Page · Code · Video Interpolation · Controllable Video Generation · Image-to-Video

PP-VCtrl

PP-VCtrl can turn text-to-video models into customizable video generators. It uses control signals like Canny edges and segmentation masks to improve video quality and control without retraining the models, making it great for character animation and video editing.

21.03.25 · Project Page · Code · Text-to-Video · Image-to-Video · Controllable Video Generation

Magic 1-For-1

Magic 1-For-1 can generate one-minute video clips in just one minute.

12.02.25 · Project Page · Code · Image-to-Video · Text-to-Video

Go-with-the-Flow

Go-with-the-Flow can control motion patterns in video diffusion models using real-time warped noise from optical flow fields. It allows users to manipulate object movements and camera motions while keeping high image quality and not needing changes to existing models.

14.01.25 · Project Page · Code · Controllable Video Generation · Image-to-Video

UniVG

UniVG is yet another video generation system. The highlight of UniVG is its ability to use image inputs for guidance and modify and guide generation with additional text prompts. Haven’t seen other video models do this yet.

10.01.25 · Project Page · Code · Text-to-Video · Image-to-Video