AI Toolbox · Video

Text-to-Video

Free text-to-video AI tools for creating engaging video content from scripts, perfect for filmmakers, marketers, and content creators.

Video AI Tools

Audio-to-Video Controllable Video Generation Image-to-Video Lip Syncing Personalized Video Generation Sketch-to-Video Talking Head Generation Text-to-Video Video Analysis Video Captioning Video Colorization Video Depth Estimation Video Editing Video Generation Video Inpainting Video Interpolation Video Object Detection Video Object Tracking Video Outpainting Video Outpainting Video Editing Video Personalization Video Prediction Video Reconstruction Video Relighting Video Restoration Video Scene Detection Video Style Transfer Video Summarization Video-to-4D Video-to-Audio Video-to-Video Video-to-Video Translation Video Upscaling Virtual Video Try-On

Mobius

Mobius can generate seamlessly looping videos from text descriptions.

16.03.25 · Project Page · Code · Text-to-Video

MovieAgent

MovieAgent can generate long-form videos with multiple scenes and shots from a script and character bank. It ensures character consistency and synchronized subtitles while reducing the need for human input in movie production.

13.03.25 · Project Page · Code · Text-to-Video · Video Editing

VideoMaker

VideoMaker can generate personalized videos from a single subject reference image.

04.03.25 · Project Page · Code · Personalized Video Generation · Controllable Video Generation · Text-to-Video

Step-Video-T2V

Step-Video-T2V can generate high-quality videos up to 204 frames long using a 30B parameter text-to-video model.

18.02.25 · Code · Model · Text-to-Video

Magic 1-For-1

Magic 1-For-1 can generate one-minute video clips in just one minute.

12.02.25 · Project Page · Code · Image-to-Video · Text-to-Video

Diffusion as Shader

Diffusion as Shader can generate high-quality videos from 3D tracking inputs.

11.02.25 · Project Page · Code · Text-to-Video · Video Editing · Video Object Detection · Controllable Video Generation

Lumina-Video

Lumina-Video can generate high-quality videos with synchronized sound from text prompts.

11.02.25 · Code · Model · Text-to-Video

FlashVideo

FlashVideo can generate videos from text prompts and upscale them to 1080p.

10.02.25 · Project Page · Code · Model · Video Upscaling · Text-to-Video

VideoGuide

VideoGuide can improve the quality of videos made by text-to-video models without needing extra training. It enhances the smoothness of motion and clarity of images, making the videos more coherent and visually appealing.

07.02.25 · Project Page · Code · Text-to-Video

RepVideo

RepVideo can improve video generation by making visuals look better and ensuring smooth transitions.

16.01.25 · Project Page · Code · Text-to-Video

Kinetic Typography Diffusion Model

Kinetic Typography Diffusion Model can generate kinetic typography videos with legible and artistic letter motions based on text prompts.

11.01.25 · Project Page · Code · Text-to-Video

UniVG

UniVG is yet another video generation system. The highlight of UniVG is its ability to use image inputs for guidance and modify and guide generation with additional text prompts. Haven’t seen other video models do this yet.

10.01.25 · Project Page · Code · Text-to-Video · Image-to-Video

TransPixar

TransPixar can generate RGBA videos, enabling the creation of transparent elements like smoke and reflections that blend seamlessly into scenes.

07.01.25 · Project Page · Code · Demo · Text-to-Video

DiTCtrl

DiTCtrl can generate multi-prompt videos with smooth transitions and consistent object motion.

24.12.24 · Project Page · Code · Text-to-Video

CustomCrafter

CustomCrafter can generate high-quality videos from text prompts and reference images. It improves motion generation with a Dynamic Weighted Video Sampling Strategy and allows for better concept combinations without needing extra video or fine-tuning.

16.12.24 · Project Page · Code · Text-to-Video · Video Editing

SynCamMaster

SynCamMaster can generate videos from different viewpoints while keeping the look and shape consistent. It improves text-to-video models for multi-camera use and allows re-rendering from new angles.

11.12.24 · Project Page · Code · Text-to-Video

Customizing Motion in Text-to-Video Diffusion Models

On the other hand, Customizing Motion can learn and generalize input motion patterns from input videos and apply them to new and unseen contexts.

09.12.24 · Project Page · Code · Text-to-Video

VideoRepair

VideoRepair can improve text-to-video generation by finding and fixing small mismatches between text prompts and videos.

05.12.24 · Project Page · Code · Text-to-Video

Adaptive Caching

Adaptive Caching can speed up video generation with Diffusion Transformers by caching important calculations. It can achieve up to 4.7 times faster video creation at 720p without losing quality.

05.11.24 · Project Page · Code · Text-to-Video

VSTAR

VSTAR is a method that enables text-to-video models to generate longer videos with dynamic visual evolution in a single pass, without finetuning needed.

10.10.24 · Project Page · Code · Text-to-Video