AI Toolbox

Image AI Tools

Free image AI tools for generating and editing visuals, creating 3D assets for games, films, and more, optimizing your creative projects.

Image AI Tools

3D Editing 3D Object Generation 3D Scene Generation Brain-to-Image Controllable Image Generation Image Captioning Image Classification Image Colorization Image Depth Estimation Image Editing Image Editing Controllable Image Generation Image Style Transfer Image Generation Image Inpainting Image Inpainting Image Editing Image Object Detection Image Relighting Image Restoration Image Segmentation Image Style Transfer Image-to-3D Image-to-Depth Image-to-Image Image-to-Sketch Image-to-Text Image-to-Video Image Upscaling Personalized Image Generation Text-to-Image Text-to-Image Personalized Image Generation Video Captioning Video Editing Virtual Image Try-On

One-Prompt-One-Story

One-Prompt-One-Story can generate consistent images from a single text prompt by combining all prompts into one input for text-to-image models.

24.01.25 · Project Page · Code · Text-to-Image · Image Editing

X-Dyna

X-Dyna can animate a single human image by transferring facial expressions and body movements from a video.

21.01.25 · Project Page · Code · Image-to-Video

ReF-LDM

ReF-LDM can restore low-quality face images by using multiple high-quality reference images.

19.01.25 · Project Page · Code · Image Restoration · Image Inpainting

Chat2SVG

Chat2SVG can generate and edit SVG vector graphics from text prompts. It combines Large Language Models and image diffusion models to create detailed SVG templates and allows users to refine them with simple language instructions.

13.01.25 · Project Page · Code · Text-to-Image · Image Editing

TryOffDiff

TryOffDiff can generate high-quality images of clothing from photos of people wearing them.

08.01.25 · Project Page · Code · Image-to-Image

LLM4GEN

LLM4GEN enhances the semantic understanding ability of text-to-image diffusion models by leveraging the semantic representation of LLMs. Meaning: More complex and dense prompts that involve multiple objects, attribute binding, and long descriptions.

07.01.25 · Project Page · Code · Text-to-Image

Reflecting Reality

Reflecting Reality can generate realistic mirror reflections using a method called MirrorFusion. It allows users to control mirror placement and achieves better reflection quality and geometry than other methods.

05.01.25 · Project Page · Code · Image Inpainting

AniDoc

AniDoc can automate the colorization of line art in videos and create smooth animations from simple sketches.

19.12.24 · Project Page · Code · Video Colorization · Image Colorization

FitDiT

FitDiT can generate realistic virtual try-on images that show how clothes fit on different body types. It keeps garment textures clear and works quickly, taking only 4.57 seconds for a single image.

17.12.24 · Project Page · Code · Image-to-Image · Image Editing · Virtual Image Try-On

ColorFlow

ColorFlow can colorize black and white line-art and manga panels while keeping characters and objects consistent.

17.12.24 · Project Page · Code · Demo · Image Colorization

InvSR

InvSR can upscale images in one to five steps. It achieves great results even with just one step, making it efficient for improving images in real-world situations.

13.12.24 · Code · Demo · Image Upscaling · Image Restoration

Personalized Restoration via Dual-Pivot Tuning

Personalized Restoration is a method that can restore degraded images of faces while retaining the identity of the person using reference images. The method is able to edit the restored image using text prompts, enabling modifications like changing the color of the eyes or making the person smile.

12.12.24 · Project Page · Code · Image Restoration

Leffa

Leffa can generate person images based on reference images, allowing for precise control over appearance and pose.

12.12.24 · Project Page · Code · Demo · Image Editing · Personalized Image Generation · Virtual Image Try-On

TryOffAnyone

TryOffAnyone can generate high-quality images of clothing on models from photos.

12.12.24 · Code · Image-to-Image

FireFlow

FireFlow is FLUX-dev editing method that can perform fast image inversion and semantic editing with just 8 diffusion steps.

10.12.24 · Code · Image Editing

Factor Graph Diffusion

Factor Graph Diffusion can generate high-quality images with better prompt adherence. The method allows for controllable image creation using tools like segmentation and depth maps.

09.12.24 · Project Page · Code · Image Editing · Controllable Image Generation

MV-Adapter

MV-Adapter can generate images from multiple views while keeping them consistent across views. It enhances text-to-image models like Stable Diffusion XL, supporting both text and image inputs, and achieves high-resolution outputs at 768x768.

06.12.24 · Project Page · Code · Text-to-Image · Image-to-Image

Anagram-MTL

Anagram-MTL can generate visual anagrams that change appearance with transformations like flipping or rotating.

04.12.24 · Code · Text-to-Image

Negative Token Merging

Negative Token Merging can improve image diversity by pushing apart similar features during the reverse diffusion process. It reduces visual similarity with copyrighted content by 34.57% and works well with Stable Diffusion as well as Flux.

02.12.24 · Project Page · Code · Text-to-Image · Controllable Image Generation

FlowEdit

FlowEdit can edit images using only text prompts with Flux and Stable Diffusion 3.

02.12.24 · Project Page · Code · Image Editing · Text-to-Image