AI Toolbox

Image AI Tools

Free image AI tools for generating and editing visuals, creating 3D assets for games, films, and more, optimizing your creative projects.

Image AI Tools

3D Editing 3D Object Generation 3D Scene Generation Brain-to-Image Controllable Image Generation Image Captioning Image Classification Image Colorization Image Depth Estimation Image Editing Image Editing Controllable Image Generation Image Style Transfer Image Generation Image Inpainting Image Inpainting Image Editing Image Object Detection Image Relighting Image Restoration Image Segmentation Image Style Transfer Image-to-3D Image-to-Depth Image-to-Image Image-to-Sketch Image-to-Text Image-to-Video Image Upscaling Personalized Image Generation Text-to-Image Text-to-Image Personalized Image Generation Video Captioning Video Editing Virtual Image Try-On

PosterCraft

PosterCraft can generate high-quality aesthetic posters by improving how text and art work together.

13.06.25 · Project Page · Code · Demo · Model · Text-to-Image

GEN3C

GEN3C can generate photorealistic videos from single or sparse-view images while keeping camera control and 3D consistency.

10.06.25 · Project Page · Code · Image-to-Video

GCC

GCC can inpaint color checkers into images to improve lighting and color accuracy.

08.06.25 · Project Page · Code · Image Inpainting · Image Colorization

RepText

RepText can render multilingual visual text in user-chosen fonts without needing to understand the text. It allows for customization of text content, font, and position.

07.06.25 · Project Page · Code · Image Generation · Text-to-Image

MARBLE

MARBLE can blend and change the material properties of objects in images using material embeddings in CLIP-space. It allows control over attributes like roughness, metallic, transparency, and glow, enabling multiple edits at once and supporting various artistic styles.

04.06.25 · Project Page · Code · Image Editing

OmniPainter

OmniPainter can generate high-quality images that match a prompt and a style reference image in just 4 to 6 timesteps. It uses the self-consistency property of latent consistency models to ensure the results closely align with the style of the reference image.

27.05.25 · Project Page · Code · Text-to-Image · Image Style Transfer

BAGEL

BAGEL is a unified multimodal model that can understand and generate images and text, excelling in tasks like image editing and predicting future frames. Basically the open-source version of GPT-4o.

23.05.25 · Project Page · Code · Image Editing

PixelHacker

PixelHacker can perform image inpainting with strong consistency in structure and meaning. It uses a diffusion-based model and a dataset of 14 million image-mask pairs, achieving better results than other methods in texture, shape, and color consistency.

20.05.25 · Project Page · Code · Image Inpainting

Custom SVG

Custom SVG can generate high-quality SVGs from text prompts with customizable styles.

16.05.25 · Project Page · Code · Text-to-Image

Marigold

Marigold can estimate depth, predict surface normals, and decompose images with minimal changes.

14.05.25 · Project Page · Code · Image-to-Image · Image Relighting · Image Editing

PreciseCam

PreciseCam can generate images with exact control over camera angles and lens distortions using four simple camera settings.

07.05.25 · Project Page · Code · Text-to-Image · Controllable Image Generation · Image Editing

AnyStory

AnyStory can generate consistent single- and multi-subject images from text.

30.04.25 · Project Page · Code · Text-to-Image · Personalized Image Generation

SwiftSketch

SwiftSketch can generate high-quality vector sketches from images in under a second. It uses a diffusion model to create editable sketches that work well for different object types and are not limited by resolution.

29.04.25 · Project Page · Code · Image-to-Sketch

Step1X-Edit

Step1X-Edit can perform advanced image editing tasks by processing reference images and user instructions.

25.04.25 · Code · Image Editing

Describe Anything

[Describe Anything] can generate detailed descriptions for specific areas in images and videos using points, boxes, scribbles, or masks. It produces context-aware captions that highlight subtle details and changes over time, achieving top performance on seven benchmarks for localized captioning.

24.04.25 · Project Page · Code · Image Captioning · Video Captioning

SwiftBrush v2

SwiftBrush v2 can improve the quality of images generated by one-step text-to-image diffusion models. Results look great, and apparently it ranks better than all GAN-based and multi-step Stable Diffusion models in benchmarks. No code though 🤷‍♂️

24.04.25 · Project Page · Code · Text-to-Image

InstantCharacter

InstantCharacter can generate high-quality images of personalized characters from a single reference image with FLUX. It supports different styles and poses, ensuring identity consistency and allowing for text-based edits.

22.04.25 · Project Page · Code · Demo · Personalized Image Generation

ID-Patch

ID-Patch can generate personalized group photos by matching faces with specific positions. It reduces problems like identity leakage and visual errors, achieving high accuracy and speed—seven times faster than other methods.

22.04.25 · Project Page · Code · Text-to-Image · Image-to-Image · Personalized Image Generation · Controllable Image Generation

Shape-Guided Clothing Warping for Virtual Try-On

SCW-VTON can fit in-shop clothing to a person’s image while keeping their pose consistent. It improves the shape of the clothing and reduces distortions in visible limb areas, making virtual try-on results look more realistic.

20.04.25 · Code · Virtual Image Try-On

PosterMaker

PosterMaker can generate high-quality product posters by rendering text accurately and keeping the main subject clear.

18.04.25 · Project Page · Code · Image Editing · Image Inpainting · Image-to-Image