AI Toolbox · Image

Controllable Image Generation

Free controllable image generation AI tools for creating customizable visuals, helping artists and designers produce tailored images for projects.

Image AI Tools

3D Editing 3D Object Generation 3D Scene Generation Brain-to-Image Controllable Image Generation Image Captioning Image Classification Image Colorization Image Depth Estimation Image Editing Image Editing Controllable Image Generation Image Style Transfer Image Generation Image Inpainting Image Inpainting Image Editing Image Object Detection Image Relighting Image Restoration Image Segmentation Image Style Transfer Image-to-3D Image-to-Depth Image-to-Image Image-to-Sketch Image-to-Text Image-to-Video Image Upscaling Personalized Image Generation Text-to-Image Text-to-Image Personalized Image Generation Video Captioning Video Editing Virtual Image Try-On

Improved Distribution Matching Distillation for Fast Image Synthesis

DMD2 is a new improved distillation method that can turn diffusion models into efficient one-step image generators.

23.05.24 · Project Page · Code · Controllable Image Generation · Personalized Image Generation

From Parts to Whole

Parts2Whole can generate customized human portraits from multiple reference images, including pose images and various aspects of human appearance. The method is able to generate human images conditioned on selected parts from different humans as control conditions, allowing you to create images with specific combinations of facial features, hair, clothes, etc.

23.04.24 · Project Page · Code · Personalized Image Generation · Controllable Image Generation

Desigen

Desigen can generate high-quality design templates, including background images and layout elements. It uses advanced diffusion models for better control and has been tested on over 40,000 advertisement banners, achieving results similar to human designers.

14.03.24 · Project Page · Code · Controllable Image Generation · Image-to-Image · Image Editing

Multi-LoRA Composition for Image Generation

Multi-LoRA Composition focuses on the integration of multiple Low-Rank Adaptations (LoRAs) to create highly customized and detailed images. The approach is able to generate images with multiple elements without fine-tuning and without losing detail or image quality.

26.02.24 · Project Page · Code · Controllable Image Generation · Image Editing

FlexGen

FlexGen can generate high-quality, multi-view images from a single-view image or text prompt. It lets users change unseen areas and adjust material properties like metallic and roughness, improving control over the final image.

12.01.24 · Project Page · Code · Text-to-Image · Image-to-Image · Controllable Image Generation

AmbiGen

AmbiGen can generate ambigrams by optimizing letter shapes for clear reading from two angles. It improves word accuracy by over 11.6% and reduces edit distance by 41.9% on the 500 most common English words.

05.12.23 · Project Page · Code · Image Editing · Controllable Image Generation

MagicPose

It’s been a while since I last doomed the TikTok dancers. MagicDance is gonna doom them some more. This model can combine human motion with reference images to precisely generate appearance-consistent videos. While the results still contain visible artifacts and jittering, give it a few months and I’m sure we can’t tell the difference no more.

18.11.23 · Project Page · Code · Image Editing · Personalized Image Generation · Controllable Image Generation

Break-A-Scene

Break-A-Scene can extract multiple concepts from a single image using segmentation masks. It allows users to re-synthesize individual concepts or combinations in different contexts, enhancing scene generation with a two-phase customization process.

25.05.23 · Project Page · Code · Image Segmentation · Image Editing · Controllable Image Generation

T2I-Adapter

[Tool Name] can [main function/capability]. It [key detail 1] and [key detail 2].

16.02.23 · Code · Demo · Text-to-Image · Image Editing · Controllable Image Generation

MultiDiffusion

MultiDiffusion can generate high-quality images using a pre-trained text-to-image diffusion model. It allows users to control aspects like image size and includes features for guiding images with segmentation masks and bounding boxes.

16.02.23 · Project Page · Code · Demo · Controllable Image Generation · Image Segmentation

Adding Conditional Control to Text-to-Image Diffusion Models

ControlNet can add control to text-to-image diffusion models. It lets users manipulate image generation using methods like edge detection and depth maps, while working well with both small and large datasets.

10.02.23 · Code · Text-to-Image · Controllable Image Generation

StyleGAN-T

StyleGAN-T can generate high-quality images at 512x512 resolution in just 2 seconds using a single NVIDIA A100 GPU. It solves problems in text-to-image synthesis, like stable training on diverse datasets and strong text alignment.

23.01.23 · Project Page · Code · Text-to-Image · Controllable Image Generation