AI Toolbox

Image AI Tools

Free image AI tools for generating and editing visuals, creating 3D assets for games, films, and more, optimizing your creative projects.

Image AI Tools

3D Editing 3D Object Generation 3D Scene Generation Brain-to-Image Controllable Image Generation Image Captioning Image Classification Image Colorization Image Depth Estimation Image Editing Image Editing Controllable Image Generation Image Style Transfer Image Generation Image Inpainting Image Inpainting Image Editing Image Object Detection Image Relighting Image Restoration Image Segmentation Image Style Transfer Image-to-3D Image-to-Depth Image-to-Image Image-to-Sketch Image-to-Text Image-to-Video Image Upscaling Personalized Image Generation Text-to-Image Text-to-Image Personalized Image Generation Video Captioning Video Editing Virtual Image Try-On

Diffusion with Forward Models

Diffusion with Forward Models is a able to reconstruct 3D scenes from a single input image. Additionally it’s also able to add small and short motions to images with people in them.

20.06.23 · Project Page · Code · Image-to-3D · 3D Scene Generation

Cocktail

Cocktail is a pipeline for guiding image generating. Compared to ControlNet, it only requires one generalized model for multiple modalities like Edge, Pose and Mask guidance.

01.06.23 · Project Page · Code · Text-to-Image

RAPHAEL

There is a new text-to-image player called RAPHAEL in town. The model aims to generate highly artistic images, which accurately portray the text prompts, encompassing multiple nouns, adjectives, and verbs. This is all great, but only if someone actually releases the model for open-source consumption as the community is craving a model that can achieve Midjourney quality.

29.05.23 · Project Page · Code · Text-to-Image

Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers

Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers can enhance low-resolution license plate images. It uses attention and transformer modules to improve details and a special loss function based on Optical Character Recognition to achieve better image quality.

27.05.23 · Code · Image Upscaling

Break-A-Scene

Break-A-Scene can extract multiple concepts from a single image using segmentation masks. It allows users to re-synthesize individual concepts or combinations in different contexts, enhancing scene generation with a two-phase customization process.

25.05.23 · Project Page · Code · Image Segmentation · Image Editing · Controllable Image Generation

Drag Your GAN

DragGAN can manipulate images by letting users drag points to change the pose, shape, and layout of objects. It produces realistic results even when parts of the image are hidden or deformed.

18.05.23 · Project Page · Code · Image Editing

FastComposer

FastComposer can generate personalized images of multiple unseen individuals in various styles and actions without fine-tuning. It is 300x-2500x faster than traditional methods and requires no extra storage for new subjects, using subject embeddings and localized attention to keep identities clear.

17.05.23 · Project Page · Code · Text-to-Image · Personalized Image Generation

InstantBooth

What if you could generate images from an untrained concept by providing a few images and without having to fine-tune a model first? InstantBooth from Adobe might be the answer. The novel approach is built upon pre-trained text-to-image models that enables instant text-guided image personalization without finetuning. Compared to methods like DreamBooth and Textual-Inversion, InstantBooth model can generate competitive results on unseen concepts concerning language-image alignment, image fidelity, and identity preservation while being 100 times faster. Wen open-source?

10.05.23 · Project Page · Code · Text-to-Image · Personalized Image Generation

Ray Conditioning

Ray Conditioning is a lightweight and geometry-free technique for multi-view image generation. You have that perfect portrait shot of a face but the angle is not right? No problem, just use that shot as an input image and generate the portrait from a another angle. Done.

26.04.23 · Project Page · Code · Image-to-Image · Image Editing

Improved Diffusion-based Image Colorization via Piggybacked Models

Improved Diffusion-based Image Colorization via Piggybacked Models can colorize grayscale images using knowledge from pre-trained Text-to-Image diffusion models. It allows for conditional colorization with user hints and text prompts, achieving high-quality results.

21.04.23 · Project Page · Code · Image Colorization

DiFaReli

DiFaReli can relight single-view face images by managing lighting effects like shadows and global illumination. It uses a conditional diffusion model to separate lighting information, achieving photorealistic results without needing 3D data.

19.04.23 · Project Page · Code · Image Editing

Expressive Text-to-Image Generation with Rich Text

Expressive Text-to-Image Generation with Rich Text can create detailed images from text by using rich text formatting like font style, size, and color. This method allows for better control over styles and colors, making it easier to generate complex scenes compared to regular text.

13.04.23 · Project Page · Code · Demo · Text-to-Image

Inst-Inpaint

Inst-Inpaint can remove objects from images using natural language instructions, which saves time by not needing binary masks. It uses a new dataset called GQA-Inpaint, improving the quality and accuracy of image inpainting significantly.

06.04.23 · Project Page · Code · Image Inpainting · Text-to-Image

Reference-based Image Composition with Sketch via Structure-aware Diffusion Model

[Reference-based Image Composition with Sketch via Structure-aware Diffusion Model] can edit images by filling in missing parts using a reference image and a sketch. This method improves editability and allows for detailed changes in various scenes.

31.03.23 · Code · Image Editing · Image Inpainting

PAIR-Diffusion

PAIR Diffusion is a generic framework that can enable a diffusion model to control the structure and appearance properties of each object in an image. This allows for various object-level editing operations on real images such as reference image-based appearance editing, free-form shape editing, adding objects, and variations.

30.03.23 · Project Page · Code · Image Editing · Image Segmentation

High-Resolution Image Synthesis with Latent Diffusion Models

LDMs are high-resolution image generators that can inpaint, generate images from text or bounding boxes, and do super-resolution.

25.03.23 · Project Page · Code · Image Inpainting · Personalized Image Generation · Image Restoration

eDiff-I

eDiff-I can generate high-resolution images from text prompts using different diffusion models for each stage. It also allows users to control image creation by selecting and moving words on a canvas.

24.03.23 · Code · Text-to-Image

Key-Locked Rank One Editing for Text-to-Image Personalization

100kb models? Combining muliple individually learned concepts? 1-shot Personalization? Key-Locking? Perfusion just might be a new viable Stable Diffusion fine-tuning method by NVIDIA. No way to try it out yet, as there is as usual no code, but I’m keeping an eye on this one.

27.02.23 · Project Page · Code · Text-to-Image · Personalized Image Generation

Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models can quickly personalize text-to-image models using just one image and only 5 training steps. This method reduces training time from minutes to seconds while maintaining quality through regularized weight-offsets.

23.02.23 · Code · Text-to-Image

Reduce, Reuse, Recycle

Reduce, Reuse, Recycle can enable compositional generation using energy-based diffusion models and MCMC samplers. It improves tasks like classifier-guided ImageNet modeling and text-to-image generation by introducing new samplers that enhance performance.

22.02.23 · Project Page · Code · Text-to-Image · Image Classification