AI Toolbox · Image

Image Editing

Free image editing AI tools for quickly enhancing photos, creating visuals, and manipulating images for projects in art, marketing, and design.

Image AI Tools

3D Editing 3D Object Generation 3D Scene Generation Brain-to-Image Controllable Image Generation Image Classification Image Colorization Image Depth Estimation Image Editing Image Editing Controllable Image Generation Image Style Transfer Image Generation Image Inpainting Image Inpainting Image Editing Image Object Detection Image Relighting Image Restoration Image Segmentation Image Style Transfer Image-to-3D Image-to-Depth Image-to-Image Image-to-Sketch Image-to-Text Image-to-Video Image Upscaling Personalized Image Generation Text-to-Image Text-to-Image Personalized Image Generation Video Editing Virtual Image Try-On

Pair Customization

Pair Customization can customize text-to-image models by learning style differences from a single image pair. It separates style and content into different weight spaces, allowing for effective style application without overfitting to specific images.

02.05.24 · Project Page · Code · Image Style Transfer · Image Editing

PuLID

Similar to ConsistentID, PuLID is a tuning-free ID customization method for text-to-image generation. This one can also be used to edit images generated by diffusion models by adding or changing the text prompt.

24.04.24 · Code · Text-to-Image · Image Editing

CharacterFactory

CharacterFactory can generate endless characters that look the same across different images and videos. It uses GANs and word embeddings from celebrity names to ensure characters stay consistent, making it easy to integrate with other models.

24.04.24 · Project Page · Code · Image-to-Image · Image Editing · Personalized Image Generation

Training-and-prompt-free General Painterly Harmonization Using Image-wise Attention Sharing

TF-GPH can blend images with disparate visual elements together stylistically!

19.04.24 · Code · Image Editing

Customizing Text-to-Image Diffusion with Camera Viewpoint Control

CustomDiffusion360 brings camera viewpoint control to text-to-image models. Only caveat: it requires a 360 degree multi-view dataset of around 50 images per object to work.

18.04.24 · Project Page · Code · Text-to-Image · Image Editing

StyleBooth

StyleBooth is a unified style editing method supporting text-based, exemplar-based and compositional style editing. So basically, you can take an image and change its style by either giving it a text prompt or an example image.

18.04.24 · Project Page · Code · Image Editing · Image Style Transfer

MOWA

MOWA is a multiple-in-one image warping model that can be used for various tasks such as rectangling panoramic images, unrolling shutter images, rotating images, fisheye images, and image retargeting.

16.04.24 · Project Page · Code · Image-to-Image · Image Editing

GoodDrag

GoodDrag can improve the stability and image quality of drag editing with diffusion models. It reduces distortions by alternating between drag and denoising operations and introduces a new dataset, Drag100, for better quality assessment.

10.04.24 · Project Page · Code · Image Editing · Image Restoration

ZeST

ZeST can change the material of an object in an image to match a material example image. It can also perform multiple material edits in a single image and perform implicit lighting-aware edits on the rendering of a textured mesh.

09.04.24 · Project Page · Code · Image Inpainting · Image Editing

Automatic Controllable Colorization via Imagination

Imagine Colorization leverages pre-trained diffusion models to colorize images while supporting controllable and user-interactive capabilities.

08.04.24 · Project Page · Code · Image Colorization · Image Editing

FlashFace

FlashFace can personalize photos by using one or a few reference face images and a text prompt. It keeps important details like scars and tattoos while balancing text and image guidance, making it useful for face swapping and turning virtual characters into real people.

25.03.24 · Project Page · Code · Personalized Image Generation · Image Editing

ReNoise

ReNoise can be used to reconstruct an input image that can be edited using text prompts.

21.03.24 · Project Page · Code · Image Editing

Desigen

Desigen can generate high-quality design templates, including background images and layout elements. It uses advanced diffusion models for better control and has been tested on over 40,000 advertisement banners, achieving results similar to human designers.

14.03.24 · Project Page · Code · Controllable Image Generation · Image-to-Image · Image Editing

ELLA

ELLA is a lightweight approach to equip existing CLIP-based diffusion models with LLMs to improve prompt-understanding and enables long dense text comprehension for text-to-image models.

08.03.24 · Project Page · Code · Text-to-Image · Image Editing

ResAdapter

ResAdapter can generate images with any resolution and aspect ratio for diffusion models. It works with various personalized models and processes images efficiently, using only 0.5M parameters while keeping the original style.

04.03.24 · Project Page · Code · Image Upscaling · Image Editing · Image Restoration

OHTA

OHTA can create detailed and usable hand avatars from just one image. It allows for text-to-avatar conversion and editing of hand textures and shapes, using data-driven hand priors to improve accuracy with limited input.

29.02.24 · Project Page · Code · Image-to-3D · Image Editing

Multi-LoRA Composition for Image Generation

Multi-LoRA Composition focuses on the integration of multiple Low-Rank Adaptations (LoRAs) to create highly customized and detailed images. The approach is able to generate images with multiple elements without fine-tuning and without losing detail or image quality.

26.02.24 · Project Page · Code · Controllable Image Generation · Image Editing

Learning Continuous 3D Words for Text-to-Image Generation

Continuous 3D Words is a control method that can modify attributes in images with a slider based approach. This allows for more control over illumination, non-rigid shape changes (like wings), and camera orientation for instance.

13.02.24 · Project Page · Code · Text-to-Image · Image Editing

Repositioning the Subject within Image

SEELE can move around objects within an image. It does so by removing it, inpainting occluded portions and harmonizing the appearance of the repositioned object with the surrounding areas.

30.01.24 · Project Page · Code · Image Inpainting · Image Editing

StableIdentity

StableIdentity is a method that can generate diverse customized images in various contexts from a single input image. The cool thing about this method is, that it is able to combine the learned identity with ControlNet and even inject it into video (ModelScope) and 3D (LucidDreamer) generation.

29.01.24 · Project Page · Code · Personalized Image Generation · Image Editing