Image Generation AI with ComfyUI: IPAdapter Edition

IPAdapter (Image Prompt Adapter) of Stable Diffusion is a technology further enhancing Stable Diffusion which is image generation model. This technology enables input of image prompt in addition to text prompt, allowing more accurate control of style and content of generated image. With this technology, user can generate images having specific art style, composition, colors etc. for example, which is very useful in creative work.

Please refer to article below for explanation of specific method.

This time I will introduce method to actually use IPAdapter using ComfyUI. Furthermore, through generation results, I will verify its effect.

Work Flow

1. Introduction of ComfyUI

Please refer to this article.

【Stable Diffusion】ComfyUIを使って画像生成AIで遊んでみよう【導入編】

blog.otama-playground.com

2. Introduction of Custom Node

Introduce ComfyU_IPAdapter_plus. Either from ComfyUI-Manager or method using git clone is fine.

GitHub - cubiq/ComfyUI_IPAdapter_plus

Contribute to cubiq/ComfyUI_IPAdapter_plus development by creating an account on GitHub.

github.com

3. Download model of IPAdapter

Download ones to use from below and place in /ComfyUI/models/ipadapter.

Note: Create ipadapter folder if not exists

Copy-paste list from repository below

ip-adapter_sd15.safetensors, Basic model, average strength
ip-adapter_sd15_light_v11.bin, Light impact model
ip-adapter-plus_sd15.safetensors, Plus model, very strong
ip-adapter-plus-face_sd15.safetensors, Face model, portraits
ip-adapter-full-face_sd15.safetensors, Stronger face model, not necessarily better
ip-adapter_sd15_vit-G.safetensors, Base model, requires bigG clip vision encoder
ip-adapter_sdxl_vit-h.safetensors, SDXL model
ip-adapter-plus_sdxl_vit-h.safetensors, SDXL plus model
ip-adapter-plus-face_sdxl_vit-h.safetensors, SDXL face model
ip-adapter_sdxl.safetensors, vit-G SDXL model, requires bigG clip vision encoder

Download below and place in /ComfyUI/models/clip_vision. Please change filename matching name of ↓. (If you forget, it won’t work with error)

CLIP-ViT-H-14-laion2B-s32B-b79K.safetensors, download and rename
CLIP-ViT-bigG-14-laion2B-39B-b160k.safetensors, download and rename

4. Create Workflow

Minimum configuration becomes like this. Yellow is node added from default workflow.

Generation Result 1

Put image of Mona Lisa, and try replacing only person with bearded man.

Prompt

a man with beard

Input Image

Generated Image

Generation Result 2 (Face Only Specification)

If using Face Model, you can generate with specified face. Please use image of face only when using this function.

Prompt

a man playing soccer, stadium, grass field, dribbling

Conclusion

In this article, I verified basic usages of IPAdapter (Image Prompt Adapter) of Stable Diffusion and its effect. By using IPAdapter, combining image prompt in addition to text prompt enables more intuitive and detailed image generation. Especially when generating images having specific art style, composition, colors etc., this technology is very useful. Please try it.

If you want to try other techniques related to image generation, please utilize the link collection below.

Stable Diffusionガイド：画像生成に役立つリンク集

blog.otama-playground.com

Image Generation AI with ComfyUI: IPAdapter Edition

Work Flow

1. Introduction of ComfyUI

2. Introduction of Custom Node

3. Download model of IPAdapter

4. Create Workflow

Generation Result 1

Generation Result 2 (Face Only Specification)

Conclusion

Related Posts

Stable Diffusion Guide: Image Generation Links

Person Extraction with ComfyUI and Impact Pack: Mask to Inpaint

StableDiffusion i2i Comparison: Latent vs ControlNet vs IPAdapter

Verifying 'Flux.1' with ComfyUI: Installation and Results

Apply LoRA to Flux.1 in ComfyUI: Steps and Results

Video Generation AI with ComfyUI: RIFE Edition

Stable Diffusion Speedup: LoRA for Turbo, Lightning, LCM, Hyper

Image Generation AI with ComfyUI: Face Detailer Edition

Image Generation AI with ComfyUI: InstantID Edition

Kolors: StableDiffusion Derivative for Better Prompts in ComfyUI