Stable Diffusion Guide: Image Generation Links

This is a collection of techniques you can use when generating images with Stable Diffusion. This article covers methods using ComfyUI.

How to Generate the Pictures You Want

LoRA

A method to extend the base model and add weights to tweak the output. For example, using a LoRA trained on night views makes it easier to generate night view images.

【Stable Diffusion】ComfyUIを使って画像生成AIで遊んでみよう【LoRA編】

blog.otama-playground.com

ControlNet

A method to extend the base model so it can accept pose data as input. It makes it easier to generate images with the shape specified by the pose data.

ComfyUIでControlNet：姿勢指定した画像生成AIの基本

blog.otama-playground.com

Textual Inversion (embedding)

A method to extend the Text Encoder part (the part that interprets prompts) and embed learned language vectors. Since it does not extend the model itself, it is relatively lightweight, but note that it cannot (or is difficult to) generate concepts that the model has not learned.

【Stable Diffusion】ComfyUIを使って画像生成AIで遊んでみよう【Textual Inversion編】

blog.otama-playground.com

IPAdapter

A method that allows images to be accepted as prompts simultaneously with text prompts.

【Stable Diffusion】ComfyUIを使って画像生成AIで遊んでみよう【IPAdapter編】

blog.otama-playground.com

InstantID

A method that allows you to fix the face of the output image by inputting a face image.

【Stable Diffusion】ComfyUIを使って画像生成AIで遊んでみよう【InstantID編】

blog.otama-playground.com

How to Upscale (High Resolution)

Hires.fix

A method of regenerating a low-resolution image at a high resolution using it as input. Since processing is done in the latent space, details are also added in the process.

ComfyUIでHires.fix：画像を高解像度化する手順

blog.otama-playground.com