LoRAと遊ぶ - 基素基

This extension is for AUTOMATIC1111's Stable Diffusion web UI, allows the Web UI to add some networks (e.g. LoRA) to the original Stable Diffusion model to generate images.

現在は LoRA のみ対応しています。

この拡張で使えるのはsd-scriptsリポジトリで学習した LoRA のモデル（*.ckpt または *.safetensors）です。他の LoRA リポジトリで学習したモデルは対応していません。https://github.com/kohya-ss/sd-scripts/blob/main/README-ja.md

つまりこの人のscriptを使ってLoRAを学習させて、それを利用することができる拡張ということ基素.icon

ではLoRAをどう学習させるのか？

sd-scriptsでは学習方法は2種類提供されている

どっちがいいの？基素.icon

DreamBoothの手法

identifier（sksなど）とclass、オプションで正則化画像を用いる

https://github.com/kohya-ss/sd-scripts/blob/main/train_db_README-ja.md

これとLoRAと遊ぶ#63efb606774b17000088f83eは同等なの？基素.icon

NAIの提案ベースの手法

https://github.com/kohya-ss/sd-scripts/blob/main/fine_tune_README_ja.md

キャプションを用いる

NovelAIの提案した学習手法

自動キャプションニング

タグ付け

DeepDanbooru or WD14Tagger

Diffusersを用いてStable DiffusionのU-Netのfine tuningを行います。NovelAIの記事にある以下の改善に対応しています（Aspect Ratio BucketingについてはNovelAIのコードを参考にしましたが、最終的なコードはすべてオリジナルです）。

デフォルトではAuto Encoderの学習は行いません。モデル全体のfine tuningではU-Netだけを学習するのが一般的なようです（NovelAIもそのようです）。オプション指定でText Encoderも学習対象とできます。

example

LoRAを使ってみよう｜感想日記

https://gyazo.com/cc24c6bf9be57b3f638c8242be8e39b7

https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#lora

A method to fine tune weights for CLIP and Unet, the language model and the actual image de-noiser used by Stable Diffusion, published in 2021. Paper. A good way to train Lora is to use kohya-ss.

Support for Lora is built-in into the Web UI, but there is an extension with original implementation by kohyaa-ss.

Currently, Lora networks for Stable Diffusion 2.0+ models are not supported by Web UI.

Lora is added to the prompt by putting the following text into any location: <lora:filename:multiplier>, where filename is the name of file with Lora on disk, excluding extension, and multiplier is a number, generally from 0 to 1, that lets you choose how strongly Lora will affect the output. Lora cannot be added to the negative prompt.

The text for adding Lora to the prompt, <lora:filename:multiplier>, is only used to enable Lora, and is erased from prompt afterwards, so you can't do tricks with prompt editing like [<lora:one:1.0>|<lora:two:1.0>]. A batch with multiple different prompts will only use the Lora from the first prompt.