CLIP-KO
https://gyazo.com/90ab9fdaac3a925efeac21d9c159edb2
https://github.com/zer0int/CLIP-fine-tune
zer0int
/CLIP-fine-tune
https://github.com/zer0int/CLIP-fine-tune/blob/CLIP-vision/KO-CLIP-teaser/KO-CLIP-paper-final.pdf
Paper
https://huggingface.co/zer0int/CLIP-KO-TypoAttack-Attn-Dropout-ViT-L-14
CLIP-KO-TypoAttack-Attn-Dropout-ViT-L-14
Typographic attack
に堅牢になるように設計された
CLIP
の改良版
CLIP-KO-LITE
https://huggingface.co/zer0int/CLIP-KO-LITE-TypoAttack-Attn-Dropout-ViT-L-14
CLIP-KO-LITE-TypoAttack-Attn-Dropout-ViT-L-14
CLIP-KOに柔軟性をもたせたもの
画像生成でテキストエンコーダで使う場合はこれを使わないといけない?
Long-CLIP-KO
https://huggingface.co/zer0int/LongCLIP-KO-LITE-TypoAttack-Attn-ViT-L-14
LongCLIP-KO-LITE-TypoAttack-Attn-ViT-L-14
長文(248トークン)に対応させたモデル
https://www.reddit.com/r/StableDiffusion/comments/1lyzjkh/clipko_knocking_out_the_text_obsession/
CLIP-KO: Knocking out the text obsession (typographic attack vulnerability) in CLIP. New Model, Text Encoder, Code, Dataset.
https://www.reddit.com/r/StableDiffusion/comments/1m1ntom/followup_longclip_variant_of_clipko_knocking_out/
Follow-Up: Long-CLIP variant of CLIP-KO, Knocking Out the Typographic Attack Vulnerability in CLIP. Models & Code.