paint-with-words-sd
https://gyazo.com/d65fd40dfdd98a1bfb9499c23cc4a980
Recently, researchers from NVIDIA proposed ✖eDiffi. In the paper, they suggested method that allows "painting with word". Basically, this is like make-a-scene, but with just using adjusted cross-attention score. You can see the results and detailed method in the paper. Their paper and their method was not open-sourced. Yet, paint-with-words can be implemented with Stable Diffusion since they share common Cross Attention module. So, I implemented it with Stable Diffusion.