LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
(2024)
Presentation / Conference Contribution
Palaev, A., Khan, A., & Kazmi, A. (2024, November). LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps. Paper presented at The 35th British Machine Vision Conference, Glasgow
The advancement of text-to-image synthesis has introduced powerful generative models capable of creating realistic images from textual prompts. However, precise control over image attributes remains challenging, especially at the instance level. Whil... Read More about LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps.