Img-Diffusion
updated
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive
Generation
Paper
• 2312.12491
• Published
• 75
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and
Generating with Multimodal LLMs
Paper
• 2401.11708
• Published
• 30
Training-Free Consistent Text-to-Image Generation
Paper
• 2402.03286
• Published
• 67
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper
• 2401.06105
• Published
• 50
ImagenHub: Standardizing the evaluation of conditional image generation
models
Paper
• 2310.01596
• Published
• 19
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper
• 2401.01952
• Published
• 32
Scalable Diffusion Models with Transformers
Paper
• 2212.09748
• Published
• 18
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass
Diffusion Transformers
Paper
• 2401.11605
• Published
• 23
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Paper
• 2402.10210
• Published
• 35
Paper
• 2402.13144
• Published
• 100
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
Composition
Paper
• 2402.15504
• Published
• 21
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Paper
• 2401.10061
• Published
• 32
LightIt: Illumination Modeling and Control for Diffusion Models
Paper
• 2403.10615
• Published
• 18
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based
Semantic Control
Paper
• 2403.09055
• Published
• 26
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image
Generation
Paper
• 2403.16990
• Published
• 25
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
• 2404.03653
• Published
• 35
Bigger is not Always Better: Scaling Properties of Latent Diffusion
Models
Paper
• 2404.01367
• Published
• 22
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
Generation
Paper
• 2405.01434
• Published
• 56
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Paper
• 2404.01197
• Published
• 31
Dynamic Typography: Bringing Words to Life
Paper
• 2404.11614
• Published
• 46
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
Generation
Paper
• 2404.02733
• Published
• 22
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
• 2404.19427
• Published
• 74
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip
Connection Editing
Paper
• 2312.11392
• Published
• 20
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
Prediction
Paper
• 2404.02905
• Published
• 74
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper
• 2403.03206
• Published
• 71
Scalable Pre-training of Large Autoregressive Image Models
Paper
• 2401.08541
• Published
• 38
Stable Flow: Vital Layers for Training-Free Image Editing
Paper
• 2411.14430
• Published
• 22