 sascha-kirch
			's Collections
			sascha-kirch
			's Collections
			
			
		Diffusion Models
		
	updated
			
 
				
				
 - Instruct-Imagen: Image Generation with Multi-modal Instruction- 
			Paper
			 •- 
			2401.01952
			 •
			Published
				
			•- 
				32
			 
 - ODIN: A Single Model for 2D and 3D Perception- 
			Paper
			 •- 
			2401.02416
			 •
			Published
				
			•- 
				13
			 
 - Bigger is not Always Better: Scaling Properties of Latent Diffusion
  Models- 
			Paper
			 •- 
			2404.01367
			 •
			Published
				
			•- 
				22
			 
 - Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion
  Models- 
			Paper
			 •- 
			2404.02747
			 •
			Published
				
			•- 
				13
			 
 - PointInfinity: Resolution-Invariant Point Diffusion Models- 
			Paper
			 •- 
			2404.03566
			 •
			Published
				
			•- 
				16
			 
 - ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback- 
			Paper
			 •- 
			2404.07987
			 •
			Published
				
			•- 
				48
			 
 - Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model- 
			Paper
			 •- 
			2404.09967
			 •
			Published
				
			•- 
				21
			 
 - Align Your Steps: Optimizing Sampling Schedules in Diffusion Models- 
			Paper
			 •- 
			2404.14507
			 •
			Published
				
			•- 
				23
			 
 - Semantica: An Adaptable Image-Conditioned Diffusion Model- 
			Paper
			 •- 
			2405.14857
			 •
			Published
				
			•- 
				11
			 
 - Improved Distribution Matching Distillation for Fast Image Synthesis- 
			Paper
			 •- 
			2405.14867
			 •
			Published
				
			•- 
				15
			 
 - 
			Paper
			 •- 
			2405.18407
			 •
			Published
				
			•- 
				48
			 
 - Scaling Rectified Flow Transformers for High-Resolution Image Synthesis- 
			Paper
			 •- 
			2403.03206
			 •
			Published
				
			•- 
				70
			 
 - Kaleido Diffusion: Improving Conditional Diffusion Models with
  Autoregressive Latent Modeling- 
			Paper
			 •- 
			2405.21048
			 •
			Published
				
			•- 
				16
			 
 - 4Diffusion: Multi-view Video Diffusion Model for 4D Generation- 
			Paper
			 •- 
			2405.20674
			 •
			Published
				
			•- 
				15
			 
 - Learning Temporally Consistent Video Depth from Video Diffusion Priors- 
			Paper
			 •- 
			2406.01493
			 •
			Published
				
			•- 
				23
			 
 - BitsFusion: 1.99 bits Weight Quantization of Diffusion Model- 
			Paper
			 •- 
			2406.04333
			 •
			Published
				
			•- 
				38
			 
 - Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few
  Steps Image Generation- 
			Paper
			 •- 
			2406.02347
			 •
			Published
				
			•- 
				3
			 
 - Step-aware Preference Optimization: Aligning Preference with Denoising
  Performance at Each Step- 
			Paper
			 •- 
			2406.04314
			 •
			Published
				
			•- 
				30
			 
 - Alleviating Distortion in Image Generation via Multi-Resolution
  Diffusion Models- 
			Paper
			 •- 
			2406.09416
			 •
			Published
				
			•- 
				29
			 
 - Interpreting the Weight Space of Customized Diffusion Models- 
			Paper
			 •- 
			2406.09413
			 •
			Published
				
			•- 
				20
			 
 - ExVideo: Extending Video Diffusion Models via Parameter-Efficient
  Post-Tuning- 
			Paper
			 •- 
			2406.14130
			 •
			Published
				
			•- 
				10
			 
 - 
			Paper
			 •- 
			2402.09470
			 •
			Published
				
			•- 
				14
			 
 - ControlNeXt: Powerful and Efficient Control for Image and Video
  Generation- 
			Paper
			 •- 
			2408.06070
			 •
			Published
				
			•- 
				55
			 
 - 
			Paper
			 •- 
			2408.07009
			 •
			Published
				
			•- 
				62
			 
 - Discrete Diffusion Modeling by Estimating the Ratios of the Data
  Distribution- 
			Paper
			 •- 
			2310.16834
			 •
			Published
				
			•- 
				5
			 
 - Transfusion: Predict the Next Token and Diffuse Images with One
  Multi-Modal Model- 
			Paper
			 •- 
			2408.11039
			 •
			Published
				
			•- 
				63
			 
 - CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models- 
			Paper
			 •- 
			2411.18613
			 •
			Published
				
			•- 
				58
			 
 - SNOOPI: Supercharged One-step Diffusion Distillation with Proper
  Guidance- 
			Paper
			 •- 
			2412.02687
			 •
			Published
				
			•- 
				113
			 
 - Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion
  Models- 
			Paper
			 •- 
			2312.09608
			 •
			Published
				
			•- 
				16
			 
 - Audio-visual Controlled Video Diffusion with Masked Selective State
  Spaces Modeling for Natural Talking Head Generation- 
			Paper
			 •- 
			2504.02542
			 •
			Published
				
			•- 
				50