h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform

Applied Artificial Intelligence Initiative, Deakin University, Australia
*Equal contribution
CVPR 2025 Β· Accepted
πŸ“ Exhibition Hall D β€” Poster #222
πŸ—“οΈ Poster Session #6 β€” Sunday, June 15th, 16:00–18:00
h-Edit teaser

πŸš€ Summary of h-Edit

A training-free image editing method that works seamlessly with any pre-trained diffusion model (e.g., Stable Diffusion, Flux), delivering high-quality edits without finetuning.

πŸ† Outperforms SOTA methods in both editing accuracy and faithfulness.

🧩 Supports diverse editing tasks such as text-guided editing, face swapping, and combined text-guided & style editing β€” being the first to enable all within a unified framework.

πŸ“ Built on a rigorous theoretical foundation using Doob's h-transform and Langevin Monte Carlo sampling, enabling controllable, high-fidelity editing.

πŸ” Challenges in Training-Free Image Editing

β€’ How can we achieve both editing accuracy and faithfulness to the original image?

β€’ How can we effectively combine multiple editing inputs (e.g., text, image, style) within a single framework?

β€’ Most methods lack a theoretical foundation for the editing process.

h-Edit is designed to solve all of these β€” efficiently and elegantly.


✨ Editing Results

πŸ“ Text-Guided Editing

A. πŸ“Š SOTA Results on PIE-Bench

Text-Guided Editing Result on PIE-Bench

B. βš–οΈ h-Edit-D vs. Baselines

h-Edit-D vs. Baselines

C. βš–οΈ h-Edit-R vs. Baselines

h-Edit-R vs. Baselines

πŸ” Ablation Studies

Through our experiments, we discovered several exciting insights for improving editing effectiveness:

  • πŸ”„ Implicit Form Advantage: Our implicit formulation allows multiple optimization steps, which significantly helps resolve hard editing casesβ€”even when more sampling steps fail.
  • πŸŽ›οΈ Role of \(\hat{w}^{\text{orig}}\): Increasing \( \hat{w}^{\text{orig}} \) towards \( w^{\text{edit}} \) leads to better faithfulness and more effective edits.
  • πŸ›‘οΈ Robustness to Weight Choices: Thanks to our theoretical guarantees, our method remains stable across different values of \( (w^{\text{edit}}, \hat{w}^{\text{orig}}) \).

πŸ”„ Effect of Implicit Multiple Optimization Steps (1 β†’ 3)

Effect of Multiple Optimization Steps

BibTeX

@article{nguyen2025hedit,
 title={h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform},
 author={Nguyen, Toan and Do, Kien and Kieu, Duc and Nguyen, Thin},
 journal={CVPR},
 year={2025}
}