Presentation 2023-05-18
Image Harmonization Using Diffusion Model for Perceptual Quality Improvement
Taito Naruki, Norimichi Ukita,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Image harmonization is the task of eliminating the discomfort of color tones that occurs when images are composited. However, conventional image harmonization datasets were not able to take into account the perceptual aspect of discomfort, and models were learned and evaluated only by the difference from the GT. For this reason, even if the generated image was not uncomfortable, if the difference from the GT was large, it was treated as an insufficiently trained image. Recently, the diffusion model has been attracting attention. The diffusion model generates images by gradually removing noise from a noisy image over multiple steps, and is known for its higher quality and diversity of generated images compared to other generative models due to this gradually restoring generative process and stable objective function. In addition, the method called DPS applies an inverse problem to the diffusion model to restore an image in zero shots such that it satisfies a given condition. In the proposed method, the image is generated by applying constraints based on image gradients and VGG features in DPS, which results in image harmonization with high quality while preserving foreground textures. Furthermore, by gradually increasing the constraints at each step of the reverse process, the restoration is performed with higher perceptual quality.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Image harmonization / Diffusion model / Inverse problem
Paper # PRMU2023-1
Date of Issue 2023-05-11 (PRMU)

Conference Information
Committee PRMU / IPSJ-CVIM
Conference Date 2023/5/18(2days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Seiichi Uchida(Kyushu Univ.)
Vice Chair Takuya Funatomi(NAIST) / Mitsuru Anpai(Denso IT Lab.)
Secretary Takuya Funatomi(CyberAgent) / Mitsuru Anpai(Univ. of Tokyo)
Assistant Nakamasa Inoue(Tokyo Inst. of Tech.) / Yasutomo Kawanishi(Riken)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding / Special Interest Group on Computer Vision and Image Media
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Image Harmonization Using Diffusion Model for Perceptual Quality Improvement
Sub Title (in English)
Keyword(1) Image harmonization
Keyword(2) Diffusion model
Keyword(3) Inverse problem
1st Author's Name Taito Naruki
1st Author's Affiliation Toyota Technological Institute(TTI)
2nd Author's Name Norimichi Ukita
2nd Author's Affiliation Toyota Technological Institute(TTI)
Date 2023-05-18
Paper # PRMU2023-1
Volume (vol) vol.123
Number (no) PRMU-30
Page pp.pp.1-5(PRMU),
#Pages 5
Date of Issue 2023-05-11 (PRMU)