연구성과

국외저널
논문명 Repurposing existing deep networks for caption and aesthetic-guide image cropping
게재일 20220601
학술지명 Pattern Recognition
책임교수
논문종류 01 SCI
제1저자 Nora Horanyi
교신저자 Hyung Jin Chang
공동저자 Kwang Moo Yi, Abhishake Kumar Bojja, Alex Leonardis, Hyung Jin Chang
Impact Factor 8.51800
Keyword

 We propose a novel optimization framework that crops a given image based on user description and aesthetics. Unlike existing image cropping methods, where one typically trains a deep network to regress to crop parameters or cropping actions, we propose to directly optimize for the cropping parameters by repurposing pre-trained networks on image captioning and aesthetic tasks, without any fine-tuning, thereby avoiding training a separate network. Specifically, we search for the best crop parameters that minimize a combined loss of the initial objectives of these networks. To make the optimization stable, we propose three strategies: (i) multi-scale bilinear sampling, (ii) annealing the scale of the crop region, therefore effectively reducing the parameter space, (iii) aggregation of multiple optimization results. Through various quantitative and qualitative evaluations, we show that our framework can produce crops that are well-aligned to intended user descriptions and aesthetically pleasing.

04620 서울특별시 중구 필동로1길 30 동국대학교 Knowledge Science 연구센터(KSRC) Tel.02-2290-1441
Copyright© 2021 DONGGUK UNIVERSITY. ALL RIGHTS RESERVED.

×