In this paper, we study the content-aware layout generation problem, which aims to automatically generate layouts that are harmonious with a given background image. Existing methods usually deal with this task with a single-step reasoning framework. The lack of a feedback-based self-correction mechanism leads to their failure rates significantly increasing when faced with complex element layout planning. To address this challenge, we introduce SEGA, a novel Stepwise Evolution Paradigm for Content-Aware Layout Generation. Inspired by the systematic mode of human thinking, SEGA employs a hierarchical reasoning framework with a coarse-to-fine strategy: first, a coarse-level module roughly estimates the layout planning results; then, another refining module performs fine-level reasoning regarding the coarse planning results. Furthermore, we incorporate layout design principles as prior knowledge into the model to enhance its layout planning ability. Besides, we present GenPoster-100K that is a new large-scale poster dataset with rich meta-information annotation. The experiments demonstrate the effectiveness of our approach by achieving the state-of-the-art results on multiple benchmark datasets. Our project page is at: https://brucew91.github.io/SEGA.github.io/
翻译:本文研究了内容感知布局生成问题,其目标在于自动生成与给定背景图像相协调的布局。现有方法通常采用单步推理框架处理此任务。由于缺乏基于反馈的自校正机制,在面对复杂元素布局规划时,其失败率显著上升。为应对这一挑战,我们提出了SEGA——一种新颖的用于内容感知布局生成的逐步演化范式。受人类思维系统模式的启发,SEGA采用从粗到细的分层推理框架:首先,粗粒度模块粗略估计布局规划结果;随后,另一优化模块基于粗规划结果进行细粒度推理。此外,我们将布局设计原则作为先验知识融入模型,以增强其布局规划能力。同时,我们提出了GenPoster-100K——一个具有丰富元信息标注的新型大规模海报数据集。实验结果表明,我们的方法在多个基准数据集上取得了最先进的性能,验证了其有效性。项目页面位于:https://brucew91.github.io/SEGA.github.io/