LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer - podcast episode cover

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Feb 07, 2025•26 min•Ep. 494
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

🤗 Upvotes: 7 | cs.CV

Authors:
Yiren Song, Danze Chen, Mike Zheng Shou

Title:
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Arxiv:
http://arxiv.org/abs/2502.01105v1

Abstract:
Generating cognitive-aligned layered SVGs remains challenging due to existing methods' tendencies toward either oversimplified single-layer outputs or optimization-induced shape redundancies. We propose LayerTracer, a diffusion transformer based framework that bridges this gap by learning designers' layered SVG creation processes from a novel dataset of sequential design operations. Our approach operates in two phases: First, a text-conditioned DiT generates multi-phase rasterized construction blueprints that simulate human design workflows. Second, layer-wise vectorization with path deduplication produces clean, editable SVGs. For image vectorization, we introduce a conditional diffusion mechanism that encodes reference images into latent tokens, guiding hierarchical reconstruction while preserving structural integrity. Extensive experiments demonstrate LayerTracer's superior performance against optimization-based and neural baselines in both generation quality and editability, effectively aligning AI-generated vectors with professional design cognition.

For the best experience, listen in Metacast app for iOS or Android