RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Daily Paper Cast

May 30, 2025•23 min•Ep. 827

--:--

Listen in podcast apps:

Apple Podcasts

Spotify

Download

Listen to this episode in Metacast mobile app

Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

🤗 Upvotes: 26 | cs.GR, cs.CV, cs.LG

Authors:
Chong Zeng, Yue Dong, Pieter Peers, Hongzhi Wu, Xin Tong

Title:
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Arxiv:
http://arxiv.org/abs/2505.21925v1

Abstract:
We present RenderFormer, a neural rendering pipeline that directly renders an image from a triangle-based representation of a scene with full global illumination effects and that does not require per-scene training or fine-tuning. Instead of taking a physics-centric approach to rendering, we formulate rendering as a sequence-to-sequence transformation where a sequence of tokens representing triangles with reflectance properties is converted to a sequence of output tokens representing small patches of pixels. RenderFormer follows a two stage pipeline: a view-independent stage that models triangle-to-triangle light transport, and a view-dependent stage that transforms a token representing a bundle of rays to the corresponding pixel values guided by the triangle-sequence from the view-independent stage. Both stages are based on the transformer architecture and are learned with minimal prior constraints. We demonstrate and evaluate RenderFormer on scenes with varying complexity in shape and light transport.

For the best experience, listen in Metacast app for iOS or Android