MAGE: Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model

Haoyuan Wang (Co-first Author), Zhenwei Wang (Co-first Author), Xiaoxiao Long, Cheng Lin, Gerhard Hancke, Rynson W. H. Lau*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

With advances in deep learning models and the availability of large-scale 3D datasets, we have recently witnessed significant progress in single-view 3D reconstruction. However, existing methods often fail to reconstruct physically based material properties given a single image, limiting their applicability in complicated scenarios. This paper presents a novel approach (named MAGE) for generating 3D geometry with realistic decomposed material properties given a single image as input. Our method leverages inspiration from traditional computer graphics deferred rendering pipelines to introduce a multi-view G-buffer estimation model. The proposed model estimates G-buffers for various views as multi-domain images, including XYZ coordinates, normals, albedo, roughness, and metallic properties from the single-view RGB. Furthermore, to address the inherent ambiguity and inconsistency in generating G-buffers simultaneously, we formulate a deterministic network from the pretrained diffusion models and propose a lighting response loss that enforces consistency across these domains using PBR principles. Finally, we propose a large-scale synthetic dataset rich in material diversity for our model training. Experimental results demonstrate the effectiveness of our method in producing high-quality 3D meshes with rich material properties. Our code and dataset can be found at https://www.whyy.site/paper/mage. ©2025 IEEE
Original languageEnglish
Title of host publication2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
PublisherIEEE
Pages10985-10995
ISBN (Electronic)979-8-3315-4365-5
ISBN (Print)979-8-3315-4364-8
DOIs
Publication statusPublished - 13 Aug 2025
Event2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025) - Music City Center, Nashville, United States
Duration: 11 Jun 202515 Jun 2025
https://cvpr.thecvf.com/Conferences/2025
https://cvpr.thecvf.com/

Conference

Conference2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025)
Abbreviated titleCVPR2025
PlaceUnited States
CityNashville
Period11/06/2515/06/25
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Fingerprint

Dive into the research topics of 'MAGE: Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model'. Together they form a unique fingerprint.

Cite this