Abstract
We present Acc3D to tackle the challenge of accelerating the diffusion process to generate 3D models from single images. To derive high-quality reconstructions through few-step inferences, we emphasize the critical issue of regularizing the learning of score function in states of random noise. To this end, we propose edge consistency, i.e., consistent predictions across the high signal-to-noise ratio region, to enhance a pre-trained diffusion model, enabling a distillation-based refinement of the endpoint score function. Building on those distilled diffusion models, we propose an adversarial augmentation strategy to further enrich the generation detail and boost overall generation quality. The two modules complement each other, mutually reinforcing to elevate generative performance. Extensive experiments demonstrate that our Acc3D not only achieves over a $20\times$ increase in computational efficiency but also yields notable quality improvements, compared to the state-of-the-arts.
Original language | English |
---|---|
Title of host publication | The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025 |
Number of pages | 10 |
Publication status | Published - 11 Jun 2025 |
Event | 2025 IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2025) - Music City Center, Nashville, United States Duration: 11 Jun 2025 → 15 Jun 2025 https://cvpr.thecvf.com/Conferences/2025 |
Conference
Conference | 2025 IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR 2025) |
---|---|
Country/Territory | United States |
City | Nashville |
Period | 11/06/25 → 15/06/25 |
Internet address |
Funding
This work was supported in part by the NSFC Excellent Young Scientists Fund 62422118, in part by the Hong Kong RGC under Grants 11219324 and 11219422, and in part by the Hong Kong UGC under Grants UGC/FDS11/E02/22 and UGC/FDS11/E03/24.