Skip to main navigation Skip to search Skip to main content

Animus3D: Text-driven 3D Animation via Motion Score Distillation

Qi SUN, Can WANG, Jiaxiang SHANG, Wensen FENG, Jing LIAO*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

We present Animus3D , a text-driven 3D animation framework that generates motion field given a static 3D asset and text prompt. Previous methods mostly leverage the vanilla Score Distillation Sampling (SDS) objective to distill motion from pretrained text-to-video diffusion, leading to animations with minimal movement or noticeable jitter. To address this, our approach introduces a novel SDS alternative, Motion Score Distillation (MSD). Specifically, we introduce a LoRA-enhanced video diffusion model that defines a static source distribution rather than pure noise as in SDS, while another inversion-based noise estimation technique ensures appearance preservation when guiding motion. To further improve motion fidelity, we incorporate explicit temporal and spatial regularization terms that mitigate geometric distortions across time and space. Additionally, we propose a motion refinement module to upscale the temporal resolution and enhance fine-grained details, overcoming the fixed-resolution constraints of the underlying video model. Extensive experiments demonstrate that Animus3D successfully animates static 3D assets from diverse text prompts, generating significantly more substantial and detailed motion than state-of-the-art baselines while maintaining high visual integrity. Code will be released upon acceptance.
© 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.
Original languageEnglish
Title of host publicationProceedings of the SIGGRAPH Asia 2025 Conference Papers
EditorsTaku Komura, Michael Wimmer, Hongbo Fu
PublisherAssociation for Computing Machinery
Number of pages11
ISBN (Print)9798400721373
DOIs
Publication statusPublished - 14 Dec 2025
Event18th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2025) - Hong Kong Convention and Exhibition Centre (HKCEC), Hong Kong, China
Duration: 15 Dec 202518 Dec 2025
https://asia.siggraph.org/2025/

Conference

Conference18th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2025)
Abbreviated titleSA '25
PlaceHong Kong, China
Period15/12/2518/12/25
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Funding

We thank all anonymous reviewers and area chairs for their valuable comments. This work was supported by Central Media Technology Institute, Huawei [Project No. TC20240927042] and a GRF grant from the Research Grants Council (RGC) of the Hong Kong Special Administrative Region, China [Project No. CityU 11208123].

Research Keywords

  • 3D Animation
  • Score Distillation Sampling
  • Video Diffusion Model

Fingerprint

Dive into the research topics of 'Animus3D: Text-driven 3D Animation via Motion Score Distillation'. Together they form a unique fingerprint.

Cite this