A Multi-Modal Transformer-based Code Summarization Approach for Smart Contracts

Zhen Yang, Jacky Keung, Xiao Yu*, Xiaodong Gu, Zhengyuan Wei, Xiaoxue Ma, Miao Zhang

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

62 Citations (Scopus)

Abstract

Code comment has been an important part of computer programs, greatly facilitating the understanding and maintenance of source code. However, high-quality code comments are often unavailable in smart contracts, the increasingly popular programs that run on the blockchain. In this paper, we propose a Multi-Modal Transformer-based (MMTrans) code summarization approach for smart contracts. Specifically, the MMTrans learns the representation of source code from the two heterogeneous modalities of the Abstract Syntax Tree (AST), i.e., Structure-based Traversal (SBT) sequences and graphs. The SBT sequence provides the global semantic information of AST, while the graph convolution focuses on the local details. The MMTrans uses two encoders to extract both global and local semantic information from the two modalities respectively, and then uses a joint decoder to generate code comments. Both the encoders and the decoder employ the multi-head attention structure of the Transformer to enhance the ability to capture the long-range dependencies between code tokens. We build a dataset with over 300K pairs of smart contracts, and evaluate the MMTrans on it. The experimental results demonstrate that the MMTrans outperforms the state-of-the-art baselines in terms of four evaluation metrics by a substantial margin, and can generate higher quality comments.
Original languageEnglish
Title of host publicationProceedings - 2021 IEEE/ACM 29th International Conference on Program Comprehension
Subtitle of host publicationICPC 2021
PublisherIEEE
Pages1-12
Number of pages12
ISBN (Electronic)9781665414036
ISBN (Print)9781665414043
DOIs
Publication statusPublished - 2021
Event29th IEEE/ACM International Conference on Program Comprehension (ICPC 2021) - Virtual
Duration: 18 May 202122 May 2021
https://conf.researchr.org/home/icpc-2021

Publication series

NameIEEE International Conference on Program Comprehension
ISSN (Print)2643-7147
ISSN (Electronic)2643-7171

Conference

Conference29th IEEE/ACM International Conference on Program Comprehension (ICPC 2021)
Abbreviated titleICPC
Period18/05/2122/05/21
Internet address

Research Keywords

  • Smart Contracts
  • Code Summarization
  • Transformer
  • Graph Convolution
  • Structure-based Traversal

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'A Multi-Modal Transformer-based Code Summarization Approach for Smart Contracts'. Together they form a unique fingerprint.

Cite this