Finding Meta Winning Ticket to Train Your MAML
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining |
Publisher | Association for Computing Machinery |
Pages | 411-420 |
ISBN (print) | 9781450393850 |
Publication status | Published - 2022 |
Externally published | Yes |
Publication series
Name | Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining |
---|
Conference
Title | 28th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2022) |
---|---|
Location | Washington DC Convention Center |
Place | United States |
City | Washington DC |
Period | 14 - 18 August 2022 |
Link(s)
Abstract
The lottery ticket hypothesis (LTH) states that a randomly initialized dense network contains sub-networks that can be trained in isolation to the performance of the dense network. In this paper, to achieve rapid learning with less computational cost, we explore LTH in the context of meta learning. First, we experimentally show that there are sparse sub-networks, known as meta winning tickets, which can be meta-trained to few-shot classification accuracy to the original backbone. The application of LTH in meta learning enables the adaptation of meta-trained networks on various IoT devices with fewer computation. However, the status quo to identify winning tickets requires iterative training and pruning, which is particularly expensive for finding meta winning tickets. To this end, then we investigate the inter- and intra-layer patterns among different meta winning tickets, and propose a scheme for early detection of a meta winning ticket. The proposed scheme enables efficient training in resource-limited devices. Besides, it also designs a lightweight solution to search the meta winning ticket. Evaluations on standard few-shot classification benchmarks show that we can find competitive meta winning tickets with 20% weights of the original backbone, while incurring only 8%-14% (Conv-4) and 19%-29% (ResNet-12) computation overhead (measured by FLOPs) of the standard winning ticket finding scheme. © 2022 ACM.
Research Area(s)
- lottery ticket hypothesis, meta learning, network pruning
Citation Format(s)
Finding Meta Winning Ticket to Train Your MAML. / Gao, Dawei; Xie, Yuexiang; Zhou, Zimu et al.
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2022. p. 411-420 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2022. p. 411-420 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review