FlashEmbedding : Storing embedding tables in SSD for large-scale recommender systems
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | APSys '21 |
Subtitle of host publication | Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems |
Publisher | Association for Computing Machinery |
Pages | 9-16 |
ISBN (print) | 9781450386982 |
Publication status | Published - 2021 |
Publication series
Name | APSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems |
---|
Conference
Title | 12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021 |
---|---|
Location | Virtual |
Place | Hong Kong |
Period | 24 - 25 August 2021 |
Link(s)
Abstract
We present FlashEmbedding, a hardware/software co-design solution for storing embedding tables on SSDs for large-scale recommendation inference under memory capacity-limited systems. FlashEmbedding leverages an embedding semantic-aware SSD, an embedding-oriented software cache, and pipeline techniques to improve the overall performance. We evaluate the performance of FlashEmbedding with our FPGA-based prototype SSD on a real-world public dataset. FlashEmbedding achieves up to 17.44× lower latency in embedding lookups and 2.89× lower end-to-end latency than baseline solution in a memory capacity-limted system.
Research Area(s)
- Embedding, Recommender systems, Solid-state drive (SSD)
Bibliographic Note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).
Citation Format(s)
FlashEmbedding: Storing embedding tables in SSD for large-scale recommender systems. / Wan, Hu; Sun, Xuan; Cui, Yufei et al.
APSys '21: Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems. Association for Computing Machinery, 2021. p. 9-16 (APSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems).
APSys '21: Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems. Association for Computing Machinery, 2021. p. 9-16 (APSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review