FlashEmbedding : Storing embedding tables in SSD for large-scale recommender systems

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

9 Scopus Citations
View graph of relations

Detail(s)

Original languageEnglish
Title of host publicationAPSys '21
Subtitle of host publicationProceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems
PublisherAssociation for Computing Machinery
Pages9-16
ISBN (Print)9781450386982
Publication statusPublished - 2021

Publication series

NameAPSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems

Conference

Title12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021
LocationVirtual
PlaceHong Kong
Period24 - 25 August 2021

Abstract

We present FlashEmbedding, a hardware/software co-design solution for storing embedding tables on SSDs for large-scale recommendation inference under memory capacity-limited systems. FlashEmbedding leverages an embedding semantic-aware SSD, an embedding-oriented software cache, and pipeline techniques to improve the overall performance. We evaluate the performance of FlashEmbedding with our FPGA-based prototype SSD on a real-world public dataset. FlashEmbedding achieves up to 17.44× lower latency in embedding lookups and 2.89× lower end-to-end latency than baseline solution in a memory capacity-limted system.

Research Area(s)

  • Embedding, Recommender systems, Solid-state drive (SSD)

Bibliographic Note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Citation Format(s)

FlashEmbedding: Storing embedding tables in SSD for large-scale recommender systems. / Wan, Hu; Sun, Xuan; Cui, Yufei et al.
APSys '21: Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems. Association for Computing Machinery, 2021. p. 9-16 (APSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review