Skip to main navigation Skip to search Skip to main content

FlashEmbedding: Storing embedding tables in SSD for large-scale recommender systems

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

We present FlashEmbedding, a hardware/software co-design solution for storing embedding tables on SSDs for large-scale recommendation inference under memory capacity-limited systems. FlashEmbedding leverages an embedding semantic-aware SSD, an embedding-oriented software cache, and pipeline techniques to improve the overall performance. We evaluate the performance of FlashEmbedding with our FPGA-based prototype SSD on a real-world public dataset. FlashEmbedding achieves up to 17.44× lower latency in embedding lookups and 2.89× lower end-to-end latency than baseline solution in a memory capacity-limted system.
Original languageEnglish
Title of host publicationAPSys '21
Subtitle of host publicationProceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems
PublisherAssociation for Computing Machinery
Pages9-16
ISBN (Print)9781450386982
DOIs
Publication statusPublished - 2021
Event12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021 - Virtual, Hong Kong, China
Duration: 24 Aug 202125 Aug 2021

Publication series

NameAPSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems

Conference

Conference12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021
PlaceHong Kong, China
Period24/08/2125/08/21

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Research Keywords

  • Embedding
  • Recommender systems
  • Solid-state drive (SSD)

Fingerprint

Dive into the research topics of 'FlashEmbedding: Storing embedding tables in SSD for large-scale recommender systems'. Together they form a unique fingerprint.

Cite this