Abstract
We present FlashEmbedding, a hardware/software co-design solution for storing embedding tables on SSDs for large-scale recommendation inference under memory capacity-limited systems. FlashEmbedding leverages an embedding semantic-aware SSD, an embedding-oriented software cache, and pipeline techniques to improve the overall performance. We evaluate the performance of FlashEmbedding with our FPGA-based prototype SSD on a real-world public dataset. FlashEmbedding achieves up to 17.44× lower latency in embedding lookups and 2.89× lower end-to-end latency than baseline solution in a memory capacity-limted system.
| Original language | English |
|---|---|
| Title of host publication | APSys '21 |
| Subtitle of host publication | Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems |
| Publisher | Association for Computing Machinery |
| Pages | 9-16 |
| ISBN (Print) | 9781450386982 |
| DOIs | |
| Publication status | Published - 2021 |
| Event | 12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021 - Virtual, Hong Kong, China Duration: 24 Aug 2021 → 25 Aug 2021 |
Publication series
| Name | APSys - Proceedings of the ACM SIGOPS Asia-Pacific Workshop on Systems |
|---|
Conference
| Conference | 12th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys 2021 |
|---|---|
| Place | Hong Kong, China |
| Period | 24/08/21 → 25/08/21 |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Research Keywords
- Embedding
- Recommender systems
- Solid-state drive (SSD)
Fingerprint
Dive into the research topics of 'FlashEmbedding: Storing embedding tables in SSD for large-scale recommender systems'. Together they form a unique fingerprint.Student theses
-
Accelerating Large-Scale Recommender Systems
WAN, H. (Author), XUE, C. J. (Supervisor), 26 Apr 2023Student thesis: Doctoral Thesis
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver