Abstract
Retrieving recipes corresponding to given dish pictures facilitates the estimation of nutrition facts, which is crucial to various health relevant applications. The current approaches mostly focus on recognition of food category based on global dish appearance without explicit analysis of ingredient composition. Such approaches are incapable for retrieval of recipes with unknown food categories, a problem referred to as zero-shot retrieval. On the other hand, content-based retrieval without knowledge of food categories is also difficult to attain satisfactory performance due to large visual variations in food appearance and ingredient composition. As the number of ingredients is far less than food categories, understanding ingredients underlying dishes in principle is more scalable than recognizing every food category and thus is suitable for zero-shot retrieval. Nevertheless, ingredient recognition is a task far harder than food categorization, and this seriously challenges the feasibility of relying on them for retrieval. This paper proposes deep architectures for simultaneous learning of ingredient recognition and food categorization, by exploiting the mutual but also fuzzy relationship between them. The learnt deep features and semantic labels of ingredients are then innovatively applied for zero-shot retrieval of recipes. By experimenting on a large Chinese food dataset with images of highly complex dish appearance, this paper demonstrates the feasibility of ingredient recognition and sheds light on this zero-shot problem peculiar to cooking recipe retrieval.
| Original language | English |
|---|---|
| Title of host publication | MM’16 |
| Subtitle of host publication | Proceedings of the 2016 ACM Multimedia Conference |
| Place of Publication | New York |
| Publisher | Association for Computing Machinery |
| Pages | 32-41 |
| ISBN (Print) | 9781450336031 |
| DOIs | |
| Publication status | Published - Oct 2016 |
| Event | 24th ACM International Conference on Multimedia, MM '16 - Amsterdam, Netherlands Duration: 15 Oct 2016 → 19 Oct 2016 |
Conference
| Conference | 24th ACM International Conference on Multimedia, MM '16 |
|---|---|
| Place | Netherlands |
| City | Amsterdam |
| Period | 15/10/16 → 19/10/16 |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Research Keywords
- Food categorization
- Ingredient recognition
- Multitask deep learning
- Zero-shot retrieval
Fingerprint
Dive into the research topics of 'Deep-based Ingredient Recognition for Cooking Recipe Retrieval'. Together they form a unique fingerprint.Prizes
-
Best Student Paper, ACM Multimedia 2016
CHEN, J. (Recipient) & NGO, C. W. (Recipient), 2016
Prize: RGC 64B - Prizes and awards