Reinforcement Learning Enhanced PicHunter for Interactive Search
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | MultiMedia Modeling |
Subtitle of host publication | 9th International Conference, MMM 2023, Proceedings, Part 1 |
Editors | Duc-Tien Dang-Nguyen, Cathal Gurrin, Martha Larson, Alan F. Smeaton, Stevan Rudinac, Minh-Son Dao, Christoph Trattner, Phoebe Chen |
Publisher | Springer, Cham |
Pages | 690-696 |
Edition | 1 |
ISBN (electronic) | 978-3-031-27077-2 |
ISBN (print) | 978-3-031-27076-5 |
Publication status | Published - 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13833 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (electronic) | 1611-3349 |
Conference
Title | 29th International Conference on MultiMedia Modeling, MMM 2023 |
---|---|
Place | Norway |
City | Bergen |
Period | 9 - 12 January 2023 |
Link(s)
Abstract
With the tremendous increase in video data size, search performance could be impacted significantly. Specifically, in an interactive system, a real-time system allows a user to browse, search and refine a query. Without a speedy system quickly, the main ingredient to engage a user to stay focused, an interactive system becomes less effective even with a sophisticated deep learning system. This paper addresses this challenge by leveraging approximate search, Bayesian inference, and reinforcement learning. For approximate search, we apply a hierarchical navigable small world, which is an efficient approximate nearest neighbor search algorithm. To quickly prune the search scope, we integrate PicHunter, one of the most popular engines in Video Browser Showdown, with reinforcement learning. The integration enhances PicHunter with the ability of systematic planning. Specifically, PicHunter performs a Bayesian update with a greedy strategy to select a small number of candidates for display. With reinforcement learning, the greedy strategy is replaced with a policy network that learns to select candidates that will result in the minimum number of user iterations, which is analytically defined by a reward function. With these improvements, the interactive system only searches a subset of video datasets relevant to a query while being able to quickly perform Bayesian updates with systematic planning to recommend the most probable candidates that can potentially lead to minimum iteration rounds. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Research Area(s)
- Bayesian method, Interactive video retrieval, Reinforcement learning, Relevance feedback
Citation Format(s)
Reinforcement Learning Enhanced PicHunter for Interactive Search. / Ma, Zhixin; Wu, Jiaxin; Loo, Weixiong et al.
MultiMedia Modeling: 9th International Conference, MMM 2023, Proceedings, Part 1. ed. / Duc-Tien Dang-Nguyen; Cathal Gurrin; Martha Larson; Alan F. Smeaton; Stevan Rudinac; Minh-Son Dao; Christoph Trattner; Phoebe Chen. 1. ed. Springer, Cham, 2023. p. 690-696 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13833 LNCS).
MultiMedia Modeling: 9th International Conference, MMM 2023, Proceedings, Part 1. ed. / Duc-Tien Dang-Nguyen; Cathal Gurrin; Martha Larson; Alan F. Smeaton; Stevan Rudinac; Minh-Son Dao; Christoph Trattner; Phoebe Chen. 1. ed. Springer, Cham, 2023. p. 690-696 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13833 LNCS).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review