Reinforcement Learning Enhanced PicHunter for Interactive Search

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

3 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationMultiMedia Modeling
Subtitle of host publication9th International Conference, MMM 2023, Proceedings, Part 1
EditorsDuc-Tien Dang-Nguyen, Cathal Gurrin, Martha Larson, Alan F. Smeaton, Stevan Rudinac, Minh-Son Dao, Christoph Trattner, Phoebe Chen
PublisherSpringer, Cham
Pages690-696
Edition1
ISBN (electronic)978-3-031-27077-2
ISBN (print)978-3-031-27076-5
Publication statusPublished - 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13833 LNCS
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Conference

Title29th International Conference on MultiMedia Modeling, MMM 2023
PlaceNorway
CityBergen
Period9 - 12 January 2023

Abstract

With the tremendous increase in video data size, search performance could be impacted significantly. Specifically, in an interactive system, a real-time system allows a user to browse, search and refine a query. Without a speedy system quickly, the main ingredient to engage a user to stay focused, an interactive system becomes less effective even with a sophisticated deep learning system. This paper addresses this challenge by leveraging approximate search, Bayesian inference, and reinforcement learning. For approximate search, we apply a hierarchical navigable small world, which is an efficient approximate nearest neighbor search algorithm. To quickly prune the search scope, we integrate PicHunter, one of the most popular engines in Video Browser Showdown, with reinforcement learning. The integration enhances PicHunter with the ability of systematic planning. Specifically, PicHunter performs a Bayesian update with a greedy strategy to select a small number of candidates for display. With reinforcement learning, the greedy strategy is replaced with a policy network that learns to select candidates that will result in the minimum number of user iterations, which is analytically defined by a reward function. With these improvements, the interactive system only searches a subset of video datasets relevant to a query while being able to quickly perform Bayesian updates with systematic planning to recommend the most probable candidates that can potentially lead to minimum iteration rounds. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Research Area(s)

  • Bayesian method, Interactive video retrieval, Reinforcement learning, Relevance feedback

Citation Format(s)

Reinforcement Learning Enhanced PicHunter for Interactive Search. / Ma, Zhixin; Wu, Jiaxin; Loo, Weixiong et al.
MultiMedia Modeling: 9th International Conference, MMM 2023, Proceedings, Part 1. ed. / Duc-Tien Dang-Nguyen; Cathal Gurrin; Martha Larson; Alan F. Smeaton; Stevan Rudinac; Minh-Son Dao; Christoph Trattner; Phoebe Chen. 1. ed. Springer, Cham, 2023. p. 690-696 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13833 LNCS).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review