Retrieval-Augmented Multiple Instance Learning
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) |
Editors | A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, S. Levine |
Pages | 24859-24878 |
ISBN (electronic) | 9781713899921 |
Publication status | Published - Dec 2023 |
Publication series
Name | Advances in Neural Information Processing Systems |
---|---|
Volume | 36 |
ISSN (Print) | 1049-5258 |
Conference
Title | 37th Conference on Neural Information Processing Systems (NeurIPS 2023) |
---|---|
Location | New Orleans Ernest N. Morial Convention Center |
Place | United States |
City | New Orleans |
Period | 10 - 16 December 2023 |
Link(s)
Document Link | Links |
---|---|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85191160670&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(0951004f-b47f-47fc-b9f6-c3d6dd1d45e4).html |
Abstract
Multiple Instance Learning (MIL) is a crucial weakly supervised learning method applied across various domains, e.g., medical diagnosis based on whole slide images (WSIs). Recent advancements in MIL algorithms have yielded exceptional performance when the training and test data originate from the same domain, such as WSIs obtained from the same hospital. However, this paper reveals a performance deterioration of MIL models when tested on an out-of-domain test set, exemplified by WSIs sourced from a novel hospital. To address this challenge, this paper introduces the Retrieval-AugMented MIL (RAM-MIL) framework, which integrates Optimal Transport (OT) as the distance metric for nearest neighbor retrieval. The development of RAM-MIL is driven by two key insights. First, a theoretical discovery indicates that reducing the input's intrinsic dimension can minimize the approximation error in attention-based MIL. Second, previous studies highlight a link between input intrinsic dimension and the feature merging process with the retrieved data. Empirical evaluations conducted on WSI classification demonstrate that the proposed RAM-MIL framework achieves state-of-the-art performance in both in-domain scenarios, where the training and retrieval data are in the same domain, and more crucially, in out-of-domain scenarios, where the (unlabeled) retrieval data originates from a different domain. Furthermore, the use of the transportation matrix derived from OT renders the retrieval results interpretable at the instance level, in contrast to the vanilla l2 distance, and allows for visualization for human experts. © 2023 Neural information processing systems foundation. All rights reserved.
Bibliographic Note
Research Unit(s) information for this publication is provided by the author(s) concerned.
Citation Format(s)
Retrieval-Augmented Multiple Instance Learning. / Cui, Yufei; Liu, Ziquan; Chen, Yixin et al.
37th Conference on Neural Information Processing Systems (NeurIPS 2023). ed. / A. Oh; T. Naumann; A. Globerson; K. Saenko; M. Hardt; S. Levine. 2023. p. 24859-24878 (Advances in Neural Information Processing Systems; Vol. 36).
37th Conference on Neural Information Processing Systems (NeurIPS 2023). ed. / A. Oh; T. Naumann; A. Globerson; K. Saenko; M. Hardt; S. Levine. 2023. p. 24859-24878 (Advances in Neural Information Processing Systems; Vol. 36).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review