Retrieval-Augmented Multiple Instance Learning

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Scopus Citations
View graph of relations

Author(s)

  • Yufei Cui
  • Ziquan Liu
  • Yuchen Lu
  • Xinyue Yu
  • Xue Liu
  • Tei-Wei Kuo
  • Miguel R.D. Rodrigues

Detail(s)

Original languageEnglish
Title of host publication37th Conference on Neural Information Processing Systems (NeurIPS 2023)
EditorsA. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
Pages24859-24878
ISBN (electronic)9781713899921
Publication statusPublished - Dec 2023

Publication series

NameAdvances in Neural Information Processing Systems
Volume36
ISSN (Print)1049-5258

Conference

Title37th Conference on Neural Information Processing Systems (NeurIPS 2023)
LocationNew Orleans Ernest N. Morial Convention Center
PlaceUnited States
CityNew Orleans
Period10 - 16 December 2023

Abstract

Multiple Instance Learning (MIL) is a crucial weakly supervised learning method applied across various domains, e.g., medical diagnosis based on whole slide images (WSIs). Recent advancements in MIL algorithms have yielded exceptional performance when the training and test data originate from the same domain, such as WSIs obtained from the same hospital. However, this paper reveals a performance deterioration of MIL models when tested on an out-of-domain test set, exemplified by WSIs sourced from a novel hospital. To address this challenge, this paper introduces the Retrieval-AugMented MIL (RAM-MIL) framework, which integrates Optimal Transport (OT) as the distance metric for nearest neighbor retrieval. The development of RAM-MIL is driven by two key insights. First, a theoretical discovery indicates that reducing the input's intrinsic dimension can minimize the approximation error in attention-based MIL. Second, previous studies highlight a link between input intrinsic dimension and the feature merging process with the retrieved data. Empirical evaluations conducted on WSI classification demonstrate that the proposed RAM-MIL framework achieves state-of-the-art performance in both in-domain scenarios, where the training and retrieval data are in the same domain, and more crucially, in out-of-domain scenarios, where the (unlabeled) retrieval data originates from a different domain. Furthermore, the use of the transportation matrix derived from OT renders the retrieval results interpretable at the instance level, in contrast to the vanilla l2 distance, and allows for visualization for human experts. © 2023 Neural information processing systems foundation. All rights reserved.

Bibliographic Note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Citation Format(s)

Retrieval-Augmented Multiple Instance Learning. / Cui, Yufei; Liu, Ziquan; Chen, Yixin et al.
37th Conference on Neural Information Processing Systems (NeurIPS 2023). ed. / A. Oh; T. Naumann; A. Globerson; K. Saenko; M. Hardt; S. Levine. 2023. p. 24859-24878 (Advances in Neural Information Processing Systems; Vol. 36).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review