Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting

Wenting Chen, Pengyu Wang, Hui Ren, Lichao Sun, Quanzheng Li, Yixuan Yuan*, Xiang Li*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

2 Citations (Scopus)

Abstract

Data scarcity and privacy concerns limit the availability of high-quality medical images for public use, which can be mitigated through medical image synthesis. However, current medical image synthesis methods often struggle to accurately capture the complexity of detailed anatomical structures and pathological conditions. To address these challenges, we propose a novel medical image synthesis model that leverages fine-grained image-text alignment and anatomy-pathology prompts to generate highly detailed and accurate synthetic medical images. Our method integrates advanced natural language processing techniques with image generative modeling, enabling precise alignment between descriptive text prompts and the synthesized images' anatomical and pathological details. The proposed approach consists of two key components: an anatomy-pathology prompting module and a fine-grained alignment-based synthesis module. The anatomy-pathology prompting module automatically generates descriptive prompts for high-quality medical images. To further synthesize high-quality medical images from the generated prompts, the fine-grained alignment-based synthesis module pre-defines a visual codebook for the radiology dataset and performs fine-grained alignment between the codebook and generated prompts to obtain key patches as visual clues, facilitating accurate image synthesis. We validate the superiority of our method through experiments on public chest X-ray datasets and demonstrate that our synthetic images preserve accurate semantic information, making them valuable for various medical applications. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
Original languageEnglish
Title of host publicationMedical Image Computing and Computer Assisted Intervention – MICCAI 2024
Subtitle of host publication27th International Conference, Marrakesh, Morocco, October 6–10, 2024, Proceedings, Part XII
EditorsMarius George Linguraru, Qi Dou, Aasa Feragen, Stamatia Giannarou, Ben Glocker, Karim Lekadir, Julia A. Schnabel
Place of PublicationCham, Switzerland
PublisherSpringer 
Pages240-250
ISBN (Electronic)978-3-031-72390-2
ISBN (Print)978-3-031-72389-6
DOIs
Publication statusPublished - 2024
Event27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024) - Palmeraie Conference Centre, Marrakesh, Morocco
Duration: 6 Oct 202410 Oct 2024
https://conferences.miccai.org/2024/en/

Publication series

NameLecture Notes in Computer Science
Volume15012
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024)
Abbreviated titleMICCAI2024
Country/TerritoryMorocco
CityMarrakesh
Period6/10/2410/10/24
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Fingerprint

Dive into the research topics of 'Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting'. Together they form a unique fingerprint.

Cite this