TY - JOUR
T1 - Vision Intelligence Assisted Lung Function Estimation Based on Transformer Encoder-Decoder Network with Invertible Modeling
AU - Chen, Liuyin
AU - Lu, Di
AU - Zhai, Jianxue
AU - Cai, Kaican
AU - Wang, Long
AU - Zhang, Zijun
PY - 2024/7
Y1 - 2024/7
N2 - Lung function evaluation is important to many medical applications, but conducting pulmonary function tests is constrained by different conditions. This paper presents a pioneer study of an integrated invertible deep learning method for lung function estimation via using computed tomography (CT) images. First, the projection method is proposed to flatten the 3D image onto a 2D plane, with preserving location information in 3D. Next, the MBConv Transformer-based encoder-decoder structure is developed to extract latent features. Finally, we develop an invertible Normalizing Flow model to infer lung function based on the extracted features and design two loss functions for two directions. The method enables both estimating the lung function based on CT images and metadata as well as generating the corresponding simulated CT image according to the lung function. Computational studies show that the proposed regression model outperforms all state-of-the-art image regression models. A comprehensive comparative analysis also demonstrates the effectiveness of using generated images and confirms the superiority of the proposed method. To the best of our knowledge, this work is the first of its kind in combining encoder-decoder network with Normalizing Flows to ensure the effectiveness of the fully invertible framework, especially in lung CT image analysis. © 2024 IEEE.
AB - Lung function evaluation is important to many medical applications, but conducting pulmonary function tests is constrained by different conditions. This paper presents a pioneer study of an integrated invertible deep learning method for lung function estimation via using computed tomography (CT) images. First, the projection method is proposed to flatten the 3D image onto a 2D plane, with preserving location information in 3D. Next, the MBConv Transformer-based encoder-decoder structure is developed to extract latent features. Finally, we develop an invertible Normalizing Flow model to infer lung function based on the extracted features and design two loss functions for two directions. The method enables both estimating the lung function based on CT images and metadata as well as generating the corresponding simulated CT image according to the lung function. Computational studies show that the proposed regression model outperforms all state-of-the-art image regression models. A comprehensive comparative analysis also demonstrates the effectiveness of using generated images and confirms the superiority of the proposed method. To the best of our knowledge, this work is the first of its kind in combining encoder-decoder network with Normalizing Flows to ensure the effectiveness of the fully invertible framework, especially in lung CT image analysis. © 2024 IEEE.
KW - Generative Model
KW - Image Regression
KW - Lung function
KW - Normalizing flows
KW - ViT
UR - http://www.scopus.com/inward/record.url?scp=85181568372&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85181568372&origin=recordpage
U2 - 10.1109/TAI.2023.3348428
DO - 10.1109/TAI.2023.3348428
M3 - RGC 21 - Publication in refereed journal
SN - 2691-4581
VL - 5
SP - 3336
EP - 3349
JO - IEEE Transactions on Artificial Intelligence
JF - IEEE Transactions on Artificial Intelligence
IS - 7
ER -