TY - JOUR
T1 - PottsMGNet
T2 - A Mathematical Explanation of Encoder-Decoder Based Neural Networks
AU - Tai, Xue-Cheng
AU - Liu, Hao
AU - Chan, Raymond
PY - 2024/3
Y1 - 2024/3
N2 - For problems in image processing and many other fields, a large class of effective neural networks has encoder-decoder-based architectures. Although these networks have shown impressive performance, mathematical explanations of their architectures are still underdeveloped. In this paper, we study the encoder-decoder-based network architecture from the algorithmic perspective and provide a mathematical explanation. We use the two-phase Potts model for image segmentation as an example for our explanations. We associate the segmentation problem with a control problem in the continuous setting. Then, the continuous control model is time discretized by an operatorsplitting scheme, the PottsMGNet, and space discretized by the multigrid method. We show that the resulting discrete PottsMGNet is equivalent to an encoder-decoder-based network. With minor modifications, it is shown that a number of the popular encoder-decoder-based neural networks are just instances of the proposed PottsMGNet. By incorporating the soft-threshold-dynamics into the PottsMGNet as a regularizer, the PottsMGNet has shown to be robust with the network parameters such as network width and depth and has achieved remarkable performance on datasets with very large noise. In nearly all our experiments, the new network always performs better than or as well as on accuracy and dice score compared to existing networks for image segmentation.
AB - For problems in image processing and many other fields, a large class of effective neural networks has encoder-decoder-based architectures. Although these networks have shown impressive performance, mathematical explanations of their architectures are still underdeveloped. In this paper, we study the encoder-decoder-based network architecture from the algorithmic perspective and provide a mathematical explanation. We use the two-phase Potts model for image segmentation as an example for our explanations. We associate the segmentation problem with a control problem in the continuous setting. Then, the continuous control model is time discretized by an operatorsplitting scheme, the PottsMGNet, and space discretized by the multigrid method. We show that the resulting discrete PottsMGNet is equivalent to an encoder-decoder-based network. With minor modifications, it is shown that a number of the popular encoder-decoder-based neural networks are just instances of the proposed PottsMGNet. By incorporating the soft-threshold-dynamics into the PottsMGNet as a regularizer, the PottsMGNet has shown to be robust with the network parameters such as network width and depth and has achieved remarkable performance on datasets with very large noise. In nearly all our experiments, the new network always performs better than or as well as on accuracy and dice score compared to existing networks for image segmentation.
KW - Potts model
KW - operator splitting
KW - deep neural network
KW - image segmentation
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:001183244000001
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85196393850&origin=recordpage
UR - http://www.scopus.com/inward/record.url?scp=85196393850&partnerID=8YFLogxK
U2 - 10.1137/23M1586355
DO - 10.1137/23M1586355
M3 - RGC 21 - Publication in refereed journal
SN - 1936-4954
VL - 17
SP - 540
EP - 594
JO - SIAM Journal on Imaging Sciences
JF - SIAM Journal on Imaging Sciences
IS - 1
ER -