TY - JOUR
T1 - Semi-Supervised Image Classification with Self-Paced Cross-Task Networks
AU - Wu, Si
AU - Ji, Qiujia
AU - Wang, Shufeng
AU - Wong, Hau-San
AU - Yu, Zhiwen
AU - Xu, Yong
PY - 2018/4
Y1 - 2018/4
N2 - In a semi-supervised setting, direct training of a deep discriminative model on partially labeled images often suffers from overfitting and poor performance, because only a small number of labeled images are available, and errors in label propagation are, in many cases, inevitable. In this paper, we introduce an auxiliary clustering task to explore the structure of the image data, and judiciously weigh unlabeled data to alleviate the influence of ambiguous data on model training. For this purpose, we propose a cross-task network composed of two streams to jointly learn two tasks: classification and clustering. Based on the model predictions, a large number of pairwise constraints can be generated from unlabeled images, and are fed to the clustering stream. Since pairwise constraints encode weak supervision information, the clustering is tolerant of errors in labeling. Unlabeled images are weighted according to the distances to the clusters discovered, and a better discriminative model is trained on the classification stream associated with a weighted softmax loss. Furthermore, a self-paced learning paradigm is adopted to gradually train our deep model from easy examples to difficult ones. Experimental results on widely used image classification datasets confirm the effectiveness and superiority of the proposed approach.
AB - In a semi-supervised setting, direct training of a deep discriminative model on partially labeled images often suffers from overfitting and poor performance, because only a small number of labeled images are available, and errors in label propagation are, in many cases, inevitable. In this paper, we introduce an auxiliary clustering task to explore the structure of the image data, and judiciously weigh unlabeled data to alleviate the influence of ambiguous data on model training. For this purpose, we propose a cross-task network composed of two streams to jointly learn two tasks: classification and clustering. Based on the model predictions, a large number of pairwise constraints can be generated from unlabeled images, and are fed to the clustering stream. Since pairwise constraints encode weak supervision information, the clustering is tolerant of errors in labeling. Unlabeled images are weighted according to the distances to the clusters discovered, and a better discriminative model is trained on the classification stream associated with a weighted softmax loss. Furthermore, a self-paced learning paradigm is adopted to gradually train our deep model from easy examples to difficult ones. Experimental results on widely used image classification datasets confirm the effectiveness and superiority of the proposed approach.
KW - cross-task network
KW - Image classification
KW - self-paced paradigm
KW - semi-supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85030784889&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85030784889&origin=recordpage
U2 - 10.1109/TMM.2017.2758522
DO - 10.1109/TMM.2017.2758522
M3 - RGC 21 - Publication in refereed journal
SN - 1520-9210
VL - 20
SP - 851
EP - 865
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 4
ER -