TY - GEN
T1 - Automatic image dataset construction from click-Through logs using deep neural network
AU - Bai, Yalong
AU - Yang, Kuiyuan
AU - Yu, Wei
AU - Xu, Chang
AU - Ma, Wei-Ying
AU - Zhao, Tiejun
N1 - Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].
PY - 2015/10/13
Y1 - 2015/10/13
N2 - Labelled image datasets are the backbone for high-level im-age understanding tasks with wide application scenarios, and continuously drive and evaluate the progress of fea-ture designing and supervised learning models. Recently, the million scale labelled image dataset further contributes to the rebirth of deep convolutional neural network and by-pass manual designing handcraft features. However, the con-struction process of image dataset is mainly manual-based and quite labor intensive, which often take years' efforts to construct a million scale dataset with high quality. In this paper, we propose a deep learning based method to construc-t large scale image dataset in an automatic way. Specifically, word representation and image representation are learned in a deep neural network from large amount of click-Through logs, and further used to define word-word similarity and image-word similarity. These two similarities are used to automatize the two labor intensive steps in manual-based image dataset construction: query formation and noisy im-age removal. With a new proposed cross convolutional filter regularizer, we can construct a million scale image dataset in one week. Finally, two image datasets are constructed to verify the effectiveness of the method. In addition to scale, the automatically constructed dataset has compara-ble accuracy, diversity and cross-dataset generalization with manually labelled image datasets. © 2015 ACM.
AB - Labelled image datasets are the backbone for high-level im-age understanding tasks with wide application scenarios, and continuously drive and evaluate the progress of fea-ture designing and supervised learning models. Recently, the million scale labelled image dataset further contributes to the rebirth of deep convolutional neural network and by-pass manual designing handcraft features. However, the con-struction process of image dataset is mainly manual-based and quite labor intensive, which often take years' efforts to construct a million scale dataset with high quality. In this paper, we propose a deep learning based method to construc-t large scale image dataset in an automatic way. Specifically, word representation and image representation are learned in a deep neural network from large amount of click-Through logs, and further used to define word-word similarity and image-word similarity. These two similarities are used to automatize the two labor intensive steps in manual-based image dataset construction: query formation and noisy im-age removal. With a new proposed cross convolutional filter regularizer, we can construct a million scale image dataset in one week. Finally, two image datasets are constructed to verify the effectiveness of the method. In addition to scale, the automatically constructed dataset has compara-ble accuracy, diversity and cross-dataset generalization with manually labelled image datasets. © 2015 ACM.
KW - Automatic Image Dataset Construction
KW - Deep Learning
KW - Image Representa-Tion
KW - Word Representation
UR - https://www.scopus.com/pages/publications/84962821266
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84962821266&origin=recordpage
U2 - 10.1145/2733373.2806243
DO - 10.1145/2733373.2806243
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9781450334594
T3 - MM 2015 - Proceedings of the 2015 ACM Multimedia Conference
SP - 441
EP - 450
BT - MM 2015 - Proceedings of the 2015 ACM Multimedia Conference
PB - Association for Computing Machinery
T2 - 23rd ACM International Conference on Multimedia, MM 2015
Y2 - 26 October 2015 through 30 October 2015
ER -