Skip to main navigation Skip to search Skip to main content

Automatic image dataset construction from click-Through logs using deep neural network

  • Yalong Bai
  • , Kuiyuan Yang
  • , Wei Yu
  • , Chang Xu
  • , Wei-Ying Ma
  • , Tiejun Zhao

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Labelled image datasets are the backbone for high-level im-age understanding tasks with wide application scenarios, and continuously drive and evaluate the progress of fea-ture designing and supervised learning models. Recently, the million scale labelled image dataset further contributes to the rebirth of deep convolutional neural network and by-pass manual designing handcraft features. However, the con-struction process of image dataset is mainly manual-based and quite labor intensive, which often take years' efforts to construct a million scale dataset with high quality. In this paper, we propose a deep learning based method to construc-t large scale image dataset in an automatic way. Specifically, word representation and image representation are learned in a deep neural network from large amount of click-Through logs, and further used to define word-word similarity and image-word similarity. These two similarities are used to automatize the two labor intensive steps in manual-based image dataset construction: query formation and noisy im-age removal. With a new proposed cross convolutional filter regularizer, we can construct a million scale image dataset in one week. Finally, two image datasets are constructed to verify the effectiveness of the method. In addition to scale, the automatically constructed dataset has compara-ble accuracy, diversity and cross-dataset generalization with manually labelled image datasets. © 2015 ACM.
Original languageEnglish
Title of host publicationMM 2015 - Proceedings of the 2015 ACM Multimedia Conference
PublisherAssociation for Computing Machinery
Pages441-450
ISBN (Print)9781450334594
DOIs
Publication statusPublished - 13 Oct 2015
Externally publishedYes
Event23rd ACM International Conference on Multimedia, MM 2015 - Brisbane, Australia
Duration: 26 Oct 201530 Oct 2015

Publication series

NameMM 2015 - Proceedings of the 2015 ACM Multimedia Conference

Conference

Conference23rd ACM International Conference on Multimedia, MM 2015
PlaceAustralia
CityBrisbane
Period26/10/1530/10/15

Bibliographical note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Research Keywords

  • Automatic Image Dataset Construction
  • Deep Learning
  • Image Representa-Tion
  • Word Representation

Fingerprint

Dive into the research topics of 'Automatic image dataset construction from click-Through logs using deep neural network'. Together they form a unique fingerprint.

Cite this