Advanced SearchSearch Tips
Automatic Tagging for Social Images using Convolution Neural Networks
facebook(new window)  Pirnt(new window) E-mail(new window) Excel Download
  • Journal title : Journal of KIISE
  • Volume 43, Issue 1,  2016, pp.47-53
  • Publisher : Korean Institute of Information Scientists and Engineers
  • DOI : 10.5626/JOK.2016.43.1.47
 Title & Authors
Automatic Tagging for Social Images using Convolution Neural Networks
Jang, Hyunwoong; Cho, Soosun;
While the Internet develops rapidly, a huge amount of image data collected from smart phones, digital cameras and black boxes are being shared through social media sites. Generally, social images are handled by tagging them with information. Due to the ease of sharing multimedia and the explosive increase in the amount of tag information, it may be considered too much hassle by some users to put the tags on images. Image retrieval is likely to be less accurate when tags are absent or mislabeled. In this paper, we suggest a method of extracting tags from social images by using image content. In this method, CNN(Convolutional Neural Network) is trained using ImageNet images with labels in the training set, and it extracts labels from instagram images. We use the extracted labels for automatic image tagging. The experimental results show that the accuracy is higher than that of instagram retrievals.
automatic image tagging;convolutional neural network;image content;instagram;
 Cited by
이미지와 텍스트 정보의 카테고리 분류에 의한 SNS 팔로잉 추천 방법,홍택은;신주현;

스마트미디어저널, 2016. vol.5. 3, pp.54-61
Z. Lin, G. Ding, M. Hu, J. Wang, and X. Ye, "Image Tag Completion via Image-Specific and Tag-Specific Linear Sparse Reconstructions," Proc. of the 2013 IEEE Conference on Computer Vision and Patten Recognition, pp. 1618-1625, Jun. 2013.

J. Cha, S. Cho, Y. Uh, S. Kim and H. Byun, "Image annotation using tag refinement," Journal of KIISE : Software and Applications, Vol. 39, No. 8, pp. 613-620, 2012. (in Korean)

S. Lee and E. Hwang, "Image Retrieval Scheme using Spatial Similarity and Annotation," Journal of KIISE : Databases, Vol. 30, No. 2, pp. 134-144, 2003. (in Korean)

G. Csurka, C. Bray, C. Dance, and L. Fan, "Visual categorization with bags of keypoints," Workshop on Statistical Learning in Computer Vision, ECCV, Vol. 1, No. 1-22, May. 2004.

L. Meiyu, D. Junping, J. Yingmin, and S. Zengqi, "Image Semantic Description and Automatic Semantic Annotation," Control Automation and Systems (ICCAS), 2010 International Conference on, pp. 1192-1195, 2010.

H. Jang and S. Cho, "Image Classification Using Bag of Visual Words and Visual Saliency Model," KIPS Trans. Software and Data Engineering, Vol. 3, No. 12, pp. 547-552, 2014. (in Korean) crossref(new window)

M. Oquab, L. Bottou, I. Laptev and J. Sivic, "Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks," Computer Vision and Pattern Recognition(CVPR), 2014 IEEE Conference on, pp. 1717-1724, 2014.

J. Schmidhuber, "Deep Learning in Neural Networks: An Overview," Neural Networks, 61, pp. 85-117, 2015. crossref(new window)

M. Nam, J. Kim and J. Shin, "A User motion Information Measurement Using Image and Text on Instagram-Based," Journal of Korea Multimedia Society, Vol. 17, No. 9, pp. 1125-1133, 2014. (in Korean) crossref(new window)

D. G. Lowe, "Distinctive Image Features from Scale-Invariant Keypoints," International Journal of Computer Vision, Vol. 60, No. 2, pp. 91-110, 2004. crossref(new window)

P. Fischer, A. Dosovitskiy and T. Brox, "Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT," arXiv preprint arXiv:1405.5769, May. 2014.

J. Wu, Y. Yu, C. Huang and K. Yu, "Deep Multiple Instance Learning for Image Classification and Auto-Annotation," Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3460-3469, 2015.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," Advances in neural information processing systems, pp. 1097-1105, 2012.

Y. Gong, Y. Jia, T. Leung, A. Toshev and S. Ioffe, "Deep Convolutional Ranking for Multilabel Image Annotation," arXiv preprint arXiv:1312.4894, 2013.

G. E. Dahl, T. N. Sainath and G. E. Hinton, "Improving Deep Neural Networks for LVCSR using Rectified Linear Units and Dropout," Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8609-8613 May. 2013.