AUTHORS: Mario Milicevic, Krunoslav Zubrinic, Ines Obradovic, Tomo Sjekavica
Download as PDF
ABSTRACT: Fine-grained classification consists of learning and understanding the subtle details between visually similar classes, which is a difficult task even for a human expert trained in a corresponding scientific field. Similar performances can be achieved by deep learning algorithms, but this requires a great amount of data in the learning phase. Obtaining data samples and manual data labeling can be time-consuming and expensive. This is why it can be difficult to acquire the required amount of data in real conditions in many areas of application, so in the context of a limited dataset it is necessary to use other techniques, such as data augmentation and transfer learning. In this we paper we study the problem of fine-grained ship type classification with a dataset size which does not allow learning network from scratch. We will show that good classification accuracy can be achieved by artificially creating additional learning examples and by using pretrained models which allow a transfer of knowledge between related source and target domains. In this, the source and target domain can differ in their entirety.
KEYWORDS: Deep Learning, Convolutional Neural Networks, Transfer Learning, Data Augmentation, Finegrained Classification
REFERENCES:
[1] Liu, W., Wang, Z., Liu, X., Zeng, N., Liu, Y., Alsaadi, F.E.: A survey of deep neural network architectures and their applications. Neurocomputing, 234, 11--26 (2017)
[2] Wang, J., Perez, L.: The effectiveness of data augmentation in image classification using deep learning. Technical report (2017)
[3] Taylor, L., Nitschke, G.S.: Improving Deep Learning using Generic Data Augmentation. CoRR, abs/1708.06020. (2017)
[4] Lemley, J., Bazrafkan, S., Corcoran, P.: Smart Augmentation Learning an Optimal Data Augmentation Strategy, The IEEE Access, Vol. 5: 5858–5869 (2017)
[5] Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering 22(10), 1345--1359 (2010)
[6] Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition. Technical report, arXiv preprint arXiv:1310.1531.
[7] Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? Advances in Neural Information Processing Systems 27, pp. 3320--3328, Curran Associates (2014)
[8] Chollet, F.: Deep Learning with Python (1st ed.). Manning Publications Co., Greenwich, CT, USA (2017)
[9] Abadi, M. et al.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems, software available from http://www.tensorflow.org (2015)
[10] Solmaz, B., Gundogdu, E., Yücesoy, V., & Koc, A.: Generic and attribute-specific deep representations for maritime vessels. IPSJ Transactions on Computer Vision and Applications, 9, 1--18 (2017)
[11] Ship Photos and Ship Tracker, http://www.shipspotting.com. Accessed 10 May 2018
[12] Krizhevsky, A., Sutskever, I. and Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pp. 1097-- 1105 (2012)
[13] Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. Salakhutdinov, R.: Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1), pp.1929--1958 (2014)
[14] Tieleman, T. and Hinton, G.: Lecture 6.5- rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural networks for machine learning, 4(2), pp.26--31 (2012)
[15] Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
[16] Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. and Wojna, Z.: Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818--2826 (2016)
[17] Chollet, F.: Xception: Deep learning with depthwise separable convolutions. arXiv preprint, pp.1610-02357 (2017)
[18] He, K., Zhang, X., Ren, S. and Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770--778 (2016)