dc.identifier.citation |
[1] Colorful image colorization. Zhang, Richard, Phillip Isola, and Alexei A. Efros. 2016. s.l. : European Conference on Computer Vision, 2016. [2] Deep neural networks in machine translation: An overview. Zhang, Jiajun, and Chengqing Zong. 2015. s.l. : IEEE Intelligent Systems 30.5 : 16-25., 2015. [3] Deep visual-semantic alignments for generating image descriptions. Karpathy, Andrej, and Li Fei-Fei. 2015. s.l. : IEEE Conference on Computer Vision and Pattern Recognition, 2015. [4] Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov. 2014. s.l. : The Journal of Machine Learning Research, 15(1), 1929-1958., 2014. [5] Explain images with multimodal recurrent neural networks. Mao, J., Xu, W., Yang, Y., Wang, J., & Yuille, A. L. 2014. s.l. : arXiv preprint arXiv:1410.1090., 2014. [6] Generating sequences with recurrent neural networks. Graves, Alex. 2013. s.l. : arXiv preprint arXiv:1308.0850, 2013. [7] Generative adversarial nets. Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. 2014. s.l. : Advances in neural information processing systems (pp. 2672-2680), 2014. [8] ImageNet Classification with Deep Convolutional Neural Networks. Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton. 2012. s.l. : Advances in neural information processing systems (pp. 1097-1105)., 2012. [9] Nielsen, Michael. Neural Networks and Deep Learning. [10] Rectified Linear Units Improve Restricted Boltzmann Machines. Vinod Nair, Geoffrey E. Hinton. 2014. s.l. : Proceedings of the 27th international conference on machine learning (ICML-10) (pp. 807-814)., 2014. [11] Scalable object detection using deep neural networks. Erhan, Dumitru, Christian Szegedy, Alexander Toshev, and Dragomir Anguelov. 2014. s.l. : IEEE Conference on Computer Vision and Pattern Recognition, pp. 2147-2154, 2014. [12] Visualizing and Understanding Convolutional Neural Networks. Matthew D. Zieler, Rob Fergus. 2014. s.l. : European conference on computer vision (pp. 818-833). Springer International Publishing., 2014. 55 [13] Visually indicated sounds. Owens, A., Isola, P., McDermott, J., Torralba, A., Adelson, E.H. and Freeman, W.T. 2016. s.l. : IEEE Conference on Computer Vision and Pattern Recognition (pp. 2405-2413), 2016. [14] YOLO9000: Better. Faster, Stronger. Redmon, J., & Farhadi, A. 2016. s.l. : arXiv preprint, 2016. [15] machinelearningmastery.com. '8 Inspirational Applications of Deep Learning', July 14, 2016. [Online]. Available: https://machinelearningmastery.com/inspirational-applications-deep-learning/. [Accessed: 02-April-2017] [16] Pjreddie.com. 'YOLO: Real-Time Object Detection'. [Online]. Available: https://pjreddie.com/darknet/yolo. [Accessed: 03-Nov-2017] [17] Towardsdatascience.com. 'YOLO — you only look once, real time object detection explained'. 2016. [Online]. Available: https://towardsdatascience.com/yolo-you-only-look-once-real-time-object-detection-explained-492dc9230006. [Accessed: 06-Nov=2017] [18] Hackernoon.com. 'Challenges in Deep Learning'. 2017. [Online]. Available: https://hackernoon.com/challenges-in-deep-learning-57bbf6e73bb. [Accessed: 15-Mar-2017] [19] Adeshpande3.github.io. 'A Beginner's Guide to Understanding Convolutional Neural Networks'. 2016. [Online}. Available: https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner%27s-Guide-To-Understanding-Convolutional-Neural-Networks. [Accessed: 05-Mar-2017] [20] Towardsdatascience.com. 'A Survey of Deep Learning Frameworks'. 2017. [Online] Available: https://towardsdatascience.com/a-survey-of-deep-learning-frameworks-43b88b11af34. [Accessed: 05-Nov-2017] [21] Tensorflow.org. 'Getting started with TensorFlow'. 2017. [Online] Available: https://www.tensorflow.org/get_started/get_started. [Accessed: April-2017] |
en_US |
dc.description.abstract |
Image classification is the task of taking an input image and outputting a class (a cat, dog, etc.) or a probability of classes that best describes the image. For humans, this task of recognition is one of the first skills we learn from the moment we are born and is one that comes naturally and effortlessly as adults. Without even thinking twice, we’re able to quickly and seamlessly identify the environment we are in as well as the objects that surround us. When we see an image or just when we look at the world around us, most of the time we are able to immediately characterize the scene and give each object a label, all without even consciously noticing. These skills of being able to quickly recognize patterns, generalize from prior knowledge, and adapt to different image environments are ones that we do not share with our fellow machines. Convolutional neural networks, Sounds like a weird combination of biology and math with a little CS sprinkled in, but these networks have been some of the most influential innovations in the field of computer vision. 2012 was the first year that neural nets grew to prominence as Alex Krizhevsky used them to win that year’s ImageNet competition (basically, the annual Olympics of computer vision), dropping the classification error record from 26% to 15%, an astounding improvement at the time. Ever since then, a host of companies have been using deep learning at the core of their services.
In our research we experimentd on image classification using different deep learning frameworks. The following sections describes basics of deep learning and how it can be used in case of image classification and convolutional neural networks. We have also discussed the different deep learning frameworks and current applications. Finally we shared our gained results and knowledge. This will help the researchers to get a clear idea about getting knowledge in the field of image classification with deep convolutional neural networks. |
en_US |