Object Classification Using Stacked Autoencoder and Convolutional Neural Network

Object Classification Using Stacked Autoencoder and Convolutional Neural Network

OBJECT CLASSIFICATION USING STACKED AUTOENCODER AND CONVOLUTIONAL NEURAL NETWORK A Paper Submitted to the Graduate Faculty of the North Dakota State University of Agriculture and Applied Science By Vijaya Chander Rao Gottimukkula In Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE Major Department: Computer Science November 2016 Fargo, North Dakota North Dakota State University Graduate School Title OBJECT CLASSIFICATION USING STACKED AUTOENCODER AND CONVOLUTIONAL NEURAL NETWORK By Vijaya Chander Rao Gottimukkula The Supervisory Committee certifies that this disquisition complies with North Dakota State University’s regulations and meets the accepted standards for the degree of MASTER OF SCIENCE SUPERVISORY COMMITTEE: Dr. Simone Ludwig Chair Dr. Anne Denton Dr. María de los Ángeles Alfonseca-Cubero Approved: 11/15/2016 Dr. Brain Slator Date Department Chair ABSTRACT In the recent years, deep learning has shown to have a formidable impact on object classification and has bolstered the advances in machine learning research. Many image datasets such as MNIST, CIFAR-10, SVHN, Imagenet, Caltech, etc. are available which contain a broad spectrum of image data for training and testing purposes. Numerous deep learning architectures have been developed in the last few years, and significant results were obtained upon testing against datasets. However, state-of-the-art results have been achieved through Convolutional Neural Networks (CNN). This paper investigates different deep learning models based on the standard Convolutional Neural Networks and Stacked Auto Encoders architectures for object classification on given image datasets. Accuracy values were computed and presented for these models on three image classification datasets. iii ACKNOWLEDGEMENTS I would like to express my gratitude to Dr. Simone Ludwig for her constant encouragement and support towards the successful completion of my masters paper. Her useful insights and positive critique have helped me understand and develop my skills in the subject. I would also like to thank Dr. Anne Denton and Dr. María de los Ángeles Alfonseca-Cubero for their willingness to serve on my committee and provide useful inputs. Also, I want to thank my parents and friends for their unending support and inspiration. iv TABLE OF CONTENTS ABSTRACT ................................................................................................................................... iii ACKNOWLEDGEMENTS ........................................................................................................... iv LIST OF TABLES ........................................................................................................................ vii LIST OF FIGURES ..................................................................................................................... viii LIST OF ABBREVIATIONS ........................................................................................................ ix 1. INTRODUCTION ...................................................................................................................... 1 1.1. Artificial Intelligence and Machine Learning ...................................................................... 1 1.2. Computer Vision and Object Recognition ........................................................................... 2 1.3. Shortcomings of Conventional Techniques ......................................................................... 2 1.4. Evolution of Deep Learning ................................................................................................. 3 2. RELATED WORK ..................................................................................................................... 5 2.1. Classification Algorithms ..................................................................................................... 7 2.1.1. K-Nearest Neighbors ..................................................................................................... 7 2.1.2. SVM .............................................................................................................................. 8 2.1.3. Boosted Stumps ............................................................................................................. 9 3. APPROACH ............................................................................................................................. 10 3.1. Stacked Autoencoders (SAE) ............................................................................................. 14 3.1.1. Autoencoders ............................................................................................................... 14 3.1.2. Stacked Autoencoders ................................................................................................. 15 3.1.3. SAE Architecture for Classification ............................................................................ 16 3.1.4. Training Details of Stacked Autoencoder ................................................................... 17 3.2. Convolutional Neural Networks (CNN) ............................................................................ 19 3.2.1. Convolutional Layer .................................................................................................... 20 v 3.2.2. ReLu Layer .................................................................................................................. 21 3.2.3. Pooling Layer .............................................................................................................. 21 3.2.4. Fully Connected Layer ................................................................................................ 22 3.2.5. Output Layer ................................................................................................................ 23 3.2.6. Convolutional Neural Network Architecture for Classification .................................. 24 3.2.7. Training Details of CNN ............................................................................................. 25 4. EXPERIMENT AND RESULTS ............................................................................................. 26 4.1. Convolutional Neural Networks (CNN) Models ............................................................... 29 4.1.1. CNN Model-1 Architecture ......................................................................................... 29 4.1.2. CNN Model-2 Architecture ......................................................................................... 35 4.2. Stacked Autoencoder Architectures ................................................................................... 39 4.2.1. Results of SAE on MNIST dataset .............................................................................. 41 4.2.2. Results of SAE on SVHN dataset ............................................................................... 42 4.2.3. Results of SAE on CIFAR-10 dataset ......................................................................... 43 5. CONCLUSION AND FUTURE WORK ................................................................................. 46 REFERENCES ............................................................................................................................. 48 vi LIST OF TABLES Table Page 1. Results of CNN - Model 1 on MNIST dataset .......................................................................... 31 2. Results of CNN - Model 1 on SVHN dataset ........................................................................... 33 3. Results of CNN - Model 1 on CIFAR-10 dataset ..................................................................... 34 4. Results of CNN - Model 2 on MNIST dataset .......................................................................... 36 5. Results of CNN - Model 2 on SVHN dataset ........................................................................... 37 6. Results of CNN - Model 2 on CIFAR-10 dataset ..................................................................... 38 7. Number of nodes in each layer for different datasets ............................................................... 40 8. Results of SAE on MNIST dataset ........................................................................................... 41 9. Results of SAE on SVHN dataset ............................................................................................. 42 10. Results of SAE on CIFAR-10 dataset ..................................................................................... 43 11. Comparison of Results ............................................................................................................ 44 vii LIST OF FIGURES Figure Page 1. History of neural networks [5] .................................................................................................... 3 2. Data mining as a core process in KDD [13] ............................................................................... 5 3. Descriptive and predictive data mining techniques [13] ............................................................. 6 4. SVM Classification [18] ............................................................................................................. 8 5. Steps to solve classification problem [2] .................................................................................. 10 6. Autoencoder [31] ...................................................................................................................... 15

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    61 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us