A Study on Deep Learning: Training, Models and Applications
dc.contributor.advisor | Jiang, Hui | |
dc.creator | Pan, Hengyue | |
dc.date.accessioned | 2018-03-01T13:43:11Z | |
dc.date.available | 2018-03-01T13:43:11Z | |
dc.date.copyright | 2017-04-18 | |
dc.date.issued | 2018-03-01 | |
dc.date.updated | 2018-03-01T13:43:11Z | |
dc.degree.discipline | Computer Science | |
dc.degree.level | Doctoral | |
dc.degree.name | PhD - Doctor of Philosophy | |
dc.description.abstract | In the past few years, deep learning has become a very important research field that has attracted a lot of research interests, attributing to the development of the computational hardware like high performance GPUs, training deep models, such as fully-connected deep neural networks (DNNs) and convolutional neural networks (CNNs), from scratch becomes practical, and using well-trained deep models to deal with real-world large scale problems also becomes possible. This dissertation mainly focuses on three important problems in deep learning, i.e., training algorithm, computational models and applications, and provides several methods to improve the performances of different deep learning methods. The first method is a DNN training algorithm called Annealed Gradient Descent (AGD). This dissertation presents a theoretical analysis on the convergence properties and learning speed of AGD to show its benefits. Experimental results have shown that AGD yields comparable performance as SGD but it can significantly expedite training of DNNs in big data sets. Secondly, this dissertation proposes to apply a novel model, namely Hybrid Orthogonal Projection and Estimation (HOPE), to CNNs. HOPE can be viewed as a hybrid model to combine feature extraction with mixture models. The experimental results have shown that HOPE layers can significantly improve the performance of CNNs in the image classification tasks. The third proposed method is to apply CNNs to image saliency detection. In this approach, a gradient descent method is used to iteratively modify the input images based on pixel-wise gradients to reduce a pre-defined cost function. Moreover, SLIC superpixels and low level saliency features are applied to smooth and refine the saliency maps. Experimental results have shown that the proposed methods can generate high-quality salience maps. The last method is also for image saliency detection. However, this method is based on Generative Adversarial Network (GAN). Different from GAN, the proposed method uses fully supervised learning to learn G-Network and D-Network. Therefore, it is called Supervised Adversarial Network (SAN). Moreover, SAN introduces a different G-Network and conv-comparison layers to further improve the saliency performance. Experimental results show that the SAN model can also generate state-of-the-art saliency maps for complicate images. | |
dc.identifier.uri | http://hdl.handle.net/10315/34243 | |
dc.language.iso | en | |
dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
dc.subject | Computer science | |
dc.subject.keywords | Deep learning | |
dc.subject.keywords | DNNs | |
dc.subject.keywords | CNNs | |
dc.subject.keywords | Training algorithms | |
dc.subject.keywords | AGD | |
dc.subject.keywords | Models | |
dc.subject.keywords | HOPE | |
dc.subject.keywords | Computer vision | |
dc.subject.keywords | Saliency detection | |
dc.subject.keywords | SAN | |
dc.title | A Study on Deep Learning: Training, Models and Applications | |
dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1