Gaussian RAM: Lightweight image classification via stochastic retina-inspired glimpse and reinforcement learning

D Shim, HJ Kim - … on Control, Automation and Systems (ICCAS), 2020 - ieeexplore.ieee.org
2020 20th International Conference on Control, Automation and …, 2020ieeexplore.ieee.org
Previous studies on image classification have mainly focused on the performance of the
networks, not on real-time operation or model compression. We propose a Gaussian Deep
Recurrent visual Attention Model (GDRAM)-a reinforcement learning based lightweight deep
neural network for large scale image classification that outperforms the conventional CNN
(Convolutional Neural Network) which uses the entire image as input. Highly inspired by the
biological visual recognition process, our model mimics the stochastic location of the retina …
Previous studies on image classification have mainly focused on the performance of the networks, not on real-time operation or model compression. We propose a Gaussian Deep Recurrent visual Attention Model (GDRAM) - a reinforcement learning based lightweight deep neural network for large scale image classification that outperforms the conventional CNN (Convolutional Neural Network) which uses the entire image as input. Highly inspired by the biological visual recognition process, our model mimics the stochastic location of the retina with Gaussian distribution. We evaluate the model on Large cluttered MNIST, Large CIFAR-10 and Large CIFAR-100 datasets which are resized to 128 in both width and height. The implementation of Gaussian RAM in PyTorch and its pretrained model are available at : https://github.com/dsshim0125/gaussian-ram
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果