PPEDNet: Pyramid pooling encoder-decoder network for real-time semantic segmentation

Z Tan, B Liu, N Yu - Image and Graphics: 9th International Conference …, 2017 - Springer
Image and Graphics: 9th International Conference, ICIG 2017, Shanghai, China …, 2017Springer
Image semantic segmentation is a fundamental problem and plays an important role in
computer vision and artificial intelligence. Recent deep neural networks have improved the
accuracy of semantic segmentation significantly. Meanwhile, the number of network
parameters and floating point operations have also increased notably. The real-world
applications not only have high requirements on the segmentation accuracy, but also
demand real-time processing. In this paper, we propose a pyramid pooling encoder-decoder …
Abstract
Image semantic segmentation is a fundamental problem and plays an important role in computer vision and artificial intelligence. Recent deep neural networks have improved the accuracy of semantic segmentation significantly. Meanwhile, the number of network parameters and floating point operations have also increased notably. The real-world applications not only have high requirements on the segmentation accuracy, but also demand real-time processing. In this paper, we propose a pyramid pooling encoder-decoder network named PPEDNet for both better accuracy and faster processing speed. Our encoder network is based on VGG16 and discards the fully connected layers due to their huge amounts of parameters. To extract context feature efficiently, we design a pyramid pooling architecture. The decoder is a trainable convolutional network for upsampling the output of the encoder, and fine-tuning the segmentation details. Our method is evaluated on CamVid dataset, achieving 7.214% mIOU accuracy improvement while reducing 17.9% of the parameters compared with the state-of-the-art algorithm.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果