PPEDNet: Pyramid pooling encoder-decoder network for real-time semantic segmentation- 学术资源搜索

PPEDNet: Pyramid pooling encoder-decoder network for real-time semantic segmentation

Z Tan, B Liu, N Yu - Image and Graphics: 9th International Conference …, 2017 - Springer

Image and Graphics: 9th International Conference, ICIG 2017, Shanghai, China …, 2017•Springer

Abstract

Image semantic segmentation is a fundamental problem and plays an important role in computer vision and artificial intelligence. Recent deep neural networks have improved the accuracy of semantic segmentation significantly. Meanwhile, the number of network parameters and floating point operations have also increased notably. The real-world applications not only have high requirements on the segmentation accuracy, but also demand real-time processing. In this paper, we propose a pyramid pooling encoder-decoder network named PPEDNet for both better accuracy and faster processing speed. Our encoder network is based on VGG16 and discards the fully connected layers due to their huge amounts of parameters. To extract context feature efficiently, we design a pyramid pooling architecture. The decoder is a trainable convolutional network for upsampling the output of the encoder, and fine-tuning the segmentation details. Our method is evaluated on CamVid dataset, achieving 7.214% mIOU accuracy improvement while reducing 17.9% of the parameters compared with the state-of-the-art algorithm.

Springer

展开收起

被引用次数：9 相关文章

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

PPEDNet: Pyramid pooling encoder-decoder network for real-time semantic segmentation

引用