Deep reinforcement learning in computer vision: a comprehensive survey

N Le, VS Rathour, K Yamazaki, K Luu… - Artificial Intelligence …, 2022 - Springer
Deep reinforcement learning augments the reinforcement learning framework and utilizes
the powerful representation of deep neural networks. Recent works have demonstrated the …

Object detection in 20 years: A survey

Z Zou, K Chen, Z Shi, Y Guo, J Ye - Proceedings of the IEEE, 2023 - ieeexplore.ieee.org
Object detection, as of one the most fundamental and challenging problems in computer
vision, has received great attention in recent years. Over the past two decades, we have …

A review of object detection based on deep learning

Y Xiao, Z Tian, J Yu, Y Zhang, S Liu, S Du… - Multimedia Tools and …, 2020 - Springer
With the rapid development of deep learning techniques, deep convolutional neural
networks (DCNNs) have become more important for object detection. Compared with …

M3d-rpn: Monocular 3d region proposal network for object detection

G Brazil, X Liu - Proceedings of the IEEE/CVF international …, 2019 - openaccess.thecvf.com
Understanding the world in 3D is a critical component of urban autonomous driving.
Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been …

Nisp: Pruning networks using neuron importance score propagation

R Yu, A Li, CF Chen, JH Lai… - Proceedings of the …, 2018 - openaccess.thecvf.com
To reduce the significant redundancy in deep Convolutional Neural Networks (CNNs), most
existing methods prune neurons by only considering the statistics of an individual layer or …

Learning rich features for image manipulation detection

P Zhou, X Han, VI Morariu… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Image manipulation detection is different from traditional semantic object detection because
it pays more attention to tampering artifacts than to image content, which suggests that richer …

Pullnet: Open domain question answering with iterative retrieval on knowledge bases and text

H Sun, T Bedrax-Weiss, WW Cohen - arXiv preprint arXiv:1904.09537, 2019 - arxiv.org
We consider open-domain queston answering (QA) where answers are drawn from either a
corpus, a knowledge base (KB), or a combination of both of these. We focus on a setting in …

Clustered object detection in aerial images

F Yang, H Fan, P Chu, E Blasch… - Proceedings of the …, 2019 - openaccess.thecvf.com
Detecting objects in aerial images is challenging for at least two reasons:(1) target objects
like pedestrians are very small in pixels, making them hardly distinguished from surrounding …

Ar-net: Adaptive frame resolution for efficient action recognition

Y Meng, CC Lin, R Panda, P Sattigeri… - Computer Vision–ECCV …, 2020 - Springer
Action recognition is an open and challenging problem in computer vision. While current
state-of-the-art models offer excellent recognition results, their computational expense limits …

f-brs: Rethinking backpropagating refinement for interactive segmentation

K Sofiiuk, I Petrov, O Barinova… - Proceedings of the …, 2020 - openaccess.thecvf.com
Deep neural networks have become a mainstream approach to interactive segmentation. As
we show in our experiments, while for some images a trained network provides accurate …