Object detection in aerial images: A large-scale benchmark and challenges

J Ding, N Xue, GS Xia, X Bai, W Yang… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
In he past decade, object detection has achieved significant progress in natural images but
not in aerial images, due to the massive variations in the scale and orientation of objects …

Do imagenet classifiers generalize to imagenet?

B Recht, R Roelofs, L Schmidt… - … conference on machine …, 2019 - proceedings.mlr.press
We build new test sets for the CIFAR-10 and ImageNet datasets. Both benchmarks have
been the focus of intense research for almost a decade, raising the danger of overfitting to …

DOTA: A large-scale dataset for object detection in aerial images

GS Xia, X Bai, J Ding, Z Zhu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Object detection is an important and challenging problem in computer vision. Although the
past decade has witnessed major advances in object detection in natural scenes, such …

Action genome: Actions as compositions of spatio-temporal scene graphs

J Ji, R Krishna, L Fei-Fei… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Action recognition has typically treated actions and activities as monolithic events that occur
in videos. However, there is evidence from Cognitive Science and Neuroscience that people …

Visual genome: Connecting language and vision using crowdsourced dense image annotations

R Krishna, Y Zhu, O Groth, J Johnson, K Hata… - International journal of …, 2017 - Springer
Despite progress in perceptual tasks such as image classification, computers still perform
poorly on cognitive tasks such as image description and question answering. Cognition is …

Imagenet large scale visual recognition challenge

O Russakovsky, J Deng, H Su, J Krause… - International journal of …, 2015 - Springer
Abstract The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object
category classification and detection on hundreds of object categories and millions of …

Image retrieval using scene graphs

J Johnson, R Krishna, M Stark, LJ Li… - Proceedings of the …, 2015 - openaccess.thecvf.com
This paper develops a novel framework for semantic image retrieval based on the notion of
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …

Facial-sketch synthesis: A new challenge

DP Fan, Z Huang, P Zheng, H Liu, X Qin… - Machine Intelligence …, 2022 - Springer
This paper aims to conduct a comprehensive study on facial-sketch synthesis (FSS).
However, due to the high cost of obtaining hand-drawn sketch datasets, there is a lack of a …

Cnn-based density estimation and crowd counting: A survey

G Gao, J Gao, Q Liu, Q Wang, Y Wang - arXiv preprint arXiv:2003.12783, 2020 - arxiv.org
Accurately estimating the number of objects in a single image is a challenging yet
meaningful task and has been applied in many applications such as urban planning and …

Advancing image understanding in poor visibility environments: A collective benchmark study

W Yang, Y Yuan, W Ren, J Liu… - … on Image Processing, 2020 - ieeexplore.ieee.org
Existing enhancement methods are empirically expected to help the high-level end
computer vision task: however, that is observed to not always be the case in practice. We …