Introduction to a large-scale general purpose ground truth database: methodology, annotation...

J Ding, N Xue, GS Xia, X Bai, W Yang… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

In he past decade, object detection has achieved significant progress in natural images but
not in aerial images, due to the massive variations in the scale and orientation of objects …

被引用次数：445 相关文章所有 11 个版本

[PDF] mlr.press

Do imagenet classifiers generalize to imagenet?

B Recht, R Roelofs, L Schmidt… - … conference on machine …, 2019 - proceedings.mlr.press

We build new test sets for the CIFAR-10 and ImageNet datasets. Both benchmarks have
been the focus of intense research for almost a decade, raising the danger of overfitting to …

被引用次数：1971 相关文章所有 6 个版本

[PDF] thecvf.com

DOTA: A large-scale dataset for object detection in aerial images

GS Xia, X Bai, J Ding, Z Zhu… - Proceedings of the …, 2018 - openaccess.thecvf.com

Object detection is an important and challenging problem in computer vision. Although the
past decade has witnessed major advances in object detection in natural scenes, such …

被引用次数：2918 相关文章所有 14 个版本

[PDF] thecvf.com

Action genome: Actions as compositions of spatio-temporal scene graphs

J Ji, R Krishna, L Fei-Fei… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Action recognition has typically treated actions and activities as monolithic events that occur
in videos. However, there is evidence from Cognitive Science and Neuroscience that people …

被引用次数：397 相关文章所有 9 个版本

[PDF] springer.com

Visual genome: Connecting language and vision using crowdsourced dense image annotations

R Krishna, Y Zhu, O Groth, J Johnson, K Hata… - International journal of …, 2017 - Springer

Despite progress in perceptual tasks such as image classification, computers still perform
poorly on cognitive tasks such as image description and question answering. Cognition is …

被引用次数：6270 相关文章所有 14 个版本

[PDF] arxiv.org

Imagenet large scale visual recognition challenge

O Russakovsky, J Deng, H Su, J Krause… - International journal of …, 2015 - Springer

Abstract The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object
category classification and detection on hundreds of object categories and millions of …

被引用次数：49074 相关文章所有 20 个版本

[PDF] thecvf.com

Image retrieval using scene graphs

J Johnson, R Krishna, M Stark, LJ Li… - Proceedings of the …, 2015 - openaccess.thecvf.com

This paper develops a novel framework for semantic image retrieval based on the notion of
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …

被引用次数：1334 相关文章所有 10 个版本

[PDF] springer.com

Facial-sketch synthesis: A new challenge

DP Fan, Z Huang, P Zheng, H Liu, X Qin… - Machine Intelligence …, 2022 - Springer

This paper aims to conduct a comprehensive study on facial-sketch synthesis (FSS).
However, due to the high cost of obtaining hand-drawn sketch datasets, there is a lack of a …

被引用次数：33 相关文章所有 13 个版本

[PDF] arxiv.org

Cnn-based density estimation and crowd counting: A survey

G Gao, J Gao, Q Liu, Q Wang, Y Wang - arXiv preprint arXiv:2003.12783, 2020 - arxiv.org

Accurately estimating the number of objects in a single image is a challenging yet
meaningful task and has been applied in many applications such as urban planning and …

被引用次数：233 相关文章所有 2 个版本

[PDF] google.com

Advancing image understanding in poor visibility environments: A collective benchmark study

W Yang, Y Yuan, W Ren, J Liu… - … on Image Processing, 2020 - ieeexplore.ieee.org

Existing enhancement methods are empirically expected to help the high-level end
computer vision task: however, that is observed to not always be the case in practice. We …

被引用次数：256 相关文章所有 5 个版本

高级搜索

QQ 群