[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

J Terven, DM Córdova-Esparza… - Machine Learning and …, 2023 - mdpi.com
YOLO has become a central real-time object detection system for robotics, driverless cars,
and video monitoring applications. We present a comprehensive analysis of YOLO's …

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - International Journal of …, 2024 - Springer
Abstract Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks across different data modalities. A PFM (eg, BERT, ChatGPT, GPT-4) is …

[PDF][PDF] 卷积神经网络研究综述

周飞燕, 金林鹏, 董军 - 计算机学报, 2017 - cjc.ict.ac.cn
摘要作为一个十余年来快速发展的崭新领域, 深度学习受到了越来越多研究者的关注,
它在特征提取和模型拟合上都有着相较于浅层模型显然的优势. 深度学习善于从原始输入数据中 …

[PDF][PDF] 深度学习研究综述

尹宝才, 王文通, 王立春 - 北京工业大学学报, 2015 - globalhha.com
鉴于深度学习在学术界和工业界的重要性, 依据数据流向对目前有代表性的深度学习算法进行
归纳和总结, 综述了不同类型深度网络的结构及特点. 首先介绍了深度学习的概念; …

[HTML][HTML] Explainable Artificial Intelligence (XAI): What we know and what is left to attain Trustworthy Artificial Intelligence

S Ali, T Abuhmed, S El-Sappagh, K Muhammad… - Information fusion, 2023 - Elsevier
Artificial intelligence (AI) is currently being utilized in a wide range of sophisticated
applications, but the outcomes of many AI models are challenging to comprehend and trust …

Object detection using YOLO: Challenges, architectural successors, datasets and applications

T Diwan, G Anirudh, JV Tembhurne - multimedia Tools and Applications, 2023 - Springer
Object detection is one of the predominant and challenging problems in computer vision.
Over the decade, with the expeditious evolution of deep learning, researchers have …

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer
While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

[HTML][HTML] Deep learning in computer vision: A critical review of emerging techniques and application scenarios

J Chai, H Zeng, A Li, EWT Ngai - Machine Learning with Applications, 2021 - Elsevier
Deep learning has been overwhelmingly successful in computer vision (CV), natural
language processing, and video/speech recognition. In this paper, our focus is on CV. We …

Unified contrastive learning in image-text-label space

J Yang, C Li, P Zhang, B Xiao, C Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Visual recognition is recently learned via either supervised learning on human-annotated
image-label data or language-image contrastive learning with webly-crawled image-text …