Rangeaugment: Efficient online augmentation with range learning

T Kumar, A Mileo, R Brennan… - arXiv preprint arXiv …, 2023 - arxiv.org

Deep learning (DL) algorithms have shown significant performance in various computer
vision tasks. However, having limited labelled data lead to a network overfitting problem …

被引用次数：15 相关文章所有 2 个版本

[PDF] thecvf.com

Mobileclip: Fast image-text models through multi-modal reinforced training

PKA Vasu, H Pouransari, F Faghri… - Proceedings of the …, 2024 - openaccess.thecvf.com

Contrastive pre-training of image-text foundation models such as CLIP demonstrated
excellent zero-shot performance and improved robustness on a wide range of downstream …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

On the Efficacy of Multi-scale Data Samplers for Vision Applications

E Nunez, T Merth, A Prabhu, M Farajtabar… - arXiv preprint arXiv …, 2023 - arxiv.org

Multi-scale resolution training has seen an increased adoption across multiple vision tasks,
including classification and detection. Training with smaller resolutions enables faster …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

高级搜索

QQ 群