Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Adaptive rotated convolution for rotated object detection

Y Pu, Y Wang, Z Xia, Y Han, Y Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Rotated object detection aims to identify and locate objects in images with arbitrary
orientation. In this scenario, the oriented directions of objects vary considerably across …

Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc
Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

Recent trends in deep learning based open-domain textual question answering systems

Z Huang, S Xu, M Hu, X Wang, J Qiu, Y Fu… - IEEE …, 2020 - ieeexplore.ieee.org
Open-domain textual question answering (QA), which aims to answer questions from large
data sources like Wikipedia or the web, has gained wide attention in recent years. Recent …

Model tells you what to discard: Adaptive kv cache compression for llms

S Ge, Y Zhang, L Liu, M Zhang, J Han, J Gao - arXiv preprint arXiv …, 2023 - arxiv.org
In this study, we introduce adaptive KV cache compression, a plug-and-play method that
reduces the memory footprint of generative inference for Large Language Models (LLMs) …

Efficient diffusion transformer with step-wise dynamic attention mediators

Y Pu, Z Xia, J Guo, D Han, Q Li, D Li, Y Yuan… - … on Computer Vision, 2025 - Springer
This paper identifies significant redundancy in the query-key interactions within self-attention
mechanisms of diffusion transformer models, particularly during the early stages of …

Adapting Neural Networks at Runtime: Current Trends in At-Runtime Optimizations for Deep Learning

M Sponner, B Waschneck, A Kumar - ACM Computing Surveys, 2024 - dl.acm.org
Adaptive optimization methods for deep learning adjust the inference task to the current
circumstances at runtime to improve the resource footprint while maintaining the model's …

Gra: Detecting oriented objects through group-wise rotating and attention

J Wang, Y Pu, Y Han, J Guo, Y Wang, X Li… - European Conference on …, 2025 - Springer
Oriented object detection, an emerging task in recent years, aims to identify and locate
objects across varied orientations. This requires the detector to accurately capture the …

Transkimmer: Transformer learns to layer-wise skim

Y Guan, Z Li, J Leng, Z Lin, M Guo - arXiv preprint arXiv:2205.07324, 2022 - arxiv.org
Transformer architecture has become the de-facto model for many machine learning tasks
from natural language processing and computer vision. As such, improving its computational …

[HTML][HTML] “Note Bloat” impacts deep learning-based NLP models for clinical prediction tasks

J Liu, D Capurro, A Nguyen, K Verspoor - Journal of biomedical informatics, 2022 - Elsevier
One unintended consequence of the Electronic Health Records (EHR) implementation is the
overuse of content-importing technology, such as copy-and-paste, that creates “bloated” …