关注
Yang Wang
Yang Wang
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Dual-side sparse tensor core
Y Wang, C Zhang, Z Xie, C Guo, Y Liu, J Leng
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
672021
Ladabert: Lightweight adaptation of bert through hybrid model compression
Y Mao, Y Wang, C Wu, C Zhang, Y Wang, Y Yang, Q Zhang, Y Tong, J Bai
Proceedings of the 28th International Conference on Computational …, 2020
642020
MOSC: A method to assign the outsourcing of service function chain across multiple clouds
H Chen, X Wang, Y Zhao, T Song, Y Wang, S Xu, L Li
Computer Networks 133, 166-182, 2018
412018
Towards efficient vision transformer inference: A first study of transformers on mobile devices
X Wang, LL Zhang, Y Wang, M Yang
Proceedings of the 23rd annual international workshop on mobile computing …, 2022
392022
{SparTA}:{Deep-Learning} Model Sparsity via {Tensor-with-Sparsity-Attribute}
N Zheng, B Lin, Q Zhang, L Ma, Y Yang, F Yang, Y Wang, M Yang, L Zhou
16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022
372022
Towards optimal outsourcing of service function chain across multiple clouds
H Chen, S Xu, X Wang, Y Zhao, K Li, Y Wang, W Wang
2016 IEEE International Conference on Communications (ICC), 1-7, 2016
212016
Adaptive page migration policy with huge pages in tiered memory systems
T Heo, Y Wang, W Cui, J Huh, L Zhang
IEEE Transactions on Computers 71 (1), 53-68, 2020
192020
LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup
X Tang, Y Wang, T Cao, LL Zhang, Q Chen, D Cai, Y Liu, M Yang
MobiCom '23: Proceedings of the 29th Annual International Conference on …, 2023
11*2023
Romou: Rapidly Generate High-Performance Tensor Kernels for Mobile GPUs
R Liang, T Cao, J Wen, M Wang, Y Wang, J Zou, Y Liu
MobiCom '22: Proceedings of the 28th Annual International Conference on …, 2022
112022
FlexMon: A flexible and fine-grained traffic monitor for programmable networks
Y Wang, X Wang, S Xu, C He, Y Zhang, J Ren, S Yu
Journal of Network and Computer Applications 201, 103344, 2022
82022
Low complexity hierarchical scheduling for diverse datacenter jobs
C You, Y Wang, S Xu, L Luo, MH Chen
IEEE Communications Letters 23 (1), 48-51, 2018
32018
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
C Li, Z Zhou, Y Wang, F Yang, T Cao, M Yang, Y Liang, G Sun
International Conference on Architectural Support for Programming Languages …, 2024
12024
NeuralMon: Graph neural network for flow measurement allocation
Y Wang, X Wang, Z Huang, C He, Y Zhang, S Xu
2021 IEEE Global Communications Conference (GLOBECOM), 1-6, 2021
12021
Toward CXL-Native Memory Tiering via Device-Side Profiling
Z Zhou, Y Chen, T Zhang, Y Wang, R Shu, S Xu, P Cheng, L Qu, Y Xiong, ...
arXiv preprint arXiv:2403.18702, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–14