Ke Hong 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	89	89
h 指数	6	6
i10 指数	4	4

0

70

35

2022202320244 16 69

开放获取的出版物数量

4 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

Ke Hong

Ke Hong

Tsinghua University

在 mails.tsinghua.edu.cn 的电子邮件经过验证

efficient computing GPU acceleration sparse computing ML system


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Flashdecoding++: Faster large language model inference on gpus K Hong, G Dai, J Xu, Q Mao, X Li, J Liu, K Chen, H Dong, Y Wang arXiv preprint arXiv:2311.01282, 2023	21	2023
A learning-based AOA estimation method for device-free localization K Hong, T Wang, J Liu, Y Wang, Y Shen IEEE Communications Letters 26 (6), 1264-1267, 2022	16	2022
A survey on efficient inference for large language models Z Zhou, X Ning, K Hong, T Fu, J Xu, S Li, Y Lou, L Wang, Z Yuan, X Li, ... arXiv preprint arXiv:2404.14294, 2024	12	2024
Torchsparse++: Efficient training and inference framework for sparse convolution on gpus H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li, G Dai, Y Wang, S Han Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023	10	2023
Torchsparse++: Efficient point cloud engine H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li, G Dai, Y Wang, S Han Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	8	2023
Exploiting hardware utilization and adaptive dataflow for efficient sparse convolution in 3D point clouds K Hong, Z Yu, G Dai, X Yang, Y Lian, N Xu, Y Wang Proceedings of Machine Learning and Systems 5, 428-441, 2023	6	2023
Llm-mq: Mixed-precision quantization for efficient llm deployment S Li, X Ning, K Hong, T Liu, L Wang, X Li, K Zhong, G Dai, H Yang, ... The Efficient Natural Language and Speech Processing Workshop with NeurIPS 9, 2023	6	2023
Ada3d: Exploiting the spatial redundancy with adaptive inference for efficient 3d object detection T Zhao, X Ning, K Hong, Z Qiu, P Lu, Y Zhao, L Zhang, L Zhou, G Dai, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
An efficient accelerator for point-based and voxel-based point cloud neural networks X Yang, T Fu, G Dai, S Zeng, K Zhong, K Hong, Y Wang 2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023	3	2023
FlashDecoding++: Faster Large Language Model Inference with Asynchronization, Flat GEMM Optimization, and Heuristics K Hong, G Dai, J Xu, Q Mao, X Li, J Liu, Y Dong, Y Wang Proceedings of Machine Learning and Systems 6, 148-161, 2024	1	2024
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning K Zhong, Z Zhu, G Dai, H Wang, X Yang, H Zhang, J Si, Q Mao, S Zeng, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024	1	2024
A Point Transformer Accelerator with Fine-Grained Pipelines and Distribution-Aware Dynamic FPS Y Lian, X Yang, K Hong, Y Wang, G Dai, N Xu 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 1-9, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–12