RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

C Jin, Z Zhang, X Jiang, F Liu, X Liu, X Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Retrieval-Augmented Generation (RAG) has shown significant improvements in various
natural language processing tasks by integrating the strengths of large language models …

[PDF][PDF] Towards software-defined FPGA acceleration for big data analytics

FA Lu - 2024 - summit.sfu.ca
With the ever-increasing amount of user data produced worldwide, today's big data ana-
lytics engines are constantly under pressure to keep up with the rapidly increasing demand …