Beyond the limits: A survey of techniques to extend the context length in large language models

Z Zhang, X Bo, C Ma, R Li, X Chen, Q Dai, J Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language model (LLM) based agents have recently attracted much attention from the
research and industry communities. Compared with original LLMs, LLM-based agents are …

被引用次数：35 相关文章所有 2 个版本

[PDF] arxiv.org

Graphinsight: Unlocking insights in large language models for graph structure understanding

Y Cao, S Han, Z Gao, Z Ding, X Xie… - arXiv preprint arXiv …, 2024 - arxiv.org

Although Large Language Models (LLMs) have demonstrated potential in processing
graphs, they struggle with comprehending graphical structure information through prompts …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Timer-XL: Long-Context Transformers for Unified Time Series Forecasting

Y Liu, G Qin, X Huang, J Wang, M Long - arXiv preprint arXiv:2410.04803, 2024 - arxiv.org

We present Timer-XL, a generative Transformer for unified time series forecasting. To
uniformly predict 1D and 2D time series, we generalize next token prediction, predominantly …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Infinipot: Infinite context processing on memory-constrained llms

M Kim, K Shim, J Choi, S Chang - arXiv preprint arXiv:2410.01518, 2024 - arxiv.org

Handling long input contexts remains a significant challenge for Large Language Models
(LLMs), particularly in resource-constrained environments such as mobile devices. Our work …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

高级搜索

QQ 群