Y Zhang,
D Li, M Okumura - arXiv preprint arXiv:2408.01308, 2024 - arxiv.org
Learning token embeddings based on token co-occurrence statistics has proven effective for
both pre-training and fine-tuning in natural language processing. However, recent studies …