SwissBERT: The multilingual language model for Switzerland

J Vamvas, J Graën, R Sennrich - arXiv preprint arXiv:2303.13310, 2023 - arxiv.org
arXiv preprint arXiv:2303.13310, 2023arxiv.org
We present SwissBERT, a masked language model created specifically for processing
Switzerland-related text. SwissBERT is a pre-trained model that we adapted to news articles
written in the national languages of Switzerland--German, French, Italian, and Romansh. We
evaluate SwissBERT on natural language understanding tasks related to Switzerland and
find that it tends to outperform previous models on these tasks, especially when processing
contemporary news and/or Romansh Grischun. Since SwissBERT uses language adapters …
We present SwissBERT, a masked language model created specifically for processing Switzerland-related text. SwissBERT is a pre-trained model that we adapted to news articles written in the national languages of Switzerland -- German, French, Italian, and Romansh. We evaluate SwissBERT on natural language understanding tasks related to Switzerland and find that it tends to outperform previous models on these tasks, especially when processing contemporary news and/or Romansh Grischun. Since SwissBERT uses language adapters, it may be extended to Swiss German dialects in future work. The model and our open-source code are publicly released at https://github.com/ZurichNLP/swissbert.
arxiv.org
以上显示的是最相近的搜索结果。 查看全部搜索结果