it often faces the challenge of slow decoding due to the considerable length of speech
sequences. Recently, some research has turned to non-autoregressive (NAR) models to
expedite decoding, yet the translation quality typically lags behind autoregressive (AR)
models significantly. In this paper, we investigate the performance of CTC-based NAR
models in S2ST, as these models have shown impressive results in machine translation …