aims to solve the" who spoke when" problem. Although the standard diarization systems can
achieve satisfactory results in various scenarios, they are composed of several
independently-optimized modules and cannot deal with the overlapped speech. In this
paper, we propose a novel speaker diarization method: Region Proposal Network based
Speaker Diarization (RPNSD). In this method, a neural network generates overlapped …