Yang Yi, Song Hui, Liu Jia. Speaker Diarization and Localization Technology Research Based on NIST Evaluation[J]. Journal of Electronics & Information Technology, 2011, 33(5): 1234-1237. doi: 10.3724/SP.J.1146.2010.00977
Citation:
Yang Yi, Song Hui, Liu Jia. Speaker Diarization and Localization Technology Research Based on NIST Evaluation[J]. Journal of Electronics & Information Technology, 2011, 33(5): 1234-1237. doi: 10.3724/SP.J.1146.2010.00977
Yang Yi, Song Hui, Liu Jia. Speaker Diarization and Localization Technology Research Based on NIST Evaluation[J]. Journal of Electronics & Information Technology, 2011, 33(5): 1234-1237. doi: 10.3724/SP.J.1146.2010.00977
Citation:
Yang Yi, Song Hui, Liu Jia. Speaker Diarization and Localization Technology Research Based on NIST Evaluation[J]. Journal of Electronics & Information Technology, 2011, 33(5): 1234-1237. doi: 10.3724/SP.J.1146.2010.00977
This paper builds one speaker diarization and localization speech processing system based on Multiple Distance Microphone (MDM) for NIST evaluation, and proposes a modified clustering algorithm based on time delay estimation, which can decrease the complexity of speaker diarization and improve the correct rate under the guarantee of stable performance. A new time delay matrix structure is proposed, which can acquire multiple speakers direction angle. It is the real speech data collected under the standard session environment to validate the algorithms. The correct rate of proposed speaker diarization algorithm is similar with other speaker diarization system existed; Location algorithm direction angle error is less than 3. The results show that under appropriate conditions, the MDM system can be a better input device applied to multiple dialogue scenes.
Khne M, Togneri R, and Nordholm S. Robust source localization in reverberant environments based on weighted fuzzy clustering [J].IEEE Signal Processing Letters.2009, 16(2):85-88[6]Knapp C H and Carter G C. The generalized correlation method for estimation of time delay [J].IEEE Transactions on Acoustics, Speech and Signal Processing.1976, 24(4):320-327[8]杨芳, 湛燕, 田学东, 郭宝兰. 使用遗传算法实现K-means聚类算法的K值选择[J].微机发展.2003, 13(1):25-29