Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders

Lam, P., Pham, L., Nguyen, T., Ngo, D., Pham, T., Nguyen, T., Nguyen, L. K., & Schindler, A. (2024). Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders. arXiv preprint. Retrieved from https://arxiv.org/abs/2407.01963.

Download