https://arxiv.org/abs/2505.22013
Abstract page for arXiv paper 2505.22013: Overlap-Adaptive Hybrid Speaker Diarization and ASR-Aware Observation Addition for MISP 2025 Challenge
speaker diarizationoverlapadaptivehybridasr
https://arxiv.org/abs/2508.06372?utm_source=chatgpt.com
Abstract page for arXiv paper 2508.06372: SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models
speaker diarizationendversatile
https://www.sri.com/publication/computer-vision-pubs/multi-modal-data-analytics-pubs/on-the-applicability-of-speaker-diarization-to-audio-indexing-of-non-speech-and-mixed-non-speech-speech-video/
This paper explores how speaker diarization can be adapted to automatically identify low-level sound concepts and how these concepts can be used for audio...
speaker diarizationaudio indexingapplicability
https://deepai.org/publication/turn-to-diarize-online-speaker-diarization-constrained-by-transformer-transducer-speaker-turn-detection
09/23/21 - In this paper, we present a novel speaker diarization system for streaming on-device applications. In this system, we use a transf...
speaker diarizationturnonlineconstrainedtransformer
https://deepai.org/publication/using-active-speaker-faces-for-diarization-in-tv-shows
03/30/22 - Speaker diarization is one of the critical components of computational media intelligence as it enables a character-level analysis...
active speakertv showsusingfacesdiarization