https://www.sri.com/publication/computer-vision-pubs/multi-modal-data-analytics-pubs/on-the-applicability-of-speaker-diarization-to-audio-indexing-of-non-speech-and-mixed-non-speech-speech-video/
This paper explores how speaker diarization can be adapted to automatically identify low-level sound concepts and how these concepts can be used for audio...
speaker diarizationaudio indexingapplicability