Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings

Publication
Proceedings of the 31st ACM International Conference on Multimedia