Who Spoke What When? Evaluating Spoken Language Models for Conversational ASR with Semantic and Overlap-Aware …
arXiv:2603.22709v1 Announce Type: new Abstract: Conversational automatic speech recognition remains challenging due to overlapping speech, far-field noise, and varying speaker counts. While recent LLM-based systems …
Naohiro Tawara, Samuele Cornell, Alexander Polok, Marc Delcroix, Luk\'a\v{s} Burget, Shinji Watanabe
73 views