Representation Collapse in Machine Translation Through the Lens of Angular Dispersion
arXiv:2602.17287v1 Announce Type: new Abstract: Modern neural translation models based on the Transformer architecture are known for their high performance, particularly when trained on high-resource …
Evgeniia Tokarchuk, Maya K. Nachesa, Sergey Troshin, Vlad Niculae
4 views