Towards a Diagnostic and Predictive Evaluation Methodology for Sequence Labeling Tasks
arXiv:2602.12759v1 Announce Type: new Abstract: Standard evaluation in NLP typically indicates that system A is better on average than system B, but it provides little …
Elena Alvarez-Mellado, Julio Gonzalo
3 views