Breaking the Correlation Plateau: On the Optimization and Capacity Limits of Attention-Based Regressors
arXiv:2602.17898v1 Announce Type: new Abstract: Attention-based regression models are often trained by jointly optimizing Mean Squared Error (MSE) loss and Pearson correlation coefficient (PCC) loss, …
Jingquan Yan, Yuwei Miao, Peiran Yu, Junzhou Huang
9 views