All Articles

Articles

Academic · 1 min

Improving Sparse Memory Finetuning

arXiv:2604.05248v1 Announce Type: new Abstract: Large Language Models (LLMs) are typically static after training, yet real-world applications require continual adaptation to new knowledge without degrading …

Satyam Goyal, Anirudh Kanchi, Garv Shah, Prakhar Gupta
6 views
Academic · 1 min

Controllable Image Generation with Composed Parallel Token Prediction

arXiv:2604.05730v1 Announce Type: new Abstract: Conditional discrete generative models struggle to faithfully compose multiple input conditions. To address this, we derive a theoretically-grounded formulation for …

Jamie Stirling, Noura Al-Moubayed, Chris G. Willcocks, Hubert P. H. Shum
29 views