Skip to main content

Academic

Academic

Academic · 1 min

Avoid What You Know: Divergent Trajectory Balance for GFlowNets

arXiv:2602.17827v1 Announce Type: new Abstract: Generative Flow Networks (GFlowNets) are a flexible family of amortized samplers trained to generate discrete and compositional objects with probability …

Pedro Dall'Antonia, Tiago da Silva, Daniel Csillag, Salem Lahlou, Diego Mesquita
8 views
Academic · 1 min

MePoly: Max Entropy Polynomial Policy Optimization

arXiv:2602.17832v1 Announce Type: new Abstract: Stochastic Optimal Control provides a unified mathematical framework for solving complex decision-making problems, encompassing paradigms such as maximum entropy reinforcement …

Hang Liu, Sangli Teng, Maani Ghaffari
6 views
Academic · 1 min

Dual Length Codes for Lossless Compression of BFloat16

arXiv:2602.17849v1 Announce Type: new Abstract: Training and serving Large Language Models (LLMs) relies heavily on parallelization and collective operations, which are frequently bottlenecked by network …

Aditya Agrawal, Albert Magyar, Hiteshwar Eswaraiah, Patrick Sheridan, Pradeep Janedula, Ravi Krishnan Venkatesan, Krishna Nair, Ravi Iyer
10 views
Academic · 1 min

JAX-Privacy: A library for differentially private machine learning

arXiv:2602.17861v1 Announce Type: new Abstract: JAX-Privacy is a library designed to simplify the deployment of robust and performant mechanisms for differentially private machine learning. Guided …

Ryan McKenna, Galen Andrew, Borja Balle, Vadym Doroshenko, Arun Ganesh, Weiwei Kong, Alex Kurakin, Brendan McMahan, Mikhail Pravilov
3 views