Academic

Academic · 1 min

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

arXiv:2603.23889v1 Announce Type: new Abstract: When safety is formulated as a limit of cumulative cost, safe reinforcement learning (RL) aims to learn policies that maximize …

Guopeng Li, Matthijs T. J. Spaan, Julian F. P. Kooij

24 views Mar 26

Academic · 1 min

Optimal Variance-Dependent Regret Bounds for Infinite-Horizon MDPs

arXiv:2603.23926v1 Announce Type: new Abstract: Online reinforcement learning in infinite-horizon Markov decision processes (MDPs) remains less theoretically and algorithmically developed than its episodic counterpart, with …

Guy Zamir, Matthew Zurek, Yudong Chen

50 views Mar 26

Academic · 1 min

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

arXiv:2603.23961v1 Announce Type: new Abstract: Deep-sea cold seep stage assessment has traditionally relied on costly, high-risk manned submersible operations and visual surveys of macrofauna. Although …

Chenxu Zhou, Zelin Liu, Rui Cai, Houlin Gong, Yikang Yu, Jia Zeng, Yanru Pei, Liang Zhang, Weishu Zhao, Xiaofeng Gao

21 views Mar 26

Academic · 1 min

Wireless communication empowers online scheduling of partially-observable transportation multi-robot systems in a smart factory

arXiv:2603.23967v1 Announce Type: new Abstract: Achieving agile and reconfigurable production flows in smart factories depends on online multi-robot task assignment (MRTA), which requires online collision-free …

Yaxin Liao, Qimei Cui, Kwang-Cheng Chen, Xiong Li, Jinlian Chen, Xiyu Zhao, Xiaofeng Tao, Ping Zhang

10 views Mar 26

Academic · 1 min

Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception

arXiv:2603.23977v1 Announce Type: new Abstract: Deep learning architectures are fundamentally inspired by neuroscience, particularly the structure of the brain's sensory pathways, and have achieved remarkable …

Tongfei Chen, Jingying Yang, Linlin Yang, Jinhu L\"u, David Doermann, Chunyu Xie, Long He, Tian Wang, Juan Zhang, Guodong Guo, Baochang Zhang

13 views Mar 26

Academic · 1 min

Transcending Classical Neural Network Boundaries: A Quantum-Classical Synergistic Paradigm for Seismic Data Processing

arXiv:2603.23984v1 Announce Type: new Abstract: In recent years, a number of neural-network (NN) methods have exhibited good performance in seismic data processing, such as denoising, …

Zhengyi Yuan, Xintong Dong, Xinyang Wang, Zheng Cong, Shiqi Dong

26 views Mar 26

Academic · 1 min

Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

arXiv:2603.23985v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated remarkable capabilities, but their massive scale poses significant challenges for practical deployment. Structured pruning …

Jimyung Hong, Jaehyung Kim

7 views Mar 26

Academic · 1 min

Can we generate portable representations for clinical time series data using LLMs?

arXiv:2603.23987v1 Announce Type: new Abstract: Deploying clinical ML is slow and brittle: models that work at one hospital often degrade under distribution shifts at the …

Zongliang Ji, Yifei Sun, Andre Amaral, Anna Goldenberg, Rahul G. Krishnan

9 views Mar 26

Academic · 1 min

Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv:2603.23994v1 Announce Type: new Abstract: Generative optimization uses large language models (LLMs) to iteratively improve artifacts (such as code, workflows or prompts) using execution feedback. …

Allen Nie, Xavier Daull, Zhiyi Kuang, Abhinav Akkiraju, Anish Chaudhuri, Max Piasevoli, Ryan Rong, YuCheng Yuan, Prerit Choudhary, Shannon Xiao, Rasool Fakoor, Adith Swaminathan, Ching-An Cheng

15 views Mar 26

Academic · 1 min

Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs

arXiv:2603.24002v1 Announce Type: new Abstract: Physics-Informed Neural Networks (PINNs) for high-dimensional and high-order partial differential equations (PDEs) are primarily constrained by the $\mathcal{O}(d^k)$ spatial derivative …

Zhangyong Liang, Ji Zhang

13 views Mar 26

Academic · 1 min

i-IF-Learn: Iterative Feature Selection and Unsupervised Learning for High-Dimensional Complex Data

arXiv:2603.24025v1 Announce Type: new Abstract: Unsupervised learning of high-dimensional data is challenging due to irrelevant or noisy features obscuring underlying structures. It's common that only …

Chen Ma, Wanjie Wang, Shuhao Fan

20 views Mar 26

Academic · 1 min

Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming

arXiv:2603.24033v1 Announce Type: new Abstract: Predict-and-search (PaS) methods have shown promise for accelerating mixed-integer linear programming (MILP) solving. However, existing approaches typically assume variable independence …

Ruobing Wang, Xin Li, Yujie Fang, Mingzhong Wang

9 views Mar 26

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

Optimal Variance-Dependent Regret Bounds for Infinite-Horizon MDPs

GRMLR: Knowledge-Enhanced Small-Data Learning for Deep-Sea Cold Seep Stage Inference

Wireless communication empowers online scheduling of partially-observable transportation multi-robot systems in a smart factory

Kirchhoff-Inspired Neural Networks for Evolving High-Order Perception

Transcending Classical Neural Network Boundaries: A Quantum-Classical Synergistic Paradigm for Seismic Data Processing

Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

Can we generate portable representations for clinical time series data using LLMs?

Understanding the Challenges in Iterative Generative Optimization with LLMs

Stochastic Dimension-Free Zeroth-Order Estimator for High-Dimensional and High-Order PINNs

i-IF-Learn: Iterative Feature Selection and Unsupervised Learning for High-Dimensional Complex Data

Lagrangian Relaxation Score-based Generation for Mixed Integer linear Programming

JCG, PC

HSOLLC Co., Ltd.