Academic

Academic · 1 min

Self-Directed Task Identification

arXiv:2604.02430v1 Announce Type: new Abstract: In this work, we present a novel machine learning framework called Self-Directed Task Identification (SDTI), which enables models to autonomously …

Timothy Gould, Sidike Paheding

16 views Apr 6

Academic · 1 min

Dynamical structure of vanishing gradient and overfitting in multi-layer perceptrons

arXiv:2604.02393v1 Announce Type: new Abstract: Vanishing gradient and overfitting are two of the most extensively studied problems in the literature about machine learning. However, they …

Alex Al\`i Maleknia, Yuzuru Sato

27 views Apr 6

Academic · 1 min

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

arXiv:2604.02378v1 Announce Type: new Abstract: Forecasting startup success is notoriously difficult, partly because meaningful outcomes, such as exits, large funding rounds, and sustained revenue growth, …

Mostapha Benhenda

29 views Apr 6

Academic · 1 min

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

arXiv:2604.02355v1 Announce Type: new Abstract: Combining Chain-of-Thought (CoT) with Reinforcement Learning (RL) improves text-to-image (T2I) generation, yet the underlying interaction between CoT's exploration and RL's …

Han Song, Yucheng Zhou, Jianbing Shen, Yu Cheng

9 views Apr 6

Academic · 1 min

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

arXiv:2604.02351v1 Announce Type: new Abstract: Machine learning models deployed in non-stationary environments are exposed to temporal distribution shift, which can erode predictive reliability over time. …

Naimur Rahman, Naazreen Tabassum

13 views Apr 6

Academic · 1 min

Contextual Intelligence The Next Leap for Reinforcement Learning

arXiv:2604.02348v1 Announce Type: new Abstract: Reinforcement learning (RL) has produced spectacular results in games, robotics, and continuous control. Yet, despite these successes, learned policies often …

Andr\'e Biedenkapp

7 views Apr 6

Academic · 1 min

FTimeXer: Frequency-aware Time-series Transformer with Exogenous variables for Robust Carbon Footprint Forecasting

arXiv:2604.02347v1 Announce Type: new Abstract: Accurate and up-to-date forecasting of the power grid's carbon footprint is crucial for effective product carbon footprint (PCF) accounting and …

Qingzhong Li, Yue Hu, Zhou Long, Qingchang Ma, Hui Ma, Jinhai Sa

6 views Apr 6

Academic · 1 min

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three …

arXiv:2604.02344v1 Announce Type: new Abstract: WebGPU's security-focused design imposes per-operation validation that compounds across the many small dispatches in neural network inference, yet the true …

J\k{e}drzej Maczan

23 views Apr 6

Academic · 1 min

Homophily-aware Supervised Contrastive Counterfactual Augmented Fair Graph Neural Network

arXiv:2604.02342v1 Announce Type: new Abstract: In recent years, Graph Neural Networks (GNNs) have achieved remarkable success in tasks such as node classification, link prediction, and …

Mahdi Tavassoli Kejani, Fadi Dornaika, Charlotte Laclau, Jean-Michel Loubes

5 views Apr 6

Academic · 1 min

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

arXiv:2604.02340v1 Announce Type: new Abstract: Recent advances in masked diffusion language models (MDLMs) narrow the quality gap to autoregressive LMs, but their sampling remains expensive …

Ivan Sedykh, Nikita Sorokin, Valentin Malykh

4 views Apr 6

Academic · 1 min

SIEVE: Sample-Efficient Parametric Learning from Natural Language

arXiv:2604.02339v1 Announce Type: new Abstract: Natural language context-such as instructions, knowledge, or feedback-contains rich signal for adapting language models. While in-context learning provides adaptation via …

Parth Asawa, Alexandros G. Dimakis, Matei Zaharia

5 views Apr 6

Academic · 1 min

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

arXiv:2604.02338v1 Announce Type: new Abstract: MoE-PEFT methods combine Mixture of Experts with parameter-efficient fine-tuning for multi-task adaptation, but require separate adapters per expert causing trainable …

Md Kowsher, Haris Mansoor, Nusrat Jahan Prottasha, Ozlem Garibay, Victor Zhu, Zhengping Ji, Chen Chen

15 views Apr 6

Self-Directed Task Identification

Dynamical structure of vanishing gradient and overfitting in multi-layer perceptrons

YC Bench: a Live Benchmark for Forecasting Startup Outperformance in Y Combinator Batches

From Broad Exploration to Stable Synthesis: Entropy-Guided Optimization for Autoregressive Image Generation

Modeling and Controlling Deployment Reliability under Temporal Distribution Shift

Contextual Intelligence The Next Leap for Reinforcement Learning

FTimeXer: Frequency-aware Time-series Transformer with Exogenous variables for Robust Carbon Footprint Forecasting

Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three …

Homophily-aware Supervised Contrastive Counterfactual Augmented Fair Graph Neural Network

Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models

SIEVE: Sample-Efficient Parametric Learning from Natural Language

LiME: Lightweight Mixture of Experts for Efficient Multimodal Multi-task Learning

JCG, PC

HSOLLC Co., Ltd.