Academic

Academic · 1 min

The Illusion of Stochasticity in LLMs

arXiv:2604.06543v1 Announce Type: new Abstract: In this work, we demonstrate that reliable stochastic sampling is a fundamental yet unfulfilled requirement for Large Language Models (LLMs) …

Xiangming Gu, Soham De, Michalis Titsias, Larisa Markeeva, Petar Veli\v{c}kovi\'c, Razvan Pascanu

3 views Apr 9

Academic · 1 min

Does a Global Perspective Help Prune Sparse MoEs Elegantly?

arXiv:2604.06542v1 Announce Type: new Abstract: Empirical scaling laws for language models have encouraged the development of ever-larger LLMs, despite their growing computational and memory costs. …

Zeliang Zhang, Nikhil Ghosh, Jiani Liu, Bin Yu, Xiaodong Liu

6 views Apr 9

Academic · 1 min

Fine-tuning Whisper for Pashto ASR: strategies and scale

arXiv:2604.06507v1 Announce Type: new Abstract: Pashto is absent from Whisper's pre-training corpus despite being one of CommonVoice's largest language collections, leaving off-the-shelf models unusable: all …

Hanif Rahman

5 views Apr 9

Academic · 1 min

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

arXiv:2604.06505v1 Announce Type: new Abstract: Large language models (LLMs) are widely explored for reasoning-intensive research tasks, yet resources for testing whether they can infer scientific …

Weiyue Li, Ruizhi Qian, Yi Li, Yongce Li, Yunfan Long, Jiahui Cai, Yan Luo, Mengyu Wang

15 views Apr 9

Academic · 1 min

ValueGround: Evaluating Culture-Conditioned Visual Value Grounding in MLLMs

arXiv:2604.06484v1 Announce Type: new Abstract: Cultural values are expressed not only through language but also through visual scenes and everyday social practices. Yet existing evaluations …

Zhipin Wang, Christoph Leiter, Christian Frey, Mohamed Hesham Ibrahim Abdalla, Josif Grabocka, Steffen Eger

12 views Apr 9

Academic · 1 min

DataSTORM: Deep Research on Large-Scale Databases using Exploratory Data Analysis and Data Storytelling

arXiv:2604.06474v1 Announce Type: new Abstract: Deep research with Large Language Model (LLM) agents is emerging as a powerful paradigm for multi-step information discovery, synthesis, and …

Shicheng Liu, Yucheng Jiang, Sajid Farook, Camila Nicollier Sanchez, David Fernando Castro Pena, Monica S. Lam

15 views Apr 9

Academic · 1 min

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

arXiv:2604.06465v1 Announce Type: new Abstract: Reasoning models have demonstrated remarkable capabilities in solving complex problems by leveraging long chains of thought. However, this more deliberate …

Mario Iacobelli, Adrian Robert Minut, Tommaso Mencattini, Donato Crisostomi, Andrea Santilli, Iacopo Masi, Emanuele Rodol\`a

11 views Apr 9

Academic · 1 min

Context-Aware Dialectal Arabic Machine Translation with Interactive Region and Register Selection

arXiv:2604.06456v1 Announce Type: new Abstract: Current Machine Translation (MT) systems for Arabic often struggle to account for dialectal diversity, frequently homogenizing dialectal inputs into Modern …

Afroza Nowshin, Prithweeraj Acharjee Porag, Haziq Jeelani, Fayeq Jeelani Syed

3 views Apr 9

Academic · 1 min

Learning to Interrupt in Language-based Multi-agent Communication

arXiv:2604.06452v1 Announce Type: new Abstract: Multi-agent systems using large language models (LLMs) have demonstrated impressive capabilities across various domains. However, current agent communication suffers from …

Danqing Wang, Da Yin, Ruta Desai, Lei Li, Asli Celikyilmaz, Ansong Ni

10 views Apr 9

Academic · 1 min

Team Fusion@ SU@ BC8 SympTEMIST track: transformer-based approach for symptom recognition and linking

arXiv:2604.06424v1 Announce Type: new Abstract: This paper presents a transformer-based approach to solving the SympTEMIST named entity recognition (NER) and entity linking (EL) tasks. For …

Georgi Grazhdanski, Sylvia Vassileva, Ivan Koychev, Svetla Boytcheva

3 views Apr 9

Academic · 1 min

When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don't

arXiv:2604.06422v1 Announce Type: new Abstract: Understanding when Vision-Language Models (VLMs) will behave unexpectedly, whether models can reliably predict their own behavior, and if models adhere …

Jonathan Nemitz, Carsten Eickhoff, Junyi Jessy Li, Kyle Mahowald, Michal Golovanevsky, William Rudman

10 views Apr 9

Academic · 1 min

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

arXiv:2604.06421v1 Announce Type: new Abstract: This paper introduces Arabic-DeepSeek-R1, an application-driven open-source Arabic LLM that leverages a sparse MoE backbone to address the digital equity …

Navan Preet Singh, Anurag Garikipati, Ahmed Abulkhair, Jyani Akshay Jagdishbhai, Atul Yaduvanshi, Amarendra Chaudhary, Madalina Ciobanu, Qingqing Mao, Ritankar Das

7 views Apr 9

The Illusion of Stochasticity in LLMs

Does a Global Perspective Help Prune Sparse MoEs Elegantly?

Fine-tuning Whisper for Pashto ASR: strategies and scale

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

ValueGround: Evaluating Culture-Conditioned Visual Value Grounding in MLLMs

DataSTORM: Deep Research on Large-Scale Databases using Exploratory Data Analysis and Data Storytelling

Multi-objective Evolutionary Merging Enables Efficient Reasoning Models

Context-Aware Dialectal Arabic Machine Translation with Interactive Region and Register Selection

Learning to Interrupt in Language-based Multi-agent Communication

Team Fusion@ SU@ BC8 SympTEMIST track: transformer-based approach for symptom recognition and linking

When to Call an Apple Red: Humans Follow Introspective Rules, VLMs Don't

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

JCG, PC

HSOLLC Co., Ltd.