Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Polarization Tasks …

arXiv:2603.23534v1 Announce Type: new Abstract: This paper describes my submission to the Polarization Shared Task at SemEval-2025, which addresses polarization detection and classification in social …

Abass Oguntade

61 views Mar 26

Academic · 1 min

Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths

arXiv:2603.23624v1 Announce Type: new Abstract: Digging-in effects, where disambiguation difficulty increases with longer ambiguous regions, have been cited as evidence for self-organized sentence processing, in …

Amani Maina-Kilaas, Roger Levy

82 views Mar 26

Academic · 1 min

Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks

arXiv:2603.23646v1 Announce Type: new Abstract: While recent work has benchmarked large language models on Swiss legal translation (Niklaus et al., 2025) and academic legal reasoning …

Fatih Uenal

69 views Mar 26

Academic · 1 min

Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges

arXiv:2603.23659v1 Announce Type: new Abstract: When large language models make ethical judgments, do their internal representations distinguish between normative frameworks, or collapse ethics into a …

Weilun Xu, Alexander Rusnak, Frederic Kaplan

69 views Mar 26

Academic · 1 min

PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation

arXiv:2603.23678v1 Announce Type: new Abstract: Large Language Models (LLMs) offer transformative solutions across many domains, but healthcare integration is hindered by strict data privacy constraints. …

Manjushree B. Aithal, Ph. D., Alexander Kotz, James Mitchell, Ph. D

73 views Mar 26

Academic · 1 min

The Diminishing Returns of Early-Exit Decoding in Modern LLMs

arXiv:2603.23701v1 Announce Type: new Abstract: In Large Language Model (LLM) inference, early-exit refers to stopping computation at an intermediate layer once the prediction is sufficiently …

Rui Wei, Rui Du, Hanfei Yu, Devesh Tiwari, Jian Li, Zhaozhuo Xu, Hao Wang

78 views Mar 26

Academic · 1 min

IslamicMMLU: A Benchmark for Evaluating LLMs on Islamic Knowledge

arXiv:2603.23750v1 Announce Type: new Abstract: Large language models are increasingly consulted for Islamic knowledge, yet no comprehensive benchmark evaluates their performance across core Islamic disciplines. …

Ali Abdelaal, Mohammed Nader Al Haffar, Mahmoud Fawzi, Walid Magdy

72 views Mar 26

Academic · 1 min

Infrequent Child-Directed Speech Is Bursty and May Draw Infant Vocalizations

arXiv:2603.23797v1 Announce Type: new Abstract: Children in many parts of the world hear relatively little speech directed to them, yet still reach major language development …

Margaret Cychosz, Adriana Weisleder

58 views Mar 26

Academic · 1 min

Perturbation: A simple and efficient adversarial tracer for representation learning in language models

arXiv:2603.23821v1 Announce Type: new Abstract: Linguistic representation learning in deep neural language models (LMs) has been studied for decades, for both practical and theoretical reasons. …

Joshua Rozner, Cory Shain

56 views Mar 26

Academic · 1 min

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

arXiv:2603.23841v1 Announce Type: new Abstract: While Large Language Models (LLMs) are increasingly used as primary sources of information, their potential for political bias may impact …

Rohan Khetan, Ashna Khetan

47 views Mar 26

Academic · 1 min

Language Model Planners do not Scale, but do Formalizers?

arXiv:2603.23844v1 Announce Type: new Abstract: Recent work shows overwhelming evidence that LLMs, even those trained to scale their reasoning trace, perform unsatisfactorily when solving planning …

Owen Jiang, Cassie Huang, Ashish Sabharwal, Li Zhang

49 views Mar 26

Academic · 1 min

BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents

arXiv:2603.23848v1 Announce Type: new Abstract: LLMs are increasingly used as long-running conversational agents, yet every major benchmark evaluating their memory treats user information as static …

Praveen Kumar Myakala, Manan Agrawal, Rahul Manche

170 views Mar 26

← Previous

57 58 59 60 61

Academic

Not All Pretraining are Created Equal: Threshold Tuning and Class Weighting for Imbalanced Polarization Tasks …

Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths

Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks

Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges

PLACID: Privacy-preserving Large language models for Acronym Clinical Inference and Disambiguation

The Diminishing Returns of Early-Exit Decoding in Modern LLMs

IslamicMMLU: A Benchmark for Evaluating LLMs on Islamic Knowledge

Infrequent Child-Directed Speech Is Bursty and May Draw Infant Vocalizations

Perturbation: A simple and efficient adversarial tracer for representation learning in language models

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

Language Model Planners do not Scale, but do Formalizers?

BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents

JCG, PC

HSOLLC Co., Ltd.