Immigration Law

LOW Academic International

CAMEL: Confidence-Gated Reflection for Reward Modeling

arXiv:2602.20670v1 Announce Type: new Abstract: Reward models play a fundamental role in aligning large language models with human preferences. Existing methods predominantly follow two paradigms: scalar discriminative preference models, which are efficient but lack interpretability, and generative judging models, which...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

The Art of Efficient Reasoning: Data, Reward, and Optimization

arXiv:2602.20945v1 Announce Type: new Abstract: Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to incentivize short yet accurate thinking trajectories, typically through reward...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving

arXiv:2602.20973v1 Announce Type: new Abstract: To comprehensively evaluate the mathematical reasoning capabilities of Large Language Models (LLMs), researchers have introduced abundant mathematical reasoning datasets. However, most existing datasets primarily focus on linear reasoning, neglecting other parts such as proof by...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

arXiv:2602.21103v1 Announce Type: new Abstract: Advanced reasoning typically requires Chain-of-Thought prompting, which is accurate but incurs prohibitive latency and substantial test-time inference costs. The standard alternative, fine-tuning smaller models, often sacrifices interpretability while introducing significant resource and operational overhead. To...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

arXiv:2602.20449v1 Announce Type: cross Abstract: Modern Protein Language Models (PLMs) apply transformer-based model architectures from natural language processing to biological sequences, predicting a variety of protein functions and properties. However, protein language has key differences from natural language, such as...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

The Truthfulness Spectrum Hypothesis

arXiv:2602.20273v1 Announce Type: new Abstract: Large language models (LLMs) have been reported to linearly encode truthfulness, yet recent work questions this finding's generality. We reconcile these views with the truthfulness spectrum hypothesis: the representational space contains directions ranging from broadly...

1 min 1 month, 3 weeks ago

tps

LOW Academic International

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

arXiv:2602.20309v1 Announce Type: new Abstract: Vision-language-action (VLA) models unify perception, language, and control for embodied agents but face significant challenges in practical deployment due to rapidly increasing compute and memory demands, especially as models scale to longer horizons and larger...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context

arXiv:2602.20396v1 Announce Type: new Abstract: Explainable artificial intelligence promises to yield insights into relevant features, thereby enabling humans to examine and scrutinize machine learning models or even facilitating scientific discovery. Considering the widespread technique of Shapley values, we find that...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

arXiv:2602.20573v1 Announce Type: new Abstract: Molecules are commonly represented as SMILES strings, which can be readily converted to fixed-size molecular fingerprints. These fingerprints serve as feature vectors to train ML/DL models for molecular property prediction tasks in the field of...

1 min 1 month, 3 weeks ago

ead

LOW News International

Gushwork bets on AI search for customer leads — and early results are emerging

Gushwork has raised $9 million in a seed round led by SIG and Lightspeed. The startup has seen early customer traction from AI search tools like ChatGPT.

1 min 1 month, 3 weeks ago

ead

LOW News International

The White House wants AI companies to cover rate hikes. Most have already said they would.

Many hyperscalers have already made public commitments to cover electricity cost increases.

1 min 1 month, 3 weeks ago

ead

LOW News International

Have hard-won scaling lessons to share? Take the stage at TechCrunch Founder Summit 2026

Apply to speak at TechCrunch Founder Summit 2026 by April 17 for a chance to lead a roundtable or breakout session for 1,000 founders and investors. If you’ve built, backed, or operated inside high-growth startups, your experience could shape how...

1 min 1 month, 3 weeks ago

ead

LOW News International

3 days left: Save up to $680 on your TechCrunch Disrupt 2026 ticket

Just 3 days left to save up to $680 on your TechCrunch Disrupt 2026 ticket. Offer ends on Friday, February 27 at 11:59 p.m. PT. Don't miss unparalleled, curated networking and valuable insights from 250+ tech leaders, and discover 300+...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning

arXiv:2602.19049v1 Announce Type: new Abstract: Large language models increasingly rely on long chains of thought to improve accuracy, yet such gains come with substantial inference-time costs. We revisit token-efficient post-training and argue that existing sequence-level reward-shaping methods offer limited control...

1 min 1 month, 3 weeks ago

tps

LOW Academic International

TriTopic: Tri-Modal Graph-Based Topic Modeling with Iterative Refinement and Archetypes

arXiv:2602.19079v1 Announce Type: new Abstract: Topic modeling extracts latent themes from large text collections, but leading approaches like BERTopic face critical limitations: stochastic instability, loss of lexical precision ("Embedding Blur"), and reliance on a single data perspective. We present TriTopic,...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

arXiv:2602.19111v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods, especially LoRA, are widely used for adapting pre-trained models to downstream tasks due to their computational and storage efficiency. However, in the context of LoRA and its variants, the potential of...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

AgenticRAGTracer: A Hop-Aware Benchmark for Diagnosing Multi-Step Retrieval Reasoning in Agentic RAG

arXiv:2602.19127v1 Announce Type: new Abstract: With the rapid advancement of agent-based methods in recent years, Agentic RAG has undoubtedly become an important research direction. Multi-hop reasoning, which requires models to engage in deliberate thinking and multi-step interaction, serves as a...

1 min 1 month, 3 weeks ago

tps

LOW Academic International

Facet-Level Persona Control by Trait-Activated Routing with Contrastive SAE for Role-Playing LLMs

arXiv:2602.19157v1 Announce Type: new Abstract: Personality control in Role-Playing Agents (RPAs) is commonly achieved via training-free methods that inject persona descriptions and memory through prompts or retrieval-augmented generation, or via supervised fine-tuning (SFT) on persona-specific corpora. While SFT can be...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Learning to Reason for Multi-Step Retrieval of Personal Context in Personalized Question Answering

arXiv:2602.19317v1 Announce Type: new Abstract: Personalization in Question Answering (QA) requires answers that are both accurate and aligned with users' background, preferences, and historical context. Existing state-of-the-art methods primarily rely on retrieval-augmented generation (RAG) solutions that construct personal context by...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

arXiv:2602.19320v1 Announce Type: new Abstract: Agentic memory systems enable large language model (LLM) agents to maintain state across long interactions, supporting long-horizon reasoning and personalization beyond fixed context windows. Despite rapid architectural development, the empirical foundations of these systems remain...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

arXiv:2602.19509v1 Announce Type: new Abstract: Large Language Models (LLMs) face a persistent trade-off between inference cost and reasoning capability. While "Oracle" models (e.g., Llama-3-70B) achieve state-of-the-art accuracy, they are prohibitively expensive for high-volume deployment. Smaller models (e.g., 8B parameters) are...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

arXiv:2602.19548v1 Announce Type: new Abstract: One of the first pre-processing steps for constructing web-scale LLM pretraining datasets involves extracting text from HTML. Despite the immense diversity of web content, existing open-source datasets predominantly apply a single fixed extractor to all...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

arXiv:2602.19549v1 Announce Type: new Abstract: Visual Document Retrieval (VDR), which aims to retrieve relevant pages within vast corpora of visually-rich documents, is of significance in current multimodal retrieval applications. The state-of-the-art multi-vector paradigm excels in performance but suffers from prohibitive...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

KGHaluBench: A Knowledge Graph-Based Hallucination Benchmark for Evaluating the Breadth and Depth of LLM Knowledge

arXiv:2602.19643v1 Announce Type: new Abstract: Large Language Models (LLMs) possess a remarkable capacity to generate persuasive and intelligible language. However, coherence does not equate to truthfulness, as the responses often contain subtle hallucinations. Existing benchmarks are limited by static and...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Wide Open Gazes: Quantifying Visual Exploratory Behavior in Soccer with Pose Enhanced Positional Data

arXiv:2602.18519v1 Announce Type: new Abstract: Traditional approaches to measuring visual exploratory behavior in soccer rely on counting visual exploratory actions (VEAs) based on rapid head movements exceeding 125{\deg}/s, but this method suffer from player position bias (i.e., a focus on...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

arXiv:2602.18523v1 Announce Type: new Abstract: Grokking -- the abrupt transition from memorization to generalization long after near-zero training loss -- has been studied mainly in single-task settings. We extend geometric analysis to multi-task modular arithmetic, training shared-trunk Transformers on dual-task...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Audio-Visual Continual Test-Time Adaptation without Forgetting

arXiv:2602.18528v1 Announce Type: new Abstract: Audio-visual continual test-time adaptation involves continually adapting a source audio-visual model at test-time, to unlabeled non-stationary domains, where either or both modalities can be distributionally shifted, which hampers online cross-modal learning and eventually leads to...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Adaptive Time Series Reasoning via Segment Selection

arXiv:2602.18645v1 Announce Type: new Abstract: Time series reasoning tasks often start with a natural language question and require targeted analysis of a time series. Evidence may span the full series or appear in a few short intervals, so the model...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

Robustness of Deep ReLU Networks to Misclassification of High-Dimensional Data

arXiv:2602.18674v1 Announce Type: new Abstract: We present a theoretical study of the robustness of parameterized networks to random input perturbations. Specifically, we analyze local robustness at a given network input by quantifying the probability that a small additive random perturbation...

1 min 1 month, 3 weeks ago

ead

LOW Academic International

In-Context Planning with Latent Temporal Abstractions

arXiv:2602.18694v1 Announce Type: new Abstract: Planning-based reinforcement learning for continuous control is bottlenecked by two practical issues: planning at primitive time scales leads to prohibitive branching and long horizons, while real environments are frequently partially observable and exhibit regime shifts...

1 min 1 month, 3 weeks ago

ead

CAMEL: Confidence-Gated Reflection for Reward Modeling

The Art of Efficient Reasoning: Data, Reward, and Optimization

Linear Reasoning vs. Proof by Cases: Obstacles for Large Language Models in FOL Problem Solving

Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning

Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

The Truthfulness Spectrum Hypothesis

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

cc-Shapley: Measuring Multivariate Feature Importance Needs Causal Context

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

Gushwork bets on AI search for customer leads — and early results are emerging

The White House wants AI companies to cover rate hikes. Most have already said they would.

Have hard-won scaling lessons to share? Take the stage at TechCrunch Founder Summit 2026

3 days left: Save up to $680 on your TechCrunch Disrupt 2026 ticket

IAPO: Information-Aware Policy Optimization for Token-Efficient Reasoning

TriTopic: Tri-Modal Graph-Based Topic Modeling with Iterative Refinement and Archetypes

Astra: Activation-Space Tail-Eigenvector Low-Rank Adaptation of Large Language Models

AgenticRAGTracer: A Hop-Aware Benchmark for Diagnosing Multi-Step Retrieval Reasoning in Agentic RAG

Facet-Level Persona Control by Trait-Activated Routing with Contrastive SAE for Role-Playing LLMs

Learning to Reason for Multi-Step Retrieval of Personal Context in Personalized Question Answering

Anatomy of Agentic Memory: Taxonomy and Empirical Analysis of Evaluation and System Limitations

Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

KGHaluBench: A Knowledge Graph-Based Hallucination Benchmark for Evaluating the Breadth and Depth of LLM Knowledge

Wide Open Gazes: Quantifying Visual Exploratory Behavior in Soccer with Pose Enhanced Positional Data

The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

Audio-Visual Continual Test-Time Adaptation without Forgetting

Adaptive Time Series Reasoning via Segment Selection

Robustness of Deep ReLU Networks to Misclassification of High-Dimensional Data

In-Context Planning with Latent Temporal Abstractions

Impact Distribution

Related Practice Areas

JCG, PC

HSOLLC Co., Ltd.