Skip to main content

Tag: cs.CV

#cs.CV

Academic · 1 min

Scaling View Synthesis Transformers

arXiv:2602.21341v1 Announce Type: cross Abstract: Geometry-free view synthesis transformers have recently achieved state-of-the-art performance in Novel View Synthesis (NVS), outperforming traditional approaches that rely on …

Evan Kim, Hyunwoo Ryu, Thomas W. Mitchel, Vincent Sitzmann
4 views
Academic · 1 min

Towards single-shot coherent imaging via overlap-free ptychography

arXiv:2602.21361v1 Announce Type: cross Abstract: Ptychographic imaging at synchrotron and XFEL sources requires dense overlapping scans, limiting throughput and increasing dose. Extending coherent diffractive imaging …

Oliver Hoidn, Aashwin Mishra, Steven Henke, Albert Vong, Matthew Seaberg
4 views
Academic · 1 min

Towards Controllable Video Synthesis of Routine and Rare OR Events

arXiv:2602.21365v1 Announce Type: cross Abstract: Purpose: Curating large-scale datasets of operating room (OR) workflow, encompassing rare, safety-critical, or atypical events, remains operationally and ethically challenging. …

Dominik Schneider, Lalithkumar Seenivasan, Sampath Rapuri, Vishalroshan Anil, Aiza Maksutova, Yiqing Shen, Jan Emily Mangulabnan, Hao Ding, Jose L. Porras, Masaru Ishii, Mathias Unberath
4 views
Academic · 1 min

FedVG: Gradient-Guided Aggregation for Enhanced Federated Learning

arXiv:2602.21399v1 Announce Type: cross Abstract: Federated Learning (FL) enables collaborative model training across multiple clients without sharing their private data. However, data heterogeneity across clients …

Alina Devkota, Jacob Thrasher, Donald Adjeroh, Binod Bhattarai, Prashnna K. Gyawali
3 views
Academic · 1 min

OmniGAIA: Towards Native Omni-Modal AI Agents

arXiv:2602.22897v1 Announce Type: new Abstract: Human intelligence naturally intertwines omni-modal perception -- spanning vision, audio, and language -- with complex reasoning and tool usage to …

Xiaoxi Li, Wenxiang Jiao, Jiarui Jin, Shijian Wang, Guanting Dong, Jiajie Jin, Hao Wang, Yinuo Wang, Ji-Rong Wen, Yuan Lu, Zhicheng Dou
3 views
Academic · 1 min

Certified Circuits: Stability Guarantees for Mechanistic Circuits

arXiv:2602.22968v1 Announce Type: new Abstract: Understanding how neural networks arrive at their predictions is essential for debugging, auditing, and deployment. Mechanistic interpretability pursues this goal …

Alaa Anani, Tobias Lorenz, Bernt Schiele, Mario Fritz, Jonas Fischer
3 views
Academic · 1 min

Entropy-Controlled Flow Matching

arXiv:2602.22265v1 Announce Type: new Abstract: Modern vision generators transport a base distribution to data through time-indexed measures, implemented as deterministic flows (ODEs) or stochastic diffusions …

Chika Maduabuchi
5 views
Academic · 1 min

ECHOSAT: Estimating Canopy Height Over Space And Time

arXiv:2602.21421v1 Announce Type: cross Abstract: Forest monitoring is critical for climate change mitigation. However, existing global tree height maps provide only static snapshots and do …

Jan Pauls, Karsten Schr\"odter, Sven Ligensa, Martin Schwartz, Berkant Turan, Max Zimmer, Sassan Saatchi, Sebastian Pokutta, Philippe Ciais, Fabian Gieseke
5 views