Artificial Indifference — Issue 095

§00 · APOD

NGC 3310: A Starburst Spiral Galaxy

The party is still going on in spiral galaxy NGC 3310. Roughly 100 million years ago, NGC 3310 likely collided with a smaller galaxy causing the large spiral galaxy to light up with a tremendous burst of star formation. The changing gravity during the collision created density waves that compressed existing clouds of gas and triggered the star-forming party. The featured image from the Gemini North Telescope shows the galaxy in great detail, color-coded so that pink highlights gas while white and blue highlight stars. Some of the star clusters in the galaxy are quite young, indicating that...

2026-04-05 · © AAO ITSO Office, Gemini Obs./AURA & T. A. Rector (U. Alaska Anchorage) · NASA APOD ↗

§06 · arXiv Dispatch

Research Filed Today

Preprints submitted to arXiv on April 5, 2026. Science before peer review.

cs.CV EventHub: Data Factory for Generalizable Event-Based Stereo Networks without Active Sensors

We propose EventHub, a novel framework for training deep-event stereo networks without ground truth annotations from costly active sensors, relying instead on standard color images. From these images, we derive either proxy annotations and proxy events through state-of-the-art no...

Luca Bartolomei, Fabio Tosi, Matteo Poggi et al. (+2)

cs.CV ActionParty: Multi-Subject Action Binding in Generative Video Games

Recent advances in video diffusion have enabled the development of "world models" capable of simulating interactive environments. However, these models are largely restricted to single-agent settings, failing to control multiple agents simultaneously in a scene. In this work, we ...

Alexander Pondaven, Ziyi Wu, Igor Gilitschenski et al. (+4)

cs.CV Generative World Renderer

Scaling generative inverse and forward rendering to real-world scenarios is bottlenecked by the limited realism and temporal coherence of existing synthetic datasets. To bridge this persistent domain gap, we introduce a large-scale, dynamic dataset curated from visually complex A...

Zheng-Hui Huang, Zhixiang Wang, Jiaming Tan et al. (+6)

cs.CV Modulate-and-Map: Crossmodal Feature Mapping with Cross-View Modulation for 3D Anomaly Detection

We present ModMap, a natively multiview and multimodal framework for 3D anomaly detection and segmentation. Unlike existing methods that process views independently, our method draws inspiration from the crossmodal feature mapping paradigm to learn to map features across both mod...

Alex Costanzino, Pierluigi Zama Ramirez, Giuseppe Lisanti et al. (+1)

cs.CV Steerable Visual Representations

Pretrained Vision Transformers (ViTs) such as DINOv2 and MAE provide generic image features that can be applied to a variety of downstream tasks such as retrieval, classification, and segmentation. However, such representations tend to focus on the most salient visual cues in the...

Jona Ruthardt, Manu Gaur, Deva Ramanan et al. (+2)

cs.CL Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation

Language models (LMs) are increasingly extended with new learnable vocabulary tokens for domain-specific tasks, such as Semantic-ID tokens in generative recommendation. The standard practice initializes these new tokens as the mean of existing vocabulary embeddings, then relies o...

Daiwei Chen, Zhoutong Fu, Chengming Jiang et al. (+12)

cs.CV Beyond Referring Expressions: Scenario Comprehension Visual Grounding

Existing visual grounding benchmarks primarily evaluate alignment between image regions and literal referring expressions, where models can often succeed by matching a prominent named category. We explore a complementary and more challenging setting of scenario-based visual groun...

Ruozhen He, Nisarg A. Shah, Qihua Dong et al. (+3)

cs.LG Batched Contextual Reinforcement: A Task-Scaling Law for Efficient Reasoning

Large Language Models employing Chain-of-Thought reasoning achieve strong performance but suffer from excessive token consumption that inflates inference costs. Existing efficiency methods such as explicit length penalties, difficulty estimators, or multi-stage curricula either d...

Bangji Yang, Hongbo Ma, Jiajun Fan et al. (+1)

Source: arXiv.org · Cornell University