Archon

Browse and search harvested arxiv metadata.

1613351 results (page 1 of 64535)

Zero-Shot Long-Horizon Dexterous Manipulation via Multi-View 3D-Grounded VLM Reasoning

2606.19340 cs.RO 2026-06-17 PDF (arxiv)

Jisoo Kim, Sangwon Baik, Taeksoo Kim, Sungjoo Kim, Junyoung Lee, Mingi Choi, Hanbyul Joo

We present a zero-shot framework for long-horizon dexterous manipulation that grounds language instructions into executable 3D task plans from calibrated multi-view RGB images. Rather than training an end-to-end policy, our system uses a vision-language model (VLM) to produce reference-frame task grounding and primitive-level 2D keypoints, then lifts them into 3D via multi-view fusion. This liftin…

Open PDF (arxiv)
Native Active Perception as Reasoning for Omni-Modal Understanding

2606.19341 cs.CV 2026-06-17 PDF (arxiv)

Zhenghao Xing, Ruiyang Xu, Yuxuan Wang, Jinzheng He, Ziyang Ma, Qize Yang, Yunfei Chu, Jin Xu, Junyang Lin, Chi-Wing Fu, Pheng-Ann Heng

Passive models for long video understanding typically rely on a "watch-it-all" paradigm, processing frames uniformly regardless of query difficulty, causing computational cost to grow with video duration. Although interactive frameworks have emerged, they often rely on global pre-scanning, and their context cost still scales with video length. We propose OmniAgent, the first native omni-modal agen…

Open PDF (arxiv)
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games

2606.19338 cs.CV 2026-06-17 PDF (arxiv)

Shengyuan Ding, Xilin Wei, Xinyu Fang, Haodong Duan, Dahua Lin, Jiaqi Wang, Yuhang Zang

Deploying multimodal foundation models as closed-loop policies increasingly requires conditioning actions on observations that are no longer visible. However, existing benchmarks either expose the full state, conflate hidden-state reconstruction with other agent skills, or test recall only after an episode has ended. We introduce RNG-Bench (Reconstructive Non-Markov Games), a benchmark suite desig…

Open PDF (arxiv)
Constraints on Cosmic Strings from the Curl-Mode CMB Lensing Power Spectrum measured by ACT DR6

2606.19337 astro-ph.CO 2026-06-17 PDF (arxiv)

A. I. Lonappan, K. Ramesh, T. Namikawa, F. J. Qu, B. Keating

A network of cosmic strings is one of the few well-motivated cosmological sources of vector and tensor metric perturbations on the largest observable scales. Such perturbations imprint a characteristic curl component in the deflection angle of cosmic microwave background (CMB) photons that, unlike the scalar lensing potential, vanishes for adiabatic density fluctuations at linear order. We exploit…

Open PDF (arxiv)
Learning User Simulators with Turing Rewards

2606.19336 cs.CL 2026-06-17 PDF (arxiv)

Yingshan Susan Wang, Cedegao E. Zhang, Linlu Qiu, Zexue He, Pengyuan Li, Alex Pentland, Roger P. Levy, Yoon Kim

Learning to simulate human users in interactive settings could advance the training of agent assistants, evaluation of personalization systems, research in the social sciences, and more. Existing approaches generally do so by training a large language model (LLM) to match a single ground truth response, either by maximizing the log probability or by using a similarity reward. We instead propose {T…

Open PDF (arxiv)
Fine-scale downflows above flare ribbons captured by Solar Orbiter/EUI

2606.19335 astro-ph.SR 2026-06-17 PDF (arxiv)

Zheng Sun, Alexander G. M. Pietrow, Malcolm K. Druett, Hui Tian, Julián D. Alvarado-Gómez, Song Tan, Alexander Warmuth, Jiasheng Wang, Yuhang Gao, Zhenyong Hou, Alexandros Stork

In solar flares, flare ribbons map chromospheric footpoints where flare energy deposition occurs. These locations are associated with field aligned energy transport from the corona that results from energy liberated during magnetic reconnection. Recent chromospheric observations in the H$α$ and H$β$ bands have revealed fine-scale downflow structures above flare ribbons, referred to as riblets. In …

Open PDF (arxiv)
Freeing the Law with LOCUS: A Local Ordinance Corpus for the United States

2606.19334 cs.CL 2026-06-17 PDF (arxiv)

Denis Peskoff, Joe Barrow, Christopher Vu, Diag Davenport

Progress in legal AI increasingly depends on access to authoritative legal text at scale. Yet one of the most consequential layers of American law remains largely absent from existing machine-readable corpora: local ordinances. Local codes govern zoning, housing, business licensing, public health, noise, animal control, and many other domains of everyday regulation, but they are fragmented across …

Open PDF (arxiv)
Do as I Do: Dexterous Manipulation Data from Everyday Human Videos

2606.19333 cs.RO 2026-06-17 PDF (arxiv)

Bhawna Paliwal, Haritheja Etukuru, William Liang, Pieter Abbeel, Nur Muhammad Mahi Shafiullah, Jitendra Malik

How can we scalably generate data for robotic manipulation, especially on human-like platforms such as dexterous multi-fingered hands? Learning from human videos has recently emerged as a likely answer to this question. However, difficulties in estimating hand-object interaction and crossing the human-to-robot embodiment gap have hindered the adoption of abundant monocular RGB-only human videos as…

Open PDF (arxiv)
The Chandra-Gaia Catalog of Counterparts: Resolving ambiguous Gaia matches to X-ray sources in the Chandra Source Catalog using Machine Learning

2606.19329 astro-ph.IM 2026-06-17 PDF (arxiv)

V. Samuel Pérez-Díaz, Vinay L. Kashyap, Joshua D. Ingram, David Fouhey, Juan Rafael Martínez-Galarza, Pavlos Protopapas, Jeremy J. Drake, Dong-Woo Kim, Cecilia Garraffo

We present a framework to cross-match sources from the Chandra Source Catalog (CSC v2.1) with optical sources from Gaia Data Release 3. Unlike purely spatial approaches, we use source properties such as magnitudes, colors, and distances to identify true counterparts, detect chance coincidences, and resolve ambiguities when multiple plausible candidates exist. We define a training set of high-confi…

Open PDF (arxiv)
UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

2606.19328 cs.LG 2026-06-17 PDF (arxiv)

Mohamed Nabail, Leo Cheng, Jingmin Wang, Nicholas Rhinehart

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on passive data collection and suffer from poor sample efficiency, especially during the early stages of learning. We introduce a model-based approach that actively directs exploration by jointly reasoning…

Open PDF (arxiv)
Rethinking Reward Supervision: Rubric-Conditioned Self-Distillation

2606.19327 cs.AI 2026-06-17 PDF (arxiv)

Siyi Gu, Jialin Chen, Sophia Zhou, Arman Cohan, Rex Ying

Post-training of reasoning language models is commonly driven by supervised distillation and reinforcement learning with verifiable rewards. Distillation often relies on chain-of-thought annotations that are expensive to obtain and may themselves be noisy, incomplete, or partially incorrect; even when the final solution is correct, an imperfect rationale can interfere with learning. Reinforcement …

Open PDF (arxiv)
Reference-Driven Multi-Speaker Audio Scene Generation from In-the-Wild Priors

2606.19325 cs.SD 2026-06-17 PDF (arxiv)

Michael Finkelson, Daniel Segal, Eitan Richardson, Shahar Armon, Nani Goldring, Poriya Panet, Nir Zabari, Benjamin Brazowski, Or Patashnik, Yoav HaCohen

Existing multi-speaker dialogue systems bind speakers to utterances through structured supervision: per-turn tags, multi-stream transcriptions, or learnable speaker embeddings. These systems operate within speech-only pipelines that produce clean vocal sequences without the ambient texture of real conversations. We take a different approach. Our method, ScenA, conditions a text-to-audio flow-match…

Open PDF (arxiv)
Mean-Payoff-Parity and Lifting Strategies from MDPs to 2-Player Stochastic Games

2606.19324 cs.GT 2026-06-17 PDF (arxiv)

Mohan Dantam, Richard Mayr

We consider the strategy complexity (i.e., memory and randomization) of optimal strategies in turn-based 2-player zero-sum stochastic games. Results in [Gimbert,Kelmendi:2023] show how to lift optimal memoryless strategies for shift-invariant inverse-submixing objectives from MDPs to 2-player stochastic games with an exponential increase in the number of memory modes. We show the corresponding low…

Open PDF (arxiv)
GRMHD and GRRT Simulations of Black Hole Accretion: Flares, Precession, and Complex Spacetimes

2606.19320 astro-ph.HE 2026-06-17 PDF (arxiv)

Hong-Xuan Jiang

This dissertation studies the electromagnetic signatures of accreting supermassive black holes using general relativistic magnetohydrodynamic simulations and covariant radiative-transfer calculations. It develops a unified numerical framework for modeling black-hole accretion, jet launching, flaring activity, and multi-band variability in Kerr, non-Kerr, and binary black-hole spacetimes. For iso…

Open PDF (arxiv)
Data Intelligence Agents: Interpreting, Modeling, and Querying Enterprise Data via Autonomous Coding Agents

2606.19319 cs.MA 2026-06-17 PDF (arxiv)

Anoushka Vyas, Aarushi Dhanuka, Sina Khoshfetrat Pakazad, Henrik Ohlsson

Production data integration is bottlenecked by repeated, lossy handoffs between data owners, engineers, and analysts who must collaboratively discover, structure, and query enterprise data. We present Data Intelligence Agents (DIA), a system of three agents (Data Interpreter, Schema Creator, and Query Generator) that compresses this workflow by treating autonomous coding agents (ACAs) as a first-c…

Open PDF (arxiv)
Explaining Attention with Program Synthesis

2606.19317 cs.LG 2026-06-17 PDF (arxiv)

Amiri Hayes, Belinda Li, Jacob Andreas

A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approximating the behavior of components of deep networks with executable programs. We focus on attention heads in transformer language models. For a given head, we first compute its associated attention matri…

Open PDF (arxiv)
NeuMesh++: Towards Versatile and Efficient Volumetric Editing with Disentangled Neural Mesh-based Implicit Field

2606.19316 cs.CV 2026-06-17 PDF (arxiv)

Chong Bao, Yuan Li, Bangbang Yang, Yujun Shen, Hujun Bao, Zhaopeng Cui, Yinda Zhang, Guofeng Zhang

Recently neural implicit rendering techniques have evolved rapidly and demonstrated significant advantages in novel view synthesis and 3D scene reconstruction. However, existing neural rendering methods for editing purposes offer limited functionalities, e.g., rigid transformation and category-specific editing. In this paper, we present a novel mesh-based representation by encoding the neural radi…

Open PDF (arxiv)
Diffusion-Proof: Recipe for Formal Theorem Proving Beyond Auto-Regressive Generation

2606.19315 cs.LG 2026-06-17 PDF (arxiv)

Ruida Wang, Rui Pan, Pengcheng Wang, Shizhe Diao, Tong Zhang

Enhancing the formal math reasoning capabilities of Large Language Models (LLMs) has become a key focus in both mathematical and computer science communities in recent years. While significant progress has been made in using state-of-the-art Auto-Regressive (AR) LLMs for formal theorem proving, these models suffer from inherent limitations. Their next-token prediction generation methods may yield …

Open PDF (arxiv)
Modeling Branches for Active Manipulation using Iterative Parameter Estimation

2606.19314 cs.RO 2026-06-17 PDF (arxiv)

Madhav Rijal, Rashik Shrestha, Trevor Smith, Yu Gu

This study presents a method for modeling diverse plant branches by iteratively estimating material parameters to support delicate branch manipulation. Branch manipulation is necessary in agricultural robotics for plant repositioning, stabilizing, and clearing visual obstructions in dense foliage. The proposed method builds a tetrahedral branch model from point-cloud data and simulates its behavio…

Open PDF (arxiv)
QDSV: A Semantic Problem Representation and Multi-Backend Execution Framework for Quantum-Oriented Computation

2606.19312 cs.PL 2026-06-17 PDF (arxiv)

Jaime Alexander Jimenez Lozano, Sebastian Jimenez Giraldo

Predicate-based computation over state spaces separates a problem specification from the backend that realizes it. Building on the model introduced in arXiv:2606.15027, this paper studies QDSV as a semantic, multi-backend execution framework for quantum-oriented computation. We describe how QDSV, QIntent, and Qruba connect declarative problem intent to a structured semantic representation, reali…

Open PDF (arxiv)
Enhancing Decision-Making with Large Language Models through Multi-Agent Fictitious Play

2606.19308 cs.CL 2026-06-17 PDF (arxiv)

Leyang Shen, Yang Zhang, Xiaoyan Zhao, Chun Kai Ling, Tat-Seng Chua

Large language model (LLM)-based multi-agent systems (MAS) have demonstrated great potential in solving tasks with execution complexity, by distributing subtasks across cooperative agents. However, this divide-and-conquer paradigm falls short on decision-making tasks that are also prevalent in the real world. These tasks require simultaneous reasoning from the stances of all involved stakeholders …

Open PDF (arxiv)
Observability and Consistency Analysis for Visual-Inertial Navigation with Anchored Feature Parameterizations

2606.19307 cs.RO 2026-06-17 PDF (arxiv)

Mitchell Cohen, Vassili Korotkine, James Richard Forbes

This paper presents an analysis of the observability and consistency properties of filtering-based visual-inertial navigation systems (VINS) that utilize anchored feature representations. The unobservable subspace of VINS with anchored landmark parameterizations is shown to be independent of the estimated landmark state, which leads to improved estimator consistency properties without any addition…

Open PDF (arxiv)
Secret key-distribution over networks with node-based adversarial errors

2606.19305 cs.IT 2026-06-17 PDF (arxiv)

Reza Sayyari, Michael Langberg

We study the multiple key-cast problem in network coding under active node-based adversaries. In multiple key-cast, a source generates independent secret keys to be securely and reliably delivered to designated terminal subsets. The network adversary can observe \(\ell_o\) nodes, inject additive or overwrite errors into \(\ell_e\) nodes, and simultaneously observe and corrupt \(\ell_{oe}\) nodes, …

Open PDF (arxiv)
P-K-GCN: Physics-augmented Koopman-enhanced Graph Convolutional Network for Deep Spatiotemporal Super-resolution

2606.19303 cs.LG 2026-06-17 PDF (arxiv)

Xizhuo, Zhang, Zekai Wang, Fei Liu, Bing Yao

High-fidelity simulation of spatiotemporal dynamics is computationally prohibitive, necessitating efficient super-resolution techniques to reconstruct high-resolution data from coarse-grained inputs. Traditional data-driven methods often lack physical constraints, and simple physics-informed learning struggles with irregular spatial geometries and intricately evolving temporal dynamics. To tackle …

Open PDF (arxiv)
Optimal scenario design for climate emulation

2606.19302 physics.ao-ph 2026-06-17 PDF (arxiv)

Christopher B. Womack, Shahine Bouabid, Andrei Sokolov, Popat Salunke, Glenn Flierl, Sebastian D. Eastham, Noelle E. Selin

As deep learning for physical systems continues to grow in popularity, efforts to improve generalizability have primarily focused on designing architectures that embed physical constraints. However, for machine-learning surrogate climate models (emulators), we show that the low structural diversity in existing scenarios commonly used to generate training data places a ceiling on predictive skill. …

Open PDF (arxiv)