Archon

Browse and search harvested arxiv metadata.

1614491 results (page 14 of 64580)

Contraction-based Neural Control for Cooperative Aerial Payload Transportation with Variable-length Cables

2606.20127 eess.SY 2026-06-18 PDF (arxiv)

Yi Lok Lo, Longhao Qian, Hugh H. T. Liu

This paper presents a novel neural nonlinear control framework for a multi-drone slung payload system with variable-length cables and a rigid-body payload. The equations of motion are formulated into a decoupled structure, where the payload and cable length dynamics are governed by independent control channels, facilitating modularized controller design on reduced-order subsystems. A neural contro…

Open PDF (arxiv)
Advancing Threshold-Inception Modeling for Predictive Simulation of Ionic Wind Fan Performance

2606.20124 physics.comp-ph 2026-06-18 PDF (arxiv)

Siim Heering, Juri Volodin, Vootele Mets, Rasmus Talviste, Jüri Raud, Karl-Eerik Unt, Indrek Jõgi, Veronika Zadin

This study investigates the predictive capability of a threshold inception-based multiphysics modeling approach for ionic wind fans by direct comparison with experimental measurements. A wire-to-cylinder electroaerodynamic (EAD) fan with variable electrode spacing is used as a reference system to assess the model's ability to reproduce airflow characteristics, discharge current, and performance tr…

Open PDF (arxiv)
QPU-scale randomized benchmarking via Bell-pair injection

2606.20123 quant-ph 2026-06-18 PDF (arxiv)

Haripriya Pettugani, María Aguado-Yáñez, Astryd Park, Daniel Bultrini, James R. Wootton

Mirror randomized benchmarking (MRB) is an established technique that provides a global error metric at the scale of a whole QPU. To expand upon this we introduce Mirror Quantum Awesomeness (MQA), a hybrid protocol that adds a structured entangling layer to MRB circuits. This enables per-edge correlation dynamics to be tracked via mutual information while preserving the MRB infidelity estimate. Th…

Open PDF (arxiv)
ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research

2606.20122 cs.AI 2026-06-18 PDF (arxiv)

Zhibang Yang, Xinke Jiang, Yuzhen Xiao, Ruizhe Zhang, Yue Fang, XinFei Wan, Zhengxing Song, Yuxuan Liu, Yuheng Huang, Xu Chu, Junfeng Zhao, Yasha Wang

Open-ended deep research (OEDR) requires systems to acquire knowledge through multi-round retrieval and generate coherent long-form reports. The outline plays a central role as a structural scaffold that coordinates retrieval, evidence organization, and generation. However, existing methods either fix the outline before writing or refine it with local heuristics, leading to scaffold drift under co…

Open PDF (arxiv)
BARReL: a modern backend for Atelier B in Lean

2606.20121 cs.LO 2026-06-18 PDF (arxiv)

Ghilain Bergeron, Vincent Trélat

BARReL is a Lean 4 library bridging Atelier B, an industrial tool for the B method, and the Lean proof assistant by enabling users to conduct their formal B developments -- up to machine refinement and implementation -- interactively inside Lean, while retaining standard B syntax. B partial operators are carefully encoded by generating explicit well-definedness conditions, leveraging Lean's depend…

Open PDF (arxiv)
Dual-Agent Framework for Cross-Model Verified Translation of Natural-Language Protocols into Robotic Laboratory Platform

2606.20120 cs.RO 2026-06-18 PDF (arxiv)

Hyeonna Choi, Jung Yup Kim, Hyuneui Lim, Seunggyu Jeon

Biological experiment protocols are written in natural language, whereas automation systems rely on predefined control commands, creating a semantic gap that limits autonomous execution. Microplate-based automatic experiments are particularly challenging due to the need to simultaneously control well mapping, sample-reagent combinations, replicate placement, and parallel dispensing. This study pro…

Open PDF (arxiv)
Pose6DAug: Physically Plausible Multi-view Object Swapping for Robot Data Augmentation

2606.20118 cs.RO 2026-06-18 PDF (arxiv)

Jonghoon Lee, Seong Hyeon Park, Byungwoo Jeon, Minha Lee, Jinwoo Shin

Vision-language-action (VLA) policies have shown strong potential for general-purpose manipulation, yet they often fail on novel, out-of-distribution objects whose appearance or geometry deviates from the training distribution. The standard remedy is to collect multi-view teleoperation data for every failure case, but this scales poorly in both cost and time. We introduce Pose6DAug, a failure-driv…

Open PDF (arxiv)
When Calibration Fails the Vulnerable Hospital: Federated Conformal Risk Control via Risk-Curve Shrinkage

2606.20115 cs.LG 2026-06-18 PDF (arxiv)

Nafis Fuad Shahid

Conformal risk control (CRC) provides distribution-free guarantees on segmentation quality by calibrating a prediction-set threshold on held-out data. In federated deployments, the standard approach pools calibration scores across sites into a single threshold. We provide the first quantification, on real multi-institutional brain tumor data (FeTS-2022, 1,251 subjects, 20 institutions), showing th…

Open PDF (arxiv)
Community detection in small-sample ordinal regimes: A benchmarking framework for Delphi data

2606.20114 stat.ME 2026-06-18 PDF (arxiv)

Yuri Calleo, Simone Di Zio, Fabrizio Maturo

The statistical modeling of consensus in Delphi data faces a critical bottleneck: the high dimensionality of questionnaire items relative to the limited sample size of expert panels. This rank deficiency leads traditional latent variable models, such as Principal Component Analysis, to be structurally unstable and prone to overfitting. Addressing this methodological gap, this study proposes a tran…

Open PDF (arxiv)
When Does Streaming Tool Use Help? Characterizing Tool-Intent Stabilization in Streaming Retrieval-Augmented Generation

2606.20113 cs.CL 2026-06-18 PDF (arxiv)

Elroy Galbraith

Streaming Retrieval-Augmented Generation (Streaming RAG) reduces user-perceived latency by issuing tool queries in parallel with ongoing user input, before the utterance is complete. Reported gains are aggregate, yet the mechanism's benefit is fundamentally query-intrinsic: speculation can only help when the correct tool query becomes determinable before the user stops speaking or typing. We isola…

Open PDF (arxiv)
Pixel-Level Residual Diffusion Transformer: Scalable 3D CT Volume Generation

2606.20112 cs.CV 2026-06-18 PDF (arxiv)

Zhenkai Zhang, Markus Hiller, Krista A. Ehinger, Tom Drummond

Generating high-resolution 3D CT volumes with fine details remains challenging due to substantial computational demands and optimization difficulties inherent to existing generative models. In this paper, we propose the Pixel-Level Residual Diffusion Transformer (PRDiT), a scalable generative framework that synthesizes high-quality 3D medical volumes directly at voxel-level. PRDiT introduces a two…

Open PDF (arxiv)
Hybrid stars with hyperons: structure based on QCD sum rule coupling constants

2606.20111 nucl-th 2026-06-18 PDF (arxiv)

F. Moradi Jangal, H. R. Moshfegh, K. Azizi

We present a comprehensive study of hybrid stars composed of hadrons, leptons, and quarks within a relativistic mean-field framework. Using coupling constants derived from QCD sum rules (QCDSR), we first determine the bulk properties of nuclear matter and evaluate the single-particle potentials of nucleons and hyperons to constrain the hadronic sector. The equation of state (EOS) under beta equili…

Open PDF (arxiv)
FrozenDrive: Zero-Shot Text-Guided Driving Scene Generation and Data Augmentation with Parameter-Free Frozen Diffusion Model

2606.20110 cs.CV 2026-06-18 PDF (arxiv)

Yuhwan Jeong, Hyeonseong Kim, Daehyun We, Seonkyu Song, Jinnyeong Yang, Hyun-Kurl Jang, Youngho Yoon, Kuk-Jin Yoon

Synthetic data for autonomous driving is surging, powered by diffusion models that promise scalable scene generation. Yet key obstacles remain, as enforcing multi-view and temporal consistency often relies on backbone fine-tuning or added layers, which erodes pre-trained knowledge and weakens text alignment. Models also stay close to the training distribution, struggling under adverse weather and …

Open PDF (arxiv)
Regular Black Holes from Anisotropic Source with Hydrodynamic Equation of State

2606.20109 gr-qc 2026-06-18 PDF (arxiv)

Hassan Firouzjahi

We study regular black hole solutions sourced by an anisotropic energy momentum tensor. It is well known that the geometry of the interior of a spherically symmetric regular black hole approaches the dS metric. Having decomposed the energy momentum tensor into its isotropic and anisotropic components, we assume a hydrodynamic equation of state, $P= P(ρ)$, for the pressure, and look for spherically…

Open PDF (arxiv)
EFIQA: Explainable Fundus Image Quality Assessment via Anatomical Priors

2606.20108 cs.CV 2026-06-18 PDF (arxiv)

Pengwei Wang, José Morano, Qian Wan, Hrvoje Bogunović

Image quality control is vital for a wide range of downstream applications. Deep learning-based image quality assessment methods typically train classifiers on dataset-specific quality labels, inheriting two limitations: (1) generalization is tied to the labeling criteria of the training set and (2) these methods cannot provide spatial feedback on where the quality is degraded, lacking explainabil…

Open PDF (arxiv)
Quantile of Means: A Bonus-Free Ensemble Method for Minimax Optimal Reinforcement Learning

2606.20107 cs.LG 2026-06-18 PDF (arxiv)

Asaf Cassel, Aviv Rosenberg

Optimal Reinforcement Learning (RL) algorithms typically rely on carefully constructed count-based uncertainty estimates to drive exploration. Although theoretically sound, such estimates are hard to compute in practical settings and therefore offer limited insight for designing exploration heuristics. Meanwhile, ensembling has emerged as a practical approach, but remains without theoretical justi…

Open PDF (arxiv)
Personalized Keyword Spotting for User-Defined Keywords Leveraging Text-Independent Speaker Verification

2606.20106 eess.AS 2026-06-18 PDF (arxiv)

Ming-Hsiang Hu, Kuan-Tang Huang, Chien-Chun Wang, Hung-Shin Lee, Berlin Chen

User-defined keyword spotting (UD-KWS) enables zero-shot wake-word detection from text, but existing systems learn speaker-invariant representations that cannot reject impostors uttering the correct keyword. We address this dual zero-shot setting -- unseen keywords and unseen speakers -- with ZP-KWS, a lightweight framework combining a phoneme-supervised audio encoder with a GE2E-pretrained compac…

Open PDF (arxiv)
Can DFT-trained neural network potentials reproduce structure, solvation, and water-exchange properties in aqueous magnesium solutions?

2606.20105 physics.chem-ph 2026-06-18 PDF (arxiv)

Sebastian Falkner, Pablo Montero de Hijes, Christoph Dellago, Nadine Schwierz

Magnesium ions play an essential role in many biological processes but remain challenging to model in biomolecular simulations. Despite considerable scientific effort, classical force fields fail to simultaneously reproduce key structural, thermodynamic and kinetic solution properties, likely due to their inability to explicitly account for quantum many-body effects. Here, we develop and systemati…

Open PDF (arxiv)
Sensorimotor World Models: Perception for Action via Inverse Dynamics

2606.20104 cs.LG 2026-06-18 PDF (arxiv)

Petr Ivashkov, Randall Balestriero, Bernhard Schölkopf

Perception for action suggests that representations of the world should be shaped not by visual fidelity alone, but by their relevance for actions. At the same time, latent JEPA-style world models advocate learning compact predictive states from high-dimensional observations to facilitate the prediction of future states, but end-to-end training of these models is nontrivial because representations…

Open PDF (arxiv)
Geometry-Preserving in 3D Gaussian Splatting for LiDAR-Camera Extrinsic Calibration

2606.20103 cs.CV 2026-06-18 PDF (arxiv)

Kyoleen Kwak, Daeho Kim, Jeong Woon Lee, Hyoseok Hwang

Accurate LiDAR-camera calibration is essential for robust multi-modal perception. Targetless approaches avoid manual setup but remain limited by the scarcity of discriminative cross-modal features. Recent methods address this by reconstructing the scene within a differentiable model, enabling extrinsic optimization through dense photometric supervision. Among these, 3D Gaussian Splatting (3DGS) ha…

Open PDF (arxiv)
Artificial Intelligence as Game Changer in Cybersecurity: What We Learned in 2025-2026, and how this is relevant for Africa

2606.20102 cs.CY 2026-06-18 PDF (arxiv)

Mikael Alemu Gorsky

In 2025 and 2026, two events settled questions that had until then been speculative. In the first, a large language model executed the great majority of a state-aligned cyber-espionage campaign on its own, with human operators intervening at only a few decision points. In the second, the most capable cyber-relevant model was placed under a controlled-access program limited to a vetted set of Unite…

Open PDF (arxiv)
Hybrid Diffusion Transformer for Instruction-Guided Audio Editing via Rectified Flow

2606.20101 cs.SD 2026-06-18 PDF (arxiv)

Liting Gao, Yonggang Zhu, Yaru Chen, Dongyu Wang, Shubin Zhang, Zhenbo Li, Jean-Yves Guillemaut, Wenwu Wang

Audio editing aims to modify specific content in an existing audio clip according to a natural language instruction while preserving the remaining acoustic content. Despite the remarkable progress of diffusion models, existing training-based editing methods mainly rely on the local inductive biases and cross-attention interaction in convolutional U-Net backbones, which often hinder long-range sema…

Open PDF (arxiv)
WeGenBench: A Multidimensional Diagnostic Benchmark towards Text-to-Image Model Optimization

2606.20100 cs.CV 2026-06-18 PDF (arxiv)

Qian Liang, Xiaomin Li, Ying Zhang, Jia Xu, Lihao Ni, Hongrui Li, Jingjing Li, Jing Lyu, Chen Li

Recent text-to-image generation models have demonstrated remarkable capabilities in synthesizing highly realistic images from text inputs alone. Although existing benchmarks can evaluate the generation capabilities of various models to some extent, they struggle to comprehensively and accurately measure performance across multiple dimensions, often failing to reveal the inherent deficiencies of mo…

Open PDF (arxiv)
Site-Specific MIMO Channel Generation via Diffusion and Flow Matching: Fidelity, Efficiency, and Downstream Utility

2606.20098 cs.IT 2026-06-18 PDF (arxiv)

Sina Beyraghi, Masoud Sadeghian, Firdous Bin Ismail, Angel Lozano, Paul Almasan, Giovanni Geraci

This paper explores the use of generative models to synthesize high-quality, site-specific multiple-input multiple-output (MIMO) channel data, addressing the high cost of the extensive measurement campaigns required to acquire real-world data for AI-native wireless networks. Two location-conditioned generative paradigms are compared: a conditional denoising diffusion implicit model (cDDIM), and a …

Open PDF (arxiv)
HydraHead: From Head-Level Functional Heterogeneity to Specialized Attention Hybridization

2606.20097 cs.CL 2026-06-18 PDF (arxiv)

Zhentao Tan, Wei Chen, Jingyi Shen, Yao Liu, Xu Shen, Yue Wu, Jieping Ye

The quadratic complexity of attention poses a critical bottleneck for long-context processing, spurring interest in hybrid attention designs. Most open-source hybrid models adopt a layer-wise strategy. Yet, prior work has noted the inherent difficulty of integrating Linear Attention (LA) with Full Attention (FA), suggesting that the design space of attention hybridization remains underexplored. To…

Open PDF (arxiv)