Archon

Browse and search harvested arxiv metadata.

1273993 results (page 118 of 50960)

MCMC with Adaptive Principal-Component Transformation: Rotation-Invariant Universal Samplers for Bayesian Structural System Identification

2604.23381 stat.AP 2026-04-25 PDF (arxiv)

Xianghao Meng, Yong Huang, James L. Beck, Kui Jiang, Hui Li

Over decades, Markov chain Monte Carlo (MCMC) methods have been widely studied, with a typical application being the quantification of posterior uncertainties in Bayesian system identification of structural dynamic models. To address the issue of excessively low sampling efficiency in generic MCMC methods when applied to specific problems, researchers developed several MCMC algorithms that integra…

Open PDF (arxiv)
V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think

2604.23380 cs.LG 2026-04-25 PDF (arxiv)

Bingda Tang, Yuhui Zhang, Xiaohan Wang, Jiayuan Mao, Ludwig Schmidt, Serena Yeung-Levy

Aligning denoising generative models with human preferences or verifiable rewards remains a key challenge. While policy-gradient online reinforcement learning (RL) offers a principled post-training framework, its direct application is hindered by the intractable likelihoods of these models. Prior work therefore either optimizes an induced Markov decision process (MDP) over sampling trajectories, w…

Open PDF (arxiv)
Constraint-Based Analysis of Reasoning Shortcuts in Neurosymbolic Learning

2604.23377 cs.AI 2026-04-25 PDF (arxiv)

Akihiro Takemura, Katsumi Inoue, Masaaki Nishino

Neurosymbolic systems can satisfy logical constraints during learning without achieving the intended concept-label correspondence; this is a problem known as reasoning shortcuts. We formalize reasoning shortcuts as a constraint satisfaction problem and investigate under which conditions concept mappings are uniquely determined by the constraints. We prove that a discrimination property (requiring …

Open PDF (arxiv)
Hierarchical Spatio-Channel Clustering for Efficient Model Compression in Medical Image Analysis

2604.23375 cs.CV 2026-04-25 PDF (arxiv)

Sisipho Hamlomo, Marcellin Atemkeng, Habte Tadesse Likassa, Blaise Ravelo, Thierry Bouwmans, Sébastien Lalléchère, Antoine Vacavant, Ding-Geng Chen

Convolutional neural networks (CNNs) have become increasingly difficult to deploy in resource-constrained environments due to their large memory and computational requirements. Although low-rank compression methods can reduce this burden, most existing approaches compress spatial and channel redundancy independently and therefore do not fully exploit the localised structure within convolutional fe…

Open PDF (arxiv)
Ghost in the Agent: Redefining Information Flow Tracking for LLM Agents

2604.23374 cs.CR 2026-04-25 PDF (arxiv)

Yuandao Cai, Wensheng Tang, Cheng Wen, Shengchao Qin

Autonomous Large Language Model (LLM) agents are increasingly deployed to conduct complex tasks by interacting with external tools, APIs, and memory stores. However, processing untrusted external data exposes these agents to severe security threats, such as indirect prompt injection and unauthorized tool execution. Securing these systems requires effective information flow tracking. Yet, tradition…

Open PDF (arxiv)
Physics-Informed Temporal U-Net for High-Fidelity Fluid Interpolation

2604.23372 physics.flu-dyn 2026-04-25 PDF (arxiv)

Eshwar R. A., Nevin Mathew Thomas, Nehal G, Farida M. Begam

Reconstructing high-fidelity fluid dynamics from sparse temporal observations is quite challenging, mainly due to the chaotic and non-linear nature of fluid transport. Standard deep learning-based interpolation methods often tend to regress to the mean, which results in spatial blurring and temporal strobing, especially noticeable around the observed anchor frames where transitions become disconti…

Open PDF (arxiv)
When Context Sticks: Studying Interference in In-Context Learning

2604.23371 cs.LG 2026-04-25 PDF (arxiv)

Hanna Rød, Dagny Streit, Nils Valseth Selte, Justin Li

This paper investigates context stickiness in in-context learning (ICL), a phenomenon where earlier examples in a prompt interfere with a transformer's ability to adapt to later tasks. Using synthetic regression tasks over linear and quadratic functions, we examine how models trained under sequential, mixed, and random curricula handle abrupt task switches during inference. By sweeping over struct…

Open PDF (arxiv)
Nonlinear Non-Gaussian Density Steering with Input and Noise Channel Mismatch: Sinkhorn with Memory for Solving the Control-affine Schrödinger Bridge Problem

2604.23370 math.OC 2026-04-25 PDF (arxiv)

Georgiy A. Bondar, Asmaa Eldesoukey, Yongxin Chen, Abhishek Halder

Solutions to the Schrödinger bridge problem and its generalizations yield feedback control policies for optimal density steering over a controlled diffusion. To numerically compute the same, the dynamic Sinkhorn recursion has become a standard approach. The mathematical engine behind this approach is the Hopf-Cole transform that recasts the conditions for optimality into a system of boundary-coupl…

Open PDF (arxiv)
TEMPO: Transformers for Temporal Disease Progression from Cross-Sectional Data

2604.23368 cs.LG 2026-04-25 PDF (arxiv)

Hongtao Hao, Joseph L. Austerweil

Event-Based Models (EBMs) infer biomarker progression from cross-sectional data but typically only as ordinal sequences and rely on rigid model assumptions. We propose \textsc{Tempo}, a Transformer architecture that learns both ordinal and continuous event sequences through simulation-based supervised learning. \textsc{Tempo} uses two Transformer modules: one treats biomarkers as tokens to infer e…

Open PDF (arxiv)
GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs

2604.23366 cs.AI 2026-04-25 PDF (arxiv)

Federico A. Kamelhar

Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is grounded in observed evidence rather than model-internal inference. Existing groundedness evaluators (binary classifiers, LLM-as-judge scalars, self-correction loops) treat supporting evidence as interchan…

Open PDF (arxiv)
Spectral Butterfly Effect and Resilient Ringdown in Thick Braneworlds

2604.23364 gr-qc 2026-04-25 PDF (arxiv)

Hai-Long Jia, Wen-Di Guo, Yu-Peng Zhang, Yu-Xiao Liu

The quasinormal mode spectrum is a unique fingerprint linking gravitational-wave observations to extra-dimensional geometry. In this Letter, we show that thick braneworlds exhibit a spectral butterfly effect: infinitesimal deformations of the effective potential trigger dramatic migrations of quasinormal modes, challenging the presumed stability of this fingerprint. Frequency-domain instabilities …

Open PDF (arxiv)
UniAda: Universal Adaptive Multi-objective Adversarial Attack for End-to-End Autonomous Driving Systems

2604.23362 cs.SE 2026-04-25 PDF (arxiv)

Jingyu Zhang, Jacky Wai Keung, Yan Xiao, Yihan Liao, Yishu Li, Xiaoxue Ma

Adversarial attacks play a pivotal role in testing and improving the reliability of deep learning (DL) systems. Existing literature has demonstrated that subtle perturbations to the input can elicit erroneous outcomes, thereby substantially compromising the security of DL systems. This has emerged as a critical concern in the development of DL-based safety-critical systems like Autonomous Driving …

Open PDF (arxiv)
An Empirical Evaluation of Locally Deployed LLMs for Bug Detection in Python Code

2604.23361 cs.SE 2026-04-25 PDF (arxiv)

Jelena Ilić Vulićević

Large language models (LLMs) have demonstrated strong performance on a wide range of software engineering tasks, including code generation and analysis. However, most prior work relies on cloud-based models or specialized hardware, limiting practical applicability in privacy-sensitive or resource-constrained environments. In this paper, we present a systematic empirical evaluation of two locally d…

Open PDF (arxiv)
Learning from Demonstration with Failure Awareness for Safe Robot Navigation

2604.23360 cs.RO 2026-04-25 PDF (arxiv)

Xianghui Wang, Siwei Cheng, Shanze Wang, Xinming Zhang, Dan Zhang, Wei Zhang

Learning from demonstration is widely used for robot navigation, yet it suffers from a fundamental limitation: demonstrations consist predominantly of successful behaviors and provide limited coverage of unsafe states. This limitation leads to poor safety when the robot encounters scenarios beyond the demonstration distribution. Failure experiences, such as collisions, contain essential informatio…

Open PDF (arxiv)
Modelling spatial heterogeneity in the effects of area-level covariates on income distributions using Bayesian nonparametric methods

2604.23357 stat.ME 2026-04-25 PDF (arxiv)

Ziyou Wang, Jim Griffin, Maria Kalli

Understanding the how the distribution of an economic outcome, such as income, changes with respect to space and covariates is a key concern for policy makers. To address this, we develop a Bayesian nonparametric model, the Normalised Latent Measure Factor Model with Covariates (NLMFM-C), which expresses a large collection of related densities as mixtures of latent factor densities and allows for …

Open PDF (arxiv)
VeriLLMed: Interactive Visual Debugging of Medical Large Language Models with Knowledge Graphs

2604.23356 cs.CL 2026-04-25 PDF (arxiv)

Yurui Xiang, Xingyi Mao, Rui Sheng, Zixin Chen, Zelin Zang, Yuyang Wu, Haipeng Zeng, Huamin Qu, Yushi Sun, Yanna Lin

Large language models (LLMs) show promise in medical diagnosis, but real-world deployment remains challenging due to high-stakes clinical decisions and imperfect reasoning reliability. As a result, careful inspection of model behavior is essential for assessing whether diagnostic reasoning is reliable and clinically grounded. However, debugging medical LLMs remains difficult. First, developers oft…

Open PDF (arxiv)
LEGO: An LLM Skill-Based Front-End Design Generation Platform

2604.23355 cs.AI 2026-04-25 PDF (arxiv)

Jincheng Lou, Ruohan Xu, Jiecheng Ma, Runzhe Tao, Xinyu Qu, Yibo Lin

Existing LLM-based EDA agents are often isolated task-specific systems. This leads to repeated engineering effort and limited reuse of successful design and debugging strategies. We present LEGO, a unified skill-based platform for front-end design generation. It decomposes the digital front-end flow into six independent steps and represents every agent capability as a standardized composable circu…

Open PDF (arxiv)
Explainable AI in Speaker Recognition -- Making Latent Representations Understandable

2604.23354 eess.AS 2026-04-25 PDF (arxiv)

Yanze Xu, Wenwu Wang, Mark D. Plumbley

Neural networks can be trained to learn task-relevant representations from data. Understanding how these networks make decisions falls within the Explainable AI (XAI) domain. This paper proposes to study an XAI topic: uncovering unknown organisational patterns in network representations, particularly those representations learned by the speaker recognition network that recognises the speaker ident…

Open PDF (arxiv)
Testing Scalar Field Dark Matter models in M31 galaxy through the Rotation Curve analysis

2604.23353 astro-ph.CO 2026-04-25 PDF (arxiv)

Gulnara Suliyeva, Kuantay Boshkayev, Talgar Konysbayev, Yergali Kurmanov, Guldana Rabigulova

We explore the viability of scalar field dark matter halo models through the rotation curve analysis of the Andromeda galaxy (M31), taking into account a realistic description of its baryonic structure. The mass model includes a stellar disk described by the Freeman profile and two alternative bulge configurations: a classical single de Vaucouleurs bulge and a two-component structure consisting of…

Open PDF (arxiv)
When Chain-of-Thought Fails, the Solution Hides in the Hidden States

2604.23351 cs.CL 2026-04-25 PDF (arxiv)

Houman Mehrafarin, Amit Parekh, Ioannis Konstas

Whether intermediate reasoning is computationally useful or merely explanatory depends on whether chain-of-thought (CoT) tokens contain task-relevant information. We present a mechanistic causal analysis of CoT on GSM8K using activation patching: transferring token-level hidden states from a CoT generation to a direct-answer run for the same question, then measuring the effect on final-answer accu…

Open PDF (arxiv)
GeoFunFlow-3D: A Physics-Guided Generative Flow Matching Framework for High-Fidelity 3D Aerodynamic Inference over Complex Geometries

2604.23350 math.NA 2026-04-25 PDF (arxiv)

Ruiling Jiang, Yong Zhang, Houbiao Li

Deep generative models and neural operators have demonstrated significant potential for 3D aerodynamic inference. However, they often face inherent challenges in maintaining physical consistency and preserving high-frequency features, primarily due to spectral bias and gradient conflicts within the governing equations. To address these issues, we propose GeoFunFlow-3D, a physics-guided generative …

Open PDF (arxiv)
EmoTrans: A Benchmark for Understanding, Reasoning, and Predicting Emotion Transitions in Multimodal LLMs

2604.23348 cs.CV 2026-04-25 PDF (arxiv)

He Hu, Tengjin Weng, Zebang Cheng, Yu Wang, Jiachen Luo, Björn Schuller, Zheng Lian, Laizhong Cui

Recent multimodal large language models (MLLMs) have shown strong capabilities in perception, reasoning, and generation, and are increasingly used in applications such as social robots and human-computer interaction, where understanding human emotions is essential. However, existing benchmarks mainly formulate emotion understanding as a static recognition problem, leaving it largely unclear whethe…

Open PDF (arxiv)
Evaluating Large Language Models on Computer Science University Exams in Data Structures

2604.23347 cs.CL 2026-04-25 PDF (arxiv)

Edan Gabay, Yael Maoz, Jonathan Stahl, Naama Maoz, Abdo Amer, Orr Eilat, Hanoch Levy, Michal Kleinbort, Amir Rubinstein, Adi Haviv

We present a comprehensive evaluation of Large Language Models (LLMs) on Computer Science (CS) Data Structure examination questions. Our work introduces a new benchmark dataset comprising exam questions from Tel Aviv University (TAU), curated to assess LLMs' abilities in handling closed and multiple-choice questions. We evaluated the performance of OpenAI's GPT 4o and Anthropic's Claude 3.5, popul…

Open PDF (arxiv)
Bridging Reasoning and Action: Hybrid LLM-RL Framework for Efficient Cross-Domain Task-Oriented Dialogue

2604.23345 cs.CL 2026-04-25 PDF (arxiv)

Yangyang Zhao, Linfan Dai, Li Cai, Bowen Xing, Libo Qin

Cross-domain task-oriented dialogue requires reasoning over implicit and explicit feasibility constraints while planning long-horizon, multi-turn actions. Large language models (LLMs) can infer such constraints but are unreliable over long horizons, while Reinforcement learning (RL) optimizes long-horizon behavior yet cannot recover constraints from raw dialogue. Naively coupling LLMs with RL is t…

Open PDF (arxiv)
Exploring Hierarchical Consistency and Unbiased Objectness for Open-Vocabulary Object Detection

2604.23344 cs.CV 2026-04-25 PDF (arxiv)

Sanghoon Lee, Geon Lee, Hyekang Park, Bumsub Ham

Conventional object detectors typically operate under a closed-set assumption, limiting recognition to a predefined set of base classes seen during training. Open-vocabulary object detection (OVD) addresses this limitation by leveraging vision-language models (VLMs) to generate pseudo labels for novel object classes. However, existing OVD methods suffer from two critical drawbacks: (1) inaccurate …

Open PDF (arxiv)