Archon

Browse and search harvested arxiv metadata.

1124467 results (page 70 of 44979)

Dual-Guard: Dual-Channel Latent Watermarking for Provenance and Tamper Localization in Diffusion Images

2604.19090 cs.CR 2026-04-21 PDF (arxiv)

JinFeng Xie, Chengfu Ou, Peipeng Yu, Xiaoyu Zhou, Dingding Huang, Jianwei Fei, Zixuan Shen, Zhihua Xia

The rapid adoption of diffusion-based generative models has intensified concerns over the attribution and integrity of AI-generated content (AIGC). Existing single-domain watermarking methods either fail under regeneration, remain vulnerable to black-box reprompting that enables adversarial framing, or provide no spatial evidence for tampered regions. We propose Dual-Guard, a dual-channel latent w…

Open PDF (arxiv)
Towards Scalable Lifelong Knowledge Editing with Selective Knowledge Suppression

2604.19089 cs.AI 2026-04-21 PDF (arxiv)

Dahyun Jung, Jaewook Lee, Heuiseok Lim

Large language models (LLMs) require frequent knowledge updates to reflect changing facts and mitigate hallucinations. To meet this demand, lifelong knowledge editing has emerged as a continual approach to modify specific pieces of knowledge without retraining the entire model. Existing parameter editing methods struggle with stability during sequential edits due to catastrophic forgetting. While …

Open PDF (arxiv)
Cultural Newcomers Dining Across Borders: Need-Based Design Envision of Mixed Media Integration in MR for Foreign Menu Understanding and Ordering

2604.19088 cs.HC 2026-04-21 PDF (arxiv)

Ying Zhang, Daoxin Chen

Cultural newcomers (CNs), including new immigrants and international students, often encounter cognitive barriers and social anxiety, exacerbated by unfamiliar cultural terminology in daily interactions. This research examines these challenges in the context of ordering in foreign restaurants. Current translation tools have significant limitations in their information delivery with current media p…

Open PDF (arxiv)
OLLM: Options-based Large Language Models

2604.19087 cs.AI 2026-04-21 PDF (arxiv)

Shashank Sharma, Janina Hoffmann, Vinay Namboodiri

We introduce Options LLM (OLLM), a simple, general method that replaces the single next-token prediction of standard LLMs with a \textit{set of learned options} for the next token, indexed by a discrete latent variable. Instead of relying on temperature or sampling heuristics to induce diversity, OLLM models variation explicitly: a small latent space parametrizes multiple plausible next-token opti…

Open PDF (arxiv)
MUCOCO: Automated Consistency Testing of Code LLMs

2604.19086 cs.SE 2026-04-21 PDF (arxiv)

Chua Jin Chou, Khant That Lwin, Ezekiel Soremekun

Code LLMs often portray inconsistent program behaviors. Developers typically employ benchmarks to assess Code LLMs, but most benchmarks are hand-crafted, static and do not target consistency property. In this work, we pose the scientific question: how can we automatically discover inconsistent program behaviors in Code LLMs? To address this challenge, we propose an automated consistency testing me…

Open PDF (arxiv)
PROMETHEE-based Modeling of Endogenous Behavioral Uncertainty of EV Owners

2604.19085 eess.SY 2026-04-21 PDF (arxiv)

Dipayan Sarkar, Qifeng Li

The electric vehicle (EV) charging demands (CD) are jointly determined by the EV owners' behavior (i.e., human factor) and the electricity prices (i.e., decisions of distribution system operators (DSO)). However, most existing studies either neglect the decision-dependent nature of EVCD uncertainty or idealistically treat EV owners as perfect decision-makers. This paper formulates the optimal oper…

Open PDF (arxiv)
DUSG-Tomo-Net: A Deep Unfolded Neural Network for Super-Resolving Gridless Spaceborne SAR Tomography via Learned Toeplitz-Structured Covariance Representation

2604.19084 eess.SP 2026-04-21 PDF (arxiv)

Kun Qian, Zhuge Xia, Qian Ma, Qi Zhang, Weijian Liu, Xiufeng He

Synthetic aperture radar tomography (TomoSAR) enables 3-D imaging by exploiting multibaseline acquisitions and has become an important tool for urban mapping. To achieve super-resolution inversion, sparse reconstruction methods based on compressive sensing (CS) are widely adopted. However, most CS-based TomoSAR methods rely on grid-based formulations and therefore suffer from off-grid bias. Gridle…

Open PDF (arxiv)
ProjLens: Unveiling the Role of Projectors in Multimodal Model Safety

2604.19083 cs.CR 2026-04-21 PDF (arxiv)

Kun Wang, Cheng Qian, Miao Yu, Lilan Peng, Liang Lin, Jiaming Zhang, Tianyu Zhang, Yu Cheng, Yang Wang

Multimodal Large Language Models (MLLMs) have achieved remarkable success in cross-modal understanding and generation, yet their deployment is threatened by critical safety vulnerabilities. While prior works have demonstrated the feasibility of backdoors in MLLMs via fine-tuning data poisoning to manipulate inference, the underlying mechanisms of backdoor attacks remain opaque, complicating the un…

Open PDF (arxiv)
Proactive Detection of GUI Defects in Multi-Window Scenarios via Multimodal Reasoning

2604.19081 cs.SE 2026-04-21 PDF (arxiv)

Xinyao Zhang, Rui Wang, Jinhao Cui, Haotian Huang, Wei Xue, Wenhua Hu, Jianwen Xiang, Rui Hao

Multi-window mobile scenarios, such as split-screen and foldable modes, make GUI display defects more likely by forcing applications to adapt to changing window sizes and dynamic layout reflow. Existing detection techniques are limited in two ways: they are largely passive, analyzing screenshots only after problematic states have been reached, and they are mainly designed for conventional full-scr…

Open PDF (arxiv)
Reducing the Offline-Streaming Gap for Unified ASR Transducer with Consistency Regularization

2604.19079 eess.AS 2026-04-21 PDF (arxiv)

Andrei Andrusenko, Vladimir Bataev, Lilit Grigoryan, Nune Tadevosyan, Vitaly Lavrukhin, Boris Ginsburg

Unification of automatic speech recognition (ASR) systems reduces development and maintenance costs, but training a single model to perform well in both offline and low-latency streaming settings remains challenging. We present a Unified ASR framework for Transducer (RNNT) training that supports both offline and streaming decoding within a single model, using chunk-limited attention with right con…

Open PDF (arxiv)
High-Order Multi-Scale Method and Its Convergence Analysis for Nonlinear Thermo-Electro-Mechanical Coupling Problems of Composite Structures

2604.19077 math.NA 2026-04-21 PDF (arxiv)

Hao Dong

This study proposes a high-order multi-scale method tailored for time-dependent nonlinear thermo-electro-mechanical coupling problems of composite structures with highly spatial heterogeneity, which incorporate temperature-dependent material properties and Joule heating effect. By employing the multi-scale asymptotic approach and the Taylor series technique, a high-accuracy multi-scale asymptotic …

Open PDF (arxiv)
What is Powering the Enigmatic He II Emitter Hebe: The First Stars or Black Holes?

2604.19075 astro-ph.GA 2026-04-21 PDF (arxiv)

Junehyoung Jeon, Tae Bong Jeong, Saiyang Zhang, Volker Bromm

Recent high-resolution spectroscopy with the James Webb Space Telescope (JWST) has confirmed the presence of a strong He II, $\lambda1640$ emitting clump in the vicinity of GN-z11, with only upper limits on its metallicity. To explain the peculiar properties of this source, now termed Hebe, a cluster of metal-free, Population III (Pop III) stars has been invoked. A less likely source for the hard …

Open PDF (arxiv)
A comprehensive framework for phase-coherent mapping of the gravitational-wave sky with pulsar timing arrays

2604.19073 astro-ph.HE 2026-04-21 PDF (arxiv)

Małgorzata Curyło, Eric Thrane, Paul D. Lasky, Dawson S. Gaynor

We present a practical implementation of a phase-coherent mapping technique for pulsar timing arrays that resolves the full complex polarisation state of the gravitational-wave sky as a function of direction and frequency. Unlike standard cross-correlation methods, this approach preserves the amplitude, phase, and polarisation of the signal in every sky pixel. The resulting maps constitute a compa…

Open PDF (arxiv)
S2MAM: Semi-supervised Meta Additive Model for Robust Estimation and Variable Selection

2604.19072 cs.LG 2026-04-21 PDF (arxiv)

Xuelin Zhang, Hong Chen, Yingjie Wang, Tieliang Gong, Bin Gu

Semi-supervised learning with manifold regularization is a classical framework for jointly learning from both labeled and unlabeled data, where the key requirement is that the support of the unknown marginal distribution has the geometric structure of a Riemannian manifold. Typically, the Laplace-Beltrami operator-based manifold regularization can be approximated empirically by the Laplacian regul…

Open PDF (arxiv)
HoWToBench: Holistic Evaluation for LLM's Capability in Human-level Writing using Tree of Writing

2604.19071 cs.CL 2026-04-21 PDF (arxiv)

Andrew Zhuoer Feng, Cunxiang Wang, Yu Luo, Lin Fan, Yilin Zhou, Zikang Wang, Xiaotao Gu, Jie Tang, Hongning Wang, Minlie Huang

Evaluating the writing capabilities of large language models (LLMs) remains a significant challenge due to the multidimensional nature of writing skills and the limitations of existing metrics. LLM's performance in thousand-words level and open-ended writing is inadequately assessed by traditional reference-based metrics or modern LLM-as-a-judge methods. We propose Tree-of-Writing (ToW), to resolv…

Open PDF (arxiv)
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only

2604.19070 cs.CL 2026-04-21 PDF (arxiv)

Yilun Liu, Ruihong Qiu, Zi Huang

Zero-shot reasoning on text-rich networks (TRNs) remains a challenging frontier, as models must integrate textual semantics with relational structure without task-specific supervision. While graph neural networks rely on fixed label spaces and supervised objectives, recent large language model (LLM)-based approaches often overlook graph context or depend on distillation from larger models, limitin…

Open PDF (arxiv)
Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference

2604.19069 cs.CL 2026-04-21 PDF (arxiv)

Aby Mammen Mathew

Neural NLI models overfit dataset artifacts instead of truly reasoning. A hypothesis-only model gets 57.7% in SNLI, showing strong spurious correlations, and 38.6% of the baseline errors are the result of these artifacts. We propose Product-of-Experts (PoE) training, which downweights examples where biased models are overconfident. PoE nearly preserves accuracy (89.10% vs. 89.30%) while cutting bi…

Open PDF (arxiv)
Taylor Tube Method for Validated IVP

2604.19068 math.NA 2026-04-21 PDF (arxiv)

Bingwei Zhang, Chee Yap

We recently introduced a novel architecture for the design of validated IVP algorithms. This architecture forms the basis of our complete validated algorithm for IVP. A key subroutine in our algorithm is the \textbf{Euler Tube}: it gave a technique for refining end- and full-enclosures and is also key to deriving a complexity bound of our IVP solver. In this paper, we generalize it…

Open PDF (arxiv)
Age-Dependent Heterogeneity in the Association Between Physical Activity and Mental Distress: A Causal Machine Learning Analysis of 3.2 Million U.S. Adults

2604.19066 cs.LG 2026-04-21 PDF (arxiv)

Yuan Shan

Physical activity (PA) is widely recognized as protective against mental distress, yet whether this benefit varies systematically across population subgroups remains poorly understood. Using pooled data from ten consecutive annual waves of the U.S. Behavioral Risk Factor Surveillance System (2015-2024; n = 3,242,218), we investigate heterogeneity in the association between leisure-time PA and freq…

Open PDF (arxiv)
Last-Iterate Guarantees for Learning in Co-coercive Games

2604.19065 cs.GT 2026-04-21 PDF (arxiv)

Siddharth Chandak, Ramanan Tamizholi, Nicholas Bambos

We establish finite-time last-iterate guarantees for vanilla stochastic gradient descent in co-coercive games under noisy feedback. This is a broad class of games that is more general than strongly monotone games, allows for multiple Nash equilibria, and includes examples such as quadratic games with negative semidefinite interaction matrices and potential games with smooth concave potentials. Pri…

Open PDF (arxiv)
The Essence of Balance for Self-Improving Agents in Vision-and-Language Navigation

2604.19064 cs.CV 2026-04-21 PDF (arxiv)

Zhen Liu, Yuhan Liu, Jinjun Wang, Jianyi Liu, Wei Song, Jingwen Fu

In vision-and-language navigation (VLN), self-improvement from policy-induced experience, using only standard VLN action supervision, critically depends on balancing behavioral diversity and learning stability, which governs whether the agent can extract a reliable learning signal for improvement. Increasing behavioral diversity is necessary to expose alternative action hypotheses but can destabil…

Open PDF (arxiv)
Differentiable Satellite Constellation Configuration via Relaxed Coverage and Revisit Objectives

2604.19062 cs.RO 2026-04-21 PDF (arxiv)

Shreeyam Kacker, Kerri Cahoy

Satellite constellation design requires optimizing orbital parameters across multiple satellites to maximize mission specific metrics. For many types of mission, it is desirable to maximize coverage and minimize revisit gaps over ground targets. Existing approaches to constellation design either restrict the design space to symmetric parametric families such as Walker constellations, or rely on me…

Open PDF (arxiv)
Three-Module SC-VAMP for LDPC-Coded Nonlinear Channels

2604.19061 cs.IT 2026-04-21 PDF (arxiv)

Tadashi Wadayama, Takumi Takahashi

We propose a three-module extension of score-based VAMP (SC-VAMP) for signal recovery in nonlinear channels, where the received signal is obtained by applying a nonlinearity to a linear mixture of the transmitted signal, followed by additive Gaussian noise. The key idea is to introduce a latent variable representing the output of the linear mixing stage, which decomposes the inference problem into…

Open PDF (arxiv)
Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports

2604.19060 cs.AI 2026-04-21 PDF (arxiv)

Yishu Wei, Yi Lin, Adam Flanders, George Shih, Yifan Peng

Accurate disease classification from radiology reports is essential for many applications. While supervised fine-tuning (SFT) of lightweight LLMs improves accuracy, it can degrade reasoning. We propose a two-stage approach: SFT on disease labels followed by Group Relative Policy Optimization (GRPO) to refine predictions by optimizing accuracy and format without reasoning supervision. Across three …

Open PDF (arxiv)
AeroBridge-TTA: Test-Time Adaptive Language-Conditioned Control for UAVs

2604.19059 cs.RO 2026-04-21 PDF (arxiv)

Lingxue Lyu

Language-guided unmanned aerial vehicles (UAVs) often fail not from bad reasoning or perception, but from execution mismatch: the gap between a planned trajectory and the controller's ability to track it when the real dynamics differ from training (mass changes, drag shifts, actuator delay, wind). We propose AeroBridge-TTA, a language-conditioned control pipeline that targets this gap with t…

Open PDF (arxiv)