Archon

Browse and search harvested arxiv metadata.

1187419 results (page 90 of 47497)

FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion

2604.19015 cs.LG 2026-04-21 PDF (arxiv)

Tao Fan, Guoqiang Ma, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

Federated fine-tuning of Large Language Models (LLMs) is obstructed by a trilemma of challenges: protecting LLMs intellectual property (IP), ensuring client privacy, and mitigating performance loss on heterogeneous data. Existing methods like Offsite-Tuning (OT) secure the LLMs IP by having clients train only lightweight adapters, yet our analysis reveals they suffer from a fundamental performance…

Open PDF (arxiv)
Quantitative Verification of Finite-Time Constrained Occupation Measures for Continuous-time Stochastic Systems

2604.19014 eess.SY 2026-04-21 PDF (arxiv)

Bai Xue, C. -H. Luke Ong

This paper addresses the quantitative verification of finite-time constrained occupation time for stochastic continuous-time systems governed by stochastic differential equations (SDEs). Unlike classical reachability analysis, which focuses on single-event properties such as entering a target set, many autonomous tasks-including surveillance, wireless charging, and chemical mixing-require a system…

Open PDF (arxiv)
Security Is Relative: Training-Free Vulnerability Detection via Multi-Agent Behavioral Contract Synthesis

2604.19012 cs.CR 2026-04-21 PDF (arxiv)

Yongchao Wang, Zhiqiu Huang

Deep learning for vulnerability detection has shown promising results on early benchmarks, but recent evaluations reveal catastrophic degradation: models achieving F1 > 0.68 on legacy datasets collapse to 0.031 under strict deduplication. We identify the root cause as the semantic ambiguity problem: identical code can be secure or vulnerable depending on project-specific behavioral contracts, rend…

Open PDF (arxiv)
Accelerating trajectory optimization with Sobolev-trained diffusion policies

2604.19011 cs.LG 2026-04-21 PDF (arxiv)

Théotime Le Hellard, Franki Nguimatsia Tiofack, Quentin Le Lidec, Justin Carpentier

Trajectory Optimization (TO) solvers exploit known system dynamics to compute locally optimal trajectories through iterative improvements. A downside is that each new problem instance is solved independently; therefore, convergence speed and quality of the solution found depend on the initial trajectory proposed. To improve efficiency, a natural approach is to warm-start TO with initial guesses pr…

Open PDF (arxiv)
SSB-Based Sensing-Assisted Robust Beamforming for High-Mobility UAV Communications in LAWN

2604.19010 eess.SP 2026-04-21 PDF (arxiv)

Aimin Tang, Shuhan Wang, Yin Xu

High-mobility uncrewed aerial vehicle (UAV) communications in low-altitude wireless networks (LAWN) demand reliable beamforming, while conventional feedback-based schemes suffer from excessive overhead and severe misalignment under rapid trajectory variations. To address this challenge, this paper proposes an SSB-based sensing-assisted predictive robust beamforming framework that replaces explicit…

Open PDF (arxiv)
Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning

2604.19009 cs.LG 2026-04-21 PDF (arxiv)

Linwei Dong, Ruoyu Guo, Ge Bai, Zehuan Yuan, Yawei Luo, Changqing Zou

Diffusion distillation, exemplified by Distribution Matching Distillation (DMD), has shown great promise in few-step generation but often sacrifices quality for sampling speed. While integrating Reinforcement Learning (RL) into distillation offers potential, a naive fusion of these two objectives relies on suboptimal raw sample evaluation. This sample-based scoring creates inherent conflicts with …

Open PDF (arxiv)
Optimal Online and Offline Algorithms for Contextual MNL with Applications to Assortment and Pricing

2604.19008 math.OC 2026-04-21 PDF (arxiv)

Yunfan Zhang, Yuxuan Han, Hongyu Shan, Jose Blanchet, Zhengyuan Zhou

Selecting which products to display and at what prices is a central decision in retail and e-commerce operations. In many applications, these two choices must be made jointly under limited display capacity and uncertain customer demand. In this paper, we study the joint assortment and pricing problem under a price-based contextual multinomial logit model, where customer preferences depend on both …

Open PDF (arxiv)
ExplainS2A: Explainable Spectral-Spatial Duality Model for Fast Transforming Sentinel-2 Image to AVIRIS-Level Hyperspectral Image

2604.19007 eess.IV 2026-04-21 PDF (arxiv)

Chia-Hsiang Lin, Zi-Chao Leng

Mainstream optical satellites often acquire multispectral multi-resolution images, which have limited material identifiability compared to the HSIs. Thus, spectrally super-resolving the MSI into their hyperspectral counterparts greatly facilitates remote material identification and the downstream tasks. However, spectrally super-resolving the MSI into an HSI is often constrained by the multi-resol…

Open PDF (arxiv)
Debating the Unspoken: Role-Anchored Multi-Agent Reasoning for Half-Truth Detection

2604.19005 cs.CL 2026-04-21 PDF (arxiv)

Yixuan Tang, Yirui Zhang, Hang Feng, Anthony K. H. Tung

Half-truths, claims that are factually correct yet misleading due to omitted context, remain a blind spot for fact verification systems focused on explicit falsehoods. Addressing such omission-based manipulation requires reasoning not only about what is said, but also about what is left unsaid. We propose RADAR, a role-anchored multi-agent debate framework for omission-aware fact verification unde…

Open PDF (arxiv)
Ocean: Fast Estimation-Based Sparse General Matrix-Matrix Multiplication on GPU

2604.19004 cs.DC 2026-04-21 PDF (arxiv)

Yifan Li, Giulia Guidi

In computational science and data analytics, many workloads involve irregular and sparse computations that are inherently difficult to optimize for modern hardware. A key kernel is Sparse General Matrix-Matrix Multiplication (SpGEMM), which underpins simulations, graph analytics, and machine learning applications. SpGEMM exhibits irregular memory access patterns and workload imbalance, making it c…

Open PDF (arxiv)
When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains

2604.19001 cs.CL 2026-04-21 PDF (arxiv)

Ishita Kakkar, Enze Zhang, Rheeya Uppaal, Junjie Hu

Large reasoning models (LRMs) produce complex, multi-step reasoning traces, yet safety evaluation remains focused on final outputs, overlooking how harm emerges during reasoning. When jailbroken, harm does not appear instantaneously but unfolds through distinct behavioral steps such as suppressing refusal, rationalizing compliance, decomposing harmful tasks, and concealing risk. However, no existi…

Open PDF (arxiv)
Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees

2604.19000 cs.LG 2026-04-21 PDF (arxiv)

Xiaoyang Liu, Zineng Dong, Yifan Bai, Yantao Li, Yuntian Liu, Tao Luo

Statement autoformalization acts as a critical bridge between human mathematics and formal mathematics by translating natural language problems into formal language. While prior works have focused on data synthesis and diverse training paradigms to optimize end-to-end Large Language Models (LLMs), they typically treat formal code as flat sequences, neglecting the hierarchical logic inherent in mat…

Open PDF (arxiv)
A Data-embedded Solution Paradigm for Nonconvex Probable Event Constrained Optimization

2604.18997 math.OC 2026-04-21 PDF (arxiv)

Qifeng Li

This paper introduces a new modeling framework for optimization under uncertainty, called Probable Event Constrained Optimization (PECO). Unlike conventional chance-constrained formulations, which only limit the probability of constraint violation, PECO also explicitly requires feasibility for all events whose probability exceeds a prescribed threshold. This guarantees that solutions remain valid …

Open PDF (arxiv)
$R^2$-dLLM: Accelerating Diffusion Large Language Models via Spatio-Temporal Redundancy Reduction

2604.18995 cs.CL 2026-04-21 PDF (arxiv)

Zhenbang Du, Kejing Xia, Xinrui Zhong, Yonggan Fu, Nicolai Oswald, Binfei Ji, Brucek Khailany, Pavlo Molchanov, Yingyan Lin

Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to autoregressive generation by enabling parallel token prediction. However, practical dLLM decoding still suffers from high inference latency, which limits deployment. In this work, we observe that a substantial part of this inefficiency comes from recurring redundancy in the decoding process, including spatial redund…

Open PDF (arxiv)
AutoAWG: Adverse Weather Generation with Adaptive Multi-Controls for Automotive Videos

2604.18993 cs.CV 2026-04-21 PDF (arxiv)

Jiagao Hu, Daiguo Zhou, Danzhen Fu, Fuhao Li, Zepeng Wang, Fei Wang, Wenhua Liao, Jiayi Xie, Haiyang Sun

Perception robustness under adverse weather remains a critical challenge for autonomous driving, with the core bottleneck being the scarcity of real-world video data in adverse weather. Existing weather generation approaches struggle to balance visual quality and annotation reusability. We present AutoAWG, a controllable Adverse Weather video Generation framework for Autonomous driving. Our method…

Open PDF (arxiv)
Estimating galactic foreground with the population of resolved galactic binaries

2604.18992 astro-ph.CO 2026-04-21 PDF (arxiv)

Yang Jiang, Qing-Guo Huang

The stochastic gravitational wave background in the mHz band is a key target for future spaceborne interferometers. Detecting such a signal presents multiple challenges for data processing, especially complicated by the presence of numerous compact binaries in our galaxy. The superposition of gravitational waves from their inspiral stages creates a confusion foreground that need to be estimated ac…

Open PDF (arxiv)
A Multi-Agent Framework with Structured Reasoning and Reflective Refinement for Multimodal Empathetic Response Generation

2604.18988 cs.CV 2026-04-21 PDF (arxiv)

Liping Wang, Cheng Ye, Weidong Chen, Peipei Song, Bo Hu, Zhendong Mao

Multimodal empathetic response generation (MERG) aims to generate emotionally engaging and empathetic responses based on users' multimodal contexts. Existing approaches usually rely on an implicit one-pass generation paradigm from multimodal context to the final response, which overlooks two intrinsic characteristics of MERG: (1) Human perception of emotional cues is inherently structured rather t…

Open PDF (arxiv)
Inertia Matching Principle: Improving Transient Synchronization Stability in Hybrid Power Systems With VSGs and SGs

2604.18987 eess.SY 2026-04-21 PDF (arxiv)

Changjun He, Li Zhang, Qi Liu, Rui Zou

This paper investigates the transient synchronization stability in power systems hybridized with virtual synchronous generators (VSGs) and synchronous generators (SGs). A relative swing equation model is established to capture the transient synchronization dynamics between the VSG and the SG. Based on this model, both static and dynamic characteristics are systematically analyzed, and a quantitati…

Open PDF (arxiv)
A Tight Channel-Capacity Lower Bound for the Simultaneous Wireless Information and Power Transfer Integrated Receiver

2604.18986 cs.IT 2026-04-21 PDF (arxiv)

Konstantinos Ntontin, Symeon Chatzinotas

Contrary to the vast majority of works on simultaneous wireless information and power transfer that provide information-theoretic limits for the separate receiver architecture, in this work we focus on the integrated receiver and provide a channel-capacity lower bound. Towards this, we provide a closed-form tight approximation for the probability transition matrix of the channel by leveraging the …

Open PDF (arxiv)
The Gamma-Ray Monitor onboard the SVOM satellite

2604.18985 astro-ph.IM 2026-04-21 PDF (arxiv)

Jian-Chao Sun, Yong-Wei Dong, Jiang He, Jiang-Tao Liu, Lu Li, Rui-Jie Wang, Xin Liu, Li Zhang, Min Gao, Yue Huang, Hao-Li Shi, Li-Ming Song, Wen-Jun Tan, Chen-Wei Wang, Jin Wang, Jin-Zhou Wang, Ping Wang, Xing Wen, Bo-Bing Wu, Shao-Lin Xiong, Juan Zhang, Shuang-Nan Zhang, Xiao-Yun Zhao, Shi-Jie Zheng

The Gamma-Ray Monitor (GRM) is a key scientific payload onboard the Space-based Multi-band Variable Object Monitor (SVOM) satellite, designed specifically for the detection and study of gamma-ray bursts (GRBs). Launched into a 625 km low-Earth orbit on 22 June 2024, GRM serves as a large-area, wide-field-of-view instrument capable of observing the hard X-ray and soft gamma-ray emissions in the ene…

Open PDF (arxiv)
SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

2604.18982 cs.AI 2026-04-21 PDF (arxiv)

Xiachong Feng, Yi Jiang, Xiaocheng Feng, Deyi Yin, Libo Qin, Yangfan Ye, Lei Huang, Weitao Ma, Yuxuan Gu, Chonghan Qin, Bing Qin, Lingpeng Kong

Social intelligence, the ability to navigate complex interpersonal interactions, presents a fundamental challenge for language agents. Training such agents via reinforcement learning requires solving the credit assignment problem: determining how individual utterances contribute to multi-turn dialogue outcomes. Existing approaches directly employ language models to distribute episode-level rewards…

Open PDF (arxiv)
AdaGScale: Viewpoint-Adaptive Gaussian Scaling in 3D Gaussian Splatting to Reduce Gaussian-Tile Pairs

2604.18980 cs.CV 2026-04-21 PDF (arxiv)

Joongho Jo, Hyerin Lim, Hanjun Choi, Jongsun Park

Reducing the number of Gaussian-tile pairs is one of the most promising approaches to improve 3D Gaussian Splatting (3D-GS) rendering speed on GPUs. However, the importance difference existing among Gaussian-tile pairs has never been considered in the previous works. In this paper, we propose AdaGScale, a novel viewpoint-adaptive Gaussian scaling technique for reducing the number of Gaussian-tile …

Open PDF (arxiv)
Low-Rank Adaptation for Critic Learning in Off-Policy Reinforcement Learning

2604.18978 cs.LG 2026-04-21 PDF (arxiv)

Yuan Zhuang, Yuexin Bian, Sihong He, Jie Feng, Qing Su, Songyang Han, Jonathan Petit, Shihao Ji, Yuanyuan Shi, Fei Miao

Scaling critic capacity is a promising direction for enhancing off-policy reinforcement learning (RL). However, larger critics are prone to overfitting and unstable in replay-buffer-based bootstrap training. This paper leverages Low-Rank Adaptation (LoRA) as a structural-sparsity regularizer for off-policy critics. Our approach freezes randomly initialized base matrices and solely optimizes low-ra…

Open PDF (arxiv)
STAR-Teaming: A Strategy-Response Multiplex Network Approach to Automated LLM Red Teaming

2604.18976 cs.CL 2026-04-21 PDF (arxiv)

MinJae Jung, YongTaek Lim, Chaeyun Kim, Junghwan Kim, Kihyun Kim, Minwoo Kim

While Large Language Models (LLMs) are widely used, they remain susceptible to jailbreak prompts that can elicit harmful or inappropriate responses. This paper introduces STAR-Teaming, a novel black-box framework for automated red teaming that effectively generates such prompts. STAR-Teaming integrates a Multi-Agent System (MAS) with a Strategy-Response Multiplex Network and employs network-driven…

Open PDF (arxiv)
Gated Coordination for Efficient Multi-Agent Collaboration in Minecraft Game

2604.18975 cs.MA 2026-04-21 PDF (arxiv)

HuaDong Jian, Chenghao Li, Haoyu Wang, Jiajia Shuai, Jinyu Guo, Yang Yang, Chaoning Zhang

In long-horizon open-world multi-agent systems, existing methods often treat local anomalies as automatic triggers for communication. This default design introduces coordination noise, interrupts local execution, and overuses public interaction in cases that could be resolved locally. To address this issue, we propose a partitioned information architecture for MLLM agents that explicitly separates…

Open PDF (arxiv)