Archon

Browse and search harvested arxiv metadata.

1129886 results (page 72 of 45196)

RoomRecon: High-Quality Textured Room Layout Reconstruction on Mobile Devices

2604.19025 cs.RO 2026-04-21 PDF (arxiv)

Seok Joon Kim, Dinh Duc Cao, Federica Spinola, Se Jin Lee, Kyu Sung Cho

Widespread RGB-Depth (RGB-D) sensors and advanced 3D reconstruction technologies facilitate the capture of indoor spaces, improving the fields of augmented reality (AR), virtual reality (VR), and extended reality (XR). Nevertheless, current technologies still face limitations, such as the inability to reflect minor scene changes without a complete recapture, the lack of semantic scene understandin…

Open PDF (arxiv)
Policy Gradient Primal-Dual Method for Safe Reinforcement Learning from Human Feedback

2604.19024 cs.LG 2026-04-21 PDF (arxiv)

Qiang Liu, Adrienne Kline, Ermin Wei

Safe Reinforcement Learning from Human Feedback (Safe RLHF) has recently achieved empirical success in developing helpful and harmless large language models by decoupling human preferences regarding helpfulness and harmlessness. Existing approaches typically rely on fitting fixed horizon reward models from human feedback and have only been validated empirically. In this paper, we formulate safe RL…

Open PDF (arxiv)
Seeing at Timau National Observatory Based on ERA5 Dataset

2604.19023 astro-ph.IM 2026-04-21 PDF (arxiv)

Rhorom Priyatikanto, Gerhana P. Putri, Clara Y. Yatini, Isfahani Rusyda, Evan I. Akbar, Agustinus G. Admiranto, Ferdhiansyah Noor, Siti Maryam, Elyyani, Abd Rachman

Understanding the seeing conditions is crucial for astronomical observations using a ground-based telescope. This study analyzes long-term atmospheric data (2002-2021) from the ERA5 dataset to assess the seeing conditions at the new Timau National Observatory in Indonesia, which will house a 3.8-meter optical telescope. While the ERA5 dataset shows good agreement with radiosonde data for temperatu…

Open PDF (arxiv)
On Accelerating Grounded Code Development for Research

2604.19022 cs.AI 2026-04-21 PDF (arxiv)

Santosh Ganji

A major challenge for niche scientific and technical domains in leveraging coding agents is the lack of access to up-to-date, domain- specific knowledge. Foundational models often demonstrate limited reasoning capabilities in specialized fields and cannot inherently incorporate knowledge that evolves through ongoing research and experimentation. Materials scientists exploring novel compounds, comm…

Open PDF (arxiv)
FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

2604.19021 cs.LG 2026-04-21 PDF (arxiv)

Pingwei Sun, Yuxuan Hu, Jianchao Tan, Xue Wang, Jiaqi Zhang, Yifan Lu, Yerui Sun, Yuchen Xie, Xunliang Cai

Linear attention mechanisms have emerged as promising alternatives to softmax attention, offering linear-time complexity during inference. Recent advances such as Gated DeltaNet (GDN) and Kimi Delta Attention (KDA) have demonstrated that the delta rule, an online gradient descent update, enables superior associative recall compared to simple additive updates. While KDA refined the coarse head-wise…

Open PDF (arxiv)
Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control

2604.19018 cs.LG 2026-04-21 PDF (arxiv)

Julian Skifstad, Xinyue Annie Yang, Glen Chou

Inference-time LLM alignment methods, particularly activation steering, offer an alternative to fine-tuning by directly modifying activations during generation. Existing methods, however, often rely on non-anticipative interventions that ignore how perturbations propagate through transformer layers and lack online error feedback, resulting in suboptimal, open-loop control. To address this, we show…

Open PDF (arxiv)
AlignCultura: Towards Culturally Aligned Large Language Models?

2604.19016 cs.CL 2026-04-21 PDF (arxiv)

Gautam Siddharth Kashyap, Mark Dras, Usman Naseem

Cultural alignment in Large Language Models (LLMs) is essential for producing contextually aware, respectful, and trustworthy outputs. Without it, models risk generating stereotyped, insensitive, or misleading responses that fail to reflect cultural diversity w.r.t Helpful, Harmless, and Honest (HHH) paradigm. Existing benchmarks represent early steps toward cultural alignment; yet, no benchmarks …

Open PDF (arxiv)
FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion

2604.19015 cs.LG 2026-04-21 PDF (arxiv)

Tao Fan, Guoqiang Ma, Yuanfeng Song, Lixin Fan, Kai Chen, Qiang Yang

Federated fine-tuning of Large Language Models (LLMs) is obstructed by a trilemma of challenges: protecting LLMs intellectual property (IP), ensuring client privacy, and mitigating performance loss on heterogeneous data. Existing methods like Offsite-Tuning (OT) secure the LLMs IP by having clients train only lightweight adapters, yet our analysis reveals they suffer from a fundamental performance…

Open PDF (arxiv)
Quantitative Verification of Finite-Time Constrained Occupation Measures for Continuous-time Stochastic Systems

2604.19014 eess.SY 2026-04-21 PDF (arxiv)

Bai Xue, C. -H. Luke Ong

This paper addresses the quantitative verification of finite-time constrained occupation time for stochastic continuous-time systems governed by stochastic differential equations (SDEs). Unlike classical reachability analysis, which focuses on single-event properties such as entering a target set, many autonomous tasks-including surveillance, wireless charging, and chemical mixing-require a system…

Open PDF (arxiv)
Security Is Relative: Training-Free Vulnerability Detection via Multi-Agent Behavioral Contract Synthesis

2604.19012 cs.CR 2026-04-21 PDF (arxiv)

Yongchao Wang, Zhiqiu Huang

Deep learning for vulnerability detection has shown promising results on early benchmarks, but recent evaluations reveal catastrophic degradation: models achieving F1 > 0.68 on legacy datasets collapse to 0.031 under strict deduplication. We identify the root cause as the semantic ambiguity problem: identical code can be secure or vulnerable depending on project-specific behavioral contracts, rend…

Open PDF (arxiv)
Accelerating trajectory optimization with Sobolev-trained diffusion policies

2604.19011 cs.LG 2026-04-21 PDF (arxiv)

Théotime Le Hellard, Franki Nguimatsia Tiofack, Quentin Le Lidec, Justin Carpentier

Trajectory Optimization (TO) solvers exploit known system dynamics to compute locally optimal trajectories through iterative improvements. A downside is that each new problem instance is solved independently; therefore, convergence speed and quality of the solution found depend on the initial trajectory proposed. To improve efficiency, a natural approach is to warm-start TO with initial guesses pr…

Open PDF (arxiv)
SSB-Based Sensing-Assisted Robust Beamforming for High-Mobility UAV Communications in LAWN

2604.19010 eess.SP 2026-04-21 PDF (arxiv)

Aimin Tang, Shuhan Wang, Yin Xu

High-mobility uncrewed aerial vehicle (UAV) communications in low-altitude wireless networks (LAWN) demand reliable beamforming, while conventional feedback-based schemes suffer from excessive overhead and severe misalignment under rapid trajectory variations. To address this challenge, this paper proposes an SSB-based sensing-assisted predictive robust beamforming framework that replaces explicit…

Open PDF (arxiv)
Guiding Distribution Matching Distillation with Gradient-Based Reinforcement Learning

2604.19009 cs.LG 2026-04-21 PDF (arxiv)

Linwei Dong, Ruoyu Guo, Ge Bai, Zehuan Yuan, Yawei Luo, Changqing Zou

Diffusion distillation, exemplified by Distribution Matching Distillation (DMD), has shown great promise in few-step generation but often sacrifices quality for sampling speed. While integrating Reinforcement Learning (RL) into distillation offers potential, a naive fusion of these two objectives relies on suboptimal raw sample evaluation. This sample-based scoring creates inherent conflicts with …

Open PDF (arxiv)
Optimal Online and Offline Algorithms for Contextual MNL with Applications to Assortment and Pricing

2604.19008 math.OC 2026-04-21 PDF (arxiv)

Yunfan Zhang, Yuxuan Han, Hongyu Shan, Jose Blanchet, Zhengyuan Zhou

Selecting which products to display and at what prices is a central decision in retail and e-commerce operations. In many applications, these two choices must be made jointly under limited display capacity and uncertain customer demand. In this paper, we study the joint assortment and pricing problem under a price-based contextual multinomial logit model, where customer preferences depend on both …

Open PDF (arxiv)
ExplainS2A: Explainable Spectral-Spatial Duality Model for Fast Transforming Sentinel-2 Image to AVIRIS-Level Hyperspectral Image

2604.19007 eess.IV 2026-04-21 PDF (arxiv)

Chia-Hsiang Lin, Zi-Chao Leng

Mainstream optical satellites often acquire multispectral multi-resolution images, which have limited material identifiability compared to the HSIs. Thus, spectrally super-resolving the MSI into their hyperspectral counterparts greatly facilitates remote material identification and the downstream tasks. However, spectrally super-resolving the MSI into an HSI is often constrained by the multi-resol…

Open PDF (arxiv)
Debating the Unspoken: Role-Anchored Multi-Agent Reasoning for Half-Truth Detection

2604.19005 cs.CL 2026-04-21 PDF (arxiv)

Yixuan Tang, Yirui Zhang, Hang Feng, Anthony K. H. Tung

Half-truths, claims that are factually correct yet misleading due to omitted context, remain a blind spot for fact verification systems focused on explicit falsehoods. Addressing such omission-based manipulation requires reasoning not only about what is said, but also about what is left unsaid. We propose RADAR, a role-anchored multi-agent debate framework for omission-aware fact verification unde…

Open PDF (arxiv)
Ocean: Fast Estimation-Based Sparse General Matrix-Matrix Multiplication on GPU

2604.19004 cs.DC 2026-04-21 PDF (arxiv)

Yifan Li, Giulia Guidi

In computational science and data analytics, many workloads involve irregular and sparse computations that are inherently difficult to optimize for modern hardware. A key kernel is Sparse General Matrix-Matrix Multiplication (SpGEMM), which underpins simulations, graph analytics, and machine learning applications. SpGEMM exhibits irregular memory access patterns and workload imbalance, making it c…

Open PDF (arxiv)
When Safety Fails Before the Answer: Benchmarking Harmful Behavior Detection in Reasoning Chains

2604.19001 cs.CL 2026-04-21 PDF (arxiv)

Ishita Kakkar, Enze Zhang, Rheeya Uppaal, Junjie Hu

Large reasoning models (LRMs) produce complex, multi-step reasoning traces, yet safety evaluation remains focused on final outputs, overlooking how harm emerges during reasoning. When jailbroken, harm does not appear instantaneously but unfolds through distinct behavioral steps such as suppressing refusal, rationalizing compliance, decomposing harmful tasks, and concealing risk. However, no existi…

Open PDF (arxiv)
Decompose, Structure, and Repair: A Neuro-Symbolic Framework for Autoformalization via Operator Trees

2604.19000 cs.LG 2026-04-21 PDF (arxiv)

Xiaoyang Liu, Zineng Dong, Yifan Bai, Yantao Li, Yuntian Liu, Tao Luo

Statement autoformalization acts as a critical bridge between human mathematics and formal mathematics by translating natural language problems into formal language. While prior works have focused on data synthesis and diverse training paradigms to optimize end-to-end Large Language Models (LLMs), they typically treat formal code as flat sequences, neglecting the hierarchical logic inherent in mat…

Open PDF (arxiv)
A Data-embedded Solution Paradigm for Nonconvex Probable Event Constrained Optimization

2604.18997 math.OC 2026-04-21 PDF (arxiv)

Qifeng Li

This paper introduces a new modeling framework for optimization under uncertainty, called Probable Event Constrained Optimization (PECO). Unlike conventional chance-constrained formulations, which only limit the probability of constraint violation, PECO also explicitly requires feasibility for all events whose probability exceeds a prescribed threshold. This guarantees that solutions remain valid …

Open PDF (arxiv)
$R^2$-dLLM: Accelerating Diffusion Large Language Models via Spatio-Temporal Redundancy Reduction

2604.18995 cs.CL 2026-04-21 PDF (arxiv)

Zhenbang Du, Kejing Xia, Xinrui Zhong, Yonggan Fu, Nicolai Oswald, Binfei Ji, Brucek Khailany, Pavlo Molchanov, Yingyan Lin

Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to autoregressive generation by enabling parallel token prediction. However, practical dLLM decoding still suffers from high inference latency, which limits deployment. In this work, we observe that a substantial part of this inefficiency comes from recurring redundancy in the decoding process, including spatial redund…

Open PDF (arxiv)
AutoAWG: Adverse Weather Generation with Adaptive Multi-Controls for Automotive Videos

2604.18993 cs.CV 2026-04-21 PDF (arxiv)

Jiagao Hu, Daiguo Zhou, Danzhen Fu, Fuhao Li, Zepeng Wang, Fei Wang, Wenhua Liao, Jiayi Xie, Haiyang Sun

Perception robustness under adverse weather remains a critical challenge for autonomous driving, with the core bottleneck being the scarcity of real-world video data in adverse weather. Existing weather generation approaches struggle to balance visual quality and annotation reusability. We present AutoAWG, a controllable Adverse Weather video Generation framework for Autonomous driving. Our method…

Open PDF (arxiv)
Estimating galactic foreground with the population of resolved galactic binaries

2604.18992 astro-ph.CO 2026-04-21 PDF (arxiv)

Yang Jiang, Qing-Guo Huang

The stochastic gravitational wave background in the mHz band is a key target for future spaceborne interferometers. Detecting such a signal presents multiple challenges for data processing, especially complicated by the presence of numerous compact binaries in our galaxy. The superposition of gravitational waves from their inspiral stages creates a confusion foreground that need to be estimated ac…

Open PDF (arxiv)
A Multi-Agent Framework with Structured Reasoning and Reflective Refinement for Multimodal Empathetic Response Generation

2604.18988 cs.CV 2026-04-21 PDF (arxiv)

Liping Wang, Cheng Ye, Weidong Chen, Peipei Song, Bo Hu, Zhendong Mao

Multimodal empathetic response generation (MERG) aims to generate emotionally engaging and empathetic responses based on users' multimodal contexts. Existing approaches usually rely on an implicit one-pass generation paradigm from multimodal context to the final response, which overlooks two intrinsic characteristics of MERG: (1) Human perception of emotional cues is inherently structured rather t…

Open PDF (arxiv)
Inertia Matching Principle: Improving Transient Synchronization Stability in Hybrid Power Systems With VSGs and SGs

2604.18987 eess.SY 2026-04-21 PDF (arxiv)

Changjun He, Li Zhang, Qi Liu, Rui Zou

This paper investigates the transient synchronization stability in power systems hybridized with virtual synchronous generators (VSGs) and synchronous generators (SGs). A relative swing equation model is established to capture the transient synchronization dynamics between the VSG and the SG. Based on this model, both static and dynamic characteristics are systematically analyzed, and a quantitati…

Open PDF (arxiv)