Archon

Browse and search harvested arxiv metadata.

1115288 results (page 68 of 44612)

Analytical Extraction of Conditional Sobol' Indices via Basis Decomposition of Polynomial Chaos Expansions

2604.19165 stat.ML 2026-04-21 PDF (arxiv)

Shijie Zhong, Jiangfeng Fu

In uncertainty quantification, evaluating sensitivity measures under specific conditions (i.e., conditional Sobol' indices) is essential for systems with parameterized responses, such as spatial fields or varying operating conditions. Traditional approaches often rely on point-wise modeling, which is computationally expensive and may lack consistency across the parameter space. This paper demonstr…

Open PDF (arxiv)
Mind the Unseen Mass: Unmasking LLM Hallucinations via Soft-Hybrid Alphabet Estimation

2604.19162 cs.CL 2026-04-21 PDF (arxiv)

Hongxing Pan, Yingying Guo, Wenqing Kuang, Jiashi Lu

This paper studies uncertainty quantification for large language models (LLMs) under black-box access, where only a small number of responses can be sampled for each query. In this setting, estimating the effective semantic alphabet size--that is, the number of distinct meanings expressed in the sampled responses--provides a useful proxy for downstream risk. However, frequency-based estimators ten…

Open PDF (arxiv)
Distributed Multi-Sensor Control for Multi-Target Tracking Using Adaptive Complementary Fusion for LMB Densities

2604.19160 eess.SP 2026-04-21 PDF (arxiv)

Aidan Blair, Amirali Khodadadian Gostar, Alireza Bab-Hadiashar, Xiaodong Li, Reza Hoseinnezhad

Tracking multiple targets in dynamic environments using distributed sensor networks is a fundamental problem in statistical signal processing. In such scenarios, the network of mobile sensors must coordinate their actions to accurately estimate the locations and trajectories of multiple targets, balancing limited computation and communication resources with multi-target tracking accuracy. Multi-se…

Open PDF (arxiv)
MSDS: Deep Structural Similarity with Multiscale Representation

2604.19159 cs.CV 2026-04-21 PDF (arxiv)

Danling Kang, Xue-Hua Chen, Bin Liu, Keke Zhang, Weiling Chen, Tiesong Zhao

Deep-feature-based perceptual similarity models have demonstrated strong alignment with human visual perception in Image Quality Assessment (IQA). However, most existing approaches operate at a single spatial scale, implicitly assuming that structural similarity at a fixed resolution is sufficient. The role of spatial scale in deep-feature similarity modeling thus remains insufficiently understood…

Open PDF (arxiv)
SAW-INT4: System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving

2604.19157 cs.LG 2026-04-21 PDF (arxiv)

Jinda Jia, Jisen Li, Zhongzhu Zhou, Jung Hwan Heo, Jue Wang, Tri Dao, Shuaiwen Leon Song, Ben Athiwaratkun, Chenfeng Xu, Tianyi Zhang, Xiaoxia Wu

KV-cache memory is a major bottleneck in real-world LLM serving, where systems must simultaneously support latency-sensitive small-batch requests and high-throughput concurrent workloads. Although many KV-cache compression methods improve offline accuracy or compression ratio, they often violate practical serving constraints such as paged memory layouts, regular memory access, and fused attention …

Open PDF (arxiv)
Transfer Learning for Degree-Corrected Mixed Membership Network Models

2604.19152 stat.ME 2026-04-21 PDF (arxiv)

Yong He, Kangxiang Qin, Haoran Tang

Statistical analysis of network data has attracted considerable attention in recent years, due to the rapid advancement of well-trained network models and the accessibility of large public network datasets. In this article, we propose a transfer learning procedure for boosting estimation accuracy of a target network structure based on the well-known Degree-Corrected Mixed-Membership (DCMM) model i…

Open PDF (arxiv)
Voice of India: A Large-Scale Benchmark for Real-World Speech Recognition in India

2604.19151 cs.CL 2026-04-21 PDF (arxiv)

Kaushal Bhogale, Manas Dhir, Amritansh Walecha, Manmeet Kaur, Vanshika Chhabra, Aaditya Pareek, Hanuman Sidh, Sagar Jain, Bhaskar Singh, Utkarsh Singh, Tahir Javed, Shobhit Banga, Mitesh M. Khapra

Existing Indic ASR benchmarks often use scripted, clean speech and leaderboard driven evaluation that encourages dataset specific overfitting. In addition, strict single reference WER penalizes natural spelling variation in Indian languages, including non standardized spellings of code-mixed English origin words. To address these limitations, we introduce Voice of India, a closed source benchmark …

Open PDF (arxiv)
The General Formulation of Loss-Based Priors for Parameter Spaces

2604.19150 stat.ME 2026-04-21 PDF (arxiv)

Cristiano Villa

Loss-based priors assign probability mass to parameter values according to the inferential loss incurred when they are excluded from the parameter space, and provide a general solution for discrete parameters. Extending this idea to continuous settings is challenging, as the exclusion of a single point induces no loss. We propose a neighbourhood-exclusion framework in which inferential loss is def…

Open PDF (arxiv)
How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning

2604.19149 cs.CL 2026-04-21 PDF (arxiv)

Haoyang Chen, Yi Liu, Jianzhi Shao, Tao Zhang, Chengfu Huo, Wei Hu

Thinking LLMs produce reasoning traces before answering. Prior activation steering work mainly targets on shaping these traces. It remains less understood how answer tokens actually read and integrate the reasoning to produce reliable outcomes. Focusing on quantitative reasoning, we analyze the answer-to-reasoning attention and observe a benign self-reading pattern aligned with correctness, charac…

Open PDF (arxiv)
Multi-Step Gaussian Process Propagation for Adaptive Path Planning

2604.19148 cs.RO 2026-04-21 PDF (arxiv)

Alex Beaudin, Bjørn Andreas Kristiansen, Kristoffer Gryte, Corrado Chiatante, Morten Omholt Alver, Murat Arcak, Tor Arne Johansen

Efficient and robust path planning hinges on combining all accessible information sources. In particular, the task of path planning for robotic environmental exploration and monitoring depends highly on the current belief of the world. To capture the uncertainty in the belief, we present a Gaussian process based path planning method that adapts to multi-modal environmental sensing data and incorpo…

Open PDF (arxiv)
Nexusformer: Nonlinear Attention Expansion for Stable and Inheritable Transformer Scaling

2604.19147 cs.LG 2026-04-21 PDF (arxiv)

Weijie Zhao, Mingquan Liu, Bolun Wang, Simo Wu, Nuobei Xie, Rui-Jie Zhu, Peng Zhou

Scaling Transformers typically necessitates training larger models from scratch, as standard architectures struggle to expand without discarding learned representations. We identify the primary bottleneck in the attention mechanism's linear projections, which strictly confine feature extraction to fixed-dimensional subspaces, limiting both expressivity and incremental capacity. To address this, we…

Open PDF (arxiv)
RL-ABC: Reinforcement Learning for Accelerator Beamline Control

2604.19146 cs.LG 2026-04-21 PDF (arxiv)

Anwar Ibrahim, Fedor Ratnikov, Maxim Kaledin, Alexey Petrenko, Denis Derkach

Particle accelerator beamline optimization is a high-dimensional control problem traditionally requiring significant expert intervention. We present RLABC (Reinforcement Learning for Accelerator Beamline Control), an open-source Python framework that automatically transforms standard Elegant beamline configurations into reinforcement learning environments. RLABC integrates with the widely-used Ele…

Open PDF (arxiv)
ST-Prune: Training-Free Spatio-Temporal Token Pruning for Vision-Language Models in Autonomous Driving

2604.19145 cs.CV 2026-04-21 PDF (arxiv)

Lin Sha, Haiyun Guo, Tao Wang, Cong Zhang, Min Huang, Jinqiao Wang, Qinghai Miao

Vision-Language Models (VLMs) have become central to autonomous driving systems, yet their deployment is severely bottlenecked by the massive computational overhead of multi-view camera and multi-frame video input. Existing token pruning methods, primarily designed for single-image inputs, treat each frame or view in isolation and thus fail to exploit the inherent spatio-temporal redundancies in d…

Open PDF (arxiv)
ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation

2604.19144 cs.CL 2026-04-21 PDF (arxiv)

Kunquan Li, Yingxue Zhang, Fandong Meng, Jinsong Su

Recent years have witnessed growing interest in applying Large Reasoning Models (LRMs) to Machine Translation (MT). Existing approaches predominantly adopt a "think-first-then-translate" paradigm. Although explicit reasoning trajectories significantly enhance translation quality, they incur prohibitive inference costs and latency. To address these limitations, we propose ReflectMT, a two-stage ref…

Open PDF (arxiv)
Towards More Empathic Programming Environments: An Experimental Empathic AI-Enhanced IDE

2604.19142 cs.SE 2026-04-21 PDF (arxiv)

Justin Rainier Go, Kurt Christian Andaya, Roemer Gabriel Caliboso, Aaron Daniel Go, Jocelynn Cu

As generative AI becomes integral to software development, the risk of over-reliance and diminished critical thinking grows. This study introduces "Ceci," our Caring Empathic C IDE designed to support novice programmers by prioritizing learning and emotional support over direct code generation. The researchers conducted a comparative pilot study between Ceci and VSCode + ChatGPT [9, 40]. Participa…

Open PDF (arxiv)
Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation

2604.19141 cs.CV 2026-04-21 PDF (arxiv)

Johannes Schusterbauer, Ming Gui, Yusong Li, Pingchuan Ma, Felix Krause, Björn Ommer

Diffusion- and flow-based models usually allocate compute uniformly across space, updating all patches with the same timestep and number of function evaluations. While convenient, this ignores the heterogeneity of natural images: some regions are easy to denoise, whereas others benefit from more refinement or additional context. Motivated by this, we explore patch-level noise scales for image synt…

Open PDF (arxiv)
The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models

2604.19139 cs.CL 2026-04-21 PDF (arxiv)

Shuai Wu, Xue Li, Yanna Feng, Yufang Li, Zhijun Wang, Ran Wang

As Large Language Models (LLMs) continue to evolve through alignment techniques such as Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI, a growing and increasingly conspicuous phenomenon has emerged: the proliferation of verbal tics -- repetitive, formulaic linguistic patterns that pervade model outputs. These range from sycophantic openers ("That's a great question!", "Awe…

Open PDF (arxiv)
Moderately beyond clique-width: reduced component max-leaf and related parameters

2604.19138 cs.DS 2026-04-21 PDF (arxiv)

Édouard Bonnet, Yeonsu Chang, Julien Duron, Colin Geniet, O-joung Kwon

Reduced parameters [BKW, JCTB '26; BKRT, SODA '22] are defined via contraction sequences. Based on this framework, we introduce the reduced component max-leaf, denoted by $\operatorname{cml}^\downarrow$, where component max-leaf is the maximum number of leaves in any spanning tree of any connected component. Reduced component max-leaf is strictly sandwiched between clique-width and reduced bandwid…

Open PDF (arxiv)
Construction of Knowledge Graph based on Language Model

2604.19137 cs.CL 2026-04-21 PDF (arxiv)

Qiubai Zhu, Qingwang Wang, Haibin Yuan, Wei Chen, Tao Shen

Knowledge Graph (KG) can effectively integrate valuable information from massive data, and thus has been rapidly developed and widely used in many fields. Traditional KG construction methods rely on manual annotation, which often consumes a lot of time and manpower. And KG construction schemes based on deep learning tend to have weak generalization capabilities. With the rapid development of Pre-t…

Open PDF (arxiv)
A Constrained Formulation for Simultaneous Line Parameter Estimation and Instrument Transformer Calibration

2604.19136 eess.SY 2026-04-21 PDF (arxiv)

Antos Cheeramban Varghese, Rajasekhar Anguluri, Anamitra Pal

The process of calibrating instrument transformers (ITs) has been greatly simplified by using phasor measurement unit (PMU) data since this process eliminates the need for (a) additional hardware, and (b) taking ITs offline. However, such simplification comes at the cost of knowing the line parameters, whose estimation using PMU data in turn requires calibrated ITs. To solve this interdependency p…

Open PDF (arxiv)
Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval

2604.19135 cs.CV 2026-04-21 PDF (arxiv)

Hang Cheng, Fanhe Dong, Long Zeng

This paper presents the first exploration of text-to-image diffusion models for zero-shot sketch-based 3D shape retrieval (ZS-SBSR). Existing sketch-based 3D shape retrieval methods struggle in zero-shot settings due to the absence of category supervision and the extreme sparsity of sketch inputs. Our key insight is that large-scale pretrained diffusion models inherently exhibit open-vocabulary ca…

Open PDF (arxiv)
BALTIC: A Benchmark and Cross-Domain Strategy for 3D Reconstruction Across Air and Underwater Domains Under Varying Illumination

2604.19133 cs.CV 2026-04-21 PDF (arxiv)

Michele Grimaldi, David Nakath, Oscar Pizarro, Jonatan Scharff Willners, Ignacio Carlucho, Yvan R. Petillot

Robust 3D reconstruction across varying environmental conditions remains a critical challenge for robotic perception, particularly when transitioning between air and water. To address this, we introduce BALTIC, a controlled benchmark designed to systematically evaluate modern 3D reconstruction methods under variations in medium and lighting. The benchmark comprises 13 datasets spanning two media (…

Open PDF (arxiv)
Has Automated Essay Scoring Reached Sufficient Accuracy? Deriving Achievable QWK Ceilings from Classical Test Theory

2604.19131 cs.AI 2026-04-21 PDF (arxiv)

Masaki Uto

Automated essay scoring (AES) is commonly evaluated on public benchmarks using quadratic weighted kappa (QWK). However, because benchmark labels are assigned by human raters and inevitably contain scoring errors, it remains unclear both what QWK is theoretically attainable and what level is practically sufficient for deployment. We therefore derive two dataset-specific QWK ceilings based on the re…

Open PDF (arxiv)
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment

2604.19129 cs.CV 2026-04-21 PDF (arxiv)

Chaonan Ji, Jinwei Qi, Sheng Xu, Peng Zhang, Bang Zhang

Existing facial reenactment methods struggle with a trade-off between expressiveness and fine-grained controllability. Holistic facial reenactment models often sacrifice granular control for expressiveness, while methods designed for control may struggle with fidelity and robust disentanglement. Instead of treating facial motion as a monolithic signal, we explore an alternative compositional persp…

Open PDF (arxiv)
GraphRAG-IRL: Personalized Recommendation with Graph-Grounded Inverse Reinforcement Learning and LLM Re-ranking

2604.19128 cs.IR 2026-04-21 PDF (arxiv)

Siqi Liang, Xiawei Wang, Yudi Zhang, Jiaying Zhou

Personalized recommendation requires models that capture sequential user preferences while remaining robust to sparse feedback and semantic ambiguity. Recent work has explored large language models (LLMs) as recommenders and re-rankers, but pure prompt-based ranking often suffers from poor calibration, sensitivity to candidate ordering, and popularity bias. These limitations make LLMs useful seman…

Open PDF (arxiv)