Archon

Browse and search harvested arxiv metadata.

953555 results (page 36 of 38143)

RL-ABC: Reinforcement Learning for Accelerator Beamline Control

2604.19146 cs.LG 2026-04-21 PDF (arxiv)

Anwar Ibrahim, Fedor Ratnikov, Maxim Kaledin, Alexey Petrenko, Denis Derkach

Particle accelerator beamline optimization is a high-dimensional control problem traditionally requiring significant expert intervention. We present RLABC (Reinforcement Learning for Accelerator Beamline Control), an open-source Python framework that automatically transforms standard Elegant beamline configurations into reinforcement learning environments. RLABC integrates with the widely-used Ele…

Open PDF (arxiv)
ST-Prune: Training-Free Spatio-Temporal Token Pruning for Vision-Language Models in Autonomous Driving

2604.19145 cs.CV 2026-04-21 PDF (arxiv)

Lin Sha, Haiyun Guo, Tao Wang, Cong Zhang, Min Huang, Jinqiao Wang, Qinghai Miao

Vision-Language Models (VLMs) have become central to autonomous driving systems, yet their deployment is severely bottlenecked by the massive computational overhead of multi-view camera and multi-frame video input. Existing token pruning methods, primarily designed for single-image inputs, treat each frame or view in isolation and thus fail to exploit the inherent spatio-temporal redundancies in d…

Open PDF (arxiv)
ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation

2604.19144 cs.CL 2026-04-21 PDF (arxiv)

Kunquan Li, Yingxue Zhang, Fandong Meng, Jinsong Su

Recent years have witnessed growing interest in applying Large Reasoning Models (LRMs) to Machine Translation (MT). Existing approaches predominantly adopt a "think-first-then-translate" paradigm. Although explicit reasoning trajectories significantly enhance translation quality, they incur prohibitive inference costs and latency. To address these limitations, we propose ReflectMT, a two-stage ref…

Open PDF (arxiv)
Towards More Empathic Programming Environments: An Experimental Empathic AI-Enhanced IDE

2604.19142 cs.SE 2026-04-21 PDF (arxiv)

Justin Rainier Go, Kurt Christian Andaya, Roemer Gabriel Caliboso, Aaron Daniel Go, Jocelynn Cu

As generative AI becomes integral to software development, the risk of over-reliance and diminished critical thinking grows. This study introduces "Ceci," our Caring Empathic C IDE designed to support novice programmers by prioritizing learning and emotional support over direct code generation. The researchers conducted a comparative pilot study between Ceci and VSCode + ChatGPT [9, 40]. Participa…

Open PDF (arxiv)
Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation

2604.19141 cs.CV 2026-04-21 PDF (arxiv)

Johannes Schusterbauer, Ming Gui, Yusong Li, Pingchuan Ma, Felix Krause, Björn Ommer

Diffusion- and flow-based models usually allocate compute uniformly across space, updating all patches with the same timestep and number of function evaluations. While convenient, this ignores the heterogeneity of natural images: some regions are easy to denoise, whereas others benefit from more refinement or additional context. Motivated by this, we explore patch-level noise scales for image synt…

Open PDF (arxiv)
The Rise of Verbal Tics in Large Language Models: A Systematic Analysis Across Frontier Models

2604.19139 cs.CL 2026-04-21 PDF (arxiv)

Shuai Wu, Xue Li, Yanna Feng, Yufang Li, Zhijun Wang, Ran Wang

As Large Language Models (LLMs) continue to evolve through alignment techniques such as Reinforcement Learning from Human Feedback (RLHF) and Constitutional AI, a growing and increasingly conspicuous phenomenon has emerged: the proliferation of verbal tics -- repetitive, formulaic linguistic patterns that pervade model outputs. These range from sycophantic openers ("That's a great question!", "Awe…

Open PDF (arxiv)
Moderately beyond clique-width: reduced component max-leaf and related parameters

2604.19138 cs.DS 2026-04-21 PDF (arxiv)

Édouard Bonnet, Yeonsu Chang, Julien Duron, Colin Geniet, O-joung Kwon

Reduced parameters [BKW, JCTB '26; BKRT, SODA '22] are defined via contraction sequences. Based on this framework, we introduce the reduced component max-leaf, denoted by $\operatorname{cml}^\downarrow$, where component max-leaf is the maximum number of leaves in any spanning tree of any connected component. Reduced component max-leaf is strictly sandwiched between clique-width and reduced bandwid…

Open PDF (arxiv)
Construction of Knowledge Graph based on Language Model

2604.19137 cs.CL 2026-04-21 PDF (arxiv)

Qiubai Zhu, Qingwang Wang, Haibin Yuan, Wei Chen, Tao Shen

Knowledge Graph (KG) can effectively integrate valuable information from massive data, and thus has been rapidly developed and widely used in many fields. Traditional KG construction methods rely on manual annotation, which often consumes a lot of time and manpower. And KG construction schemes based on deep learning tend to have weak generalization capabilities. With the rapid development of Pre-t…

Open PDF (arxiv)
A Constrained Formulation for Simultaneous Line Parameter Estimation and Instrument Transformer Calibration

2604.19136 eess.SY 2026-04-21 PDF (arxiv)

Antos Cheeramban Varghese, Rajasekhar Anguluri, Anamitra Pal

The process of calibrating instrument transformers (ITs) has been greatly simplified by using phasor measurement unit (PMU) data since this process eliminates the need for (a) additional hardware, and (b) taking ITs offline. However, such simplification comes at the cost of knowing the line parameters, whose estimation using PMU data in turn requires calibrated ITs. To solve this interdependency p…

Open PDF (arxiv)
Diff-SBSR: Learning Multimodal Feature-Enhanced Diffusion Models for Zero-Shot Sketch-Based 3D Shape Retrieval

2604.19135 cs.CV 2026-04-21 PDF (arxiv)

Hang Cheng, Fanhe Dong, Long Zeng

This paper presents the first exploration of text-to-image diffusion models for zero-shot sketch-based 3D shape retrieval (ZS-SBSR). Existing sketch-based 3D shape retrieval methods struggle in zero-shot settings due to the absence of category supervision and the extreme sparsity of sketch inputs. Our key insight is that large-scale pretrained diffusion models inherently exhibit open-vocabulary ca…

Open PDF (arxiv)
BALTIC: A Benchmark and Cross-Domain Strategy for 3D Reconstruction Across Air and Underwater Domains Under Varying Illumination

2604.19133 cs.CV 2026-04-21 PDF (arxiv)

Michele Grimaldi, David Nakath, Oscar Pizarro, Jonatan Scharff Willners, Ignacio Carlucho, Yvan R. Petillot

Robust 3D reconstruction across varying environmental conditions remains a critical challenge for robotic perception, particularly when transitioning between air and water. To address this, we introduce BALTIC, a controlled benchmark designed to systematically evaluate modern 3D reconstruction methods under variations in medium and lighting. The benchmark comprises 13 datasets spanning two media (…

Open PDF (arxiv)
Has Automated Essay Scoring Reached Sufficient Accuracy? Deriving Achievable QWK Ceilings from Classical Test Theory

2604.19131 cs.AI 2026-04-21 PDF (arxiv)

Masaki Uto

Automated essay scoring (AES) is commonly evaluated on public benchmarks using quadratic weighted kappa (QWK). However, because benchmark labels are assigned by human raters and inevitably contain scoring errors, it remains unclear both what QWK is theoretically attainable and what level is practically sufficient for deployment. We therefore derive two dataset-specific QWK ceilings based on the re…

Open PDF (arxiv)
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment

2604.19129 cs.CV 2026-04-21 PDF (arxiv)

Chaonan Ji, Jinwei Qi, Sheng Xu, Peng Zhang, Bang Zhang

Existing facial reenactment methods struggle with a trade-off between expressiveness and fine-grained controllability. Holistic facial reenactment models often sacrifice granular control for expressiveness, while methods designed for control may struggle with fidelity and robust disentanglement. Instead of treating facial motion as a monolithic signal, we explore an alternative compositional persp…

Open PDF (arxiv)
GraphRAG-IRL: Personalized Recommendation with Graph-Grounded Inverse Reinforcement Learning and LLM Re-ranking

2604.19128 cs.IR 2026-04-21 PDF (arxiv)

Siqi Liang, Xiawei Wang, Yudi Zhang, Jiaying Zhou

Personalized recommendation requires models that capture sequential user preferences while remaining robust to sparse feedback and semantic ambiguity. Recent work has explored large language models (LLMs) as recommenders and re-rankers, but pure prompt-based ranking often suffers from poor calibration, sensitivity to candidate ordering, and popularity bias. These limitations make LLMs useful seman…

Open PDF (arxiv)
Do Emotions Influence Moral Judgment in Large Language Models?

2604.19125 cs.CL 2026-04-21 PDF (arxiv)

Mohammad Saim, Tianyu Jiang

Large language models have been extensively studied for emotion recognition and moral reasoning as distinct capabilities, yet the extent to which emotions influence moral judgment remains underexplored. In this work, we develop an emotion-induction pipeline that infuses emotion into moral situations and evaluate shifts in moral acceptability across multiple datasets and LLMs. We observe a directio…

Open PDF (arxiv)
Detoxification for LLM: From Dataset Itself

2604.19124 cs.CL 2026-04-21 PDF (arxiv)

Wei Shao, Yihang Wang, Gaoyu Zhu, Ziqiang Cheng, Lei Yu, Jiafeng Guo, Xueqi Cheng

Existing detoxification methods for large language models mainly focus on post-training stage or inference time, while few tackle the source of toxicity, namely, the dataset itself. Such training-based or controllable decoding approaches cannot completely suppress the model's inherent toxicity, whereas detoxifying the pretraining dataset can fundamentally reduce the toxicity that the model learns …

Open PDF (arxiv)
A General Framework for Radial Velocity Calibration in Low-Resolution Spectroscopic Surveys: Correcting Wavelength-Dependent and Global Systematics with Application to LAMOST DR9

2604.19119 astro-ph.SR 2026-04-21 PDF (arxiv)

Jinming Zhang, Haibo Yuan, Zhijia Tian

Radial velocity (RV) is crucial for stellar kinematics and Galactic archaeology. The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) has obtained over ten million low-resolution spectra ($R \sim 1800$), yielding RVs for millions of stars, but these suffer from (1) wavelength-dependent inconsistencies (relative shifts between spectral segments) and (2) global zero-point offsets (…

Open PDF (arxiv)
DP-FlogTinyLLM: Differentially private federated log anomaly detection using Tiny LLMs

2604.19118 cs.CR 2026-04-21 PDF (arxiv)

Isaiah Thompson, Tanmay Sen, Ritwik Bhattacharya

Modern distributed systems generate massive volumes of log data that are critical for detecting anomalies and cyber threats. However, in real world settings, these logs are often distributed across multiple organizations and cannot be centralized due to privacy and security constraints. Existing log anomaly detection methods, including recent large language model (LLM) based approaches, largely re…

Open PDF (arxiv)
LLMs Know They're Wrong and Agree Anyway: The Shared Sycophancy-Lying Circuit

2604.19117 cs.LG 2026-04-21 PDF (arxiv)

Manav Pandey

When a language model agrees with a user's false belief, is it failing to detect the error, or noticing and agreeing anyway? We show the latter. Across twelve open-weight models from five labs, spanning small to frontier scale, the same small set of attention heads carries a "this statement is wrong" signal whether the model is evaluating a claim on its own or being pressured to agree with a user.…

Open PDF (arxiv)
LIVE: Learnable Monotonic Vertex Embedding for Efficient Exact Subgraph Matching (Technical Report)

2604.19116 cs.DB 2026-04-21 PDF (arxiv)

Yutong Ye, Weilong Ren, Yang Liu, Mengyi Yan, Ruijie Wang, Li Sun, Jianxin Li, Philip S. Yu

Exact subgraph matching is a fundamental graph operator that supports many graph analytics tasks, yet it remains computationally challenging due to its NP-completeness. Recent learning-based approaches accelerate query processing via dominance-preserving vertex embeddings, but they suffer from expensive offline training, limited pruning effectiveness, and heavy reliance on complex index structures…

Open PDF (arxiv)
Expert Upcycling: Shifting the Compute-Efficient Frontier of Mixture-of-Experts

2604.19835 cs.LG 2026-04-21 PDF (arxiv)

Chaitanya Dwivedi, Binxuan Huang, Himanshu Gupta, Pratik Jayarao, Neeraj Varshney, Bing Yin

Mixture-of-Experts (MoE) has become the dominant architecture for scaling large language models: frontier models routinely decouple total parameters from per-token computation through sparse expert routing. Scaling laws show that under fixed active computation, model quality scales predictably with total parameters, and MoEs realize this by increasing expert count. However, training large MoEs is …

Open PDF (arxiv)
KD-Judge: A Knowledge-Driven Automated Judge Framework for Functional Fitness Movements on Edge Devices

2604.19834 cs.CV 2026-04-21 PDF (arxiv)

Shaibal Saha, Fan Li, Yunge Li, Arun Iyengar, Lucas Alves, Lanyu Xu

Functional fitness movements are widely used in training, competition, and health-oriented exercise programs, yet consistently enforcing repetition (rep) standards remains challenging due to subjective human judgment, time constraints, and evolving rules. Existing AI-based approaches mainly rely on learned scoring or reference-based comparisons and lack explicit rule-based, limiting transparency a…

Open PDF (arxiv)
OOPrompt: Reifying Intents into Structured Artifacts for Modular and Iterative Prompting

2604.19114 cs.HC 2026-04-21 PDF (arxiv)

Tengyou Xu, Detao Ma, Xiang 'Anthony' Chen

The rise of large language models (LLMs) has given rise to a class of prompt-based interactive systems where users primarily express their input in natural language. However, composing a prompt as a linear text string becomes unwieldy when capturing users' multifaceted intents. We present Object-Oriented Prompting (OOPrompt), an emergent interaction paradigm that enables users to create, edit, ite…

Open PDF (arxiv)
Think Before Writing: Feature-Level Multi-Objective Optimization for Generative Citation Visibility

2604.19113 cs.IR 2026-04-21 PDF (arxiv)

Zikang Liu, Peilan Xu

Generative answer engines expose content through selective citation rather than ranked retrieval, fundamentally altering how visibility is determined. This shift calls for new optimization methods beyond traditional search engine optimization. Existing generative engine optimization (GEO) approaches primarily rely on token-level text rewriting, offering limited interpretability and weak control ov…

Open PDF (arxiv)
Governed Auditable Decisioning Under Uncertainty: Synthesis and Agentic Extension

2604.19112 cs.CY 2026-04-21 PDF (arxiv)

Oleg Solozobov

When automated decision systems fail, organizations frequently discover that formally compliant governance infrastructure cannot reconstruct what happened or why. This paper synthesizes an operational governance evidence framework -- structural accountability collapse diagnostics, decision trace schemas, evidence sufficiency measurement, and label-free monitoring -- into an integrated chain and an…

Open PDF (arxiv)