Archon

Browse and search harvested arxiv metadata.

1137106 results (page 74 of 45485)

Relationships Between Trust, Compliance, and Performance for Novice Programmers Using AI Code Generation

2604.18948 cs.HC 2026-04-21 PDF (arxiv)

Nicholas Gardella, Matthew L. Bolton, Sara L. Riggs

Objective. To explore how novice programmers' trust in Artificial Intelligence-driven Development Environments (AIDEs) relates to their coding performance and AI compliance while programming under time pressure. Background. Computer programming has undergone rapid upheaval due to state-of-the-art AIDEs, which provide clever automation for many aspects of software development. A longstanding intere…

Open PDF (arxiv)
Revisiting the distance and the globular cluster system of the remarkable galaxy UDG1 in the NGC 5846 group

2604.18947 astro-ph.GA 2026-04-21 PDF (arxiv)

Duncan A. Forbes, Bas van Heumen, Yimeng Tang

Two studies that utilised the same HST/WFC3 imaging of NGC5846_UDG1 have reported quite different total counts for its globular cluster (GC) system, i.e. 54 $\pm$ 9 vs 33 $\pm$ 3 GCs. In both cases they counted all GCs, that met their selection criteria, down to the faintest magnitudes. They also disagree as to whether NGC5846_UDG1 lies in the NGC 5846 group or well outside the group, in the field…

Open PDF (arxiv)
Reasoning Structure Matters for Safety Alignment of Reasoning Models

2604.18946 cs.AI 2026-04-21 PDF (arxiv)

Yeonjun In, Wonjoong Kim, Sangwu Park, Chanyoung Park

Large reasoning models (LRMs) achieve strong performance on complex reasoning tasks but often generate harmful responses to malicious user queries. This paper investigates the underlying cause of these safety risks and shows that the issue lies in the reasoning structure itself. Based on this insight, we claim that effective safety alignment can be achieved by altering the reasoning structure. We …

Open PDF (arxiv)
Relaxed Generalized Scalar Auxiliary Variable Exponential Integrator for A Modified Landau-de Gennes Theory for Smectic Liquid Crystals

2604.18945 math.NA 2026-04-21 PDF (arxiv)

Wenshuai Hu, Guanghua Ji, Xiao Li

The Smectic-A (SmA) phase is modeled by a modified Landau-de Gennes (mLdG) model proposed by Xia et al. [Phys. Rev. Lett., 126 (2021), 177801], in which a tensor order parameter Q for the orientational order is coupled with a real scalar $u$ characterizing the positional order. In this paper, we propose and analyze a novel, highly efficient, and unconditionally energy-stable numerical scheme for t…

Open PDF (arxiv)
A Mechanism and Optimization Study on the Impact of Information Density on User-Generated Content Named Entity Recognition

2604.18944 cs.CL 2026-04-21 PDF (arxiv)

Jiang Xiaobo, Dinghong Lai, Song Qiu, Yadong Deng, Xinkai Zhan

Named Entity Recognition (NER) models trained on clean, high-resource corpora exhibit catastrophic performance collapse when deployed on noisy, sparse User-Generated Content (UGC), such as social media. Prior research has predominantly focused on point-wise symptom remediation -- employing customized fine-tuning to address issues like neologisms, alias drift, non-standard orthography, long-tail en…

Open PDF (arxiv)
Personalized Benchmarking: Evaluating LLMs by Individual Preferences

2604.18943 cs.AI 2026-04-21 PDF (arxiv)

Cristina Garbacea, Heran Wang, Chenhao Tan

With the rise in capabilities of large language models (LLMs) and their deployment in real-world tasks, evaluating LLM alignment with human preferences has become an important challenge. Current benchmarks average preferences across all users to compute aggregate ratings, overlooking individual user preferences when establishing model rankings. Since users have varying preferences in different con…

Open PDF (arxiv)
Disparities In Negation Understanding Across Languages In Vision-Language Models

2604.18942 cs.CL 2026-04-21 PDF (arxiv)

Charikleia Moraitaki, Sarah Pan, Skyler Pulling, Gwendolyn Flusche, Kumail Alhamoud, Marzyeh Ghassemi

Vision-language models (VLMs) exhibit affirmation bias: a systematic tendency to select positive captions ("X is present") even when the correct description contains negation ("no X"). While prior work has documented this failure mode in English and proposed solutions, negation manifests differently across languages through varying morphology, word order, and cliticization patterns, raising the qu…

Open PDF (arxiv)
Path Integral Control for Partially Observed Systems with Controlled Sensing

2604.18941 eess.SY 2026-04-21 PDF (arxiv)

Goutam Das, Takashi Tanaka

Path integral control in Gaussian belief space requires a structural matching condition between the observation-driven diffusion of the belief mean and the actuation authority, which a fixed observation matrix cannot enforce. We treat the observation matrix as a control variable and show that constraining the sensing control to a measurable selector from the resulting matching set reduces the Hami…

Open PDF (arxiv)
Localization-Guided Foreground Augmentation in Autonomous Driving

2604.18940 cs.CV 2026-04-21 PDF (arxiv)

Jiawei Yong, Deyuan Qu, Qi Chen, Kentaro Oguchi, Shintaro Fukushima

Autonomous driving systems often degrade under adverse visibility conditions-such as rain, nighttime, or snow-where online scene geometry (e.g., lane dividers, road boundaries, and pedestrian crossings) becomes sparse or fragmented. While high-definition (HD) maps can provide missing structural context, they are costly to construct and maintain at scale. We propose Localization-Guided Foreground A…

Open PDF (arxiv)
TabEmb: Joint Semantic-Structure Embedding for Table Annotation

2604.18939 cs.LG 2026-04-21 PDF (arxiv)

Ehsan Hoseinzade, Ke Wang, Anandharaju Durai Raju

Table annotation is crucial for making web and enterprise tables usable in downstream NLP applications. Unlike textual data where learning semantically rich token or sentence embeddings often suffice, tables are structured combinations of columns wherein useful representations must jointly capture column's semantics and the inter-column relationships. Existing models learn by linearizing the 2D ta…

Open PDF (arxiv)
Fine-Tuning Small Reasoning Models for Quantum Field Theory

2604.18936 cs.LG 2026-04-21 PDF (arxiv)

Nathaniel S. Woodward, Zhiqi Gao, Yurii Kvasiuk, Kendrick M. Smith, Frederic Sala, Moritz Münchmeyer

Despite the growing application of Large Language Models (LLMs) to theoretical physics, there is little academic exploration into how domain-specific physics reasoning ability develops while training these models. To investigate this, we perform the first academic fine-tuning study of small (7B-parameter) reasoning models dedicated specifically to theoretical physics. Because open-source verifiabl…

Open PDF (arxiv)
AutomationBench

2604.18934 cs.AI 2026-04-21 PDF (arxiv)

Daniel Shepard, Robin Salimans

Existing AI benchmarks for software automation rarely combine cross-application coordination, autonomous API discovery, and policy adherence. Real business workflows demand all three: a single task may span a CRM, inbox, calendar, and messaging platform - requiring the agent to find the right endpoints, follow a policy document, and write correct data to each system. To address this gap, we introd…

Open PDF (arxiv)
Gated Memory Policy

2604.18933 cs.RO 2026-04-21 PDF (arxiv)

Yihuai Gao, Jinyun Liu, Shuang Li, Shuran Song

Robotic manipulation tasks exhibit varying memory requirements, ranging from Markovian tasks that require no memory to non-Markovian tasks that depend on historical information spanning single or multiple interaction trials. Surprisingly, simply extending observation histories of a visuomotor policy often leads to a significant performance drop due to distribution shift and overfitting. To address…

Open PDF (arxiv)
Tadabur: A Large-Scale Quran Audio Dataset

2604.18932 cs.SD 2026-04-21 PDF (arxiv)

Faisal Alherran

Despite growing interest in Quranic data research, existing Quran datasets remain limited in both scale and diversity. To address this gap, we present Tadabur, a large-scale Quran audio dataset. Tadabur comprises more than 1400+ hours of recitation audio from over 600 distinct reciters, providing substantial variation in recitation styles, vocal characteristics, and recording conditions. This dive…

Open PDF (arxiv)
Capacity Expansion Planning for Puerto Rico's Electric Power System

2604.18926 math.OC 2026-04-21 PDF (arxiv)

Elizabeth Glista, Tomas Valencia Zuluaga, Amelia Musselman, Minda Monteagudo, Juliette Franzman, Jean-Paul Watson

This study presents a mathematical optimization framework and preliminary analysis for long-term investment planning in Puerto Rico's electric power system. We develop a high-resolution capacity expansion model to identify least-cost generation and storage investments that improve system reliability. The model co-optimizes new investments and thermal generator retirements while representing genera…

Open PDF (arxiv)
Writing Blog Posts Helps Students Connect Experiential Learning to the Workplace

2604.18925 cs.CY 2026-04-21 PDF (arxiv)

Utsab Saha, Lola Egherman, Ramiz Rahman, Mohd Toukir Khan, Kevin Wang, Tyler Menezes

Undergraduates in work-based learning experiences often produce meaningful contributions as viewed by their supervisors, yet report a negative perception of their contributions because they struggled during the process or produced only a few lines of code change. As a result, many omit these contributions from their resumes and job interviews, losing a meaningful signal of technical ability. Thi…

Open PDF (arxiv)
Lindbladian Homotopy Analysis Method to Solve Nonlinear Partial Differential Equations

2604.18924 math.NA 2026-04-21 PDF (arxiv)

Eunsik Choi, Jungin E. Kim, Xueling Lu, Yan Wang

Quantum scientific computing is to solve engineering and science problems such as simulation and optimization on quantum computerss. Solving ordinary and partial differential equations (PDEs) is essential in simulations. However, existing quantum approaches to solve nonlinear PDEs suffer from the issues of curse of dimensionality and convergence during the linearization process. In this paper, a L…

Open PDF (arxiv)
Stellar separation shapes spin-orbit alignment in visual binaries

2604.18921 astro-ph.SR 2026-04-20 PDF (arxiv)

Michael Poon, Dang Pham, Marta L. Bryan, Hanno Rein, Jiayin Dong

Stellar binaries may form through several formation pathways, including disk or core fragmentation. Their spin-orbit angles are a signature of formation, although individual measurements for visual binaries are limited and broad. A seminal work by A. Hale (1994) found that visual binaries with separations $\lesssim 30$ AU tend to be more aligned, which laid the groundwork for binary formation theo…

Open PDF (arxiv)
Comparison of sEMG Encoding Accuracy Across Speech Modes Using Articulatory and Phoneme Features

2604.18920 cs.SD 2026-04-20 PDF (arxiv)

Chenqian Le, Ruisi Li, Beatrice Fumagalli, Xupeng Chen, Amirhossein Khalilian-Gourtani, Tianyu He, Adeen Flinker, Yao Wang

We test whether Speech Articulatory Coding (SPARC) features can linearly predict surface electromyography (sEMG) envelopes across aloud, mimed, and subvocal speech in twenty-four subjects. Using elastic-net multivariate temporal response function (mTRF) with sentence-level cross-validation, SPARC yields higher prediction accuracy than phoneme one-hot representations on nearly all electrodes and in…

Open PDF (arxiv)
Proposing Topic Models and Evaluation Frameworks for Analyzing Associations with External Outcomes: An Application to Leadership Analysis Using Large-Scale Corporate Review Data

2604.18919 cs.CL 2026-04-20 PDF (arxiv)

Yura Yoshida, Masato Kanai, Masataka Nakayama, Haruki Ohsawa, Yukiko Uchida, Arata Yuminaga, Gakuse Hoshina, Nobuo Sayama

Analyzing topics extracted from text data in relation to external outcomes is important across fields such as computational social science and organizational research. However, existing topic modeling methods struggle to simultaneously achieve interpretability, topic specificity (alignment with concrete actions or characteristics), and polarity stance consistency (absence of mixed positive and neg…

Open PDF (arxiv)
From Particles to Perils: SVGD-Based Hazardous Scenario Generation for Autonomous Driving Systems Testing

2604.18918 cs.SE 2026-04-20 PDF (arxiv)

Linfeng Liang, Xiao Cheng, Tsong Yueh Chen, Xi Zheng

Simulation-based testing of autonomous driving systems (ADS) must uncover realistic and diverse failures in dense, heterogeneous traffic. However, existing search-based seeding methods (e.g., genetic algorithms) struggle in high-dimensional spaces, often collapsing to limited modes and missing many failure scenarios. We present PtoP, a framework that combines adaptive random seed generation with S…

Open PDF (arxiv)
An Implicit Compact-Kernel Material Point Method for Computational Solid Mechanics

2604.18917 cs.CE 2026-04-20 PDF (arxiv)

Qirui Fu, Yupeng Jiang, Minchen Li

The numerical performance of the material point method (MPM) is strongly governed by the particle-grid kernel, which controls the trade-off among smoothness, locality, numerical diffusion, contact accuracy, and computational cost. Although wide-support smooth kernels can effectively suppress cell-crossing instability, they often introduce increased numerical diffusion, artificial contact gaps, and…

Open PDF (arxiv)
Error-free Training for MedMNIST Datasets

2604.18916 cs.AI 2026-04-20 PDF (arxiv)

Bo Deng

In this paper, we introduce a new concept called Artificial Special Intelligence by which Machine Learning models for the classification problem can be trained error-free, thus acquiring the capability of not making repeated mistakes. The method is applied to 18 MedMNIST biomedical datasets. Except for three datasets, which suffer from the double-labeling problem, all are trained to perfection.

Open PDF (arxiv)
MORPHOGEN: A Multilingual Benchmark for Evaluating Gender-Aware Morphological Generation

2604.18914 cs.CL 2026-04-20 PDF (arxiv)

Mehul Agarwal, Aditya Aggarwal, Arnav Goel, Medha Hira, Anubha Gupta

While multilingual large language models (LLMs) perform well on high-level tasks like translation and question answering, their ability to handle grammatical gender and morphological agreement remains underexplored. In morphologically rich languages, gender influences verb conjugation, pronouns, and even first-person constructions with explicit and implicit mentions of gender. We introduce MORPHOG…

Open PDF (arxiv)
LogosKG: Hardware-Optimized Scalable and Interpretable Knowledge Graph Retrieval

2604.18913 cs.CL 2026-04-20 PDF (arxiv)

He Cheng, Yifu Wu, Saksham Khatwani, Maya Kruse, Dmitriy Dligach, Timothy A. Miller, Majid Afshar, Yanjun Gao

Knowledge graphs (KGs) are increasingly integrated with large language models (LLMs) to provide structured, verifiable reasoning. A core operation in this integration is multi-hop retrieval, yet existing systems struggle to balance efficiency, scalability, and interpretability. We introduce LogosKG, a novel, hardware-aligned framework that enables scalable and interpretable k-hop retrieval on larg…

Open PDF (arxiv)