Archon

Browse and search harvested arxiv metadata.

897382 results (page 25 of 35896)

Structure-guided molecular design with contrastive 3D protein-ligand learning

2604.19562 cs.LG 2026-04-21 PDF (arxiv)

Carles Navarro, Philipp Tholke, Gianni de Fabritiis

Structure-based drug discovery faces the dual challenge of accurately capturing 3D protein-ligand interactions while navigating ultra-large chemical spaces to identify synthetically accessible candidates. In this work, we present a unified framework that addresses these challenges by combining contrastive 3D structure encoding with autoregressive molecular generation conditioned on commercial comp…

Open PDF (arxiv)
Detecting Data Contamination in Large Language Models

2604.19561 cs.AI 2026-04-21 PDF (arxiv)

Juliusz Janicki, Savvas Chamezopoulos, Evangelos Kanoulas, Georgios Tsatsaronis

Large Language Models (LLMs) utilize large amounts of data for their training, some of which may come from copyrighted sources. Membership Inference Attacks (MIA) aim to detect those documents and whether they have been included in the training corpora of the LLMs. The black-box MIAs require a significant amount of data manipulation; therefore, their comparison is often challenging. We study state…

Open PDF (arxiv)
Separating Geometry from Probability in the Analysis of Generalization

2604.19560 cs.LG 2026-04-21 PDF (arxiv)

Maxim Raginsky, Benjamin Recht

The goal of machine learning is to find models that minimize prediction error on data that has not yet been seen. Its operational paradigm assumes access to a dataset $S$ and articulates a scheme for evaluating how well a given model performs on an arbitrary sample. The sample can be $S$ (in which case we speak of ``in-sample'' performance) or some entirely new $S'$ (in which case we speak of ``ou…

Open PDF (arxiv)
Enhancing Construction Worker Safety in Extreme Heat: A Machine Learning Approach Utilizing Wearable Technology for Predictive Health Analytics

2604.19559 cs.AI 2026-04-21 PDF (arxiv)

Syed Sajid Ullah, Amir Khan

Construction workers are highly vulnerable to heat stress, yet tools that translate real-time physiological data into actionable safety intelligence remain scarce. This study addresses this gap by developing and evaluating deep learning models, specifically a baseline Long Short-Term Memory (LSTM) network and an attention-based LSTM, to predict heat stress among 19 workers in Saudi Arabia. Using G…

Open PDF (arxiv)
On Reasoning-Centric LLM-based Automated Theorem Proving

2604.19558 cs.SE 2026-04-21 PDF (arxiv)

Yican Sun, Chengwei Shi, Hangzhou Lyu, Yingfei Xiong

Automated theorem proving is fundamental to formal methods, and the recent trend is to integrate large language models (LLMs) and proof assistants to form effective proof agents. While existing proof agents show promising performance, they inadequately leverage reasoning capabilities of modern LLMs in high-level planning and self-critique. We argue that proof agents should not merely generate tact…

Open PDF (arxiv)
Understanding supernova gravitational waves with protoneutron star asteroseismology

2604.19557 gr-qc 2026-04-21 PDF (arxiv)

Hajime Sotani

Supernovae are one of the most promising gravitational wave sources. But, since the system of the supernovae is nearly spherically symmetric, the expected gravitational waves from them are relatively weak, compared to the case of the compact binary mergers. Thus, at least using the current gravitational wave detectors, only the gravitational waves from a supernova that occurred in our galaxy could…

Open PDF (arxiv)
Paparazzo: Active Mapping of Moving 3D Objects

2604.19556 cs.CV 2026-04-21 PDF (arxiv)

Davide Allegro, Shiyao Li, Stefano Ghidoni, Vincent Lepetit

Current 3D mapping pipelines generally assume static environments, which limits their ability to accurately capture and reconstruct moving objects. To address this limitation, we introduce the novel task of active mapping of moving objects, in which a mapping agent must plan its trajectory while compensating for the object's motion. Our approach, Paparazzo, provides a learning-free solution that r…

Open PDF (arxiv)
Local power of approximation in hierarchical spline spaces on weakly admissible meshes

2604.19555 math.NA 2026-04-21 PDF (arxiv)

Gustavo A. Fernandez Lezcano, Eduardo M. Garau, Bárbara Ivaniszyn

We study local approximation properties in hierarchical spline spaces through a twofold approach. First, we design and analyze a robust adaptive refinement algorithm to construct locally graded meshes. Second, we establish rigorous stability and approximation results using computationally efficient quasi-interpolation operators. The primary contribution is the analysis of weakly admissible hierarc…

Open PDF (arxiv)
Characterizing and spectrally modeling embedded FUor eruptions in the near-infrared

2604.19551 astro-ph.SR 2026-04-21 PDF (arxiv)

Jiaxun Li, Tinggui Wang, Zheyu Lin

Context. Episodic accretion in young stellar objects (YSOs) is thought to play a critical role in addressing the "luminosity problem" associated with star formation. However, optical surveys tend to bias against sources that are heavily obscured. Infrared time-domain surveys, such as unTimely WISE, facilitate the identification of such sources within the dense star formation regions of our Galaxy.…

Open PDF (arxiv)
LoopCTR: Unlocking the Loop Scaling Power for Click-Through Rate Prediction

2604.19550 cs.IR 2026-04-21 PDF (arxiv)

Jiakai Tang, Runfeng Zhang, Weiqiu Wang, Yifei Liu, Chuan Wang, Xu Chen, Yeqiu Yang, Jian Wu, Yuning Jiang, Bo Zheng

Scaling Transformer-based click-through rate (CTR) models by stacking more parameters brings growing computational and storage overhead, creating a widening gap between scaling ambitions and the stringent industrial deployment constraints. We propose LoopCTR, which introduces a loop scaling paradigm that increases training-time computation through recursive reuse of shared model layers, decoupling…

Open PDF (arxiv)
Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

2604.19548 cs.CL 2026-04-21 PDF (arxiv)

Bobo Li, Rui Wu, Zibo Ji, Meishan Zhang, Hao Fei, Min Zhang, Mong-Li Lee, Wynne Hsu

Large Language Model agents have rapidly evolved from static text generators into dynamic systems capable of executing complex autonomous workflows. To enhance reliability, multi-agent frameworks assigning specialized roles are increasingly adopted to enable self-reflection and mutual auditing. While such role-playing effectively leverages domain expert knowledge, we find it simultaneously induces…

Open PDF (arxiv)
Emotion-Cause Pair Extraction in Conversations via Semantic Decoupling and Graph Alignment

2604.19547 cs.CL 2026-04-21 PDF (arxiv)

Tianxiang Ma, Weijie Feng, Xinyu Wang, Zhiyong Cheng

Emotion-Cause Pair Extraction in Conversations (ECPEC) aims to identify the set of causal relations between emotion utterances and their triggering causes within a dialogue. Most existing approaches formulate ECPEC as an independent pairwise classification task, overlooking the distinct semantics of emotion diffusion and cause explanation, and failing to capture globally consistent many-to-many co…

Open PDF (arxiv)
Viability of Big Bang Nucleosynthesis in Some Generalized Horizon Entropies

2604.19545 gr-qc 2026-04-21 PDF (arxiv)

Kajal Phukan, Rajdeep Mazumdar, Kalyan Malakar, Kalyan Bhuyan

In this work, we investigate the viability of some cosmological models derived from generalized horizon entropies, using Big Bang Nucleosynthesis (BBN) constraints. By analyzing the deviations in the expansion rate, we derive bounds on the model parameters from freeze-out temperature, helium, and deuterium abundances. Our results show that the freeze-out condition provides the most stringent const…

Open PDF (arxiv)
DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling

2604.19544 cs.AI 2026-04-21 PDF (arxiv)

Zhihong Zhang, Jie Zhao, Xiaojian Huang, Jin Xu, Zhuodong Luo, Xin Liu, Jiansheng Wei, Xuejin Chen

Multimodal reward models (MRMs) play a crucial role in aligning Multimodal Large Language Models (MLLMs) with human preferences. Training a good MRM requires high-quality multimodal preference data. However, existing preference datasets face three key challenges: lack of granularity in preference strength, textual style bias, and unreliable preference signals. Besides, existing open-source multimo…

Open PDF (arxiv)
Dynamically cold discs in high-redshift galaxies: comparison between ALMA observations and TNG50

2604.19543 astro-ph.GA 2026-04-21 PDF (arxiv)

Yi He, Qi Guo, Filippo Fraternali, Hang Yang, Shihong Liao

Observations of highly rotationally supported gas discs in high redshift ($z$ > 3) star-forming galaxies challenge our understanding of galaxy formation, as the prevailing view holds that galaxies in the early universe are dynamically hot due to frequent mergers, gas accretion, and strong stellar feedback. We examined the kinematic properties of massive ($M_{\star} \geq 10^{10}\,M_{\odot}$) star-f…

Open PDF (arxiv)
FOCAL: Filtered On-device Continuous Activity Logging for Efficient Personal Desktop Summarization

2604.19541 cs.MA 2026-04-21 PDF (arxiv)

Haoran Yin, Zhiyuan Wen, Jiannong Cao, Bo Yuan, Ruosong Yang

Desktop interaction streams provide a continuous, privacy-sensitive record of interleaved user tasks. Transforming these streams into task-organized personal logs on-device faces two main challenges: exhaustive Vision-Language Model (VLM) processing strains local resources, and global stream processing causes cross-task context pollution. We present FOCAL (Filtered On-device Continuous Activity Lo…

Open PDF (arxiv)
Mesh Memory Protocol: Semantic Infrastructure for Multi-Agent LLM Systems

2604.19540 cs.MA 2026-04-21 PDF (arxiv)

Hongwei Xu

Teams of LLM agents increasingly collaborate on tasks spanning days or weeks: multi-day data-generation sprints where generator, reviewer, and auditor agents coordinate in real time on overlapping batches; specialists carrying findings forward across session restarts; product decisions compounding over many review rounds. This requires agents to share, evaluate, and combine each other's cognitive …

Open PDF (arxiv)
Integrating Anomaly Detection into Agentic AI for Proactive Risk Management in Human Activity

2604.19538 cs.AI 2026-04-21 PDF (arxiv)

Farbod Zorriassatine, Ahmad Lotfi

Agentic AI, with goal-directed, proactive, and autonomous decision-making capabilities, offers a compelling opportunity to address movement-related risks in human activity, including the persistent hazard of falls among elderly populations. Despite numerous approaches to fall mitigation through fall prediction and detection, existing systems have not yet functioned as universal solutions across ca…

Open PDF (arxiv)
InvestChat: Exploring Multimodal Interaction via Natural Language, Touch, and Pen in an Investment Dashboard

2604.19537 cs.HC 2026-04-21 PDF (arxiv)

Sarah Lykke Tost, Adson Lucas de Paiva Sales, Henrik Østergaard, Vaishali Dhanoa, Gabriela Molina León

We designed and implemented InvestChat, a multimodal tablet-based application that supports stock market exploration with multiple coordinated views and an LLM-powered chat. We evaluated the application with 12 novice investors. Our findings suggest that combining natural language, touch, and pen input during stock market exploration facilitates user engagement. Participants leveraged the modaliti…

Open PDF (arxiv)
LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

2604.19536 cs.RO 2026-04-21 PDF (arxiv)

Xiangchen Wang, Weiye Zhu, Teng Wang, TianTian Geng, Zekai Zhang, Zhiyuan Qi, Jinyu Yang, Feng Zheng

Recent navigation systems achieve strong benchmark results, yet real-world deployment often remains visibly stop-and-go. This bottleneck arises because the sense-inference-execution loop is still blocking: after each new observation, the controller must wait for sensing, transmission, and inference before motion can continue. Reducing action-generation cost alone therefore does not remove redundan…

Open PDF (arxiv)
The swept-back multipolar magnetic field of neutron stars: Application to NICER MSP J0030+0451

2604.19534 astro-ph.HE 2026-04-21 PDF (arxiv)

Anu Kundu, Constantinos Kalapotharakos, Zorawar Wadiasingh, Greg Olmschenk, Wendy F. Wallace, Alice K. Harding, Christo Venter, Demosthenes Kazanas

NICER observations of millisecond pulsars (MSPs) suggest that non-dipolar magnetic fields are required to explain their surface X-ray hotspots. C. Kalapotharakos et al. (2021) modeled the NICER light curve of MSP J0030+0451 (J0030) using a static vacuum offset dipole-plus-quadrupole field and corresponding force-free (FF) solutions to jointly reproduce the X-ray and Fermi-LAT $γ$-ray emission. We …

Open PDF (arxiv)
Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps

2604.19533 cs.CR 2026-04-21 PDF (arxiv)

Alankrit Chona, Igor Kozlov, Ambuj Kumar

We introduce the Cyber Defense Benchmark, a benchmark for measuring how well large language model (LLM) agents perform the core SOC analyst task of threat hunting: given a database of raw Windows event logs with no guided questions or hints, identify the exact timestamps of malicious events. The benchmark wraps 106 real attack procedures from the OTRF Security-Datasets corpus - spanning 86 MITRE…

Open PDF (arxiv)
BEAT: Tokenizing and Generating Symbolic Music by Uniform Temporal Steps

2604.19532 cs.SD 2026-04-21 PDF (arxiv)

Lekai Qian, Haoyu Gu, Jingwei Zhao, Ziyu Wang

Tokenizing music to fit the general framework of language models is a compelling challenge, especially considering the diverse symbolic structures in which music can be represented (e.g., sequences, grids, and graphs). To date, most approaches tokenize symbolic music as sequences of musical events, such as onsets, pitches, time shifts, or compound note events. This strategy is intuitive and has pr…

Open PDF (arxiv)
Hypergraph Mining via Proximity Matrix

2604.19531 cs.SI 2026-04-21 PDF (arxiv)

Junhao Bian, Yilin Bi, Tao Zhou

Hypergraphs serve as an effective tool widely adopted to characterize higher-order interactions in complex systems. The most intuitive and commonly used mathematical instrument for representing a hypergraph is the incidence matrix, in which each entry is binary, indicating whether the corresponding node belongs to the corresponding hyperedge. Although the incidence matrix has become a foundational…

Open PDF (arxiv)
Calibrating Scientific Foundation Models with Inference-Time Stochastic Attention

2604.19530 cs.LG 2026-04-21 PDF (arxiv)

Akash Yadav, Taiwo A. Adebiyi, Ruda Zhang

Transformer-based scientific foundation models are increasingly deployed in high-stakes settings, but current architectures give deterministic outputs and provide limited support for calibrated predictive uncertainty. We propose Stochastic Attention, a lightweight inference-time modification that randomizes attention by replacing softmax weights with normalized multinomial samples controlled by a …

Open PDF (arxiv)