Archon

Browse and search harvested arxiv metadata.

1045714 results (page 54 of 41829)

An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA

2604.19685 cs.CL 2026-04-21 PDF (arxiv)

Saransh Sharma, Pritika Ramu, Aparna Garimella, Koyel Mukherjee

Answering open-ended questions remains challenging for AI systems because it requires synthesis, judgment, and exploration beyond factual retrieval, and users often refine answers through multiple iterations rather than accepting a single response. Existing QA benchmarks do not explicitly support this refinement process. To address this gap, we introduce a new task, document-grounded related insig…

Open PDF (arxiv)
PREF-XAI: Preference-Based Personalized Rule Explanations of Black-Box Machine Learning Models

2604.19684 cs.LG 2026-04-21 PDF (arxiv)

Salvatore Greco, Jacek Karolczak, Roman Słowiński, Jerzy Stefanowski

Explainable artificial intelligence (XAI) has predominantly focused on generating model-centric explanations that approximate the behavior of black-box models. However, such explanations often overlook a fundamental aspect of interpretability: different users require different explanations depending on their goals, preferences, and cognitive constraints. Although recent work has explored user-cent…

Open PDF (arxiv)
Mask World Model: Predicting What Matters for Robust Robot Policy Learning

2604.19683 cs.RO 2026-04-21 PDF (arxiv)

Yunfan Lou, Xiaowei Chi, Xiaojie Zhang, Zezhong Qian, Chengxuan Li, Rongyu Zhang, Yaoxu Lyu, Guoyu Song, Chuyao Fu, Haoxuan Xu, Pengwei Wang, Shanghang Zhang

World models derived from large-scale video generative pre-training have emerged as a promising paradigm for generalist robot policy learning. However, standard approaches often focus on high-fidelity RGB video prediction, this can result in overfitting to irrelevant factors, such as dynamic backgrounds and illumination changes. These distractions reduce the model's ability to generalize, ultimate…

Open PDF (arxiv)
Frequency-Forcing: From Scaling-as-Time to Soft Frequency Guidance

2604.20902 cs.LG 2026-04-21 PDF (arxiv)

Weitao Du

While standard flow-matching models transport noise to data uniformly, incorporating an explicit generation order - specifically, establishing coarse, low-frequency structure before fine detail - has proven highly effective for synthesizing natural images. Two recent works offer distinct paradigms for this. K-Flow imposes a hard frequency constraint by reinterpreting a frequency scaling variable a…

Open PDF (arxiv)
IR-Flow: Bridging Discriminative and Generative Image Restoration via Rectified Flow

2604.19680 cs.CV 2026-04-21 PDF (arxiv)

Zihao Fan, Xin Lu, Jie Xiao, Dong Li, Jie Huang, Xueyang Fu

In image restoration, single-step discriminative mappings often lack fine details via expectation learning, whereas generative paradigms suffer from inefficient multi-step sampling and noise-residual coupling. To address this dilemma, we propose IR-Flow, a novel image restoration method based on Rectified Flow that serves as a unified framework bridging the gap between discriminative and generativ…

Open PDF (arxiv)
MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation

2604.19679 cs.CV 2026-04-21 PDF (arxiv)

Liyang Li, Wen Wang, Canyu Zhao, Tianjian Feng, Zhiyue Zhao, Hao Chen, Chunhua Shen

Recent advances in Diffusion Transformers (DiTs) have enabled high-quality joint audio-video generation, producing videos with synchronized audio within a single model. However, existing controllable generation frameworks are typically restricted to video-only control. This restricts comprehensive controllability and often leads to suboptimal cross-modal alignment. To bridge this gap, we present M…

Open PDF (arxiv)
Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation

2604.19678 cs.CL 2026-04-21 PDF (arxiv)

Nurkhan Laiyk, Gerard I. Gállego, Javier Ferrando, Fajri Koto

Function vectors (FVs) are vector representations of tasks extracted from model activations during in-context learning. While prior work has shown that multilingual model representations can be language-agnostic, it remains unclear whether the same holds for function vectors. We study whether FVs exhibit language-agnosticity, using machine translation as a case study. Across three decoder-only mul…

Open PDF (arxiv)
Learning Hybrid-Control Policies for High-Precision In-Contact Manipulation Under Uncertainty

2604.19677 cs.RO 2026-04-21 PDF (arxiv)

Hunter L. Brown, Geoffrey Hollinger, Stefan Lee

Reinforcement learning-based control policies have been frequently demonstrated to be more effective than analytical techniques for many manipulation tasks. Commonly, these methods learn neural control policies that predict end-effector pose changes directly from observed state information. For tasks like inserting delicate connectors which induce force constraints, pose-based policies have limite…

Open PDF (arxiv)
MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention

2604.19675 cs.CV 2026-04-21 PDF (arxiv)

Zhi Chen, Runze Hu, Le Zhang

Flow matching has recently emerged as a principled framework for learning continuous-time transport maps, enabling efficient deterministic generation without relying on stochastic diffusion processes. While generative modeling has shown promise for medical image segmentation, particularly in capturing uncertainty and complex anatomical variability, existing approaches are predominantly built upon …

Open PDF (arxiv)
Resolved UV-Optical HST Imaging and Spectral Energy Distribution Modeling of Nearby BAT Active Galactic Nuclei

2604.19674 astro-ph.GA 2026-04-21 PDF (arxiv)

Connor Auge, Michael Koss, Kriti K. Gupta, Claudio Ricci, Benny Trakhtenbrot, Franz E. Bauer, Ezequiel Treister, Alessandro Peca, Brad Cenko, Kohei Ichikawa, Arghajit Janna, Darshan Kakkad, Richard Mushotzky, Kyuseok Oh, Alejandra Rojas Lilayú, David Sanders, Roberto Serafinelli, Matilde Signorini, Alessia Tortosa, C. Megan Urry

We use high-resolution UV-to-optical imaging from the Hubble Space Telescope (HST) to construct spatially resolved spectral energy distributions (SEDs) for seven nearby ($z<0.07$) hard (14--195$\,$keV) X-ray-selected broad-line active galactic nuclei (AGN) with $L_{\rm bol}=10^{43.26}-10^{45.34}\,\rm{erg\,s^{-1}}$. The high spatial resolution of HST, which physically resolves structures on the sca…

Open PDF (arxiv)
InHabit: Leveraging Image Foundation Models for Scalable 3D Human Placement

2604.19673 cs.CV 2026-04-21 PDF (arxiv)

Nikita Kister, Pradyumna YM, István Sárándi, Jiayi Wang, Anna Khoreva, Gerard Pons-Moll

Training embodied agents to understand 3D scenes as humans do requires large-scale data of people meaningfully interacting with diverse environments, yet such data is scarce. Real-world motion capture is costly and limited to controlled settings, while existing synthetic datasets rely on simple geometric heuristics that ignore rich scene context. In contrast, 2D foundation models trained on intern…

Open PDF (arxiv)
Budgeted Online Influence Maximization

2604.19672 cs.LG 2026-04-21 PDF (arxiv)

Pierre Perrault, Jennifer Healey, Zheng Wen, Michal Valko

We introduce a new budgeted framework for online influence maximization, considering the total cost of an advertising campaign instead of the common cardinality constraint on a chosen influencer set. Our approach better models the real-world setting where the cost of influencers varies and advertisers want to find the best value for their overall social advertising budget. We propose an algorithm …

Open PDF (arxiv)
Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teaming

2604.19670 cs.RO 2026-04-21 PDF (arxiv)

Alex Cuellar, Michael Hagenow, Julie Shah

Effective human-robot teaming is crucial for the practical deployment of robots in human workspaces. However, optimizing joint human-robot plans remains a challenge due to the difficulty of modeling individualized human capabilities and preferences. While prior research has leveraged the multi-cycle structure of domains like manufacturing to learn an individual's tendencies and adapt plans over re…

Open PDF (arxiv)
HardNet++: Nonlinear Constraint Enforcement in Neural Networks

2604.19669 cs.LG 2026-04-21 PDF (arxiv)

Andrea Goertzen, Kaveh Alim, Navid Azizan

Enforcing constraint satisfaction in neural network outputs is critical for safety, reliability, and physical fidelity in many control and decision-making applications. While soft-constrained methods penalize constraint violations during training, they do not guarantee constraint adherence during inference. Other approaches guarantee constraint satisfaction via specific parameterizations or a proj…

Open PDF (arxiv)
Abstract null hypersurfaces and characteristic initial value problems in General Relativity

2604.19668 gr-qc 2026-04-21 PDF (arxiv)

Gabriel Sánchez-Pérez

This thesis is framed within the field of Mathematical Relativity and is organized into six chapters. After an introduction to the topic in Chapter 1, Chapter 2 reviews and further develops the formalism of hypersurface data, which provides the unifying framework for the entire thesis. In Chapter 3 we study the characteristic Cauchy problem from a fully detached perspective. Chapter 4 is devoted t…

Open PDF (arxiv)
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language

2604.19667 cs.CL 2026-04-21 PDF (arxiv)

Yi Zhong, Buqiang Xu, Yijun Wang, Zifei Shan, Shuofei Qiao, Guozhou Zheng, Ningyu Zhang

At present, executable visual workflows have emerged as a mainstream paradigm in real-world industrial deployments, offering strong reliability and controllability. However, in current practice, such workflows are almost entirely constructed through manual engineering: developers must carefully design workflows, write prompts for each step, and repeatedly revise the logic as requirements evolve-ma…

Open PDF (arxiv)
ECLASS-Augmented Semantic Product Search for Electronic Components

2604.19664 cs.IR 2026-04-21 PDF (arxiv)

Nico Baumgart, Markus Lange-Hegermann, Jan Henze

Efficient semantic access to industrial product data is a key enabler for factory automation and emerging LLM-based agent workflows, where both human engineers and autonomous agents must identify suitable components from highly structured catalogs. However, the vocabulary mismatch between natural-language queries and attribute-centric product descriptions limits the effectiveness of traditional re…

Open PDF (arxiv)
From Top-1 to Top-K: A Reproducibility Study and Benchmarking of Counterfactual Explanations for Recommender Systems

2604.19663 cs.IR 2026-04-21 PDF (arxiv)

Quang-Huy Nguyen, Thanh-Hai Nguyen, Khac-Manh Thai, Duc-Hoang Pham, Huy-Son Nguyen, Cam-Van Thi Nguyen, Masoud Mansoury, Duc-Trong Le, Hoang-Quynh Le

Counterfactual explanations (CEs) provide an intuitive way to understand recommender systems by identifying minimal modifications to user-item interactions that alter recommendation outcomes. Existing CE methods for recommender systems, however, have been evaluated under heterogeneous protocols, using different datasets, recommenders, metrics, and even explanation formats, which hampers reproducib…

Open PDF (arxiv)
Modelling time-order effects in haptic perception with a Bayesian dynamical framework

2604.19662 q-bio.NC 2026-04-21 PDF (arxiv)

Gastón Avetta, Jose Lobera, Juan José Zárate, Inés Samengo, Damián G. Hernández

Perceptual judgments of sequential stimuli are systematically biased by prior expectations and by the temporal structure of sensory input. In haptic discrimination tasks, these effects often manifest as time-order asymmetries, whereby the perceived difference between two stimuli depends on their presentation order. Here, we introduce a dynamical Bayesian model that accounts for these biases by com…

Open PDF (arxiv)
Pilot-Free Predictive Multi-User Beamforming via Sensing Management in Cell-Free Networks

2604.19660 eess.SP 2026-04-21 PDF (arxiv)

Eren Berk Kama, Murat Babek Salman, Isaac Skog, Emil Björnson

This paper presents a sensing management frame- work for integrated sensing and communications (ISAC) within cell-free massive multiple-input multiple-output (MIMO) systems to reduce pilot-based channel state information (CSI) acquisition overhead. Conventional communication systems rely on frequent channel estimation procedures that impose significant signaling overhead, consuming valuable time-f…

Open PDF (arxiv)
Disentangling Damage from Operational Variability: A Label-Free Self-Supervised Representation Learning Framework for Output-Only Structural Damage Identification

2604.19658 cs.LG 2026-04-21 PDF (arxiv)

Xudong Jian, Charikleia Stoura, Simon Scandella, Eleni Chatzi

Damage identification is a core task in structural health monitoring. In practice, however, its reliability is often compromised by confounding non-damage effects, such as variations in excitation and environmental conditions, which can induce changes comparable to or larger than those caused by structural damage. To address this challenge, this study proposes a self-supervised label-free disentan…

Open PDF (arxiv)
An AI Agent Execution Environment to Safeguard User Data

2604.19657 cs.CR 2026-04-21 PDF (arxiv)

Robert Stanley, Avi Verma, Lillian Tsai, Konstantinos Kallas, Sam Kumar

AI agents promise to serve as general-purpose personal assistants for their users, which requires them to have access to private user data (e.g., personal and financial information). This poses a serious risk to security and privacy. Adversaries may attack the AI model (e.g., via prompt injection) to exfiltrate user data. Furthermore, sharing private data with an AI agent requires users to trust a…

Open PDF (arxiv)
Pause or Fabricate? Training Language Models for Grounded Reasoning

2604.19656 cs.CL 2026-04-21 PDF (arxiv)

Yiwen Qiu, Linjuan Wu, Yizhou Liu, Yuchen Yan, Jin Ma, Xu Tan, Yao Hu, Daoxin Zhang, Wenqi Zhang, Weiming Lu, Jun Xiao, Yongliang Shen

Large language models have achieved remarkable progress on complex reasoning tasks. However, they often implicitly fabricate information when inputs are incomplete, producing confident but unreliable conclusions -- a failure mode we term ungrounded reasoning. We argue that this issue arises not from insufficient reasoning capability, but from the lack of inferential boundary awareness -- the abili…

Open PDF (arxiv)
FEPLB: Exploiting Copy Engines for Nearly Free MoE Load Balancing in Distributed Training

2604.19654 cs.DC 2026-04-21 PDF (arxiv)

Shuyao Qi, Haoyuan Liu, Shizhen Zhao

Fine-grained, per-micro-batch load balancing is essential for efficient Mixture-of-Experts (MoE) training, yet every prior dynamic scheduling scheme pays for it with extra communication that is hard to hide. Especially on modern bulk-transfer backends such as DeepEP. We make a simple but consequential observation: on the NVIDIA Hopper architecture the NVLink Copy Engine can move data between intra…

Open PDF (arxiv)
A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities

2604.19653 cs.AI 2026-04-21 PDF (arxiv)

Aya Cherigui, Florent Guépin, Arnaud Legendre, Jean-François Couchot

Human mobility data are used in numerous applications, ranging from public health to urban planning. Human mobility is inherently sensitive, as it can contain information such as religious beliefs and political affiliations. Historically, it has been proposed to modify the information using techniques such as aggregation, obfuscation, or noise addition, to adequately protect privacy and eliminate …

Open PDF (arxiv)