Archon

Browse and search harvested arxiv metadata.

1088910 results (page 63 of 43557)

Deconstructing Superintelligence: Identity, Self-Modification and Différance

2604.19845 cs.AI 2026-04-21 PDF (arxiv)

Elija Perrier

Self-modification is often taken as constitutive of artificial superintelligence (SI), yet modification is a relative action requiring a supplement outside the operation. When self-modification extends to this supplement, the classical self-referential structure collapses. We formalise this on an associative operator algebra $\mathcal{A}$ with update $\hat{U}$, discrimination $\hat{D}$, and self-r…

Open PDF (arxiv)
FairTree: Subgroup Fairness Auditing of Machine Learning Models with Bias-Variance Decomposition

2604.19357 cs.LG 2026-04-21 PDF (arxiv)

Rudolf Debelak

The evaluation of machine learning models typically relies mainly on performance metrics based on loss functions, which risk to overlook changes in performance in relevant subgroups. Auditing tools such as SliceFinder and SliceLine were proposed to detect such groups, but usually have conceptual disadvantages, such as the inability to directly address continuous covariates. In this paper, we intro…

Open PDF (arxiv)
LASER: Learning Active Sensing for Continuum Field Reconstruction

2604.19355 cs.LG 2026-04-21 PDF (arxiv)

Huayu Deng, Jinghui Zhong, Xiangming Zhu, Yunbo Wang, Xiaokang Yang

High-fidelity measurements of continuum physical fields are essential for scientific discovery and engineering design but remain challenging under sparse and constrained sensing. Conventional reconstruction methods typically rely on fixed sensor layouts, which cannot adapt to evolving physical states. We propose LASER, a unified, closed-loop framework that formulates active sensing as a Partially …

Open PDF (arxiv)
Do Agents Dream of Root Shells? Partial-Credit Evaluation of LLM Agents in Capture The Flag Challenges

2604.19354 cs.AI 2026-04-21 PDF (arxiv)

Ali Al-Kaswan, Maksim Plotnikov, Maxim Hájek, Roland Vízner, Arie van Deursen, Maliheh Izadi

Large Language Model (LLM) agents are increasingly proposed for autonomous cybersecurity tasks, but their capabilities in realistic offensive settings remain poorly understood. We present DeepRed, an open-source benchmark for evaluating LLM-based agents on realistic Capture The Flag (CTF) challenges in isolated virtualized environments. DeepRed places an agent in a Kali attacker environment with t…

Open PDF (arxiv)
Asymptotic e-processes

2604.19353 math.ST 2026-04-21 PDF (arxiv)

Pierre-François Massiani, Sebastian Schulze, Mattes Mollenhauer

We introduce the concept of an asymptotic e-process, which is a doubly indexed stochastic process $(E_{m,n})_{m,n\in\mathbb{N}}$ that approximates an e-process with monitoring time $n$ in terms of a suitable limiting behavior for an approximation parameter $m\to \infty$. This theory is motivated by practical applications in sequential hypothesis testing, in which e-variables can only be constructe…

Open PDF (arxiv)
DASH-KV: Accelerating Long-Context LLM Inference via Asymmetric KV Cache Hashing

2604.19351 cs.CL 2026-04-21 PDF (arxiv)

Jinyu Guo, Zhihan Zhang, Yutong Li, Jiehui Xie, Md. Tamim Iqbal, Dongshen Han, Lik-Hang Lee, Sung-Ho Bae, Jie Zou, Yang Yang, Chaoning Zhang

The quadratic computational complexity of the standard attention mechanism constitutes a fundamental bottleneck for large language models in long-context inference. While existing KV cache compression methods alleviate memory pressure, they often sacrifice generation quality and fail to address the high overhead of floating-point arithmetic. This paper introduces DASH-KV, an innovative acceleratio…

Open PDF (arxiv)
Attend what matters: Leveraging vision foundational models for breast cancer classification using mammograms

2604.19350 cs.CV 2026-04-21 PDF (arxiv)

Samyak Sanghvi, Piyush Miglani, Sarvesh Shashikumar, Kaustubh R Borgavi, Veenu Singla, Chetan Arora

Vision Transformers $(\texttt{ViT})$ have become the architecture of choice for many computer vision tasks, yet their performance in computer-aided diagnostics remains limited. Focusing on breast cancer detection from mammograms, we identify two main causes for this shortfall. First, medical images are high-resolution with small abnormalities, leading to an excessive number of tokens and making it…

Open PDF (arxiv)
RAFT-MSF++: Temporal Geometry-Motion Feature Fusion for Self-Supervised Monocular Scene Flow

2604.19349 cs.CV 2026-04-21 PDF (arxiv)

Xunpei Sun, Zuoxun Hou, Yi Chang, Gang Chen, Wei-Shi Zheng

Monocular scene flow estimation aims to recover dense 3D motion from image sequences, yet most existing methods are limited to two-frame inputs, restricting temporal modeling and robustness to occlusions. We propose RAFT-MSF++, a self-supervised multi-frame framework that recurrently fuses temporal features to jointly estimate depth and scene flow. Central to our approach is the Geometry-Motion Fe…

Open PDF (arxiv)
Geometry-Guided Self-Supervision for Ultra-Fine-Grained Recognition with Limited Data

2604.19345 cs.CV 2026-04-21 PDF (arxiv)

Shijie Wang, Yadan Luo, Zijian Wang, Haojie Li, Zi Huang, Mahsa Baktashmotlagh

This paper investigates the intrinsic geometrical features of highly similar objects and introduces a general self-supervised framework called the Geometric Attribute Exploration Network (GAEor), which is designed to address the ultra-fine-grained visual categorization (Ultra-FGVC) task in data-limited scenarios. Unlike prior work that often captures subtle yet critical distinctions, GAEor generat…

Open PDF (arxiv)
If you're waiting for a sign... that might not be it! Mitigating Trust Boundary Confusion from Visual Injections on Vision-Language Agentic Systems

2604.19844 cs.CV 2026-04-21 PDF (arxiv)

Jiamin Chang, Minhui Xue, Ruoxi Sun, Shuchao Pang, Salil S. Kanhere, Hammond Pearce

Recent advances in embodied Vision-Language Agentic Systems (VLAS), powered by large vision-language models (LVLMs), enable AI systems to perceive and reason over real-world scenes. Within this context, environmental signals such as traffic lights are essential in-band signals that can and should influence agent behavior. However, similar signals could also be crafted to operate as misleading visu…

Open PDF (arxiv)
Quadruped Parkour Learning: Sparsely Gated Mixture of Experts with Visual Input

2604.19344 cs.RO 2026-04-21 PDF (arxiv)

Michael Ziegltrum, Jianhao Jiao, Tianhu Peng, Chengxu Zhou, Dimitrios Kanoulas

Robotic parkour provides a compelling benchmark for advancing locomotion over highly challenging terrain, including large discontinuities such as elevated steps. Recent approaches have demonstrated impressive capabilities, including dynamic climbing and jumping, but typically rely on sequential multilayer perceptron (MLP) architectures with densely activated layers. In contrast, sparsely gated mix…

Open PDF (arxiv)
Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

2604.19343 cs.NE 2026-04-21 PDF (arxiv)

Coşku Can Horuz, Andrea Ceni, Claudio Gallicchio, Sebastian Otte

Memristive devices present a promising foundation for next-generation information processing by combining memory and computation within a single physical substrate. This unique characteristic enables efficient, fast, and adaptive computing, particularly well suited for deep learning applications. Among recent developments, the memristive-friendly echo state network (MF-ESN) has emerged as a promis…

Open PDF (arxiv)
Are Large Language Models Economically Viable for Industry Deployment?

2604.19342 cs.CL 2026-04-21 PDF (arxiv)

Abdullah Mohammad, Sushant Kumar Ray, Pushkar Arora, Rafiq Ali, Ebad Shabbir, Gautam Siddharth Kashyap, Jiechao Gao, Usman Naseem

Generative AI-powered by Large Language Models (LLMs)-is increasingly deployed in industry across healthcare decision support, financial analytics, enterprise retrieval, and conversational automation, where reliability, efficiency, and cost control are critical. In such settings, models must satisfy strict constraints on energy, latency, and hardware utilization-not accuracy alone. Yet prevailing …

Open PDF (arxiv)
Evaluation-driven Scaling for Scientific Discovery

2604.19341 cs.LG 2026-04-21 PDF (arxiv)

Haotian Ye, Haowei Lin, Jingyi Tang, Yizhen Luo, Caiyin Yang, Chang Su, Rahul Thapa, Rui Yang, Ruihua Liu, Zeyu Li, Chong Gao, Dachao Ding, Guangrong He, Miaolei Zhang, Lina Sun, Wenyang Wang, Yuchen Zhong, Zhuohao Shen, Di He, Jianzhu Ma, Stefano Ermon, Tongyang Li, Xiaowen Chu, James Zou, Yuzhi Xu

Language models are increasingly used in scientific discovery to generate hypotheses, propose candidate solutions, implement systems, and iteratively refine them. At the core of these trial-and-error loops lies evaluation: the process of obtaining feedback on candidate solutions via verifiers, simulators, or task-specific scoring functions. While prior work has highlighted the importance of evalua…

Open PDF (arxiv)
Improvements to the post-processing of weather forecasts using machine learning and feature selection

2604.19340 physics.ao-ph 2026-04-21 PDF (arxiv)

Kazuma Iwase, Tomoyuki Takenawa

This study aims to develop and improve machine learning-based post-processing models for precipitation, temperature, and wind speed predictions using the Mesoscale Model (MSM) dataset provided by the Japan Meteorological Agency (JMA) for 18 locations across Japan, including plains, mountainous regions, and islands. By incorporating meteorological variables from grid points surrounding the target l…

Open PDF (arxiv)
Divide-and-Conquer Approach to Holistic Cognition in High-Similarity Contexts with Limited Data

2604.19339 cs.CV 2026-04-21 PDF (arxiv)

Shijie Wang, Zijian Wang, Yadan Luo, Haojie Li, Zi Huang, Mahsa Baktashmotlagh

Ultra-fine-grained visual categorization (Ultra-FGVC) aims to classify highly similar subcategories within fine-grained objects using limited training samples. However, holistic yet discriminative cues, such as leaf contours in extremely similar cultivars, remain under-explored in current studies, thereby limiting recognition performance. Though crucial, modeling holistic cues with complex morphol…

Open PDF (arxiv)
Hybrid Beamforming for Subarray-Level Movable Antenna Enhanced MU-MIMO Communications

2604.19338 eess.SP 2026-04-21 PDF (arxiv)

Shanshan Zhang, Songjie Yang, Wenxuan Zhang, Youzhi Xiong, Siya Yao

This study investigates subarray-level movable antenna (MA) architecture for multi-user MIMO (MU-MIMO) systems. Unlike conventional systems with fixed-position antennas (FPAs), the proposed scheme harnesses the additional positional degrees of freedom (DoFs) of movable subarrays to enhance spatial multiplexing capabilities for both multi-user and multi-stream communications. Our objective is to ma…

Open PDF (arxiv)
POLAR-PIC: A Holistic Framework for Matrixized PIC with Co-Designed Compute, Layout, and Communication

2604.19337 cs.DC 2026-04-21 PDF (arxiv)

Yizhuo Rao, Xingjian Cui, Shangzhi Pang, Jiabin Xie, Guangnan Feng, Jinhui Wei, Ziyan Zhang, Languang Gao, Zhenyu Wang, Zhiguang Chen, Yutong Lu

Particle-in-Cell (PIC) simulations are fundamental to plasma physics but often suffer from limited scalability due to particle-grid interaction bottlenecks and particle redistribution costs. Specifically, the particle-grid interaction computations have not taken full advantage of the emerging Matrix Processing Units (MPUs), the particle motion introduces irregular memory accesses, and the bulk-syn…

Open PDF (arxiv)
FedSEA: Achieving Benefit of Parallelization in Federated Online Learning

2604.19336 cs.LG 2026-04-21 PDF (arxiv)

Harekrushna Sahu, Pratik Jawanpuria, Pranay Sharma

Online federated learning (OFL) has emerged as a popular framework for decentralized decision-making over continuous data streams without compromising client privacy. However, the adversary model assumed in standard OFL typically precludes any potential benefits of parallelization. Further, it fails to adequately capture the different sources of statistical variation in OFL problems. In this paper…

Open PDF (arxiv)
When Active Learning Falls Short: An Empirical Study on Chemical Reaction Extraction

2604.19335 cs.LG 2026-04-21 PDF (arxiv)

Simin Yu, Sufia Fathima

The rapid growth of chemical literature has generated vast amounts of unstructured data, where reaction information is particularly valuable for applications such as reaction predictions and drug design. However, the prohibitive cost of expert annotation has led to a scarcity of training data, severely hindering the performance of automatic reaction extraction. In this work, we conduct a systemati…

Open PDF (arxiv)
Silicon Aware Neural Networks

2604.19334 cs.CV 2026-04-21 PDF (arxiv)

Sebastian Fieldhouse, Kea-Tiong Tang

Recent work in the machine learning literature has demonstrated that deep learning can train neural networks made of discrete logic gate functions to perform simple image classification tasks at very high speeds on CPU, GPU and FPGA platforms. By virtue of being formed by discrete logic gates, these Differentiable Logic Gate Networks (DLGNs) lend themselves naturally to implementation in custom si…

Open PDF (arxiv)
Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation

2604.19331 cs.CL 2026-04-21 PDF (arxiv)

Eoghan Cunningham, Derek Greene, James Cross, Antonio Rago

Understanding how policy is debated and justified in parliament is a fundamental aspect of the democratic process. However, the volume and complexity of such debates mean that outside audiences struggle to engage. Meanwhile, Large Language Models (LLMs) have been shown to enable automated summarisation at scale. While summaries of debates can make parliamentary procedures more accessible, evaluati…

Open PDF (arxiv)
Text-To-Speech with Chain-of-Details: modeling temporal dynamics in speech generation

2604.19330 eess.AS 2026-04-21 PDF (arxiv)

Jianbo Ma, Richard Cartwright

Recent advances in Text-To-Speech (TTS) synthesis have seen the popularity of multi-stage approaches that first predict semantic tokens and then generate acoustic tokens. In this paper, we extend the coarse-to-fine generation paradigm to the temporal domain and introduce Chain-of-Details (CoD), a novel framework that explicitly models temporal coarse-to-fine dynamics in speech generation using a c…

Open PDF (arxiv)
PLaMo 2.1-VL Technical Report

2604.19324 cs.CV 2026-04-21 PDF (arxiv)

Tommi Kerola, Yuya Masuda, Takashi Masuko, Toshiki Nakanishi, Daisuke Nishino, Kuniyuki Takahashi, Hanqin Wang, Yoshihiro Yamada

We introduce PLaMo 2.1-VL, a lightweight Vision Language Model (VLM) for autonomous devices, available in 8B and 2B variants and designed for local and edge deployment with Japanese-language operation. Focusing on Visual Question Answering (VQA) and Visual Grounding as its core capabilities, we develop and evaluate the models for two real-world application scenarios: factory task analysis via tool…

Open PDF (arxiv)
Concept Inconsistency in Dermoscopic Concept Bottleneck Models: A Rough-Set Analysis of the Derm7pt Dataset

2604.19323 cs.LG 2026-04-21 PDF (arxiv)

Gonzalo Nápoles, Isel Grau, Yamisleydi Salgueiro

Concept Bottleneck Models (CBMs) route predictions exclusively through a clinically grounded concept layer, binding interpretability to concept-label consistency. When a dataset contains concept-level inconsistencies, identical concept profiles mapped to conflicting diagnosis labels create an unresolvable bottleneck that imposes a hard ceiling on achievable accuracy. In this paper, we apply rough …

Open PDF (arxiv)