CUEBES

CIBER: A Comprehensive Benchmark for Security Evaluation of Code Interpreter Agents

arXiv:2602.19547v1 Announce Type: new Abstract: LLM-based code interpreter agents are increasingly deployed in critical workflows, yet their robustness against risks introduced by their code execution...

Software Engineering

arXiv CS 3d ago

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

arXiv:2602.19548v1 Announce Type: new Abstract: One of the first pre-processing steps for constructing web-scale LLM pretraining datasets involves extracting text from HTML. Despite the immense...

Software Policy

arXiv CS 3d ago

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

arXiv:2602.19549v1 Announce Type: new Abstract: Visual Document Retrieval (VDR), which aims to retrieve relevant pages within vast corpora of visually-rich documents, is of significance in...

Software Policy

arXiv CS 3d ago

Hardware-Friendly Randomization: Enabling Random-Access and Minimal Wiring in FHE Accelerators with Low Total Cost

arXiv:2602.19550v1 Announce Type: new Abstract: The Ring-Learning With Errors (RLWE) problem forms the backbone of highly efficient Fully Homomorphic Encryption (FHE) schemes. A significant component...

Hardware Technology

arXiv CS 3d ago

The Sample Complexity of Replicable Realizable PAC Learning

arXiv:2602.19552v1 Announce Type: new Abstract: In this paper, we consider the problem of replicable realizable PAC learning. We construct a particularly hard learning problem and...

Technology Policy

arXiv CS 3d ago

Sound-first immersive training for blind and low-vision learners: A simulation flow for safe, standardized orientation, mobility, and daily living practice

arXiv:2602.19554v1 Announce Type: new Abstract: Orientation and mobility (O&M) instruction for blind and low-vision learners is effective but difficult to standardize and repeat at scale...

Technology Psychology

arXiv CS 3d ago

Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains

arXiv:2602.19555v1 Announce Type: new Abstract: Agentic systems built on large language models (LLMs) extend beyond text generation to autonomously retrieve information and invoke tools. This...

Cybersecurity Policy

arXiv CS 3d ago

Identifying, Explaining, and Correcting Ableist Language with AI

arXiv:2602.19560v1 Announce Type: new Abstract: Ableist language perpetuates harmful stereotypes and exclusion, yet its nuanced nature makes it difficult to recognize and address. Artificial intelligence...

Artificial Intelligence Biology

arXiv CS 3d ago

A Multimodal Framework for Aligning Human Linguistic Descriptions with Visual Perceptual Data

arXiv:2602.19562v1 Announce Type: new Abstract: Establishing stable mappings between natural language expressions and visual percepts is a foundational problem for both cognitive science and artificial...

Software Neuroscience

arXiv CS 3d ago

DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

arXiv:2602.19565v1 Announce Type: new Abstract: Articulated object pose estimation is a core task in embodied AI. Existing methods typically regress poses in a continuous space,...

Engineering Environment

arXiv CS 3d ago

Spritz: Path-Aware Load Balancing in Low-Diameter Networks

arXiv:2602.19567v1 Announce Type: new Abstract: Low-diameter topologies such as Dragonfly and Slim Fly are increasingly adopted in HPC and datacenter networks, yet existing load balancing...

Hardware Technology

arXiv CS 3d ago

Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

arXiv:2602.19569v1 Announce Type: new Abstract: Question Answering over Temporal Knowledge Graphs (TKGQA) has attracted growing interest for handling time-sensitive queries. However, existing methods still struggle...

Neuroscience Software

arXiv CS 3d ago

VALD: Multi-Stage Vision Attack Detection for Efficient LVLM Defense

arXiv:2602.19570v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) can be vulnerable to adversarial images that subtly bias their outputs toward plausible yet incorrect responses....

Psychology Software

arXiv CS 3d ago

HOCA-Bench: Beyond Semantic Perception to Predictive World Modeling via Hegelian Ontological-Causal Anomalies

arXiv:2602.19571v1 Announce Type: new Abstract: Video-LLMs have improved steadily on semantic perception, but they still fall short on predictive world modeling, which is central to...

Policy Genetics

arXiv CS 3d ago

ConceptPrism: Concept Disentanglement in Personalized Diffusion Models via Residual Token Optimization

arXiv:2602.19575v1 Announce Type: new Abstract: Personalized text-to-image generation suffers from concept entanglement, where irrelevant residual information from reference images is captured, leading to a trade-off...

Quantum Computing Software

arXiv CS 3d ago

Chasing Ghosts: A Simulation-to-Real Olfactory Navigation Stack with Optional Vision Augmentation

arXiv:2602.19577v1 Announce Type: new Abstract: Autonomous odor source localization remains a challenging problem for aerial robots due to turbulent airflow, sparse and delayed sensory signals,...

Hardware Robotics

arXiv CS 3d ago

Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

arXiv:2602.19580v1 Announce Type: new Abstract: We introduce Leap+Verify, a framework that applies speculative execution -- predicting future model weights and validating predictions before acceptance --...

Artificial Intelligence Neuroscience

arXiv CS 3d ago

Advantage-based Temporal Attack in Reinforcement Learning

arXiv:2602.19582v1 Announce Type: new Abstract: Extensive research demonstrates that Deep Reinforcement Learning (DRL) models are susceptible to adversarially constructed inputs (i.e., adversarial examples), which can...

Policy Biology

arXiv CS 3d ago

DEEP: Docker-based Execution and Evaluation Platform

arXiv:2602.19583v1 Announce Type: new Abstract: Comparative evaluation of several systems is a recurrent task in researching. It is a key step before deciding which system...

Software Technology

arXiv CS 3d ago

Interpolation-Driven Machine Learning Approaches for Plume Shine Dose Estimation: A Comparison of XGBoost, Random Forest, and TabNet

arXiv:2602.19584v1 Announce Type: new Abstract: Despite the success of machine learning (ML) in surrogate modeling, its use in radiation dose assessment is limited by safety-critical...

Energy Artificial Intelligence

arXiv CS 3d ago

Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

arXiv:2602.19585v1 Announce Type: new Abstract: Multimodal Sentiment Analysis (MSA) integrates language, visual, and acoustic modalities to infer human sentiment. Most existing methods either focus on...

Energy Quantum Computing

arXiv CS 3d ago

Co-Optimization of Network Topology and Variable Impedance Devices under Dynamic Line Ratings in Power Transmission Systems

arXiv:2602.19587v1 Announce Type: new Abstract: Power system operators are increasingly deploying Grid Enhancing Technologies (GETs) to mitigate operational challenges such as line and transformer congestion,...

Energy Climate & Environment

arXiv CS 3d ago

Detecting High-Potential SMEs with Heterogeneous Graph Neural Networks

arXiv:2602.19591v1 Announce Type: new Abstract: Small and Medium Enterprises (SMEs) constitute 99.9% of U.S. businesses and generate 44% of economic activity, yet systematically identifying high-potential...

Business Politics

arXiv CS 3d ago

ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

arXiv:2602.19594v1 Announce Type: new Abstract: We introduce ISO-Bench, a benchmark for coding agents to test their capabilities on real-world inference optimization tasks. These tasks were...

Software World News