CUEBES

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

arXiv:2602.19416v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) enables powerful LLM alignment but can introduce reward hacking - models exploit spurious correlations...

Engineering Psychology

arXiv CS 5d ago

PA-Attack: Guiding Gray-Box Attacks on LVLM Vision Encoders with Prototypes and Attention

arXiv:2602.19418v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) are foundational to modern multimodal applications, yet their susceptibility to adversarial attacks remains a critical concern....

Software Policy

arXiv CS 5d ago

RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs

arXiv:2602.19419v1 Announce Type: new Abstract: Concentrated liquidity provision in decentralized exchanges presents a fundamental Impulse Control problem. Liquidity Providers (LPs) face a non-trivial trade-off between...

Business Economics

arXiv CS 5d ago

A Reinforcement Learning-based Transmission Expansion Framework Considering Strategic Bidding in Electricity Markets

arXiv:2602.19421v1 Announce Type: new Abstract: Transmission expansion planning in electricity markets is tightly coupled with the strategic bidding behaviors of generation companies. This paper proposes...

Policy Psychology

arXiv CS 5d ago

Positioning Modular Co-Design in Future HRI Design Research

arXiv:2602.19422v1 Announce Type: new Abstract: Design-oriented HRI is increasingly interested in robots as long-term companions, yet many designs still assume a fixed form and a...

Robotics Materials Science

arXiv CS 5d ago

Prefer-DAS: Learning from Local Preferences and Sparse Prompts for Domain Adaptive Segmentation of Electron Microscopy

arXiv:2602.19423v1 Announce Type: new Abstract: Domain adaptive segmentation (DAS) is a promising paradigm for delineating intracellular structures from various large-scale electron microscopy (EM) without incurring...

Software Biology

arXiv CS 5d ago

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images

arXiv:2602.19424v1 Announce Type: new Abstract: Hepatocellular Carcinoma diagnosis relies heavily on the interpretation of gigapixel Whole Slide Images. However, current computational approaches are constrained by...

Software Health

arXiv CS 5d ago

Sizing of Battery Considering Renewable Energy Bidding Strategy with Reinforcement Learning

arXiv:2602.19428v1 Announce Type: new Abstract: This paper proposes a novel computationally efficient algorithm for optimal sizing of Battery Energy Storage Systems (BESS) considering renewable energy...

Energy Policy

arXiv CS 5d ago

TherA: Thermal-Aware Visual-Language Prompting for Controllable RGB-to-Thermal Infrared Translation

arXiv:2602.19430v1 Announce Type: new Abstract: Despite the inherent advantages of thermal infrared(TIR) imaging, large-scale data collection and annotation remain a major bottleneck for TIR-based perception....

Climate & Environment Software

arXiv CS 5d ago

CountEx: Fine-Grained Counting via Exemplars and Exclusion

arXiv:2602.19432v1 Announce Type: new Abstract: This paper presents CountEx, a discriminative visual counting framework designed to address a key limitation of existing prompt-based methods: the...

Software Policy

arXiv CS 5d ago

Why iCloud Fails: The Category Mistake of Cloud Synchronization

arXiv:2602.19433v1 Announce Type: new Abstract: iCloud Drive presents a filesystem interface but implements cloud synchronization semantics that diverge from POSIX in fundamental ways. This divergence...

Psychology Embedded Systems

arXiv CS 5d ago

FinSight-Net:A Physics-Aware Decoupled Network with Frequency-Domain Compensation for Underwater Fish Detection in Smart Aquaculture

arXiv:2602.19437v1 Announce Type: new Abstract: Underwater fish detection (UFD) is a core capability for smart aquaculture and marine ecological monitoring. While recent detectors improve accuracy...

Psychology Environment

arXiv CS 5d ago

OptiRepair: Closed-Loop Diagnosis and Repair of Supply Chain Optimization Models with LLM Agents

arXiv:2602.19439v1 Announce Type: new Abstract: Problem Definition. Supply chain optimization models frequently become infeasible because of modeling errors. Diagnosis and repair require scarce OR expertise:...

Artificial Intelligence Health

arXiv CS 5d ago

Breaking the Barriers of Database-Agnostic Transactions

arXiv:2602.19440v1 Announce Type: new Abstract: Federated transaction management has long been used as a method to virtually integrate multiple databases from a transactional perspective, ensuring...

Software World News

arXiv CS 5d ago

When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests

arXiv:2602.19441v1 Announce Type: new Abstract: Autonomous coding agents increasingly contribute to software development by submitting pull requests on GitHub; yet, little is known about how...

Software Robotics

arXiv CS 5d ago

UrbanAlign: Post-hoc Semantic Calibration for VLM-Human Preference Alignment

arXiv:2602.19442v1 Announce Type: new Abstract: Aligning vision-language model (VLM) outputs with human preferences in domain-specific tasks typically requires fine-tuning or reinforcement learning, both of which...

Software Policy

arXiv CS 5d ago

PIS: A Physics-Informed System for Accurate State Partitioning of $A\beta_{42}$ Protein Trajectories

arXiv:2602.19444v1 Announce Type: new Abstract: Understanding the conformational evolution of $\beta$-amyloid ($A\beta$), particularly the $A\beta_{42}$ isoform, is fundamental to elucidating the pathogenic mechanisms underlying Alzheimer's...

Biology Psychology

arXiv CS 5d ago

"Write in English, Nobody Understands Your Language Here": A Study of Non-English Trends in Open-Source Repositories

arXiv:2602.19446v1 Announce Type: new Abstract: The open-source software (OSS) community has historically been dominated by English as the primary language for code, documentation, and developer...

Software Policy

arXiv CS 5d ago

Decoupling Vision and Language: Codebook Anchored Visual Adaptation

arXiv:2602.19449v1 Announce Type: new Abstract: Large Vision-Language Models (LVLMs) use their vision encoders to translate images into representations for downstream reasoning, but the encoders often...

Health Software

arXiv CS 5d ago

Red-Teaming Claude Opus and ChatGPT-based Security Advisors for Trusted Execution Environments

arXiv:2602.19450v1 Announce Type: new Abstract: Trusted Execution Environments (TEEs) (e.g., Intel SGX and ArmTrustZone) aim to protect sensitive computation from a compromised operating system, yet...

Artificial Intelligence Policy

arXiv CS 5d ago

Dekker's floating point number system and compensated summation algorithms

arXiv:2602.19452v1 Announce Type: new Abstract: The recent hardware trend towards reduced precision computing has reignited the interest in numerical techniques that can be used to...

Hardware Technology

arXiv CS 5d ago

HD-TTA: Hypothesis-Driven Test-Time Adaptation for Safer Brain Tumor Segmentation

arXiv:2602.19454v1 Announce Type: new Abstract: Standard Test-Time Adaptation (TTA) methods typically treat inference as a blind optimization task, applying generic objectives to all or filtered...

Software Policy

arXiv CS 5d ago

SenTSR-Bench: Thinking with Injected Knowledge for Time-Series Reasoning

arXiv:2602.19455v1 Announce Type: new Abstract: Time-series diagnostic reasoning is essential for many applications, yet existing solutions face a persistent gap: general reasoning large language models...

Software World News

arXiv CS 5d ago

Optimal Error Estimates of a new Multiphysic Finite Element Method for Nonlinear Poroelasticity model with Hencky-Mises Stress Tensor

arXiv:2602.19457v1 Announce Type: new Abstract: In this paper, we develop a new multiphysics finite element method for a nonlinear poroelastic model with Hencky-Mises stress tensor....

Policy Chemistry