CUEBES

Harmonia: Algorithm-Hardware Co-Design for Memory- and Compute-Efficient BFP-based LLM Inference

arXiv:2602.04595v2 Announce Type: replace Abstract: Large Language Models (LLMs) are powerful but incur high memory and computation costs. Quantization is an effective solution, with INT...

Hardware Technology

arXiv CS 3d ago

EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization

arXiv:2602.05165v3 Announce Type: replace Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has proven effective for enhancing the reasoning capabilities of Large Language Models (LLMs). However,...

Policy Artificial Intelligence

arXiv CS 3d ago

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

arXiv:2602.05220v2 Announce Type: replace Abstract: Current audio foundation models typically rely on rigid, task-specific supervision, addressing isolated factors of audio rather than the whole. In...

Software Neuroscience

arXiv CS 3d ago

Transport and Merge: Cross-Architecture Merging for Large Language Models

arXiv:2602.05495v2 Announce Type: replace Abstract: Large language models (LLMs) achieve strong capabilities by scaling model capacity and training data, yet many real-world deployments rely on...

Neuroscience Software

arXiv CS 3d ago

SweetSpot: An Analytical Model for Predicting Energy Efficiency of LLM Inference

arXiv:2602.05695v2 Announce Type: replace Abstract: Large Language Models (LLMs) inference is central to modern AI applications, dominating worldwide datacenter workloads, making it critical to predict...

Software Artificial Intelligence

arXiv CS 3d ago

Landscaper: Understanding Loss Landscapes Through Multi-Dimensional Topological Analysis

arXiv:2602.07135v3 Announce Type: replace Abstract: Loss landscapes are a powerful tool for understanding neural network optimization and generalization, yet traditional low-dimensional analyses often miss complex...

Artificial Intelligence Neuroscience

arXiv CS 3d ago

Humanizing AI Grading: Student-Centered Insights on Fairness, Trust, Consistency and Transparency

arXiv:2602.07754v2 Announce Type: replace Abstract: This study investigates students' perceptions of Artificial Intelligence (AI) grading systems in an undergraduate computer science course (n = 27),...

Artificial Intelligence Environment

arXiv CS 3d ago

Geometry-Aware Rotary Position Embedding for Consistent Video World Model

arXiv:2602.07854v3 Announce Type: replace Abstract: Predictive world models that simulate future observations under explicit camera control are fundamental to interactive AI. Despite rapid advances, current...

Psychology World News

arXiv CS 3d ago

ToolSelf: Unifying Task Execution and Self-Reconfiguration via Tool-Driven Intrinsic Adaptation

arXiv:2602.07883v2 Announce Type: replace Abstract: Agentic systems powered by Large Language Models (LLMs) have demonstrated remarkable potential in tackling complex, long-horizon tasks. However, their efficacy...

Psychology Biology

arXiv CS 3d ago

Interpretable Failure Analysis in Multi-Agent Reinforcement Learning Systems

arXiv:2602.08104v2 Announce Type: replace Abstract: Multi-Agent Reinforcement Learning (MARL) is increasingly deployed in safety-critical domains, yet methods for interpretable failure detection and attribution remain underdeveloped....

Policy Health

arXiv CS 3d ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

arXiv:2602.08354v2 Announce Type: replace Abstract: Recent advancements in large reasoning models (LRMs) have greatly improved their capabilities on complex reasoning tasks through Long Chains of...

Software Psychology

arXiv CS 3d ago

Causal Schr\"odinger Bridges: Constrained Optimal Transport on Structural Manifolds

arXiv:2602.08535v4 Announce Type: replace Abstract: Generative modeling typically seeks the path of least action via deterministic flows (ODE). While effective for in-distribution tasks, we argue...

Mathematics Policy

arXiv CS 3d ago

GOT-Edit: Geometry-Aware Generic Object Tracking via Online Model Editing

arXiv:2602.08550v2 Announce Type: replace Abstract: Human perception for effective object tracking in a 2D video stream arises from the implicit use of prior 3D knowledge...

Software Business

arXiv CS 3d ago

Breaking the Simplification Bottleneck in Amortized Neural Symbolic Regression

arXiv:2602.08885v4 Announce Type: replace Abstract: Symbolic regression (SR) aims to discover interpretable analytical expressions that accurately describe observed data. Amortized SR promises to be much...

Genetics Neuroscience

arXiv CS 3d ago

Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling

arXiv:2602.09084v2 Announce Type: replace Abstract: We study instruction-based image editing under professional workflows and identify three persistent challenges: (i) editors often over-edit, modifying content beyond...

Policy

arXiv CS 3d ago

AgentCgroup: Understanding and Controlling OS Resources of AI Agents

arXiv:2602.09345v2 Announce Type: replace Abstract: AI agents are increasingly deployed in multi-tenant cloud environments, where they execute diverse tool calls within sandboxed containers, each call...

Engineering Artificial Intelligence

arXiv CS 3d ago

Toward Linking Declined Proposals and Source Code: An Exploratory Study on the Go Repository

arXiv:2602.09467v3 Announce Type: replace Abstract: Traceability links are key information sources for software developers, connecting software artifacts. Such links play an important role, particularly between...

Software Technology

arXiv CS 3d ago

ReSIM: Re-ranking Binary Similarity Embeddings to Improve Function Search Performance

arXiv:2602.09548v2 Announce Type: replace Abstract: Binary Function Similarity (BFS), the problem of determining whether two binary functions originate from the same source code, has been...

Engineering Embedded Systems

arXiv CS 3d ago

Time2General: Learning Spatiotemporal Invariant Representations for Domain-Generalization Video Semantic Segmentation

arXiv:2602.09648v2 Announce Type: replace Abstract: Domain Generalized Video Semantic Segmentation (DGVSS) is trained on a single labeled driving domain and is directly deployed on unseen...

Software Policy

arXiv CS 3d ago

SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

arXiv:2602.10116v2 Announce Type: replace Abstract: Real-world data collection for embodied agents remains costly and unsafe, calling for scalable, realistic, and simulator-ready 3D environments. However, existing...

Policy Environment

arXiv CS 3d ago

Unconditionally Long-Time Stable Variable-Step Second-Order Exponential Time-Differencing Schemes for the Incompressible NSE

arXiv:2602.10268v2 Announce Type: replace Abstract: We develop an efficient, unconditionally stable, variable step second order exponential time differencing scheme for the incompressible Navier Stokes equations...

Embedded Systems Mathematics

arXiv CS 3d ago

TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation

arXiv:2602.10471v2 Announce Type: replace Abstract: Given that Large Language Models (LLMs) are increasingly applied to automate software development, comprehensive software assurance spans three distinct goals:...

Software Artificial Intelligence

arXiv CS 3d ago

TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents

arXiv:2602.11767v2 Announce Type: replace Abstract: Advances in large language models (LLMs) are driving a shift toward using reinforcement learning (RL) to train agents from iterative,...

Environment Software

arXiv CS 3d ago

Wisdom of the LLM Crowd: A Large Scale Benchmark of Multi-Label U.S. Election-Related Harmful Social Media Content

arXiv:2602.11962v2 Announce Type: replace Abstract: The spread of election misinformation and harmful political content conveys misleading narratives and poses a serious threat to democratic integrity....

Politics Psychology