CUEBES

One-Step Flow Q-Learning: Addressing the Diffusion Policy Bottleneck in Offline Reinforcement Learning

arXiv:2508.13904v3 Announce Type: replace Abstract: Diffusion Q-Learning (DQL) has established diffusion policies as a high-performing paradigm for offline reinforcement learning, but its reliance on multi-step...

Policy Energy

arXiv CS Feb 25

Uncertainty Propagation Networks for Neural Ordinary Differential Equations

arXiv:2508.16815v2 Announce Type: replace Abstract: This paper introduces Uncertainty Propagation Network (UPN), a novel family of neural differential equations that naturally incorporate uncertainty quantification into...

Biology Neuroscience

arXiv CS Feb 25

MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling

arXiv:2508.17404v3 Announce Type: replace Abstract: Existing video generation models predominantly emphasize appearance fidelity while exhibiting limited ability to synthesize complex human motions, such as whole-body...

Engineering Environment

arXiv CS Feb 25

Decouple, Reorganize, and Fuse: A Multimodal Framework for Cancer Survival Prediction

arXiv:2508.18632v2 Announce Type: replace Abstract: Cancer survival analysis commonly integrates information across diverse medical modalities to make survival-time predictions. Existing methods primarily focus on extracting...

Medicine & Health Psychology

arXiv CS Feb 25

Hybrid Deep Searcher: Scalable Parallel and Sequential Search Reasoning

arXiv:2508.19113v2 Announce Type: replace Abstract: Large reasoning models (LRMs) combined with retrieval-augmented generation (RAG) have enabled deep research agents capable of multi-step reasoning with external...

Biology Psychology

arXiv CS Feb 25

Learning Unified Representations from Heterogeneous Data for Robust Heart Rate Modeling

arXiv:2508.21785v3 Announce Type: replace Abstract: Heart rate prediction is vital for personalized health monitoring and fitness, while it frequently faces a critical challenge in real-world...

Software Health

arXiv CS Feb 25

Hierarchical Multi-Agent MCTS for Safety-Critical Coordination in Mixed-Autonomy Roundabouts

arXiv:2509.01856v4 Announce Type: replace Abstract: Navigating unsignalized roundabouts in mixed-autonomy traffic presents significant challenges due to dense vehicle interactions, lane-changing complexities, and behavioral uncertainties of...

Robotics Psychology

arXiv CS Feb 25

Adaptive Evolutionary Framework for Safe, Efficient, and Cooperative Autonomous Vehicle Interactions

arXiv:2509.07411v2 Announce Type: replace Abstract: Modern transportation systems face significant challenges in ensuring road safety, given serious injuries caused by road accidents. The rapid growth...

Biology Robotics

arXiv CS Feb 25

On the Convergence of Elementary Cellular Automata under Sequential Update Modes

arXiv:2509.07797v2 Announce Type: replace Abstract: In this paper, we perform a theoretical analysis of the sequential convergence of elementary cellular automata that have at least...

Policy Biology

arXiv CS Feb 25

PegasusFlow: Parallel Rolling-Denoising Score Sampling for Robot Diffusion Planner Flow Matching

arXiv:2509.08435v2 Announce Type: replace Abstract: Diffusion models offer powerful generative capabilities for robot trajectory planning, yet their practical deployment on robots is hindered by a...

Embedded Systems Energy

arXiv CS Feb 25

Efficiently Computing Equilibria in Budget-Aggregation Games

arXiv:2509.08767v3 Announce Type: replace Abstract: Budget aggregation deals with the social choice problem of distributing an exogenously given budget among a set of public projects,...

Energy Policy

arXiv CS Feb 25

Secure Semantic Communication over Wiretap Channels: Rate-Distortion-Equivocation Tradeoff

arXiv:2509.12142v2 Announce Type: replace Abstract: This paper investigates an information-theoretic model of secure semantic-aware communication. For this purpose, we consider the lossy joint source-channel coding...

Quantum Computing Software

arXiv CS Feb 25

An Adaptive CMSA for Solving the Longest Filled Common Subsequence Problem with an Application in Audio Querying

arXiv:2509.12261v2 Announce Type: replace Abstract: This paper addresses the Longest Filled Common Subsequence (LFCS) problem, a challenging NP-hard problem with applications in bioinformatics, including gene...

Engineering Software

arXiv CS Feb 25

A Simple and Efficient Jailbreak Method Exploiting LLMs' Helpfulness

arXiv:2509.14297v2 Announce Type: replace Abstract: This study reveals a critical safety blind spot in modern LLMs: learning-style queries, which closely resemble ordinary educational questions, can...

Policy Artificial Intelligence

arXiv CS Feb 25

ATTS: Asynchronous Test-Time Scaling via Conformal Prediction

arXiv:2509.15148v3 Announce Type: replace Abstract: Large language models (LLMs) benefit from test-time scaling but are often hampered by high inference latency. Speculative decoding is a...

Artificial Intelligence Software

arXiv CS Feb 25

Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

arXiv:2509.15796v2 Announce Type: replace Abstract: The goal of protein design is to generate amino acid sequences that fold into functional structures with desired properties. Prior...

Biology Engineering

arXiv CS Feb 25

From Samples to Scenarios: A New Paradigm for Probabilistic Forecasting

arXiv:2509.19975v2 Announce Type: replace Abstract: Most state-of-the-art probabilistic time series forecasting models rely on sampling to represent future uncertainty. However, this paradigm suffers from inherent...

Psychology Software

arXiv CS Feb 25

DS-STAR: Data Science Agent for Solving Diverse Tasks across Heterogeneous Formats and Open-Ended Queries

arXiv:2509.21825v4 Announce Type: replace Abstract: While large language models (LLMs) have shown promise in automating data science, existing agents often struggle with the complexity of...

Software World News

arXiv CS Feb 25

Why High-rank Neural Networks Generalize?: An Algebraic Framework with RKHSs

arXiv:2509.21895v2 Announce Type: replace Abstract: We derive a new Rademacher complexity bound for deep neural networks using Koopman operators, group representations, and reproducing kernel Hilbert...

Neuroscience Software

arXiv CS Feb 25

From Parameters to Behaviors: Unsupervised Compression of the Policy Space

arXiv:2509.22566v2 Announce Type: replace Abstract: Despite its recent successes, Deep Reinforcement Learning (DRL) is notoriously sample-inefficient. We argue that this inefficiency stems from the standard...

Policy Environment

arXiv CS Feb 25

RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility

arXiv:2509.23115v3 Announce Type: replace Abstract: Predicting human mobility is inherently challenging due to complex long-range dependencies and multi-scale periodic behaviors. To address this, we introduce...

Psychology Software

arXiv CS Feb 25

Multi-Order Runge-Kutta Methods or how to numerically solve initial value problems of any order

arXiv:2509.23513v4 Announce Type: replace Abstract: When one wishes to numerically solve an initial value problem, it is customary to rewrite it as an equivalent first-order...

Psychology Software

arXiv CS Feb 25

GPM: The Gaussian Pancake Mechanism for Planting Undetectable Backdoors in Differential Privacy

arXiv:2509.23834v2 Announce Type: replace Abstract: Differential privacy (DP) has become the gold standard for preserving individual privacy in data analysis. However, an implicit yet fundamental...

Software Technology

arXiv CS Feb 25

TimeOmni-1: Incentivizing Complex Reasoning with Time Series in Large Language Models

arXiv:2509.24803v3 Announce Type: replace Abstract: Recent advances in multimodal time series learning underscore a paradigm shift from analytics centered on basic patterns toward advanced time...

Psychology World News