CUEBES

R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning

arXiv:2601.19620v2 Announce Type: replace Abstract: Large reasoning models (LRMs) aim to solve diverse and complex problems through structured reasoning. Recent advances in group-based policy optimization...

Policy Engineering

arXiv CS Jan 29

ProToken: Token-Level Attribution for Federated Large Language Models

arXiv:2601.19672v2 Announce Type: replace Abstract: Federated Learning (FL) enables collaborative training of Large Language Models (LLMs) across distributed data sources while preserving privacy. However, when...

Neuroscience Software

arXiv CS Jan 29

Benchmarking Multimodal Large Language Models for Missing Modality Completion in Product Catalogues

arXiv:2601.19750v2 Announce Type: replace Abstract: Missing-modality information on e-commerce platforms, such as absent product images or textual descriptions, often arises from annotation errors or incomplete...

Software Policy

arXiv CS Jan 29

GeoDiff3D: Self-Supervised 3D Scene Generation with Geometry-Constrained 2D Diffusion Guidance

arXiv:2601.19785v2 Announce Type: replace Abstract: 3D scene generation is a core technology for gaming, film/VFX, and VR/AR. Growing demand for rapid iteration, high-fidelity detail, and...

Technology Engineering

arXiv CS Jan 29

LVLMs and Humans Ground Differently in Referential Communication

arXiv:2601.19792v2 Announce Type: replace Abstract: For generative AI agents to partner effectively with human users, the ability to accurately predict human intent is critical. But...

Policy Artificial Intelligence

arXiv CS Jan 29

Zero-Shot Stance Detection in the Wild: Dynamic Target Generation and Multi-Target Adaptation

arXiv:2601.19802v2 Announce Type: replace Abstract: Current stance detection research typically relies on predicting stance based on given targets and text. However, in real-world social media...

Psychology World News

arXiv CS Jan 29

Equitable Routing--Rethinking the Multiple Traveling Salesman Problem

arXiv:2404.08157v5 Announce Type: replace-cross Abstract: The Multiple Traveling Salesman Problem (MTSP) extends the traveling salesman problem by assigning multiple salesmen to visit a set of...

Software World News

arXiv CS Jan 29

Analyzing decision tree bias towards the minority class

arXiv:2501.04903v4 Announce Type: replace-cross Abstract: There is a widespread and longstanding belief that machine learning models are biased towards the majority class when learning from...

Software Policy

arXiv CS Jan 29

X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second

arXiv:2503.06382v2 Announce Type: replace-cross Abstract: Sparse-view 3D CT reconstruction aims to recover volumetric structures from a limited number of 2D X-ray projections. Existing feedforward methods...

Software Neuroscience

arXiv CS Jan 29

Fractal and Regular Geometry of Deep Neural Networks

arXiv:2504.06250v2 Announce Type: replace-cross Abstract: We study the geometric properties of random neural networks by investigating the boundary volumes of their excursion sets for different...

Neuroscience Psychology

arXiv CS Jan 29

Improved bounds on the zeros of the chromatic polynomial of graphs and claw-free graphs

arXiv:2505.04366v2 Announce Type: replace-cross Abstract: We prove that for any graph $G$ the (complex) zeros of its chromatic polynomial, $\chi_G(x)$, lie inside the disk centered...

Policy

arXiv CS Jan 29

Simplicity is Key: An Unsupervised Pretraining Approach for Sparse Radio Channels

arXiv:2505.13055v3 Announce Type: replace-cross Abstract: Unsupervised representation learning for wireless channel state information (CSI)reduces reliance on labeled data, thereby lowering annotation costs, and often improves...

Software Politics

arXiv CS Jan 29

Determining unit groups and $\mathrm{K}_1$ of finite rings

arXiv:2506.00266v2 Announce Type: replace-cross Abstract: We consider the computational problem of determining the unit group of a finite ring, by which we mean the computation...

Policy Artificial Intelligence

arXiv CS Jan 29

Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks

arXiv:2506.00856v3 Announce Type: replace-cross Abstract: Can AI effectively perform complex econometric analysis traditionally requiring human expertise? This paper evaluates AI agents' capability to master econometrics,...

Software Artificial Intelligence

arXiv CS Jan 29

Confidence intervals for forced alignment boundaries using model ensembles

arXiv:2506.01256v3 Announce Type: replace-cross Abstract: Forced alignment is a common tool to align audio with orthographic and phonetic transcriptions. Most forced alignment tools provide only...

Technology Neuroscience

arXiv CS Jan 29

Online Conformal Model Selection for Nonstationary Time Series

arXiv:2506.05544v2 Announce Type: replace-cross Abstract: This paper introduces the MPS (Model Prediction Set), a novel framework for online model selection for nonstationary time series. Classical...

World News Biology

arXiv CS Jan 29

BRISC: Annotated Dataset for Brain Tumor Segmentation and Classification

arXiv:2506.14318v5 Announce Type: replace-cross Abstract: Accurate segmentation and classification of brain tumors from Magnetic Resonance Imaging (MRI) remain key challenges in medical image analysis, primarily...

Neuroscience Policy

arXiv CS Jan 29

Telegrapher's Generative Model via Kac Flows

arXiv:2506.20641v5 Announce Type: replace-cross Abstract: We break the mold in flow-based generative modeling by proposing a new model based on the damped wave equation, also...

Mathematics Neuroscience

arXiv CS Jan 29

AGFS-Tractometry: A Novel Atlas-Guided Fine-Scale Tractometry Approach for Enhanced Along-Tract Group Statistical Comparison Using Diffusion MRI Tractography

arXiv:2507.10601v2 Announce Type: replace-cross Abstract: Diffusion MRI (dMRI) tractography is currently the only method for in vivo mapping of the brain's white matter (WM) connections....

Software Genetics

arXiv CS Jan 29

Random forest-based out-of-distribution detection for robust lung cancer segmentation

arXiv:2508.19112v4 Announce Type: replace-cross Abstract: Accurate detection and segmentation of cancerous lesions from computed tomography (CT) scans is essential for automated treatment planning and cancer...

Software Medicine & Health

arXiv CS Jan 29

Some Robustness Properties of Label Cleaning

arXiv:2509.11379v2 Announce Type: replace-cross Abstract: We demonstrate that learning procedures that rely on aggregated labels, e.g., label information distilled from noisy responses, enjoy robustness properties...

Software Policy

arXiv CS Jan 29

Blind Source Separation of Radar Signals in Time Domain Using Deep Learning

arXiv:2509.15603v2 Announce Type: replace-cross Abstract: Identification and further analysis of radar emitters in a contested environment requires detection and separation of incoming signals. If they...

Artificial Intelligence Neuroscience

arXiv CS Jan 29

ArchesClimate: Probabilistic Decadal Ensemble Generation With Flow Matching

arXiv:2509.15942v2 Announce Type: replace-cross Abstract: Climate projections have uncertainties related to components of the climate system and their interactions. A typical approach to quantifying these...

Climate & Environment Software

arXiv CS Jan 29

Zeroth-Order Constrained Optimization from a Control Perspective via Feedback Linearization

arXiv:2509.24056v2 Announce Type: replace-cross Abstract: Safe derivative-free optimization under unknown constraints is a fundamental challenge in modern learning and control. Existing zeroth-order (ZO) methods typically...

Technology Psychology