CUEBES

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces

arXiv:2512.13821v2 Announce Type: replace Abstract: Large language models (LLMs) increasingly generate code with minimal human oversight, raising critical concerns about backdoor injection and malicious behavior....

Psychology Software

arXiv CS Feb 6

Softly Constrained Denoisers for Diffusion Models

arXiv:2512.14980v3 Announce Type: replace Abstract: Diffusion models struggle to produce samples that respect constraints, a common requirement in scientific applications. Recent approaches have introduced regularization...

Software Energy

arXiv CS Feb 6

A 96pJ/Frame/Pixel and 61pJ/Event Anti-UAV System with Hybrid Object Tracking Modes

arXiv:2512.17939v2 Announce Type: replace Abstract: We present an energy-efficient anti-UAV system that integrates frame-based and event-driven object tracking to enable reliable detection of small and...

Technology Robotics

arXiv CS Feb 6

VisionDirector: Vision-Language Guided Closed-Loop Refinement for Generative Image Synthesis

arXiv:2512.19243v3 Announce Type: replace Abstract: Generative models can now produce photorealistic imagery, yet they still struggle with the long, multi-goal prompts that professional designers issue....

World News Policy

arXiv CS Feb 6

(Im)possibility of Incentive Design for Challenge-based Blockchain Protocols

arXiv:2512.20864v2 Announce Type: replace Abstract: Blockchains offer a decentralized and secure execution environment strong enough to host cryptocurrencies, but the state-replication model makes on-chain computation...

Environment Policy

arXiv CS Feb 6

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

arXiv:2512.22120v2 Announce Type: replace Abstract: Large vision-language models (VLMs) often benefit from intermediate visual cues, either injected via external tools or generated as latent visual...

Biology Software

arXiv CS Feb 6

Diversity or Precision? A Deep Dive into Next Token Prediction

arXiv:2512.22955v3 Announce Type: replace Abstract: Recent advancements have shown that reinforcement learning (RL) can substantially improve the reasoning abilities of large language models (LLMs). The...

Policy Software

arXiv CS Feb 6

Active Perception Agent for Omnimodal Audio-Video Understanding

arXiv:2512.23646v2 Announce Type: replace Abstract: Omnimodal large language models have made significant strides in unifying audio and visual modalities; however, they often face challenges in...

Robotics Software

arXiv CS Feb 6

Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction

arXiv:2512.24139v4 Announce Type: replace Abstract: While conformal prediction provides robust marginal coverage guarantees, achieving reliable conditional coverage for specific inputs remains challenging. Although exact distribution-free...

Software World News

arXiv CS Feb 6

Deep Probabilistic Supervision for Image Classification

arXiv:2512.24162v2 Announce Type: replace Abstract: Supervised training of deep neural networks for classification typically relies on hard targets, which promote overconfidence and can limit calibration,...

Neuroscience Policy

arXiv CS Feb 6

RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Contextual Adaptation

arXiv:2512.24212v2 Announce Type: replace Abstract: Efficiently finding targets in complex environments is fundamental to real-world embodied applications. While recent advances in multimodal foundation models have...

Software Robotics

arXiv CS Feb 6

Quantifying and Inducing Shape Bias in CNNs via Max-Pool Dilation

arXiv:2601.05599v2 Announce Type: replace Abstract: Convolutional Neural Networks (CNNs) exhibit a well-known texture bias, prioritizing local patterns over global shapes - a tendency inherent to...

Neuroscience Engineering

arXiv CS Feb 6

Quantification and Classification of Carbon Nanotubes in Electron Micrographs using Vision Foundation Models

arXiv:2601.06673v2 Announce Type: replace Abstract: Accurate characterization of carbon nanotube morphologies in electron microscopy images is vital for exposure assessment and toxicological studies, yet current...

Materials Science Physics

arXiv CS Feb 6

Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification

arXiv:2601.07163v2 Announce Type: replace Abstract: Reliable learning of multimodal data (e.g., multi-omics) is a widely concerning issue, especially in safety-critical applications such as medical diagnosis....

Software Health

arXiv CS Feb 6

SIRR-LMM: Single-image Reflection Removal via Large Multimodal Model

arXiv:2601.07209v2 Announce Type: replace Abstract: Glass surfaces create complex interactions of reflected and transmitted light, making single-image reflection removal (SIRR) challenging. Existing datasets suffer from...

Software Policy

arXiv CS Feb 6

ESDD2: Environment-Aware Speech and Sound Deepfake Detection Challenge Evaluation Plan

arXiv:2601.07303v5 Announce Type: replace Abstract: Audio recorded in real-world environments often contains a mixture of foreground speech and background environmental sounds. With rapid advances in...

World News Biology

arXiv CS Feb 6

Resisting Manipulative Bots in Meme Coin Copy Trading: A Multi-Agent Approach with Chain-of-Thought Reasoning

arXiv:2601.08641v3 Announce Type: replace Abstract: Copy trading has become the dominant entry strategy in meme coin markets. However, due to the market's extremely illiquid and...

Economics Finance

arXiv CS Feb 6

An Example for Domain Adaptation Using CycleGAN

arXiv:2601.08776v2 Announce Type: replace Abstract: Cycle-Consistent Adversarial Network (CycleGAN) is very promising in domain adaptation. In this report, an example in medical domain will be...

Policy Medicine & Health

arXiv CS Feb 6

GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR

arXiv:2601.09361v2 Announce Type: replace Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) is crucial for advancing large-scale reasoning models. However, existing parameter-efficient methods, such as PiSSA...

Hardware Biology

arXiv CS Feb 6

Nonlinear numerical schemes using specular differentiation for initial value problems of first-order ordinary differential equations

arXiv:2601.09900v2 Announce Type: replace Abstract: This paper proposes specular differentiation in one-dimensional Euclidean space and provides its fundamental analysis, including a quasi-Fermat's theorem and quasi-Mean...

Software Mathematics

arXiv CS Feb 6

SolarGPT-QA: A Domain-Adaptive Large Language Model for Educational Question Answering in Space Weather and Heliophysics

arXiv:2601.12131v3 Announce Type: replace Abstract: Solar activity, including solar flares, coronal mass ejections (CMEs), and geomagnetic storms, can significantly impact satellites, aviation, power grids, data...

Energy Artificial Intelligence

arXiv CS Feb 6

Streaming Operator Inference for Model Reduction of Large-Scale Dynamical Systems

arXiv:2601.12161v2 Announce Type: replace Abstract: Projection-based model reduction enables efficient simulation of complex dynamical systems by constructing low-dimensional surrogate models from high-dimensional data. The Operator...

Software Psychology

arXiv CS Feb 6

AlphaSyndrome: Tackling the Syndrome Measurement Circuit Scheduling Problem for QEC Codes

arXiv:2601.12509v2 Announce Type: replace Abstract: Quantum error correction (QEC) is essential for scalable quantum computing, yet repeated syndrome-measurement cycles dominate its spacetime and hardware cost....

Hardware Quantum Computing

arXiv CS Feb 6

Improving Low-Resource Machine Translation via Round-Trip Reinforcement Learning

arXiv:2601.12535v2 Announce Type: replace Abstract: Low-resource machine translation (MT) has gained increasing attention as parallel data from low-resource language communities is collected, but many potential...

Software Policy