CUEBES

V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction

arXiv:2503.17736v2 Announce Type: replace Abstract: Large Vision-Language Models (LVLMs) have made significant strides in the field of video understanding in recent times. Nevertheless, existing video...

Cybersecurity Policy

arXiv CS Feb 4

An Overview of Low-Rank Structures in the Training and Adaptation of Large Models

arXiv:2503.19859v3 Announce Type: replace Abstract: The substantial computational demands of modern large-scale deep learning present significant challenges for efficient training and deployment. Recent research has...

Software Technology

arXiv CS Feb 4

Patronus: Interpretable Diffusion Models with Prototypes

arXiv:2503.22782v2 Announce Type: replace Abstract: Uncovering the opacity of diffusion-based generative models is urgently needed, as their applications continue to expand while their underlying procedures...

Software Energy

arXiv CS Feb 4

Language-Integrated Recursive Queries

arXiv:2504.02443v3 Announce Type: replace Abstract: Performance-critical industrial applications, including large-scale program, network, and distributed system analyses, rely on fixed-point computations. The introduction of recursive common...

Software Embedded Systems

arXiv CS Feb 4

GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning

arXiv:2504.02546v4 Announce Type: replace Abstract: Reinforcement Learning (RL) can directly enhance the reasoning capabilities of large language models without extensive reliance on Supervised Fine-Tuning (SFT)....

Software Policy

arXiv CS Feb 4

Align to Structure: Aligning Large Language Models with Structural Information

arXiv:2504.03622v2 Announce Type: replace Abstract: Generating long, coherent text remains a challenge for large language models (LLMs), as they lack hierarchical planning and structured organization...

Software Policy

arXiv CS Feb 4

Towards Quantum Universal Hypothesis Testing

arXiv:2504.16299v3 Announce Type: replace Abstract: Hoeffding's formulation and solution to the universal hypothesis testing (UHT) problem had a profound impact on many subsequent works dealing...

Quantum Computing Software

arXiv CS Feb 4

Crypto-ncRNA: a bio-inspired post-quantum cryptographic primitive exploiting RNA folding complexity

arXiv:2504.17878v2 Announce Type: replace Abstract: The imminent realization of fault-tolerant quantum computing precipitates a systemic collapse of classical public-key infrastructure and necessitates an urgent transition...

Quantum Computing Technology

arXiv CS Feb 4

Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors

arXiv:2504.20106v2 Announce Type: replace Abstract: Ensuring that large language models (LLMs) are both helpful and harmless is a critical challenge, as overly strict constraints can...

Psychology Software

arXiv CS Feb 4

Advancing AI Research Assistants with Expert-Involved Learning

arXiv:2505.04638v4 Announce Type: replace Abstract: Large language models (LLMs) and large multimodal models (LMMs) promise to accelerate biomedical discovery, yet their reliability remains unclear. We...

Engineering Artificial Intelligence

arXiv CS Feb 4

Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization

arXiv:2505.12311v3 Announce Type: replace Abstract: Despite over a decade of development, autonomous driving trajectory planning in complex urban environments continues to encounter significant challenges. These...

Robotics Environment

arXiv CS Feb 4

Neural Thermodynamics: Entropic Forces in Deep and Universal Representation Learning

arXiv:2505.12387v4 Announce Type: replace Abstract: With the rapid discovery of emergent phenomena in deep learning and large language models, understanding their cause has become an...

Artificial Intelligence Neuroscience

arXiv CS Feb 4

SpecFLASH: A Latent-Guided Semi-autoregressive Speculative Decoding Framework for Efficient Multimodal Generation

arXiv:2505.12728v3 Announce Type: replace Abstract: Large language models and large multimodal models (LLMs and LMMs) deliver strong generative performance but suffer from slow decoding, a...

Software Engineering

arXiv CS Feb 4

Multi-Level Monte Carlo Training of Neural Operators

arXiv:2505.12940v2 Announce Type: replace Abstract: Operator learning is a rapidly growing field that aims to approximate nonlinear operators related to partial differential equations (PDEs) using...

Neuroscience Software

arXiv CS Feb 4

Lightweight and Interpretable Transformer via Mixed Graph Algorithm Unrolling for Traffic Forecast

arXiv:2505.13102v3 Announce Type: replace Abstract: Unlike conventional "black-box" transformers with classical self-attention mechanism, we build a lightweight and interpretable transformer-like neural net by unrolling a...

Neuroscience Policy

arXiv CS Feb 4

Inferring stochastic dynamics with growth from cross-sectional data

arXiv:2505.13197v3 Announce Type: replace Abstract: Time-resolved single-cell omics data offers high-throughput, genome-wide measurements of cellular states, which are instrumental to reverse-engineer the processes underpinning cell...

Mathematics Biology

arXiv CS Feb 4

Building spatial world models from sparse transitional episodic memories

arXiv:2505.13696v2 Announce Type: replace Abstract: Many animals possess a remarkable capacity to rapidly construct flexible cognitive maps of their environments. These maps are crucial for...

Neuroscience Psychology

arXiv CS Feb 4

AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models

arXiv:2505.14103v3 Announce Type: replace Abstract: Jailbreak attacks to Large audio-language models (LALMs) are studied recently, but they exclusively focused on the attack scenario where the...

Technology Software

arXiv CS Feb 4

Abacus: A Cost-Based Optimizer for Semantic Operator Systems

arXiv:2505.14661v3 Announce Type: replace Abstract: LLMs enable an exciting new class of data processing applications over large collections of unstructured documents. Several new programming frameworks...

Software Energy

arXiv CS Feb 4

SPAR: Self-supervised Placement-Aware Representation Learning for Distributed Sensing

arXiv:2505.16936v4 Announce Type: replace Abstract: We present SPAR, a framework for self-supervised placement-aware representation learning in distributed sensing. Distributed sensing spans applications where multiple spatially...

Software Engineering

arXiv CS Feb 4

MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning

arXiv:2505.16964v2 Announce Type: replace Abstract: Real-world clinical practice demands multi-image comparative reasoning, yet current medical benchmarks remain limited to single-frame interpretation. We present MedFrameQA, the...

World News Medicine & Health

arXiv CS Feb 4

Seeing through Satellite Images at Street Views

arXiv:2505.17001v2 Announce Type: replace Abstract: This paper studies the task of SatStreet-view synthesis, which aims to render photorealistic street-view panorama images and videos given any...

Neuroscience Chemistry

arXiv CS Feb 4

Redirection for Erasing Memory (REM): Towards a universal unlearning method for corrupted data

arXiv:2505.17730v2 Announce Type: replace Abstract: Machine unlearning is studied for a multitude of tasks, but specialization of unlearning methods to particular tasks has made their...

Neuroscience Policy

arXiv CS Feb 4

Thalia: A Global, Multi-Modal Dataset for Volcanic Activity Monitoring

arXiv:2505.17782v2 Announce Type: replace Abstract: Monitoring volcanic activity is of paramount importance to safeguarding lives, infrastructure, and ecosystems. However, only a small fraction of known...

Software Artificial Intelligence