CUEBES

LO-BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

arXiv:2502.05376v2 Announce Type: replace Abstract: Post-training quantization (PTQ) is a promising approach to reducing the storage and computational requirements of large language models (LLMs) without...

Software Artificial Intelligence

arXiv CS 1d ago

V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multimodal Large Language Models

arXiv:2502.09980v4 Announce Type: replace Abstract: Current autonomous driving vehicles rely mainly on their individual sensors to understand surrounding scenes and plan for future trajectories, which...

Software Robotics

arXiv CS 1d ago

Fenchel-Young Variational Learning

arXiv:2502.10295v3 Announce Type: replace Abstract: From a variational perspective, many statistical learning criteria involve seeking a distribution that balances empirical risk and regularization. In this...

Biology Software

arXiv CS 1d ago

BoundPlanner: A convex-set-based approach to bounded manipulator trajectory planning

arXiv:2502.13286v2 Announce Type: replace Abstract: Online trajectory planning enables robot manipulators to react quickly to changing environments or tasks. Many robot trajectory planners exist for...

Software Robotics

arXiv CS 1d ago

Less is More: Improving LLM Alignment via Preference Data Selection

arXiv:2502.14560v4 Announce Type: replace Abstract: Direct Preference Optimization (DPO) has emerged as a promising approach for aligning large language models with human preferences. While prior...

Software Mathematics

arXiv CS 1d ago

RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents

arXiv:2502.16730v2 Announce Type: replace Abstract: We present RapidPen, a fully automated penetration testing (pentesting) framework that addresses the challenge of achieving an initial foothold (IP-to-Shell)...

Software Cybersecurity

arXiv CS 1d ago

HIPPO: Enhancing the Table Understanding Capability of LLMs through Hybrid-Modal Preference Optimization

arXiv:2502.17315v2 Announce Type: replace Abstract: Tabular data contains rich structural semantics and plays a crucial role in organizing and manipulating information. Recent methods employ Multi-modal...

Engineering Software

arXiv CS 1d ago

Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application

arXiv:2502.20326v2 Announce Type: replace Abstract: This paper presents the first end-to-end framework that combines guidance, navigation, and centralised task allocation for multiple UAVs performing autonomous...

Robotics Software

arXiv CS 1d ago

Steering Dialogue Dynamics for Robustness against Multi-turn Jailbreaking Attacks

arXiv:2503.00187v3 Announce Type: replace Abstract: Large language models (LLMs) are shown to be vulnerable to jailbreaking attacks where adversarial prompts are designed to elicit harmful...

Neuroscience Psychology

arXiv CS 1d ago

Policy Design in Long-Run Welfare Dynamics

arXiv:2503.00632v2 Announce Type: replace Abstract: Improving social welfare is a complex challenge requiring policymakers to optimize objectives across multiple time horizons. Evaluating the impact of...

Policy Psychology

arXiv CS 1d ago

Contextual Quantum Neural Networks for Stock Price Prediction

arXiv:2503.01884v2 Announce Type: replace Abstract: In this paper, we apply quantum machine learning (QML) to predict the stock prices of multiple assets using a contextual...

Quantum Computing Artificial Intelligence

arXiv CS 1d ago

Simulating the Real World: A Unified Survey of Multimodal Generative Models

arXiv:2503.04641v3 Announce Type: replace Abstract: Understanding and replicating the real world is a critical challenge in Artificial General Intelligence (AGI) research. To achieve this, many...

Psychology Software

arXiv CS 1d ago

Robust Multi-Objective Controlled Decoding of Large Language Models

arXiv:2503.08796v2 Announce Type: replace Abstract: We introduce Robust Multi-Objective Decoding (RMOD), a novel inference-time algorithm that robustly aligns Large Language Models (LLMs) to multiple human...

Software Policy

arXiv CS 1d ago

Measure Twice, Cut Once: A Semantic-Oriented Approach to Video Temporal Localization with Video LLMs

arXiv:2503.09027v2 Announce Type: replace Abstract: Temporally localizing user-queried events through natural language is a crucial capability for video models. Recent methods predominantly adapt video LLMs...

Software Engineering

arXiv CS 1d ago

Learning Rate Annealing Improves Tuning Robustness in Stochastic Optimization

arXiv:2503.09411v2 Announce Type: replace Abstract: The learning rate in stochastic gradient methods is a critical hyperparameter that is notoriously costly to tune via standard grid...

Policy

arXiv CS 1d ago

AudioX: A Unified Framework for Anything-to-Audio Generation

arXiv:2503.10522v3 Announce Type: replace Abstract: Audio and music generation based on flexible multimodal control signals is a widely applicable topic, with the following key challenges:...

Software Energy

arXiv CS 1d ago

Autoregressive Image Generation with Randomized Parallel Decoding

arXiv:2503.10568v3 Announce Type: replace Abstract: We introduce ARPG, a novel visual autoregressive model that enables randomized parallel generation, addressing the inherent limitations of conventional raster-order...

Software Policy

arXiv CS 1d ago

Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset

arXiv:2503.12385v2 Announce Type: replace Abstract: Fine-grained visual categorization (FGVC) is a challenging but significant task in computer vision, which aims to recognize different sub-categories of...

Software Robotics

arXiv CS 1d ago

A representational framework for learning and encoding structurally enriched trajectories in complex agent environments

arXiv:2503.13194v3 Announce Type: replace Abstract: The ability of artificial intelligence agents to make optimal decisions and generalise them to different domains and tasks is compromised...

Engineering Environment

arXiv CS 1d ago

Heuristic Methods are Good Teachers to Distill MLPs for Graph Link Prediction

arXiv:2504.06193v3 Announce Type: replace Abstract: Link prediction is a crucial graph-learning task with applications including citation prediction and product recommendation. Distilling Graph Neural Networks (GNNs)...

Software Neuroscience

arXiv CS 1d ago

S2R-HDR: A Large-Scale Rendered Dataset for HDR Fusion

arXiv:2504.07667v2 Announce Type: replace Abstract: The generalization of learning-based high dynamic range (HDR) fusion is often limited by the availability of training data, as collecting...

Software Technology

arXiv CS 1d ago

Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) of 2D Black Box Classifier Functions

arXiv:2504.09733v3 Announce Type: replace Abstract: Accurately estimating decision boundaries in black box systems is critical when ensuring safety, quality, and feasibility in real-world applications. However,...

Software Energy

arXiv CS 1d ago

Multiharmonic algorithms for contrast-enhanced ultrasound

arXiv:2504.13335v2 Announce Type: replace Abstract: Harmonic generation plays a crucial role in contrast-enhanced ultrasound, both for imaging and therapeutic applications. However, accurately capturing these nonlinear...

Software Mathematics

arXiv CS 1d ago

Diffusion-based Dynamic Contract for Federated AI Agent Construction in Mobile Metaverses

arXiv:2504.14326v2 Announce Type: replace Abstract: Mobile metaverses are envisioned as a transformative digital ecosystem that delivers immersive, intelligent, and ubiquitous services through mobile devices. Driven...

Artificial Intelligence Energy