CUEBES

Characterizing LLM Inference Energy-Performance Tradeoffs across Workloads and GPU Scaling

arXiv:2501.08219v4 Announce Type: replace Abstract: LLM inference exhibits substantial variability across queries and execution phases, yet inference configurations are often applied uniformly. We present a...

Hardware Software

arXiv CS Feb 25

Universality of Benign Overfitting in Binary Linear Classification

arXiv:2501.10538v2 Announce Type: replace Abstract: The practical success of deep learning has led to the discovery of several surprising phenomena. One of these phenomena, that...

Artificial Intelligence Neuroscience

arXiv CS Feb 25

Safe Reinforcement Learning for Real-World Engine Control

arXiv:2501.16613v2 Announce Type: replace Abstract: This work introduces a toolchain for applying Reinforcement Learning (RL), specifically the Deep Deterministic Policy Gradient (DDPG) algorithm, in safety-critical...

Software Energy

arXiv CS Feb 25

A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers

arXiv:2502.01310v4 Announce Type: replace Abstract: Neural network-based optimal transport (OT) is a recent and fruitful direction in the generative modeling community. It finds its applications...

Software Biology

arXiv CS Feb 25

Improving the Convergence of Private Shuffled Gradient Methods with Public Data

arXiv:2502.03652v2 Announce Type: replace Abstract: We consider the problem of differentially private (DP) convex empirical risk minimization (ERM). While the standard DP-SGD algorithm is theoretically...

Software Cybersecurity

arXiv CS Feb 25

Oracular Programming: A Modular Foundation for Building LLM-Enabled Software

arXiv:2502.05310v4 Announce Type: replace Abstract: Large Language Models can solve a wide range of tasks from just a few examples, but they remain difficult to...

Embedded Systems Software

arXiv CS Feb 25

Using the Path of Least Resistance to Explain Deep Networks

arXiv:2502.12108v2 Announce Type: replace Abstract: Integrated Gradients (IG), a widely used axiomatic path-based attribution method, assigns importance scores to input features by integrating model gradients...

Policy Biology

arXiv CS Feb 25

Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining

arXiv:2502.12600v3 Announce Type: replace Abstract: Generalization to unseen degradations remains a fundamental challenge for low-level vision models. This paper aims to investigate the underlying mechanism...

Psychology Policy

arXiv CS Feb 25

SEFL: A Framework for Generating Synthetic Educational Assignment Feedback with LLM Agents

arXiv:2502.12927v3 Announce Type: replace Abstract: Providing high-quality feedback on student assignments is crucial for student success, but it is heavily limited by time and budgetary...

World News Policy

arXiv CS Feb 25

Distributional Vision-Language Alignment by Cauchy-Schwarz Divergence

arXiv:2502.17028v3 Announce Type: replace Abstract: Vision-language alignment is crucial for various downstream tasks such as cross-modal generation and retrieval. Previous multimodal approaches like CLIP utilize...

Software World News

arXiv CS Feb 25

Bridging Gaps in Natural Language Processing for Yor\`ub\'a: A Systematic Review of a Decade of Progress and Prospects

arXiv:2502.17364v2 Announce Type: replace Abstract: Natural Language Processing (NLP) is becoming a dominant subset of artificial intelligence as the need to help machines understand human...

Software Technology

arXiv CS Feb 25

Characterizing Production GPU Workloads using System-wide Telemetry Data

arXiv:2502.18680v2 Announce Type: replace Abstract: GPGPU-accelerated clusters and supercomputers are central to modern high-performance computing (HPC). Over the past decade, these systems continue to expand,...

Hardware Technology

arXiv CS Feb 25

Armijo Line-search Can Make (Stochastic) Gradient Descent Provably Faster

arXiv:2503.00229v4 Announce Type: replace Abstract: Armijo line-search (Armijo-LS) is a standard method to set the step-size for gradient descent (GD). For smooth functions, Armijo-LS alleviates...

Policy World News

arXiv CS Feb 25

Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios

arXiv:2503.01852v2 Announce Type: replace Abstract: Autonomous vehicles must negotiate with pedestrians in ways that are both safe and socially compliant. We present an interaction-aware model...

Robotics World News

arXiv CS Feb 25

Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling

arXiv:2503.04398v4 Announce Type: replace Abstract: Prevailing LLM serving engines employ expert parallelism (EP) to implement multi-device inference of massive MoE models. However, the efficiency of...

Technology Software

arXiv CS Feb 25

SEED: Towards More Accurate Semantic Evaluation for Visual Brain Decoding

arXiv:2503.06437v2 Announce Type: replace Abstract: We present SEED (Semantic Evaluation for Visual Brain Decoding), a novel metric for evaluating the semantic decoding performance of visual...

Neuroscience Software

arXiv CS Feb 25

WonderVerse: Extendable 3D Scene Generation with Video Generative Models

arXiv:2503.09160v4 Announce Type: replace Abstract: We introduce \textit{WonderVerse}, a simple but effective framework for generating extendable 3D scenes. Unlike existing methods that rely on iterative...

Embedded Systems Technology

arXiv CS Feb 25

A categorical perspective on constraint satisfaction: The wonderland of adjunctions

arXiv:2503.10353v2 Announce Type: replace Abstract: The so-called algebraic approach to the constraint satisfaction problem (CSP) has been a prevalent method of the study of complexity...

Software Mathematics

arXiv CS Feb 25

VISIONLOGIC: From Neuron Activations to Causally Grounded Concept Rules for Vision Models

arXiv:2503.10547v2 Announce Type: replace Abstract: While concept-based explanations improve interpretability over local attributions, they often rely on correlational signals and lack causal validation. We introduce...

Neuroscience Psychology

arXiv CS Feb 25

A Survey on Federated Fine-tuning of Large Language Models

arXiv:2503.12016v3 Announce Type: replace Abstract: Large Language Models (LLMs) have demonstrated impressive success across various tasks. Integrating LLMs with Federated Learning (FL), a paradigm known...

Software Technology

arXiv CS Feb 25

A Survey on the Optimization of Large Language Model-based Agents

arXiv:2503.12434v2 Announce Type: replace Abstract: With the rapid development of Large Language Models (LLMs), LLM-based agents have been widely adopted in various fields, becoming essential...

Engineering Psychology

arXiv CS Feb 25

Towards the Usage of Window Counting Constraints in the Synthesis of Reactive Systems to Reduce State Space Explosion

arXiv:2503.21852v4 Announce Type: replace Abstract: The synthesis of reactive systems aims for the automated construction of strategies for systems that interact with their environment. Whereas...

Software Environment

arXiv CS Feb 25

Towards Trustworthy GUI Agents: A Survey

arXiv:2503.23434v2 Announce Type: replace Abstract: Graphical User Interface (GUI) agents extend large language models from text generation to action execution in real-world digital environments. Unlike...

Technology Environment

arXiv CS Feb 25

Sobolev-Poincar\'e inequalities for piecewise $W^{1,p}$ functions over general polytopic meshes

arXiv:2504.03449v3 Announce Type: replace Abstract: We establish Sobolev-Poincar\'e inequalities for piecewise $W^{1,p}$ functions over families of fairly general polytopic (thence also shape-regular simplicial and Cartesian)...

Policy Biology