CUEBES

GRASP: Guided Region-Aware Sparse Prompting for Adapting MLLMs to Remote Sensing

arXiv:2601.17089v1 Announce Type: new Abstract: In recent years, Multimodal Large Language Models (MLLMs) have made significant progress in visual question answering tasks. However, directly applying...

Software Energy

arXiv CS Jan 28

SFO: Learning PDE Operators via Spectral Filtering

arXiv:2601.17090v1 Announce Type: new Abstract: Partial differential equations (PDEs) govern complex systems, yet neural operators often struggle to efficiently capture the long-range, nonlocal interactions inherent...

Neuroscience Chemistry

arXiv CS Jan 28

CUROCKET: Optimizing ROCKET for GPU

arXiv:2601.17091v1 Announce Type: new Abstract: ROCKET (RandOm Convolutional KErnel Transform) is a feature extraction algorithm created for Time Series Classification (TSC), published in 2019. It...

Software Artificial Intelligence

arXiv CS Jan 28

The Triangle of Similarity: A Multi-Faceted Framework for Comparing Neural Network Representations

arXiv:2601.17093v1 Announce Type: new Abstract: Comparing neural network representations is essential for understanding and validating models in scientific applications. Existing methods, however, often provide a...

Software Biology

arXiv CS Jan 28

Boltzmann-GPT: Bridging Energy-Based World Models and Language Generation

arXiv:2601.17094v1 Announce Type: new Abstract: Large Language Models (LLMs) generate fluent text, yet whether they truly understand the world or merely produce plausible language about...

Artificial Intelligence Neuroscience

arXiv CS Jan 28

LoD Sketch Extraction from Architectural Models Using Generative AI: Dataset Construction for Multi-Level Architectural Design Generation

arXiv:2601.17095v1 Announce Type: new Abstract: For architectural design, representation across multiple Levels of Details (LoD) is essential for achieving a smooth transition from conceptual massing...

Software Artificial Intelligence

arXiv CS Jan 28

Beyond Instrumental and Substitutive Paradigms: Introducing Machine Culture as an Emergent Phenomenon in Large Language Models

arXiv:2601.17096v1 Announce Type: new Abstract: Recent scholarship typically characterizes Large Language Models (LLMs) through either an \textit{Instrumental Paradigm} (viewing models as reflections of their developers'...

Quantum Computing Software

arXiv CS Jan 28

Sink or SWIM: Tackling Real-Time ASR at Scale

arXiv:2601.17097v1 Announce Type: new Abstract: Real-time automatic speech recognition systems are increasingly integrated into interactive applications, from voice assistants to live transcription services. However, scaling...

Software

arXiv CS Jan 28

StealthMark: Harmless and Stealthy Ownership Verification for Medical Segmentation via Uncertainty-Guided Backdoors

arXiv:2601.17107v1 Announce Type: new Abstract: Annotating medical data for training AI models is often costly and limited due to the shortage of specialists with relevant...

Software Medicine & Health

arXiv CS Jan 28

MambaNet: Mamba-assisted Channel Estimation Neural Network With Attention Mechanism

arXiv:2601.17108v1 Announce Type: new Abstract: This paper proposes a Mamba-assisted neural network framework incorporating self-attention mechanism to achieve improved channel estimation with low complexity for...

Neuroscience Artificial Intelligence

arXiv CS Jan 28

Forecasting Energy Consumption using Recurrent Neural Networks: A Comparative Analysis

arXiv:2601.17110v1 Announce Type: new Abstract: Accurate short-term energy consumption forecasting is essential for efficient power grid management, resource allocation, and market stability. Traditional time-series models...

Software Energy

arXiv CS Jan 28

Least-Loaded Expert Parallelism: Load Balancing An Imbalanced Mixture-of-Experts

arXiv:2601.17111v1 Announce Type: new Abstract: Mixture-of-Experts (MoE) models are typically pre-trained with explicit load-balancing constraints to ensure statistically balanced expert routing. Despite this, we observe...

Artificial Intelligence Technology

arXiv CS Jan 28

Low-Rank Tensor Approximation of Weights in Large Language Models via Cosine Lanczos Bidiagonalization

arXiv:2601.17112v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language tasks but suffer from extremely large memory footprints...

Software Artificial Intelligence

arXiv CS Jan 28

Acoustic Field Video for Multimodal Scene Understanding

arXiv:2601.17123v1 Announce Type: new Abstract: We introduce and explore a new multimodal input representation for vision-language models: acoustic field video. Unlike conventional video (RGB with...

Robotics Energy

arXiv CS Jan 28

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

arXiv:2601.17124v2 Announce Type: new Abstract: The field of image generation is currently bifurcated into autoregressive (AR) models operating on discrete tokens and diffusion models utilizing...

Software Energy

arXiv CS Jan 28

How does Graph Structure Modulate Membership-Inference Risk for Graph Neural Networks?

arXiv:2601.17130v1 Announce Type: new Abstract: Graph neural networks (GNNs) have become the standard tool for encoding data and their complex relationships into continuous representations, improving...

Software Artificial Intelligence

arXiv CS Jan 28

Equilibrium Refinements Improve Subgame Solving in Imperfect-Information Games

arXiv:2601.17131v1 Announce Type: new Abstract: Subgame solving is a technique for scaling algorithms to large games by locally refining a precomputed blueprint strategy during gameplay....

Technology Artificial Intelligence

arXiv CS Jan 28

From Emotion to Expression: Theoretical Foundations and Resources for Fear Speech

arXiv:2601.17132v1 Announce Type: new Abstract: Few forces rival fear in their ability to mobilize societies, distort communication, and reshape collective behavior. In computational linguistics, fear...

Engineering Software

arXiv CS Jan 28

Learning to Collaborate: An Orchestrated-Decentralized Framework for Peer-to-Peer LLM Federation

arXiv:2601.17133v1 Announce Type: new Abstract: Fine-tuning Large Language Models (LLMs) for specialized domains is constrained by a fundamental challenge: the need for diverse, cross-organizational data...

Artificial Intelligence Software

arXiv CS Jan 28

Deconstructing Taste: Toward a Human-Centered AI Framework for Modeling Consumer Aesthetic Perceptions

arXiv:2601.17134v1 Announce Type: new Abstract: Understanding and modeling consumers' stylistic taste such as "sporty" is crucial for creating designs that truly connect with target audiences....

Artificial Intelligence Biology

arXiv CS Jan 28

ConceptACT: Episode-Level Concepts for Sample-Efficient Robotic Imitation Learning

arXiv:2601.17135v1 Announce Type: new Abstract: Imitation learning enables robots to acquire complex manipulation skills from human demonstrations, but current methods rely solely on low-level sensorimotor...

Software Robotics

arXiv CS Jan 28

Communication-Avoiding Linear Algebraic Kernel K-Means on GPUs

arXiv:2601.17136v1 Announce Type: new Abstract: Clustering is an important tool in data analysis, with K-means being popular for its simplicity and versatility. However, it cannot...

Software Energy

arXiv CS Jan 28

Bowling Online: Accounting for Civil Society Reshaped into Streamlined Photons within a Fiber Network

arXiv:2601.17139v1 Announce Type: new Abstract: Civil society has been deemed by various scholars, such as Robert D. Putnam, to be a predictor and a cornerstone...

Technology Engineering

arXiv CS Jan 28

Exploring EEG-driven brain-heart coupling across sleep stages in individuals with sleep disorders

arXiv:2601.17149v1 Announce Type: new Abstract: The interactions between the brain and heart during sleep are responsible for regulating autonomic function. While brain-heart coupling has been...

Neuroscience Energy