CUEBES

Intent Laundering: AI Safety Datasets Are Not What They Seem

arXiv:2602.16729v2 Announce Type: replace Abstract: We systematically evaluate the quality of widely used AI safety datasets from two perspectives: in isolation and in practice. In...

Technology Psychology

arXiv CS Feb 25

AI-Mediated Feedback Improves Student Revisions: A Randomized Trial with FeedbackWriter in a Large Undergraduate Course

arXiv:2602.16820v2 Announce Type: replace Abstract: Despite growing interest in using LLMs to generate feedback on students' writing, little is known about how students respond to...

Artificial Intelligence Economics

arXiv CS Feb 25

SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation

arXiv:2602.16863v2 Announce Type: replace Abstract: The ability to manipulate tools significantly expands the set of tasks a robot can perform. Yet, tool manipulation represents a...

Engineering Policy

arXiv CS Feb 25

Beyond Message Passing: A Symbolic Alternative for Expressive and Interpretable Graph Learning

arXiv:2602.16947v2 Announce Type: replace Abstract: Graph Neural Networks (GNNs) have become essential in high-stakes domains such as drug discovery, yet their black-box nature remains a...

Biology Neuroscience

arXiv CS Feb 25

Neural Proposals, Symbolic Guarantees: Neuro-Symbolic Graph Generation with Hard Constraints

arXiv:2602.16954v2 Announce Type: replace Abstract: We challenge black-box purely deep neural approaches for molecules and graph generation, which are limited in controllability and lack formal...

Chemistry Neuroscience

arXiv CS Feb 25

Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

arXiv:2602.17050v2 Announce Type: replace Abstract: Embedding tables are critical components of large-scale recommendation systems, facilitating the efficient mapping of high-cardinality categorical features into dense vector...

Software Policy

arXiv CS Feb 25

On the complexity of covering points by guillotine cuts

arXiv:2602.17294v2 Announce Type: replace Abstract: We show that the problem of covering a set of points in the plane with a minimum number of guillotine...

Policy

arXiv CS Feb 25

Tree crop mapping of South America reveals links to deforestation and conservation

arXiv:2602.17372v2 Announce Type: replace Abstract: Monitoring tree crop expansion is vital for zero-deforestation policies like the European Union's Regulation on Deforestation-free Products (EUDR). However, these...

European Affairs Policy

arXiv CS Feb 25

EAGLE: Expert-Augmented Attention Guidance for Tuning-Free Industrial Anomaly Detection in Multimodal Large Language Models

arXiv:2602.17419v2 Announce Type: replace Abstract: Industrial anomaly detection is important for smart manufacturing, but many deep learning approaches produce only binary decisions and provide limited...

Software Artificial Intelligence

arXiv CS Feb 25

MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning

arXiv:2602.17550v2 Announce Type: replace Abstract: Existing Reinforcement Learning with Verifiable Rewards (RLVR) algorithms, such as GRPO, rely on rigid, uniform, and symmetric trust region mechanisms...

Policy Artificial Intelligence

arXiv CS Feb 25

A Theoretical Framework for Modular Learning of Robust Generative Models

arXiv:2602.17554v2 Announce Type: replace Abstract: Training large-scale generative models is resource-intensive and relies heavily on heuristic dataset weighting. We address two fundamental questions: Can we...

Artificial Intelligence Engineering

arXiv CS Feb 25

Multi-Round Human-AI Collaboration with User-Specified Requirements

arXiv:2602.17646v2 Announce Type: replace Abstract: As humans increasingly rely on multiround conversational AI for high stakes decisions, principled frameworks are needed to ensure such interactions...

Artificial Intelligence Psychology

arXiv CS Feb 25

GPU Memory and Utilization Estimation for Training-Aware Resource Management: Opportunities and Limitations

arXiv:2602.17817v2 Announce Type: replace Abstract: Collocating deep learning training tasks improves GPU utilization but risks resource contention, severe slowdowns, and out-of-memory (OOM) failures. Accurate memory...

Hardware Technology

arXiv CS Feb 25

Games That Teach, Chats That Convince: Comparing Interactive and Static Formats for Persuasive Learning

arXiv:2602.17905v2 Announce Type: replace Abstract: Interactive systems such as chatbots and games are increasingly used to persuade and educate on sustainability-related topics, yet it remains...

Environment Policy

arXiv CS Feb 25

Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation

arXiv:2602.18296v2 Announce Type: replace Abstract: Manufacturing automation in process planning, inspection planning, and digital-thread integration depends on a unified specification that binds the geometric features...

Engineering Embedded Systems

arXiv CS Feb 25

INSURE-Dial: A Phase-Aware Conversational Dataset & Benchmark for Compliance Verification and Phase Detection

arXiv:2602.18448v2 Announce Type: replace Abstract: Administrative phone tasks drain roughly 1 trillion USD annually from U.S. healthcare, with over 500 million insurance-benefit verification calls manually...

Health Policy

arXiv CS Feb 25

How Well Can LLM Agents Simulate End-User Security and Privacy Attitudes and Behaviors?

arXiv:2602.18464v2 Announce Type: replace Abstract: A growing body of research assumes that large language model (LLM) agents can serve as proxies for how people form...

Cybersecurity Psychology

arXiv CS Feb 25

Transforming Science Learning Materials in the Era of Artificial Intelligence

arXiv:2602.18470v2 Announce Type: replace Abstract: The integration of artificial intelligence (AI) into science education is transforming the design and function of learning materials, offering new...

Artificial Intelligence Technology

arXiv CS Feb 25

RPU -- A Reasoning Processing Unit

arXiv:2602.18568v2 Announce Type: replace Abstract: Large language model (LLM) inference performance is increasingly bottlenecked by the memory wall. While GPUs continue to scale raw compute...

Software Energy

arXiv CS Feb 25

Refactoring for Novices in Java: An Eye Tracking Study on the Extract vs. Inline Methods

arXiv:2602.18579v2 Announce Type: replace Abstract: Developers often extract methods to improve readability, understanding, and reuse, while inlining keeps logic in one block. Prior work based...

Software Neuroscience

arXiv CS Feb 25

Soft Surfaced Vision-Based Tactile Sensing for Bipedal Robot Applications

arXiv:2602.18638v2 Announce Type: replace Abstract: Legged locomotion benefits from embodied sensing, where perception emerges from the physical interaction between body and environment. We present a...

Software Robotics

arXiv CS Feb 25

MIRROR: Multimodal Iterative Reasoning via Reflection on Visual Regions

arXiv:2602.18746v2 Announce Type: replace Abstract: In the era of Vision-Language Models (VLMs), enhancing multimodal reasoning capabilities remains a critical challenge, particularly in handling ambiguous or...

Policy Biology

arXiv CS Feb 25

CRAFT-LoRA: Content-Style Personalization via Rank-Constrained Adaptation and Training-Free Fusion

arXiv:2602.18936v2 Announce Type: replace Abstract: Personalized image generation requires effectively balancing content fidelity with stylistic consistency when synthesizing images based on text and reference examples....

Software Quantum Computing

arXiv CS Feb 25

The Metaphysics We Train: A Heideggerian Reading of Machine Learning

arXiv:2602.19028v2 Announce Type: replace Abstract: This paper offers a phenomenological reading of contemporary machine learning through Heideggerian concepts, aimed at enriching practitioners' reflexive understanding of...

Artificial Intelligence World News