Evaluating Actionability in Explainable AI
arXiv:2601.20086v1 Announce Type: new Abstract: A core assumption of Explainable AI (XAI) is that explanations are useful to users -- that is, users will do...
Stay updated with the latest research and technology news
arXiv:2601.20086v1 Announce Type: new Abstract: A core assumption of Explainable AI (XAI) is that explanations are useful to users -- that is, users will do...
arXiv:2601.20090v1 Announce Type: new Abstract: Large language model (LLM)-powered agents can translate high-level user intents into plans and actions in an environment. Yet after observing...
arXiv:2601.20099v1 Announce Type: new Abstract: Humans and large language models (LLMs) now co-produce and co-consume the web's shared knowledge archives. Such human-AI collective knowledge ecosystems...
arXiv:2601.20100v1 Announce Type: new Abstract: Generative AI chatbots have proven surprisingly effective at persuading people to change their beliefs and attitudes in lab settings. However,...
arXiv:2601.20102v1 Announce Type: new Abstract: Engineering sustainable and equitable healthcare requires medical language models that do not change clinically correct diagnoses when presented with non-decisive...
arXiv:2601.20103v1 Announce Type: new Abstract: Recent advances in reinforcement learning for code generation have made robust environments essential to prevent reward hacking. As LLMs increasingly...
arXiv:2601.20104v1 Announce Type: new Abstract: Nuclei instance segmentation in hematoxylin and eosin (H&E)-stained images plays an important role in automated histological image analysis, with various...
arXiv:2601.20105v1 Announce Type: new Abstract: Figurative language, particularly fixed figurative expressions (FFEs) such as idioms and proverbs, poses persistent challenges for large language models (LLMs)....
arXiv:2601.20106v1 Announce Type: new Abstract: Autonomous AI agents are transforming software development and redefining how developers collaborate with AI. Prior research shows that the adoption...
arXiv:2601.20107v1 Announce Type: new Abstract: Recent Vision-Language Models (e.g., ColPali) enable fine-grained Visual Document Retrieval (VDR) but incur prohibitive index vector size overheads. Training-free pruning...
arXiv:2601.20109v1 Announce Type: new Abstract: The increasing adoption of AI coding agents has increased the number of agent-generated pull requests (PRs) merged with little or...
arXiv:2601.20112v1 Announce Type: new Abstract: The rise of large language models (LLMs) has accelerated the development of automated techniques and tools for supporting various software...
arXiv:2601.20113v1 Announce Type: new Abstract: The growing volume of scientific simulation data presents a significant challenge for storage and transfer. Error-bounded lossy compression has emerged...
arXiv:2601.20115v1 Announce Type: new Abstract: Graphics Processing Units (GPUs) are the state-of-the-art architecture for essential tasks, ranging from rendering 2D/3D graphics to accelerating workloads in...
arXiv:2601.20116v1 Announce Type: new Abstract: Transformer models have achieved remarkable empirical successes, largely due to their in-context learning capabilities. Inspired by this, we explore training...
arXiv:2601.20118v1 Announce Type: new Abstract: To advance Polar code design for 6G applications, we develop a reinforcement learning-based universal sequence design framework that is extensible...
arXiv:2601.20119v1 Announce Type: new Abstract: Strength-of-connection algorithms play a key role in algebraic multigrid (AMG). Specifically, they determine which matrix nonzeros are classified as weak...
arXiv:2601.20120v1 Announce Type: new Abstract: Since its introduction, Facebook Prophet has attracted positive attention from both classical statisticians and the Bayesian statistics community. The model...
arXiv:2601.20125v1 Announce Type: new Abstract: Diffusion Language Models (DLMs) represent a promising alternative to autoregressive language models, using bidirectional masked token prediction. Yet their susceptibility...
arXiv:2601.20126v1 Announce Type: new Abstract: Large Language Models (LLMs) often produce hallucinated or unverifiable content, undermining their reliability in factual domains. This work investigates Reinforcement...
arXiv:2601.20129v1 Announce Type: new Abstract: Sentiment analysis for the Bengali language has attracted increasing research interest in recent years. However, progress remains constrained by the...
arXiv:2601.20130v1 Announce Type: new Abstract: Real-time execution is essential for cyber-physical systems such as robots. These systems operate in dynamic real-world environments where even small...
arXiv:2601.20131v1 Announce Type: new Abstract: Designing an embedding retrieval system requires navigating a complex design space of conflicting trade-offs between efficiency and effectiveness. This work...
arXiv:2601.20135v1 Announce Type: new Abstract: This paper gives an overview of the use of control systems engineering in synthetic biology, motivated by applications such as...