Pancake: Hierarchical Memory System for Multi-Agent LLM Serving
arXiv:2602.21477v1 Announce Type: new Abstract: In this work, we identify and address the core challenges of agentic memory management in LLM serving, where large-scale storage,...
Stay updated with the latest research and technology news
arXiv:2602.21477v1 Announce Type: new Abstract: In this work, we identify and address the core challenges of agentic memory management in LLM serving, where large-scale storage,...
arXiv:2602.21480v1 Announce Type: new Abstract: Text-to-SQL and Big Data are both extensively benchmarked fields, yet there is limited research that evaluates them jointly. In the...
arXiv:2602.21481v1 Announce Type: new Abstract: Recruiting academically strong students into NSF S-STEM scholarship programs remains a persistent challenge in computer science education. This paper presents...
arXiv:2602.21484v1 Announce Type: new Abstract: 3D object detection is essential for autonomous driving and robotic perception, yet its reliance on large-scale manually annotated data limits...
arXiv:2602.21485v1 Announce Type: new Abstract: In AI, most evaluations of natural language understanding tasks are conducted in standardized dialects such as Standard American English (SAE)....
arXiv:2602.21486v1 Announce Type: new Abstract: GenAI's ability to produce text and images is increasingly incorporated into human-AI co-creation tasks such as storytelling and video editing....
arXiv:2602.21492v1 Announce Type: new Abstract: Reinforcement learning (RL) has become a central post-training paradigm for large language models (LLMs), but its performance is highly sensitive...
arXiv:2602.21494v1 Announce Type: new Abstract: In this paper, we present new constructions of $q$-ary Singleton-optimal locally repairable codes (LRCs) with minimum distance $d=6$ and locality...
arXiv:2602.21495v1 Announce Type: new Abstract: Congestion pricing has emerged as an effective tool for mitigating traffic congestion, yet implementing welfare or revenue-optimal dynamic tolls is...
arXiv:2602.21496v1 Announce Type: new Abstract: While defenses for structured PII are mature, Large Language Models (LLMs) pose a new threat: Semantic Sensitive Information (SemSI), where...
arXiv:2602.21497v1 Announce Type: new Abstract: Recent large vision-language models (LVLMs) have demonstrated impressive reasoning ability by generating long chain-of-thought (CoT) responses. However, CoT reasoning in...
arXiv:2602.21498v1 Announce Type: new Abstract: Irregular Multivariate Time Series (IMTS) are characterized by uneven intervals between consecutive timestamps, which carry sampling pattern information valuable and...
arXiv:2602.21499v1 Announce Type: new Abstract: Existing 3D editing methods rely on computationally intensive scene-by-scene iterative optimization and suffer from multi-view inconsistency. We propose an effective...
arXiv:2602.21503v1 Announce Type: new Abstract: Identical twin face verification represents an extreme fine-grained recognition challenge where even state-of-the-art systems fail due to overwhelming genetic similarity....
arXiv:2602.21508v1 Announce Type: new Abstract: Robust watermarking is critical for intellectual property protection, whereas existing methods face a severe vulnerability against regeneration-based AIGC attacks. We...
arXiv:2602.21514v1 Announce Type: new Abstract: Approximate nearest neighbor (ANN) search on SSD-backed indexes is increasingly I/O-bound (I/O accounts for 70--90\% of query latency). We present...
arXiv:2602.21515v1 Announce Type: new Abstract: Many emerging agentic paradigms require agents to collaborate with one another (or people) to achieve shared goals. Unfortunately, existing approaches...
arXiv:2602.21517v1 Announce Type: new Abstract: AI agents with tool-use capabilities show promise for integrating the domain expertise of various tools. In the medical field, however,...
arXiv:2602.21524v1 Announce Type: new Abstract: The advent of Cryptographically Relevant Quantum Computers (CRQCs) presents a fundamental and existential threat to the forensic integrity and operational...
arXiv:2602.21525v1 Announce Type: new Abstract: In this paper, we investigate the optimal real-time fusion of data collected by multiple sensors. In our set-up, the sensor...
arXiv:2602.21528v1 Announce Type: new Abstract: Guided wireless technology is an innovative approach that combines the strengths of guided waves and wireless communication. In traditional wireless...
arXiv:2602.21529v1 Announce Type: new Abstract: Rug-pull attacks pose a systemic threat across the blockchain ecosystem, yet research into early detection is hindered by the lack...
arXiv:2602.21531v1 Announce Type: new Abstract: General-purpose robots must master long-horizon manipulation, defined as tasks involving multiple kinematic structure changes (e.g., attaching or detaching objects) in...
arXiv:2602.21534v1 Announce Type: new Abstract: Agentic reinforcement learning (ARL) has rapidly gained attention as a promising paradigm for training agents to solve complex, multi-step interactive...