CQ-CiM: Hardware-Aware Embedding Shaping for Robust CiM-Based Retrieval
arXiv:2602.20083v2 Announce Type: replace Abstract: Deploying Retrieval-Augmented Generation (RAG) on edge devices is in high demand, but is hindered by the latency of massive data...