Low-Rank Tensor Approximation of Weights in Large Language Models via Cosine Lanczos Bidiagonalization
arXiv:2601.17112v1 Announce Type: new Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language tasks but suffer from extremely large memory footprints...