Temperature-Aware Scheduling of LLM Inference in Large-Scale Geo-Distributed Edge Data Centers with Distributed Optimization
arXiv:2603.07810v1 Announce Type: new Abstract: The environmental impact of Large Language Models (LLMs) on data centers hosting these models is becoming a significant concern. While...