GRASP: Guided Region-Aware Sparse Prompting for Adapting MLLMs to Remote Sensing
arXiv:2601.17089v1 Announce Type: new Abstract: In recent years, Multimodal Large Language Models (MLLMs) have made significant progress in visual question answering tasks. However, directly applying...