Visual Self-Refine: A Pixel-Guided Paradigm for Accurate Chart Parsing
arXiv:2602.16455v1 Announce Type: new Abstract: While Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities for reasoning and self-correction at the textual level, these strengths provide...