From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models
arXiv:2501.00296v4 Announce Type: replace Abstract: Our aim is to learn to solve long-horizon decision-making problems in complex robotics domains given low-level skills and a handful...