PGPL: enhancing spatial awareness abilities of multimodal large language models based on precise geometric position learning
Crossref DOI link: https://doi.org/10.1007/s11432-024-4416-8
Published Online: 2025-10-09
Published Print: 2026-02
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zhao, Yongqiang
Li, Zhenyu
Jin, Zhi
Zhang, Feng
Wang, Ziliang
Wu, Lianwei
Dou, Chengfeng
Zhao, Haiyan
Xu, Xinhai
Text and Data Mining valid from 2025-10-09
Version of Record valid from 2025-10-09
Article History
Received: 30 July 2024
Revised: 13 February 2025
Accepted: 24 April 2025
First Online: 9 October 2025