Crossmark

PGPL: enhancing spatial awareness abilities of multimodal large language models based on precise geometric position learning

Published Online: 2025-10-09

Published Print: 2026-02

Authors

Zhao, Yongqiang

Li, Zhenyu

Jin, Zhi

Zhang, Feng

Wang, Ziliang

Wu, Lianwei

Dou, Chengfeng

Zhao, Haiyan

Xu, Xinhai
License Information

Text and Data Mining valid from 2025-10-09

Version of Record valid from 2025-10-09
More Information

Article History

Received: 30 July 2024

Revised: 13 February 2025

Accepted: 24 April 2025

First Online: 9 October 2025

Document is current