InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
Crossref DOI link: https://doi.org/10.1007/978-3-031-94969-2_2
Published Online: 2025-08-31
Published Print: 2026
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Wang, Wenhai
Chen, Zhe
Liu, Yangzhou
Cao, Yue
Wang, Weiyun
Zhu, Xizhou
Lu, Lewei
Lu, Tong
Qiao, Yu
Dai, Jifeng
Text and Data Mining valid from 2025-08-31
Version of Record valid from 2025-08-31
Chapter History
First Online: 31 August 2025