Visual spatial relationship sensitive transformer for image captioning
Crossref DOI link: https://doi.org/10.1038/s41598-025-28290-1
Published Online: 2025-12-24
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Piao, Xianghua
Jin, Dong
Kwon, Min Jung
Gu, Yeong Hyeon
Text and Data Mining valid from 2025-12-24
Version of Record valid from 2025-12-24
Article History
Received: 10 June 2025
Accepted: 10 November 2025
First Online: 24 December 2025
Declarations
:
: The authors declare no competing interests.