Video action recognition meets vision-language models exploring human factors in scene interaction: a review
Crossref DOI link: https://doi.org/10.1007/s11801-025-5058-9
Published Online: 2025-09-25
Published Print: 2025-10
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Guo, Yuping
Gao, Hongwei
Yu, Jiahui
Ge, Jinchao
Han, Meng
Ju, Zhaojie
Text and Data Mining valid from 2025-09-25
Version of Record valid from 2025-09-25
Article History
Received: 16 March 2025
Revised: 26 May 2025
First Online: 25 September 2025
Ethics declarations
:
: The authors declare no conflict of interest.