Hierarchical multi-modal fusion with vision transformers for robust action recognition in infrared-visible videos
Crossref DOI link: https://doi.org/10.1007/s13735-025-00386-4
Published Online: 2025-10-19
Published Print: 2025-12
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Imran, Javed
Wasid, Mohammed
Text and Data Mining valid from 2025-10-19
Version of Record valid from 2025-10-19
Article History
Received: 17 May 2025
Revised: 5 September 2025
Accepted: 6 October 2025
First Online: 19 October 2025
Declarations
:
: The authors have no competing interests to declare relevant to this work’s content.