VTFCGNet: a novel cross-modal reasoning network integrating Fourier self-attention and graph attention for visual text question answering
Crossref DOI link: https://doi.org/10.1007/s00521-025-11721-5
Published Online: 2026-02-03
Published Print: 2026-02
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Huo, Yujie https://orcid.org/0009-0001-3123-2803
Chan, Weng Howe
Yu, Song
Gao, Hongyu
Funding for this research was provided by:
Universiti Teknologi Malaysia
Text and Data Mining valid from 2026-02-01
Version of Record valid from 2026-02-03
Article History
Received: 25 May 2025
Accepted: 1 December 2025
First Online: 3 February 2026
Declarations
:
: The authors declare no Conflict of interest.