MutualFormer: Multi-modal Representation Learning via Cross-Diffusion Attention
Crossref DOI link: https://doi.org/10.1007/s11263-024-02067-x
Published Online: 2024-04-24
Published Print: 2024-09
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Wang, Xixi
Wang, Xiao
Jiang, Bo
Tang, Jin
Luo, Bin
Text and Data Mining valid from 2024-04-24
Version of Record valid from 2024-04-24
Article History
Received: 3 March 2023
Accepted: 23 March 2024
First Online: 24 April 2024