Enhancing semantic audio-visual representation learning with supervised multi-scale attention
Crossref DOI link: https://doi.org/10.1007/s10044-025-01414-z
Published Online: 2025-02-11
Published Print: 2025-06
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zhang, Jiwei
Yu, Yi
Tang, Suhua
Qi, GuoJun
Wu, Haiyuan
Hachiya, Hirotaka
Text and Data Mining valid from 2025-02-11
Version of Record valid from 2025-02-11
Article History
Received: 11 September 2024
Accepted: 9 January 2025
First Online: 11 February 2025
Declarations
:
: All authors declare that they have no Conflict of interest.