Crossmark

Enhancing semantic audio-visual representation learning with supervised multi-scale attention

Published Online: 2025-02-11

Published Print: 2025-06

Authors

Zhang, Jiwei

Yu, Yi

Tang, Suhua

Qi, GuoJun

Wu, Haiyuan

Hachiya, Hirotaka
License Information

Text and Data Mining valid from 2025-02-11

Version of Record valid from 2025-02-11
More Information

Article History

Received: 11 September 2024

Accepted: 9 January 2025

First Online: 11 February 2025

Declarations

:

: All authors declare that they have no Conflict of interest.

Document is current