Multimodal diffusion framework for collaborative text image audio generation and applications
Crossref DOI link: https://doi.org/10.1038/s41598-025-05794-4
Published Online: 2025-07-01
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Wang, Junhua
Zhang, Ouya
Jiang, Yuan
Text and Data Mining valid from 2025-07-01
Version of Record valid from 2025-07-01
Article History
Received: 7 March 2025
Accepted: 4 June 2025
First Online: 1 July 2025
Declarations
:
: The authors declare that they have no conflict of interest.