A high-performance batched matrix multiplication framework for GPUs under unbalanced input distribution
Crossref DOI link: https://doi.org/10.1007/s11227-021-03936-9
Published Online: 2021-06-21
Published Print: 2022-02
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Wang, Ruimin
Yang, Zhiwei
Xu, Hao
Lu, Lu
Funding for this research was provided by:
Guangzhou Produce & Research Fund (201902020004)
Text and Data Mining valid from 2021-06-21
Version of Record valid from 2021-06-21
Article History
Accepted: 7 June 2021
First Online: 21 June 2021