Accelerating BERT inference with GPU-efficient exit prediction
Crossref DOI link: https://doi.org/10.1007/s11704-022-2341-9
Published Online: 2024-01-22
Published Print: 2024-06
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Li, Lei
Wang, Chengyu
Qiu, Minghui
Chen, Cen
Gao, Ming
Zhou, Aoying
Text and Data Mining valid from 2024-01-22
Version of Record valid from 2024-01-22
Article History
Received: 7 June 2022
Accepted: 30 November 2022
First Online: 22 January 2024