Trustworthy evaluation of large language models
Crossref DOI link: https://doi.org/10.1007/s11704-025-50442-9
Published Online: 2025-10-17
Published Print: 2026-02
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zhang, Xin-Yi
Ye, Han-Jia
Zhan, De-Chuan
Text and Data Mining valid from 2025-10-17
Version of Record valid from 2025-10-17
Article History
Received: 13 April 2025
Accepted: 16 June 2025
First Online: 17 October 2025
Competing interests
: The authors declare that they have no competing interests or financial conflicts to disclose.