Evaluating large language models for criterion-based grading from agreement to consistency
Crossref DOI link: https://doi.org/10.1038/s41539-024-00291-1
Published Online: 2024-12-30
Update policy: https://doi.org/10.1007/springer_crossmark_policy
Zhang, Da-Wei
Boey, Melissa
Tan, Yan Yu
Jia, Alexis Hoh Sheng
Text and Data Mining valid from 2024-12-30
Version of Record valid from 2024-12-30
Article History
Received: 26 January 2024
Accepted: 16 December 2024
First Online: 30 December 2024
Competing interests
: The authors declare no competing interests.