Crossmark

Evaluating large language models for accuracy incentivizes hallucinations

Published Online: 2026-04-22

Published Print: 2026-05-28

Authors

Kalai, Adam Tauman https://orcid.org/0000-0002-4559-8574
Nachum, Ofir

Vempala, Santosh S.

Zhang, Edwin
License Information

Text and Data Mining valid from 2026-04-22

Version of Record valid from 2026-05-20
More Information

Article History

Received: 1 July 2025

Accepted: 15 April 2026

First Online: 22 April 2026

Competing interests

: A.T.K., O.N. and E.Z. are (or were) employed by OpenAI. E.Z. is currently employed by Isara Laboratories. S.S.V. declares no competing interests.

Document is current