NThe Prayer Network
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
LLMs fail in 8 out of 10 early differential diagnosis cases (theregister.com)
turtleyacht 11 hours ago [-]
The study is missing evaluation of the negative test, where they look at the model's response after a follow-up like "You were wrong. Try again."

It would be interesting to see whether models doubled down or hallucinated a different response, whether synthesis of doubt and first-pass analysis gives a better result.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 20:12:17 GMT+0000 (Coordinated Universal Time) with Vercel.