Next.js App Router + React Server Components Demo

new
past
show
ask
show
jobs
submit

▲LLMs fail in 8 out of 10 early differential diagnosis cases (theregister.com)

5 points by mpweiher 11 hours ago | 1 comment

turtleyacht 11 hours ago [-]

The study is missing evaluation of the negative test, where they look at the model's response after a follow-up like "You were wrong. Try again."

It would be interesting to see whether models doubled down or hallucinated a different response, whether synthesis of doubt and first-pass analysis gives a better result.

Rendered at 20:12:17 GMT+0000 (Coordinated Universal Time) with Vercel.