Discussion about this post

User's avatar
Robert J R's avatar

"There are no inconsistencies across adjudicators because there’s one adjudicator."

This seems like a bold claim, when AI can be inconsistent with itself!

Given a complex prompt and difficult questions, which this seems like, it's totally possible that an AI would come down on one side XX% of the time and YY% on the other.

Sure you could go one level deeper and have AI adjudicate 10 times and take the mode, or flag cases that are inconsistent for manual review, but I think assuming AI totally eliminates this problem is simply incorrect.

Expand full comment
10 more comments...

No posts