Raw LLM Responses

Inspect the exact model output for any coded comment.

Comment
The question is: did it spat out candidates who were unqualified at a lower rate than humans? It's likely the F1 score and logloss was better than human performance.
reddit Cross-Cultural 1539223174.0 ♥ 8
Coding Result
DimensionValue
Responsibilitynone
Reasoningconsequentialist
Policynone
Emotionapproval
Coded at2026-04-25T08:33:43.502452
Raw LLM Response
[ {"id":"rdc_e7il5t7","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"indifference"}, {"id":"rdc_e7inq3u","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"indifference"}, {"id":"rdc_e7j81fp","responsibility":"developer","reasoning":"mixed","policy":"none","emotion":"outrage"}, {"id":"rdc_e7jals0","responsibility":"developer","reasoning":"consequentialist","policy":"none","emotion":"mixed"}, {"id":"rdc_e7juc4u","responsibility":"none","reasoning":"consequentialist","policy":"none","emotion":"approval"} ]