Raw LLM Responses

Inspect the exact model output for any coded comment.

Comment
also you were wrong about the fact that these algorithms are “too complicated to look inside.” you can actually train neural networks to accurately classify whether a language model thinks it’s lying already, regardless of its scale, from tiny ones up to the largest available. and during in-context learning, that’s actually not a mutable property of the model; it can’t fool the lie detector without changing its output (which is sort of the point, yeah)? of course if “trained,” it can jointly maximize the objective and minimize the lie detector given a gradient of it, but why would a fully trained model be doing gradient updates, and better yet, why would the lie detector be available to it as a white box? it just makes no sense.
youtube AI Moral Status 2023-08-21T08:1…
Coding Result
DimensionValue
Responsibilitydeveloper
Reasoningmixed
Policynone
Emotionmixed
Coded at2026-04-26T23:09:12.988011
Raw LLM Response
[{"id":"ytc_Ugx2K5GUzFiqIHv_dtR4AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"}, {"id":"ytc_UgxhpmSHhmcI6gBCuet4AaABAg","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"mixed"}, {"id":"ytc_Ugydqdk6mprk_7R8gKJ4AaABAg","responsibility":"ai_itself","reasoning":"consequentialist","policy":"none","emotion":"fear"}, {"id":"ytc_Ugy81UBhAJPyqWW3jWt4AaABAg","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"mixed"}, {"id":"ytc_Ugww_dwgn8ChrL3JR854AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"}, {"id":"ytc_Ugwg-Kra3DkgQhMIOBd4AaABAg","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"mixed"}, {"id":"ytc_Ugzw25el0Mf2QLx4Tm94AaABAg","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"outrage"}, {"id":"ytc_Ugz-e7YyVgm-HO4etDJ4AaABAg","responsibility":"none","reasoning":"mixed","policy":"none","emotion":"mixed"}, {"id":"ytc_UgzNlfJS7GMkEtTwrrJ4AaABAg","responsibility":"developer","reasoning":"deontological","policy":"none","emotion":"mixed"}, {"id":"ytc_UgzgtvNmbaz-4H738_h4AaABAg","responsibility":"developer","reasoning":"mixed","policy":"none","emotion":"mixed"}]