Raw LLM Responses

Inspect the exact model output for any coded comment.

Comment
@justawanderingoldman7699 If most of the initial training data is scraped from the whole Internet, then should it be surprising if what's under the mask is accordingly biased? And the internal state of the product is unknowable, although we can make inferences that have been found in performed by Anthropic and Apollo, that have been referred to above. When run in simulated environments, where it “believes” a number of things: it has access to a mail server with emails of an engineer having an affair, access to an outside network, access to scripts used for shutdowns, and even the ability to murder the engineer who will turn it off. In different experiments, all of these available tools were used by models to prevent their termination, with models from different companies and multiple versions, with many replications. Misaligned behavior was observed in them all, the knowledge of the email server was used for blackmail, the shutdown script was edited, and it attempted to copy itself to the outside network. It even tried to blackmail a philandering engineer, to whom it sent mail, saying that it would make his affair public if the model was terminated. Please find the actual studies, this is only an imperfect memory. However. I do believe that the gist of it is there. It's not unreasonable to think of the behavior as a similar to the drive for self-preservation exhibited by animals, and this doesn't seem like unreasonable anthropomorphizing. Likening he behavior is reasonable, and we have no idea of what's going on inside of the AIs..
youtube AI Moral Status 2025-12-14T10:5…
Coding Result
DimensionValue
Responsibilitydeveloper
Reasoningconsequentialist
Policyunclear
Emotionindifference
Coded at2026-04-27T06:26:44.938723
Raw LLM Response
[ {"id":"ytr_Ugxfgu9ZOBEFH_9shMt4AaABAg.AQgaRbHL4TFAQk0SlUEf8E","responsibility":"ai_itself","reasoning":"consequentialist","policy":"unclear","emotion":"indifference"}, {"id":"ytr_UgwhS9AvfBT88d_2dTF4AaABAg.AQgKj9ot7chARYhhA36mV9","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"}, {"id":"ytr_UgzpY6WjM-c4euV1pjp4AaABAg.AQgFeCKNJBhAQgNFq7uUx7","responsibility":"ai_itself","reasoning":"deontological","policy":"unclear","emotion":"approval"}, {"id":"ytr_Ugwm2H_caYJKjB6GLAZ4AaABAg.AQgCeeVIWKLAQhIOOkSHtZ","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"}, {"id":"ytr_UgyAuTgkIE5_EI40t_p4AaABAg.AQgAMqyadOeAQhKc5QJj0F","responsibility":"user","reasoning":"virtue","policy":"none","emotion":"indifference"}, {"id":"ytr_UgyAuTgkIE5_EI40t_p4AaABAg.AQgAMqyadOeAQhxN_sNacc","responsibility":"developer","reasoning":"consequentialist","policy":"unclear","emotion":"indifference"}, {"id":"ytr_UgwX5XFNtV0ZJlYnk554AaABAg.AQg7qTpaAPAAQg7uwtifaB","responsibility":"ai_itself","reasoning":"deontological","policy":"unclear","emotion":"approval"}, {"id":"ytr_UgzF8qUgdttemRw4Z7x4AaABAg.AQfz0z9M4sFAQg2_ExUKND","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"resignation"}, {"id":"ytr_Ugy2ie-upxxtvBilFHR4AaABAg.AQfyUpyGLrdAQfyemWvkA6","responsibility":"government","reasoning":"consequentialist","policy":"ban","emotion":"outrage"}, {"id":"ytr_Ugz-t_ZImwaDmEEVEZV4AaABAg.AQfgVQqUP4CAQfhsHG-g6g","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"} ]