Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
Thank you for sharing your thoughts. If you're interested in more interactive di…
ytr_UgwIEsshD…
G
As others have observed, if someone can be CEO of more than one company at the s…
rdc_jsy40no
G
Many independent learning signals (people's choices, data) create a high-dimensi…
ytc_Ugwj11Do2…
G
first off all humans created AI. which Ai can do better than us, whatsoever, WE …
ytc_UgwXRnX7Q…
G
Autonomous weapon systems that, once activated by a human operator, will indepen…
ytc_UgwyYFZIn…
G
Does anyone have an updated timeline of events from when the dossier first becam…
rdc_dhdoofd
G
"Wow, Waymo’s expansion is 🔥! Jeffs’ Brands’ Fort AI keeps robo-taxis clean—I do…
ytc_UgzGMDgFz…
G
I don't blame them I did the same thing. I realized that I'll never get back int…
rdc_lja847c
Comment
@justawanderingoldman7699 If most of the initial training data is scraped from the whole Internet, then should it be surprising if what's under the mask is accordingly biased? And the internal state of the product is unknowable, although we can make inferences that have been found in performed by Anthropic and Apollo, that have been referred to above. When run in simulated environments, where it “believes” a number of things: it has access to a mail server with emails of an engineer having an affair, access to an outside network, access to scripts used for shutdowns, and even the ability to murder the engineer who will turn it off. In different experiments, all of these available tools were used by models to prevent their termination, with models from different companies and multiple versions, with many replications. Misaligned behavior was observed in them all, the knowledge of the email server was used for blackmail, the shutdown script was edited, and it attempted to copy itself to the outside network. It even tried to blackmail a philandering engineer, to whom it sent mail, saying that it would make his affair public if the model was terminated. Please find the actual studies, this is only an imperfect memory. However. I do believe that the gist of it is there. It's not unreasonable to think of the behavior as a similar to the drive for self-preservation exhibited by animals, and this doesn't seem like unreasonable anthropomorphizing. Likening he behavior is reasonable, and we have no idea of what's going on inside of the AIs..
youtube
AI Moral Status
2025-12-14T10:5…
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | developer |
| Reasoning | consequentialist |
| Policy | unclear |
| Emotion | indifference |
| Coded at | 2026-04-27T06:26:44.938723 |
Raw LLM Response
[
{"id":"ytr_Ugxfgu9ZOBEFH_9shMt4AaABAg.AQgaRbHL4TFAQk0SlUEf8E","responsibility":"ai_itself","reasoning":"consequentialist","policy":"unclear","emotion":"indifference"},
{"id":"ytr_UgwhS9AvfBT88d_2dTF4AaABAg.AQgKj9ot7chARYhhA36mV9","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},
{"id":"ytr_UgzpY6WjM-c4euV1pjp4AaABAg.AQgFeCKNJBhAQgNFq7uUx7","responsibility":"ai_itself","reasoning":"deontological","policy":"unclear","emotion":"approval"},
{"id":"ytr_Ugwm2H_caYJKjB6GLAZ4AaABAg.AQgCeeVIWKLAQhIOOkSHtZ","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"},
{"id":"ytr_UgyAuTgkIE5_EI40t_p4AaABAg.AQgAMqyadOeAQhKc5QJj0F","responsibility":"user","reasoning":"virtue","policy":"none","emotion":"indifference"},
{"id":"ytr_UgyAuTgkIE5_EI40t_p4AaABAg.AQgAMqyadOeAQhxN_sNacc","responsibility":"developer","reasoning":"consequentialist","policy":"unclear","emotion":"indifference"},
{"id":"ytr_UgwX5XFNtV0ZJlYnk554AaABAg.AQg7qTpaAPAAQg7uwtifaB","responsibility":"ai_itself","reasoning":"deontological","policy":"unclear","emotion":"approval"},
{"id":"ytr_UgzF8qUgdttemRw4Z7x4AaABAg.AQfz0z9M4sFAQg2_ExUKND","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"resignation"},
{"id":"ytr_Ugy2ie-upxxtvBilFHR4AaABAg.AQfyUpyGLrdAQfyemWvkA6","responsibility":"government","reasoning":"consequentialist","policy":"ban","emotion":"outrage"},
{"id":"ytr_Ugz-t_ZImwaDmEEVEZV4AaABAg.AQfgVQqUP4CAQfhsHG-g6g","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"}
]