Raw LLM Responses
Inspect the exact model output for any coded comment.
Look up by comment ID
Random samples — click to inspect
G
I think AI is good for concepts, like previsualization to give an idea of how …
ytc_Ugwzdmdbu…
G
The atheist AI is arguing against the existence of God and the Believer AI is ar…
ytc_UgxfxJRk5…
G
If Ai is going to take everyone’s jobs, who will be able to buy anything?…
ytc_UgyzAm5ud…
G
when you think about ai, think, its 1970, and the worlds ended, but people dont …
ytc_UgwupokL-…
G
Oh boy, wait until those self driving cars come to Europe. Or better yet: *TO BA…
ytc_UgyFb5EMC…
G
@geck5505 you don’t need talent and effort to create a drawing with a pencil. To…
ytr_UgwQzf0bM…
G
Oh my god...I honest to heavens thought Silicon Valley had made up this subplot.…
ytc_UgwRz34q2…
G
If I can go back in time, imma bring a sledgehammer and destroy some stuff that …
ytc_Ugyx0-7C2…
Comment
One idea to keep in mind is that you can use a cheap AI model to augment GPT-4/5 or even human output.
A joke example is replacing the word "wand" with "wang" in the Harry Potter stories. Taping knives to roombas. Or consider how not every employee was aware they were working on the atom bomb (or are working at scam organizations today). Basically, advanced jailbreaking, as opposed to those jailbreaks that should be obvious to fix.
I don't know if such a technique would actually scale for truly dangerous scenarios, but I believe it'd definitely scale for hate speech and erotica, and I've already found some success with this technique with barely any postprocessing at all. OpenAI would also probably not really care about this kind of misuse, so long as they weren't directly responsible.
Terrorist level misuse is a different story, and I'm not sure how you could avoid the possibility without severely handicapping your product. Considering helpful business emails and manipulative phishing scams are basically identical, as one example...
reddit
AI Responsibility
1682548472.0
♥ 2
Coding Result
| Dimension | Value |
|---|---|
| Responsibility | none |
| Reasoning | unclear |
| Policy | none |
| Emotion | mixed |
| Coded at | 2026-04-25T08:33:43.502452 |
Raw LLM Response
[{"id":"rdc_jhspuqw","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jht26c9","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jhsqwc5","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"indifference"},{"id":"rdc_jhsre0c","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"approval"},{"id":"rdc_jhuh106","responsibility":"none","reasoning":"unclear","policy":"none","emotion":"mixed"}]