Raw LLM — Corpus Dashboard

Look up by comment ID

Random samples — click to inspect

G though the biased artist is right that it has no soul...it is incorrect in the a… ytc_UgyipqYSV… G Being able to poison AI databases with one simple tool is insane Hopefully we c… ytc_Ugx0d4MTD… G I make some of the strangest and bizarre art, like for example: replacing people… ytc_UgzxP44kl… G You can't teach A I to have a concept of empathy if the use of Pavlovian style o… ytc_UgxWn8b4A… G These issues are not really an issue. Kind of like how you know facebook and goo… ytc_UgyBZjgr0… G What about AI slop? Code lines are being created, it works then crashes. Now you… ytc_Ugw6HSSJY… G And yet these (possibly) abled ai artists feel like they’re inferior because the… ytr_UgyTmBTVo… G You can’t do art without a consciousness. The concept of AI art doesn’t make any… ytc_UgwNWVik6…

Comment

My impression has been that Opus is actually much worse at coding than Sonnet. It overcomplicates everything, overgeneralizes simple discrete tasks, creates more bugs, and really does nothing better except cost more money, so idiots and Anthropic bots promote it the hardest. Like bear boxes in national parks, any heuristic you develop to defeat bots will also defeat the dumbest quintile of humanity… My impression was also that 4.5 was moderately better than 3.7 at remaining “on track” with more complex tasks and managing context rot. Similar to the incremental change from 3.5 to 3.7. I do think Claude 3.5 was a real step change forward. LLMs did not impress me much before that. Perhaps due to my own ignorance of NLP, I didn’t foresee the evolution from the initial release of ChatGPT to the near-future applications of semantic search and RAG. Like you, I am skeptical of the integrity of anyone who acts as if we’re not *deep* into the curve of diminishing returns with blindly scaling the current system architecture of LLMs. I have not met anyone IRL who thinks 4.5 was anything more than incremental progress, or that 4.6 was noticeably different in any way. Perhaps now that Claude Code’s embedded system prompts have been optimized for Claude 4.6, then 4.5 will seem worse if you downgraded, but judging model performance by the quality of prompt engineering is a category error in my book. That being said: to play devil’s advocate, perhaps there are people who know less than I did, who are genuinely impressed by the latest models, that they are now able to use to do new things. Perhaps it’s not the model capabilities that upgraded, but the user’s capabilities. This is all moving very fast. It’s genuinely exciting. Anyone who starts building applications with AI today is still an early adopter. Skepticism is warranted IMO. Crypto burned a lot of people, the neutered chat interfaces are utter garbage, and OpenAI and Anthropic are SoftBank-level financial dumpster fire

reddit AI Jobs 1774268079.0 ♥ 5

Coding Result

Dimension	Value
Responsibility	company
Reasoning	deontological
Policy	none
Emotion	outrage
Coded at	2026-04-25T08:33:43.502452

Raw LLM Response

[
{"id":"rdc_obw7q2f","responsibility":"company","reasoning":"consequentialist","policy":"none","emotion":"indifference"},
{"id":"rdc_obvmgt7","responsibility":"ai_itself","reasoning":"mixed","policy":"none","emotion":"mixed"},
{"id":"rdc_oc0674s","responsibility":"company","reasoning":"deontological","policy":"none","emotion":"outrage"},
{"id":"rdc_obv69wi","responsibility":"company","reasoning":"consequentialist","policy":"none","emotion":"outrage"},
{"id":"rdc_obv8w5z","responsibility":"company","reasoning":"consequentialist","policy":"none","emotion":"outrage"}
]

Raw LLM Responses