- Sep 4, 2005
- 30,140
- 17,591
- Country
- United States
- Gender
- Male
- Faith
- Atheist
- Marital Status
- Single
- Politics
- US-Others
Anthropic is out from what it seems. They refuse to remove guardrails from their model that Hegseth is insisting on.
And no matter what version (free/professional) you are using, if it's an LLM, you will never be able to get rid of hallucinations. It's unfortunately baked into how the models function and you can only try to mitigate it, but it will always be in here in LLMs.
Seems like Anthropic is changing their tune pretty quickly
“Rather than being hard commitments, these are public goals that we will openly grade our progress towards,” the company said in its blog post.
The change comes a day after Defense Secretary Pete Hegseth gave Anthropic CEO Dario Amodei a Friday deadline to roll back the company’s AI safeguards, or risk losing a $200 million Pentagon contract and being put on what is effectively a government blacklist.
With regards to the other aspect you mentioned, while the hallucinations you speak of are certainly pronounced in the free tiered versions, the higher tier versions and enterprise solutions are a different user experience. Where the free tiered ones on the web are basically just a glorified LLM/google search hybrid made for speed, the more robust offerings are more tailored to accuracy and depth.
So while one may be used to hopping on ChatGPT, Perplexity, or Claude and typing a question, and getting an answer back in 5 seconds. The more robust solutions are asking follow-up questions, and giving users the option to prioritize different aspects of what they're trying to do. For example, when utilizing the top tier version of Claude Code, it's not unusual to basically have an 1 hour+ discussion with it, and have it "think" for 5-10 minutes between answers and build a complex solution that can pass SOC2, HIPAA, and PCI audits, where as if you hop on the free version of Claude that's online, it'll spit out a quick and dirty html page with some vanilla JavaScript.
To describe the difference in the past, I've used the steak dinner analogy.
You can order a steak dinner at Cracker Barrel or Applebee's.
vs
You can order a steak dinner from Ruth's Chris or Fleming's
Both will involve sitting a table and having someone bringing you a steak to eat that's edible, but you'll be getting a very different product offering and experience at the latter.
That's not to say that you can't end up with a poorly cooked steak at Ruth's Chris, it obviously happens once in a while. But your chances of getting an overcooked piece of shoe leather, or one that's still cold in the middle (because they didn't thaw properly) when you ordered medium-rare is much less likely at the latter.
Upvote
0