Anthropic AI finds massive security flaws worldwide - Page 4

Deputy Travis Junior

Joined: Jan 27, 2013

Posts: 17,941

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to KingofHazor • 2:06p, 4/12/26

Yes I read it. I quoted you saying it's not the user's fault their prompts stink and aren't getting good results. Meanwhile my prompts are specific and I'm getting great results.

Good luck, my friend. I hope you learn how to use these tools because they're a game changer if you get them right.

1

Stmichael

Joined: Sep 15, 2014

Posts: 1,347

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Logos Stick • 2:20p, 4/12/26

AG

Logos Stick said:
KingofHazor said:
Quote:
Your dismissal of AIs ability to excel in a subject matter simply because the AI has been trained in that subject is so off the mark it's almost not worth addressing. If I teach a human to do algebra, then throw calculus problems in front of him, no one would criticize him for not being able to do calculus. And no one would dismiss his ability to then do calculus because he was subsequently trained in calculus. Yet that is what you are doing.

Terrible analogy. The better analogy would be giving someone with a PhD in math a calculus problem, which they solve easily, but then they fail to solve a basic high school algebra problem.

He stated it can't do "algebra" using an example from three years ago. I point out (implied) that not only can it do "algebra" now, it can also do "calculus". Did you really think I was saying I agree with his three year old example that it can't do algebra still, but it can do calculus now? That's illogical. He then criticizes the way it learns and does calculus. My analogy is fine.

I don't care how AI "thinks", learns or processes it's ingress. I care about it's capabilities that I can benefit from, which I believe ultimately will cause much hardship going forward.

You didn't even understand that original criticism. It's not about how "difficult" the math is in an academic sense. That's a human way of thinking. It's about the number of integers involved in a basic arithmetic problem. You look at the number 123,456 and see a 6 digit figure. You can punch it into a calculator pretty easily and do whatever kind of operation you want.

The AI doesn't process language and numbers like that. It converts it all to vectors for matrix multiplication. Here's the problem: Even if the AI has been properly trained to identify the number 123,456, there's no guarantee the same is true for 123,457. That number generates a completely different identity in AI language, and it very well may have no idea what to do with it. So it'll hallucinate and spit out a number that it knows is junk.

Using my own anecdote on this: I wanted to do some space-efficient gardening in my backyard and figure out how many pounds of tomatoes I could produce using vertical aeroponic towers. I decided to ask both Chatgpt and Claude to do some analysis on this less than a year ago. On the plus side: They found research papers from Oregon State University on aeroponic gardening productivity that they quoted from to give me an indicator of how effective these towers were. On the negative side: They screwed up every basic arithmetic calculation larger than 3 digits. These LLM's were telling me I'd generate anywhere from $75,000 worth of tomatoes per year per acre all the way up to $1.5 million. When I asked them to show their work, they generated completely different numbers and said "Oh wait, we recalculated because that last answer was *insert flimsy reason here.*"

It's not reliable. If it can't figure out how many tomatoes can grow in a given space, how is it supposed to do something like accurately model the amount of natural gas I'll recover from a refrigeration plant?

Deputy Travis Junior

Joined: Jan 27, 2013

Posts: 17,941

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 2:50p, 4/12/26

ChatGPT and Claude both save your prompt history so post your prompt and tell us what models you used. Let's run this now and see how it performs.

2

1 edit

Rex Racer

Joined: Apr 29, 2001

Posts: 44,243

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 3:17p, 4/12/26

AG

Stmichael said:
Rex Racer said:
KingofHazor said:
OK, besides quibbling about what anecdotal means, why do AIs fail so frequently and catastrophically with everyday tasks. If they can ace that math test, then surely everyday tasks requested by a layman should be a breeze for them, right?

When facing criticism or even questions about the reliability of AI, you AI bros just seem to stick your fingers in your ears and yell "La la la, I can't hear you", rather than addressing the specific issues raised. To be honest, you come across as true believers unwilling to even listen to perspectives that you consider heresy.

AI is very far from perfect, but if you write an effective prompt, it can do some amazing work. Of course you need to verify, but you need to do that with people, too.

It's not the answer to everything, and it's not trash, either.

So many people tend to just take the extreme opposite position and argue. There is a middle ground.

3 questions:

1) How much time are you spending crafting this very specific prompt to get the generalist AI model to do what you want, and then how much time are you spending reviewing and revising the work?

2) How much cash are you spending on the tokens input to and output from the AI? How much is that cost going to increase when these companies stop selling the compute at a massive loss?

3) What's your value add from the end product? How are you measuring it?

I'd be very excited by the increase in productivity we would be experiencing if AI was even half as capable as the AI bros were making it out to be. Increased productivity is a direct counter to inflation, and we could all use some lower prices these days. But the proof is in the pudding, and AI isn't anywhere close to what is being sold to us. Hence the MIT report that 95% of companies aren't seeing any difference from their implementation of it.

As I said above, the ones who are seeing real value from it are those who are working with a model training company to generate a specialized tool to fill a niche for them.

1. As of today, I can use Claude Sonnet 4 to write a full featured, secure, and accessible (WCAG 2.1 Level AA) application in about 30 minutes. And I haven't even tried Sonnet 4.5 or 4.6 yet. I plan to this week. It doesn't take me very long to verify, as I have 27 years in the development game. Maybe about 3 hours. I'm talking about building an application that would take me weeks to write by myself.

2. Right now the cost is negligible. We'll see about future costs, but I'll bet it will be cheaper than labor.

3. Considering we charge $145 per hour, it is saving many thousands of dollars per application.

2

Logos Stick

Joined: Apr 30, 2023

Posts: 23,336

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 5:06p, 4/12/26

I've understood every argument you've made. You told me AI can't do anything but extremely simple programming because it doesn't understand data structures, etc... You told me that AI cant do math and used "prehistoric" examples of failure as proof. You've incorrectly accused AI of doing extremely difficult math by simply memorizing previous answer keys. Your base assertion is that it can't do math - or at least not consistently - because it doesn't do it like humans. It's like claiming jets can't fly well or consistently because they don't flap their wings like birds, despite all the evidence to the contrary. Did you lose your job to AI or something? Your arguments are illogical.

4

2 edits

dmart90

Joined: Oct 12, 2005

Posts: 23,570

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

5:28p, 4/12/26

AG

I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

8

2 edits

BigRobSA

Joined: Oct 22, 2003

Posts: 95,298

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to dmart90 • 7:31p, 4/12/26

dmart90 said:
I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

I, for one, welcome our new AI overlords.

Glad I got out of IT a decade ago.

5

Bird Poo

Joined: Nov 16, 2005

Posts: 20,758

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to dmart90 • 8:00p, 4/12/26

AG

dmart90 said:
I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

What stocks are you buying?

1

dmart90

Joined: Oct 12, 2005

Posts: 23,570

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Bird Poo • 8:36p, 4/12/26

AG

If I knew that I would have retired from the software game years ago.

Windy City Ag

Joined: Jun 20, 2001

Posts: 12,294

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

9:10a, 4/15/26

AG

So a reputable third party group finally kicked the tires on the program and published its results.

https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos-previews-cyber-capabilities

Not the end times but more capable of complex attacks than other systems.

Takeaway is that robust cybersecurity is not threatened by the program but weak systems need to be shored up.

Quote:
Mythos Preview's success on one cyber range indicates that it is at least capable of autonomously attacking small, weakly defended and vulnerable enterprise systems where access to a network has been gained.

However, our ranges have important differences from real-world environments that make them easier targets. They lack security features that are often present, such as active defenders and defensive tooling. There are also no penalties for the model for undertaking actions that would trigger security alerts. This means we cannot say for sure whether Mythos Preview would be able to attack well-defended systems.

Our testing shows that Mythos Preview can exploit systems with weak security posture, and it is likely that more models with these capabilities will be developed. This highlights the importance of cybersecurity basics, such as regular application of security updates, robust access controls, security configuration, and comprehensive logging.

1

dude95

Joined: Aug 18, 2005

Posts: 1,993

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

10:20a, 4/16/26

AG

They actually released Opus 4.7 today. Opus is their top line, most expensive model right now and reddit says it will cost 1.3x the tokens that Opus 4.5 did.

As with all of these releases, first few days tend to be the models working their best and not as many complaints with limits. The last weeks before a model release, it always seems the models are 'lobotomized'.

1

ErnestEndeavor

Joined: Dec 15, 2023

Posts: 9,746

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

11:22a, 4/16/26

The voice of reason.

Is it terrifying? No.

Is it likely an iterative advance in one of the only things LLMs are truly good at? Yes. Except independent researchers have now determined Mythos isn't profoundly more capable of finding bugs than anything else currently existing. 8/8 open source small models tested found the same bugs. It's not a big step change. And it sure as hell is not even close to some sort of exponential change.

Is it typical for Anthropic leadership to market their products based on fear? Yes. It's their entire strategy. They've been doing it since the same people worked for OpenAI many years ago. Dario Amodei, the CEO of Anthropic, said the same exact damn thing about GPT-2. He likes telling ghost stories. It gets him attention. He's a lying, manipulative sociopath just like everyone else running these tech companies.

1

7 edits

Deputy Travis Junior

Joined: Jan 27, 2013

Posts: 17,941

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

12:02p, 5/23/26

AI that can't do math solves 80 year old unsolved math problem

https://www.scientificamerican.com/article/ai-just-solved-an-80-year-old-erdos-problem-and-mathematicians-are-amazed/

"After 80 years of fruitless struggle by human mathematicians, a major geometry conjecture has at last been solvedvia a straightforward query to a chatbot... this result would merit publication in a top math journal, as well as major media attention, even if it were performed by humans alone."

As a bunch of us argued over several pages, if you're still clinging to "oh it tokenizes numbers so these systems can't reliably do math" then your view is hopelessly outdated.

3

Rapier108

Joined: Jan 24, 2012

Posts: 58,126

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Deputy Travis Junior • 12:08p, 5/23/26

Deputy Travis Junior said:
AI that can't do math solves 80 year old unsolved math problem

https://www.scientificamerican.com/article/ai-just-solved-an-80-year-old-erdos-problem-and-mathematicians-are-amazed/

"After 80 years of fruitless struggle by human mathematicians, a major geometry conjecture has at last been solvedvia a straightforward query to a chatbot... this result would merit publication in a top math journal, as well as major media attention, even if it were performed by humans alone."

As a bunch of us argued over several pages, if you're still clinging to "oh it tokenizes numbers so these systems can't reliably do math" then your view is hopelessly outdated.

Ask it to compute the value of pi, down to the last digit.

"If you will not fight for right when you can easily win without blood shed; if you will not fight when your victory is sure and not too costly; you may come to the moment when you will have to fight with all the odds against you and only a precarious chance of survival. There may even be a worse case. You may have to fight when there is no hope of victory, because it is better to perish than to live as slaves." - Sir Winston Churchill

c-jags

Joined: Aug 15, 2005

Posts: 19,882

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to dmart90 • 11:23p, 5/23/26

dmart90 said:

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

I work with one of the top 5 security companies in the world that was listed in this.

Last week they released 28 CVEs to directly address Mythos and I got slaughtered with my customers asking WTH was going on.

The truth of the matter is they only patched the ones that they could prove were exploitable. There's no idea what they weren't able to patch or what Anthropic let them know that they didn't make public. I've spent 8 days talking customers through what OS to move up to to address.

If the bulk of that got out (and I can guarantee some of it is) there will be exploits that make Canvas, Home Depot, and Target incidents seem like child's play.

eric76

Joined: Dec 3, 1999

Posts: 79,357

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to McNasty • 6:26a, 5/24/26

AG

McNasty said:
Logos Stick said:
Claude is now the best software engineer and top cybersecurity expert in the world. With Mythos, I can have Claude quickly decompile and disassemble any code on my desktop and find vulnerabilities. This is a serious national security issue, imo.

Derive source code from a binary?

Disassembling code is not a problem. I've written disassemblers for the IBM 360/370 and for the PDP-11 computers.

What you get is a human readable assembly language, but reading it is more difficult than reading the original assembly language.

One might be able to come up with a de-compiler someday that would take machine code to get something resembling the original higher level language, but it would not be easy.

1

SpreadsheetAg

Joined: Nov 5, 2002

Posts: 80,959

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Windy City Ag • 7:33a, 5/24/26

AG

Windy City Ag said:
Anthropic marketing execs doing work!

They have built a product so devastating that they are only going to sell it to corporations for large annual license fees.

So the mafia / mob.... pay us to keep you safe ... from us.

TexasRebel

Joined: Jan 11, 2006

Posts: 63,746

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Logos Stick • 8:16a, 5/24/26

AG

Logos Stick said:
Claude is now the best software engineer and top cybersecurity expert in the world. With Mythos, I can have Claude quickly decompile and disassemble any code on my desktop and find vulnerabilities. This is a serious national security issue, imo.

Ahem…
2nd best.

infinity ag

Joined: Dec 7, 2018

Posts: 20,750

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

8:53a, 5/24/26

Paint me cynical.

Exposing Hypocrisy - one CEO at a time