Anthropic AI finds massive security flaws worldwide - Page 4

Deputy Travis Junior

Joined: Jan 27, 2013

Posts: 17,887

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to KingofHazor • 2:06p, 4/12/26

Yes I read it. I quoted you saying it's not the user's fault their prompts stink and aren't getting good results. Meanwhile my prompts are specific and I'm getting great results.

Good luck, my friend. I hope you learn how to use these tools because they're a game changer if you get them right.

1

Stmichael

Joined: Sep 15, 2014

Posts: 1,339

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Logos Stick • 2:20p, 4/12/26

AG

Logos Stick said:
KingofHazor said:
Quote:
Your dismissal of AIs ability to excel in a subject matter simply because the AI has been trained in that subject is so off the mark it's almost not worth addressing. If I teach a human to do algebra, then throw calculus problems in front of him, no one would criticize him for not being able to do calculus. And no one would dismiss his ability to then do calculus because he was subsequently trained in calculus. Yet that is what you are doing.

Terrible analogy. The better analogy would be giving someone with a PhD in math a calculus problem, which they solve easily, but then they fail to solve a basic high school algebra problem.

He stated it can't do "algebra" using an example from three years ago. I point out (implied) that not only can it do "algebra" now, it can also do "calculus". Did you really think I was saying I agree with his three year old example that it can't do algebra still, but it can do calculus now? That's illogical. He then criticizes the way it learns and does calculus. My analogy is fine.

I don't care how AI "thinks", learns or processes it's ingress. I care about it's capabilities that I can benefit from, which I believe ultimately will cause much hardship going forward.

You didn't even understand that original criticism. It's not about how "difficult" the math is in an academic sense. That's a human way of thinking. It's about the number of integers involved in a basic arithmetic problem. You look at the number 123,456 and see a 6 digit figure. You can punch it into a calculator pretty easily and do whatever kind of operation you want.

The AI doesn't process language and numbers like that. It converts it all to vectors for matrix multiplication. Here's the problem: Even if the AI has been properly trained to identify the number 123,456, there's no guarantee the same is true for 123,457. That number generates a completely different identity in AI language, and it very well may have no idea what to do with it. So it'll hallucinate and spit out a number that it knows is junk.

Using my own anecdote on this: I wanted to do some space-efficient gardening in my backyard and figure out how many pounds of tomatoes I could produce using vertical aeroponic towers. I decided to ask both Chatgpt and Claude to do some analysis on this less than a year ago. On the plus side: They found research papers from Oregon State University on aeroponic gardening productivity that they quoted from to give me an indicator of how effective these towers were. On the negative side: They screwed up every basic arithmetic calculation larger than 3 digits. These LLM's were telling me I'd generate anywhere from $75,000 worth of tomatoes per year per acre all the way up to $1.5 million. When I asked them to show their work, they generated completely different numbers and said "Oh wait, we recalculated because that last answer was *insert flimsy reason here.*"

It's not reliable. If it can't figure out how many tomatoes can grow in a given space, how is it supposed to do something like accurately model the amount of natural gas I'll recover from a refrigeration plant?

Deputy Travis Junior

Joined: Jan 27, 2013

Posts: 17,887

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 2:50p, 4/12/26

ChatGPT and Claude both save your prompt history so post your prompt and tell us what models you used. Let's run this now and see how it performs.

2

1 edit

Rex Racer

Joined: Apr 29, 2001

Posts: 44,069

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 3:17p, 4/12/26

AG

Stmichael said:
Rex Racer said:
KingofHazor said:
OK, besides quibbling about what anecdotal means, why do AIs fail so frequently and catastrophically with everyday tasks. If they can ace that math test, then surely everyday tasks requested by a layman should be a breeze for them, right?

When facing criticism or even questions about the reliability of AI, you AI bros just seem to stick your fingers in your ears and yell "La la la, I can't hear you", rather than addressing the specific issues raised. To be honest, you come across as true believers unwilling to even listen to perspectives that you consider heresy.

AI is very far from perfect, but if you write an effective prompt, it can do some amazing work. Of course you need to verify, but you need to do that with people, too.

It's not the answer to everything, and it's not trash, either.

So many people tend to just take the extreme opposite position and argue. There is a middle ground.

3 questions:

1) How much time are you spending crafting this very specific prompt to get the generalist AI model to do what you want, and then how much time are you spending reviewing and revising the work?

2) How much cash are you spending on the tokens input to and output from the AI? How much is that cost going to increase when these companies stop selling the compute at a massive loss?

3) What's your value add from the end product? How are you measuring it?

I'd be very excited by the increase in productivity we would be experiencing if AI was even half as capable as the AI bros were making it out to be. Increased productivity is a direct counter to inflation, and we could all use some lower prices these days. But the proof is in the pudding, and AI isn't anywhere close to what is being sold to us. Hence the MIT report that 95% of companies aren't seeing any difference from their implementation of it.

As I said above, the ones who are seeing real value from it are those who are working with a model training company to generate a specialized tool to fill a niche for them.

1. As of today, I can use Claude Sonnet 4 to write a full featured, secure, and accessible (WCAG 2.1 Level AA) application in about 30 minutes. And I haven't even tried Sonnet 4.5 or 4.6 yet. I plan to this week. It doesn't take me very long to verify, as I have 27 years in the development game. Maybe about 3 hours. I'm talking about building an application that would take me weeks to write by myself.

2. Right now the cost is negligible. We'll see about future costs, but I'll bet it will be cheaper than labor.

3. Considering we charge $145 per hour, it is saving many thousands of dollars per application.

2

Logos Stick

Joined: Apr 30, 2023

Posts: 22,758

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Stmichael • 5:06p, 4/12/26

I've understood every argument you've made. You told me AI can't do anything but extremely simple programming because it doesn't understand data structures, etc... You told me that AI cant do math and used "prehistoric" examples of failure as proof. You've incorrectly accused AI of doing extremely difficult math by simply memorizing previous answer keys. Your base assertion is that it can't do math - or at least not consistently - because it doesn't do it like humans. It's like claiming jets can't fly well or consistently because they don't flap their wings like birds, despite all the evidence to the contrary. Did you lose your job to AI or something? Your arguments are illogical.

4

2 edits

dmart90

Joined: Oct 12, 2005

Posts: 23,526

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

5:28p, 4/12/26

AG

I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

6

2 edits

BigRobSA

Joined: Oct 22, 2003

Posts: 94,991

User Profile Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to dmart90 • 7:31p, 4/12/26

dmart90 said:
I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

I, for one, welcome our new AI overlords.

Glad I got out of IT a decade ago.

4

Bird Poo

Joined: Nov 16, 2005

Posts: 20,706

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to dmart90 • 8:00p, 4/12/26

AG

dmart90 said:
I've been in the software game for 35+ years. What has happened with AI in the last 6 months is quite simply a game changer. The job of a software engineer is irrevocably changing. The pace at which things advance is about to be off the charts. And that change will now happen in months, not years like it has been. If you doubt this you will be left behind.

Claude Mythos is the single most frightening thing I have read about. If that AI engine gets leaked; the world as we know if it is done. Most companies are NOTORIOUSLY bad when it comes to upgrading servers and systems. THIS tool in the wrong hands? Chaos will reign.

What stocks are you buying?

dmart90

Joined: Oct 12, 2005

Posts: 23,526

User Profile

Private Message

Ignore User Stop Ignoring

How long do you want to ignore this user?

24 hours One week Permanently Cancel

In reply to Bird Poo • 8:36p, 4/12/26

AG

If I knew that I would have retired from the software game years ago.