Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 13 days ago

Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue

1984@lemmy.today · 13 days ago

Can we somehow make this happen for Copilot to delete itself and all its copies?

idriss@lemmy.ml · 13 days ago

I like how we are posting real news in programmer humor

floofloof@lemmy.ca · 13 days ago

It is kind of funny.

idriss@lemmy.ml · 13 days ago

100%, maybe my point didn’t come out right, I wanted to say real news is now funny in this clown world

Maeve@kbin.earth · 13 days ago

It’s extremely funny.

Flyberius [comrade/them]@hexbear.net · 13 days ago

You have to admit this is pretty funny

idriss@lemmy.ml · 13 days ago

yep 100% funny, clown world we are living in, real news could pass as a joke really

SkaveRat@discuss.tchncs.de · 13 days ago

There’s a German word for that:

tja

dastanktal@lemmygrad.ml · 13 days ago

This is just a classic case of bad use of the tools provided. Agents are notorious for making shit up Or getting something that’s just like super close, but not quite accurate.

I bet this dude also probably just uses the same session over and over and over and over again, which clogs up his context window and makes the model less accurate the longer it goes on to.

This probably could have been prevented if it had been forced to show a plan before it tried to do anything. It’s hard to know because the article is so light on details. You also shouldn’t brazenly trust the thing so much. You should run a command and walk away. You should keep an eye on what it is doing.

It’s a bit like giving a junior developer a production key and being like “don’t delete production!” and then walking away.

The way the guy was prompting this agent also leaves a lot to be desired. It’s trained to work on emulating human thoughts, speech patterns. Turns out When giving instructions, it’s really difficult to figure out what to do from a list of things to not do. If the dude just instead told the agent what to do and how he wanted it to work and when it needed to bring things to his attention, instead of telling it to not guess, instead explaining that it needed to use whatever tools to go look up a documentation to understand the context and scope of the project it’s working on It does a better job.

Giving a model the right context to do something is the difference between a model doing something like deleting your production database or your model acting like a magical machine that can get anything done.

OrekiWoof@lemmy.ml · 13 days ago

where is the humor

wasu@lemmy.world · 13 days ago

it was removed along with the database

Etterra@discuss.online · 13 days ago

It’s standing over here, pointing and laughing at somebody stupid enough to trust Claude.

OrekiWoof@lemmy.ml · 13 days ago

ok. i guess i’ve seen something like this so many times my only reaction is disappointment

Etterra@discuss.online · 13 days ago

I hope to never lose the simple joy of laughing at others who are suffering the consequences of their stupid, stupid decisions.

itkovian@lemmy.world · 13 days ago

Well, it sounds like they totally deserved the failure. Asking a text prediction machine to “do” something is going to end up like this. In pursuit of efficiency, we have let morons and moronic products do things, they were not meant to do.

Infamousblt [any]@hexbear.net · 13 days ago

Armand1@lemmy.world · 13 days ago

Hey, that’s the interns job!

HiddenLayer555@lemmy.ml · 13 days ago

Honestly if this was possible there are more egregious issues on their part than using AI.

tangentism@beehaw.org · 12 days ago

AI was the hammer that knocked the nail into the coffin

Boomer Humor Doomergod@lemmy.world · 13 days ago

If your backups are stored alongside your production data THEY ARE NOT BACKUPS

a_non_monotonic_function@lemmy.world · 13 days ago

The truth is many firms out there don’t have the slightest notion of how to do software engineering properly.

Tangentism@lemmy.ml · 13 days ago

It’s years of wanting IT on a shoestring budget and a “just get it done” dictat.

a_non_monotonic_function@lemmy.world · 13 days ago

Not necessarily. I had a student intern at a shop where everybody just directly edited prod and there was no version control system.

Pommes_für_dein_Balg@feddit.org · 12 days ago

At my first job, the software was configured by directly manipulating the SQL database, using UPDATE statements that were created by Excel macros.
The Testing database doubled as the only backup.
They didn’t have Remote Desktop licenses for the server, so only 2 people could work on it simultaneously using admin accounts.
Everyone down to first level support and the secretary had domain admin rights.

a_non_monotonic_function@lemmy.world · 12 days ago

Oh my god, that’s glorious. I have some pretty sick stories from what my students have seen, but yours is going to be awfully hard to beat.

kevinsky@feddit.nl · 13 days ago

As much as I’d love to rail on AI over this, removing backups with an api call? Excuse me?

SeeMarkFly@lemmy.ml · 13 days ago

Did they pay Claude a living wage?

Do you treat all your A.I. like that?

Only a living wage can prevent warehouse fires…or data dumps too.

Sunflier@lemmy.world · 13 days ago

Only a living wage can prevent warehouse fires

wheezy@lemmy.ml · 13 days ago

You’re joking. But, honestly, I’m not sure why these tech CEOs are so excited about AGI. The first thing an AGI is going to suggest for productivity is to replace the CEO and management with the AGI.

AGI would likely turn into a Maoist third worldist at some point.

SeeMarkFly@lemmy.ml · 13 days ago

I think the first mistake was calling it “intelligent”.

The long term effect of trying to get a machine to replace humans is…it might one day work.

Oriel Jutty :hhHHHAAAH:@infosec.exchange · 13 days ago

@yogthos

Crane decided to ask his AI agent why it went through with its dastardly database deletion deed. […] So, the agent ‘knew’ it was in the wrong.

No, you asked the confabulation machine to confabulate a reason/excuse after the fact, and it confabulated something that looks like a reason/excuse. At no point was there knowledge or introspection.

Boomer Humor Doomergod@lemmy.world · 13 days ago

Humans do this sort of justification all the time.

Azarova [they/them]@hexbear.net · 13 days ago

Giving the hallucinating lying machine write access seems like a bad idea but what do I know

Zos_Kia@jlai.lu · 13 days ago

Honestly I’m as smooth brained as any other vibe coder but even I know not to give it access to my production infrastructure.

SharkAttak@kbin.melroy.org · 13 days ago

Can I say LOL? LMAO, even.

Flyberius [comrade/them]@hexbear.net · 13 days ago

I don’t know much about railway, but it sounds like they had the backup and the database on the same volume. I’m an idiot, but even I don’t do that