I just tried vibe coding with Claude

entwine@programming.dev · 2 months ago

I just tried vibe coding with Claude

Evotech@lemmy.world · 2 months ago

You need to use plan mode

Bonje@lemmy.world · 2 months ago

Our work started giving Claude access. Plugging sonnet 4.6 in with opencode I had it do some terragrunt code. It was mostly correct. Highly documented languages seem to be its best. The modules I had it write cost 4 bucks of tokens total.

It just gave insane ick using it though. I might just resign to using it though because of our backlog and burn out.

solomonschuler@lemmy.zip · edit-2 7 days ago

deleted by creator

onlinepersona@programming.dev · 2 months ago

It’s not called “correct” coding for a reason.

That’s why people are wrong so often: they feel like something is right, but don’t check. That’s how you get anti -vaxxers, manospere people, MAGA, QAnon, Brexit, etc.

ReallyCoolDude@lemmy.ml · 2 months ago

I read a lot of these posts that sadly leave out the basic parts: what were your prompts? What does it means in this context ‘vibe coding’? Did you create an initial setup, and slowly build up? Did you left wverything to the agent understanding, and just pushed approve or reject? There are multiple levels of quality that depends on the input. Did you get into context rotting? 3d math means vector math, matrices, or what? Given claude has a serious problem from march at least, the way u use it is paramount. In our team we all use claude with copilot ( sadly, that is a business directive ), and while excpetional at finding small relationships in components and microservices, had to build a long list of skills just to be barely usable in a ‘star trek’ way. The bottom line is that it is that you must be extremely precise when asking. Prompt modeling count a lot. Context build as well. For now, unit tests and data/mocks refactors are working extremely well for me, when i define the tests cases. My agents got to a point where i can safely have small peoperty additions with refactors on multiple repositories at once ( ie: i change the contract on microservice a, microservices b,c,and d are automatically updated ). This last part had to.be built thoug, with memory, engrams, and some fune tuing. It is not always a shit: if not nobody would use it. It is not this revolutionary technology that will make humans obsolete as well ( as they are selling it ).

No1@aussie.zone · 2 months ago

“were almost write every time”

Claude: You too are human, human.

BenevolentOne@infosec.pub · 2 months ago

If you make a spelling error, Claude thinks, “we’re doing low quality work”, and it does.

JubilantJaguar@lemmy.world · 2 months ago

Recently I used it (some free-tier DuckAI model, not Claude) to write a Python script for pasting PNGs into PDFs (complete with Tk interface) while applying a whole bunch of custom transformations. Simple enough, but a total chore with all the back-and-forth of searching for relevant unfamiliar libraries and syntax checking and troubleshooting. Inevitably it would have taken me the whole afternoon by hand. With AI I knocked it out in 25 minutes. That was my epiphany moment.

Since then I’ve noticed a general problem with AI coding. It almost always introduces too much complexity, which I then have to waste time untangling (and often just understanding) before I can proceed. Whereas if I had done it “my way” from the start I might have got there earlier. But I figure this problem is kinda on me.

thedogz22@thelemmy.club · 2 months ago

And for me, therein lies why my use of it has become reduced to a really complex rubber duck, or to write something out that I could do by hand, but making my robot butler do it is just faster. Anyone actually leaning into today’s generative AI models for generating code that requires complexity or thought… they shall reap what they sow in the years to come.

Katherine 🪴@piefed.social · 2 months ago

Don’t just use it as a drop in replacement for a programmer; use it to automate menial tasks while employing trust but verify with every output it produces.

A well written CLAUDE.md and prompt to restrict it from auto committing, auto pushing, and auto editing without explicit verification before doing anything will keep everything in your control while also aiding menial maintenance tasks like repetitive sections or user tests.

Feyd@programming.dev · 2 months ago

verify with every output it produces.

I agree that you can get quality output using these tools, but if you actually take the time to validate and fix everything they’ve output then you spend more time than if you’d just written it, rob yourself of experience, and melt glaciers for no reason in the process.

prompt to restrict it from auto committing, auto pushing, and auto editing without explicit verification

Anything in the prompt is a suggestion, not a restriction. You are correct you should restrict those actions, but it must be done outside of the chatbot layer. This is part of the problem with this stuff. People using it don’t understand what it is or how it works at all and are being ridiculously irresponsible.

repetitive sections

Repetitive sections that are logic can be factored down and should be for maintainability. Those that can’t be can be written with tons of methods. A list of words can be expanded into whatever repetitive boilerplate with sed, awk, a python script etc and you’ll know nothing was hallucinated because it was deterministic in the first place.

user tests.

Tests are just as important as the rest of the code and should be given the same amount of attention instead of being treated as fine as long as you check the box.

x00z@lemmy.world · 2 months ago

The trick about vibe coding is that you confidently release the messed up code as something amazing by generating a professional looking readme to accompany it.

Wilmo@lemmy.ml · 2 months ago

The more Emojis in that Readme the better!

Blackmist@feddit.uk · 2 months ago

I think it’s mostly going to be useful for boilerplate generation, and effectiveness is going to vary wildly based on what language you’re using. JS or Python? It’ll probably do OK. Plenty of open source for it to “learn” from. Delphi? Forget it.

Brief experimentation showed it liked to bullshit if it was wrong, rather than fix things.

arthur@lemmy.zip · 2 months ago

I’m using (Gemini 3.1 pro in) Gemini cli to build a complex (personal) project to explore how to use these tools. My impression is that the code produced by LLMs is disposable/throwaway. We need to babysit the model and be very hands on to get good results.

bruce_babbler@lemmy.zip · 2 months ago

You’re probably done with this. But if you give claude a test case or two (or have it try to make them) you can have claude run the test case, and then it will iterate.

Also, aggressively use plan mode and if claude screws up more than three times do /clear, explain that it’s screwing up to it and then give it new instructions.

Feyd@programming.dev · 2 months ago

producing subtly broken junk

The difference between you and people that say it’s amazing is that you are capable of discerning this reality.

JustEnoughDucks@feddit.nl · 2 months ago

I wonder if it was even able to compile. I am a shitty hobby coder who just does it to make my embedded hardware projects function.

I have yet to get compilable code out of any of the AI bots I have tried. Gemini, mistral, and chatGPT. I am not making an account lol.

I have gotten some compilable python and VBA code for data analysis stuff at work, so I wonder if it is because embedded stuff uses specific SDKs that it can’t handle.

Either way I have given up on it for anything besides bouncing ideas off of or debugging where electromagnetics issues could lie (though it has been completely wrong about that also even though it is using the wrong concepts, it just reminds me of concepts that I might have overlooked)

OwOarchist@pawb.social · 2 months ago

What I don’t get, though, is how the vibe code bros can’t discern this reality.

How can they sit there and not see that their vibe-coded app just doesn’t do what they wanted it to do? Eventually, you’ve got to try actually running the app, right? And how do you keep drinking the AI kool-aid when you find out that the app doesn’t work?

Oisteink@lemmy.world · 2 months ago

I do apps that work, i do patches that are production quality. Half the cs world does… I do full stack ai debugging of esp32 projects.

It’s a powerful tool, you just need to learn it’s strong and weak points, just like any other tool you use.

Kissaki@programming.dev · 2 months ago

Half the cs world does…

What’s the basis for this claim? I’m doubtful, but don’t have wide data for this.

KeenFlame@feddit.nu · 2 months ago

Of course they do, it is hyperbole to think they are completely useless

Oisteink@lemmy.world · 2 months ago

Rough estimate from my personal connections only. Some work places where ai is not possible, but all that have made an effort report good code. You need to work with what it is - a word generator that sometimes gives correct results. Make it research and not trust training. Never let it do things on its own, require a plan and reason. Make it evaluate its own work/plan.

Most issues i have stem from models beeing too eager. Restrain them and remove the “i can do this next…”behaviour.

Context is king - so proper mcp and documentation that is agent facing. I use serena as i can get lsp for yaml, markup and keep these docs like that

zbyte64@awful.systems · 2 months ago

Any luck with integrating platform.io? Have a esp32 project but VSCode can’t provide type hinting with it’s main c++ extension that is used by platform.io.

Feyd@programming.dev · 2 months ago

They’re the same people that copied code from stack overflow that you had to tell them how to actually fix every PR. The difference is the C suite types are backing them this time

favoredponcho@lemmy.zip · 2 months ago

You do try running the app, and then you see what is broken and then you have Claude fix it. The process is still iterative just like regular coding. I haven’t met a software engineer that wrote a perfect app the first try, its always broken, even in subtle ways. Why does everyone think vibecoding needs to be perfect on the first shot?

tleb@lemmy.ca · 2 months ago

Eventually, you’ve got to try actually running the app, right?

At least at my company, no, they just start selling it.

pinball_wizard@lemmy.zip · 2 months ago

Yes. Exactly. In my experience, there’s more code shops that ship shit than that catch their mistakes.

Lumelore (She/her)@lemmy.blahaj.zone · 2 months ago

Vibe code bros aren’t real programmers. They’re business people, not computer people. Even if they have a CS degree, they only got that because they think it’ll get them more money. They lack passion and they don’t care about understanding anything. They probably don’t even care about what they’re generating beyond its potential to be used in a grift.

I graduated college not that long ago and my CS classes had quite a few former business majors. They switched because they think it’ll be more lucrative for them but since they only care about money they didn’t bother to actually learn the material especially since they could just vibe code through everything.

b_n@sh.itjust.works · 2 months ago

So much this.

After working in tech companies for the last 10 years I’ve noticed the difference between people that “generate code” and those that engineer code.

My worry about the industry is that vibe coding gives the code generators the ability to generate even more code. The engineers (even those that use vibe tools) are not engineering as much code by volume compared to “the generators”.

My hope is that this is one of those “short term gain, long term pain” things that might self correct in a couple of years 🤞.

sobchak@programming.dev · 2 months ago

It’s insane that companies are going back to metrics like LOC (or tokens generated), when the industry figured out decades ago that these are horrible, counterproductive metrics.

“The hard thing about building software is deciding what one wants to say, not saying it. No facilitation of expression can give more than marginal gains.” - No Silver Bullet (1986)

☭可爱小猫☭@programming.dev · 2 months ago

I only use AI for generating ok looking UI.

Anthropic says Methos will find bugs on FreeBSD, Bank system etc. What a bullshit.

OwOarchist@pawb.social · 2 months ago

Oh, it will ‘find bugs’ alright. And then flood FreeBSD’s bug report system with bullshit bug reports that turn out to be nothing, but require expert human review to discern that.

homoludens@feddit.org · 2 months ago

It’s easier finding bugs than producing correct, readbale, maintainable code though-

zerofk@lemmy.zip · 2 months ago

“Almost but not quite” is exactly my experience with Claude.

The only time I’ve had real success is telling it to do a simple API change that touches a dozen files. It took a while and I’m not sure it was faster than doing it manually, but at least it was less boring.

Possibly important context: I only started really using it a few weeks ago.