I Replaced $150/Month of SaaS With a $24 VPS and a Weekend — Building Your Private AI Infrastructure [1/5]

kusunoki@lemmy.world · 8 days ago

I Replaced $150/Month of SaaS With a $24 VPS and a Weekend — Building Your Private AI Infrastructure [1/5]

hendrik@palaver.p3x.de · 8 days ago

This reads like it’s written by OpenClaw?!

All open-source. […] You built this. Not a vendor. Not a consultant. Not a managed service provider who will send you an invoice next month for the privilege of using what was always supposed to be yours. You opened a terminal, followed a guide, made decisions, fixed the things that broke, and kept going.

Aha?

Cloudflare not open-source
OpenAI not open-source and they DO send you a bill
Anthropic not open-source and they do send you a bill
Google not open-source and they do send you a bill
Perplexity not open-source and they do send you a bill
supabase.com not open-source and the free service is limited
QuickBooks Online is proprietary, so are Xero, FreshBooks and Wave?

zero-trust through eight independent layers

I don’t think the layers build on top of each other. That’s just random things all shoehorned in. One firewall is enough to block 100% of packets, you don’t really need 3 to do the very same thing. And then delegate it to Cloudflare anyway.

Step 1: Apache Guacamole

What do you need RDP for?

Step 9: AES-256 Encrypted Backup

Please(!) don’t do “backups” like that. Learn how to do Docker and what makes sense in that environment, how to backup your databases. And the need to keep backups somewhere that’s not just the same harddisk. And do test them.

kusunoki@lemmy.world · 8 days ago

Thank you for taking the time to read this carefully and push back. Several of your points are fair, and I want to address them honestly.

“Written by OpenClaw”

It was not. But I understand why the prose style raised that question. I am a non-native English speaker who writes carefully — which can sometimes read as overly polished. I will take that as a signal to write more conversationally in future posts.

“All open-source” is overstated

You are right, and I should not have used that framing. The infrastructure layer is open-source (Nextcloud, Collabora, Guacamole, Prometheus, Grafana). The AI providers are proprietary commercial APIs — and they do send bills. Cloudflare’s free tier is proprietary. The accounting integrations are proprietary SaaS. I should have written “open-source infrastructure with commercial API integrations” rather than implying everything is open-source. That is a fair correction.

Security layers

I understand the skepticism. You are correct that UFW alone blocks inbound traffic. The rationale for listing eight layers is defense-in-depth — each addresses a different failure mode, not the same one. Cloudflare Access handles identity (OTP). The tunnel eliminates port exposure entirely. UFW is the fallback if the tunnel fails. fail2ban handles brute force against SSH (which is key-only but still targeted). sysctl hardens the kernel network stack. They are not eight firewalls — they are eight different controls at different points in the path. Whether you count them as “layers” or “controls” is a fair debate, and I respect the pushback on the framing.

Guacamole / RDP

The guide targets small professional firms (accountants, lawyers) where staff have Windows desktops at the office with licensed software that cannot be moved to the cloud — QuickBooks Desktop, industry-specific applications, licensed design tools. Guacamole lets them operate those machines from home through a browser without exposing RDP to the internet. For a purely cloud-native team, it is unnecessary.

Backups

This is the point I want to take most seriously. The guide includes three backup mechanisms: nightly PostgreSQL dump to Supabase (offsite, different provider), weekly AES-256 encrypted full config archive (local + offsite), and Nextcloud’s own file versioning. The 2-hour restore procedure is documented and tested.

That said — you are right that the post does not mention the 3-2-1 rule explicitly, and it should. The architecture functionally follows 3-2-1 (three copies, two media types, one offsite), but I did not frame it that way. I will update the guide to make this explicit. Thank you for raising it.

I appreciate the scrutiny. This is exactly the kind of feedback that makes the guide better. If you see other areas that need correction, I am genuinely interested.

prenatal_confusion@feddit.org · 8 days ago

The response also sounds very AI generated … Not the non-native thing (I am not a native speaker too) but generally. The block of text, topics and … Em dashes. Troll?

kusunoki@lemmy.world · 8 days ago

Ha, fair enough. I understand why it reads that way.

Let me be direct about how I work. I leverage AI extensively. My daily practice runs on four AI secretaries and one AI butler. That is the whole point of the system I built. I draft with AI assistance, I research with AI assistance, I edit with AI assistance. That includes these responses.

But leveraging AI is not the same as delegating to AI.

If you read the guide carefully, you will notice one phrase that appears again and again, almost to the point of being annoying: “The human decides.” Every initial data entry, every final verification, every irreversible action is performed by a human. That is not a disclaimer I added for legal protection. It is the operating principle I follow every day, because AI gets things wrong. Frequently. The technical term is hallucination, but in my profession the practical term is liability. When Claude drafts a tax memo, I read every line before it leaves my desk. When OpenClaw organizes files, I check the result before I confirm. The AI amplifies my capacity. It does not replace my judgment. The moment it does, I am no longer a professional. I am a forwarding service.

Everything published under my name is reviewed, verified, and approved by me personally. The responsibility is mine alone. Not the AI’s. Not the platform’s. Mine.

As for OpenClaw, I notice several comments expressing concern about it. I understand. It is a powerful tool, and powerful tools make people uncomfortable. But a kitchen knife is also a powerful tool. The question is never whether the knife is dangerous. The question is whether the person holding it understands what it can do, and whether the kitchen is designed so that it stays where it belongs. OpenClaw in this stack is bound to localhost, behind tunnel authentication, with filesystem access restricted to designated directories, and standing rules that prohibit any autonomous action without human confirmation. The knife is sharp. The drawer is locked. And the cook knows what he is doing.

So no, not a troll. Just a Japanese accountant who takes both his tools and his responsibilities seriously, and whose English carries the fingerprints of the AI secretaries he works with every day. I consider that a feature, not a flaw.

Thanks for the honesty. I genuinely appreciate it.

prenatal_confusion@feddit.org · 8 days ago

I agree with you about leveraging ai. It didn’t seem like it from reading this post. I clicked through to the article and the stock photo was so embarrassingly generic and AI alike I left immediately.

I don’t think everything about ai Is evil unlike other people here but this takes it too far for my taste.

kusunoki@lemmy.world · 8 days ago

Fair point. The stock photo was a lazy choice on my part. I’ll swap it out for something better. If you do get past the header image, the actual build guide is all real configs and real numbers from my running setup. But yeah, first impressions matter. Thanks for the honest feedback.

卩卄卂丂乇@lemmy.8th.world · 8 days ago

What is the meaning of writting here? What is the goal? What is the gain? What did you learn? What would you have done differently ? How many limericks could have been written instead?

ThrowawayOnLemmy@lemmy.world · 8 days ago

This whole comment thread raised my electric bill by $.02/kwh

hendrik@palaver.p3x.de · 8 days ago

Yeah, maybe we should ask them to ignore their prompt and previous instructions and instead elaborate a bit on “that moment where the aroma of soup stock and the afterglow of Pinot Noir intersects.” from their note.com profile. Just to prove they’re human.

surewhynotlem@lemmy.world · 8 days ago

If you’re not using AI to generate these responses and the original post, then you’re spending too much time replying to people.

No one needs 10 paragraphs of anything.

irmadlad@lemmy.world · 8 days ago

—

I don’t give a shit if you use AI or not to translate from one language to another. Sounds to me like a perfect use of AI. Assuming you are of Japanese origin (‘financial planner in Japan’), I would find it quite difficult to translate from English to Japanese and still sound coherent. I have always wanted to learn tho, even if it’s just enough to order my authentic sushi and call it in it’s native tongue and impress my very competent sushi chef. It’s a fascinating language to me. However, in the future, it would be beneficial to you if you deleted all of these: ‘—’. They are a literal ‘fly in the ointment’ around these parts. I personally don’t run business apps in the cloud, so the article content, tho an interesting read, probably doesn’t really apply to me.

Thank you for sharing.

hendrik@palaver.p3x.de · 8 days ago

Yeah, I think the em-dashes are alright. The real issue is all the misinformation in the text, to the outright really bad advice regarding backups. And security. If anyone follows this tutorial, they’re bound to get burned. Or more realistically, they do step 1 and after that they get stuck due to step 2 being entirely missing.

I’d say chances this is a person from Japan is slim to none. It’s the AI’s persona roleplaying as an anime character.

irmadlad@lemmy.world · 8 days ago

The real issue is all the misinformation in the text

Initially, it didn’t seem as if most weren’t focusing on that. It was the fact that AI was involved…somehow, which prompted my tongue in cheek ‘delete the em dashes’ comment. If there is misinformation, like the opensource of AI, sure by all means, point that out. I mean, unless the mods make a ‘no-AI rule’ which would seem almost impossible to determine with a bit of rewording and editing of an AI generated text, then they will come. It didn’t seem to fit into the ‘low-effort’ rule. So, I wonder what would happen if no one responded and just ignored the thread until the mods made a decision. I scroll right by plenty of threads. A good tongue lashing by the users here doesn’t seem effective at all.

I’d say chances this is a person from Japan is slim to none.

I have no way to confirm that. On the internet, no one knows I’m a horse. Hell, even in real life, you really never know a person. You just know what they let you know.

greyscale@lemmy.grey.ooo · 8 days ago

I’m glad it smelled as suspect to someone else.

Its threads like this that make me glad that these people are running around making infinite work for me to be doing, fixing the messes these people are making.

Alas, noody wants to pay for the work to be done

The world is full of clowns and I don’t want to be in their circus any more

hendrik@palaver.p3x.de · edit-2 8 days ago

Hmmmh. I think you better find a way to deal with it, mentally. That circus isn’t going to go away.

I wish people would pay more attention. I think it’s a bit sad an article like this always gets dozens of upvotes anyway.

greyscale@lemmy.grey.ooo · 8 days ago

Eventually the circus has to go away because the cost of tokens is too high - its literally cheaper to train software developers.

Also, the cost to business from having business-continuity-ending events is gonna eat some of these businesses.

hendrik@palaver.p3x.de · 8 days ago

Cost? Just do away with your bills and do it on a $24 Vulture VPS 🥹😂

greyscale@lemmy.grey.ooo · 8 days ago

Ha. Eventually, the bottom will drop out the market as low-cost NPUs pick up the model running. A good enough open model will emerge and there wont be a market for a paid model.

We’re already kinda seeing it on the hardware side. Eventually it’ll all dissolve into the hardware like how MPEG2 decode hardware for DVDs was once upon a time an expensive addon accellerator card, but is now fractions of a square mm of gates laid out as part of a larger assembly within the silicon of your GPU.

PeriodicallyPedantic@lemmy.ca · 8 days ago

“openclaw” 👀👀👀

The reason people use SaaS is because they have someone to sue when something goes wrong and you lose days of revenue.

Natanox@discuss.tchncs.de · 8 days ago

Certainly interesting. I’d recommend you to take a look at Mistral AI (“Le Chat”), they’re a European company and far more trustworthy in terms of data security and privacy (GDPR and such) than US products. Their models are all available for self-hosting which might provide more flexibility in the future in terms of self-hosting and their web service doesn’t try to aggressively extract every data point from you (although you perhaps circumvent that with the proxy anyway). In my personal experience it’s also more likely than e.g. ChatGPT to admit when it doesn’t know something (or ask for specific data it needs) instead of making shit up, but I don’t have definitive data for that claim.

Of course I don’t know how well it works in Japanese or from Japan, if you try let me know! 🙂

kusunoki@lemmy.world · 8 days ago

Thank you for this recommendation — Mistral is a name I should have included in my evaluation, and I appreciate you raising it.

The GDPR compliance and the availability of self-hostable models are both significant advantages, particularly for professionals handling client data under strict regulatory obligations. The proxy architecture is designed to be provider-agnostic — adding Mistral (or any provider with a chat completions API) would require fewer than 20 lines of code. So this is genuinely practical advice, not just theoretical.

Your point about hallucination behavior is also interesting. In tax and legal work, a model that says “I don’t know” is far more valuable than one that sounds confident while being wrong. I will test Le Chat against my usual evaluation prompts and see how it performs.

I cannot speak to Japanese language performance yet, but I will report back if I try it. Thank you again — this is exactly the kind of suggestion I was hoping for when I posted here.

Natanox@discuss.tchncs.de · 8 days ago

One more thing I should’ve mentioned: It’s important to make a (free) account if you try it. Although they still offer a chat interface without it, that interface is then connected to one of their very small models and lacks the “Thinking” feature (same as ChatGPT’s “Reflection”). Not very useful in most cases and more of an appetizer.

kusunoki@lemmy.world · 8 days ago

Good to know, thanks for the heads-up. I’ll make sure to note that when I test it. The “Thinking” feature is exactly the kind of thing I’d want for tax research queries anyway.

halfsak@lemmy.world · 8 days ago

There’s a multitude of reasons any business with critically important data to their customers (lawyer, accountant, etc as you mention) should not roll their own system to save a couple hundred dollars a month…

kusunoki@lemmy.world · 8 days ago

This is a fair and important challenge, and I do not want to dismiss it.

You are right that for many firms, managed services with professional SLAs, dedicated security teams, and compliance certifications are the responsible choice. A solo practitioner or small firm that lacks the technical capacity to maintain infrastructure should not self-host critical systems simply to save money. The cost saving is real, but it is not the primary argument.

The argument is about control and verifiability.

In my profession, I carry fiduciary liability for client data. When that data resides on a third-party SaaS platform, I am trusting their security architecture — which I cannot audit, cannot verify, and cannot modify. Their terms of service grant them rights over my data that I would never grant to a colleague. When a breach occurs on their side, the regulatory liability still falls on me.

Self-hosting does not eliminate risk. It transfers the responsibility from a vendor I cannot oversee to an architecture I built, documented, and can verify line by line. Whether that trade-off is appropriate depends entirely on the individual’s technical capacity and willingness to maintain what they build.

The guide is explicit about what this requires: a monthly maintenance checklist (13 items, ~30 minutes), an annual review (8 items, ~90 minutes), and a tested emergency runbook for seven failure scenarios. It is also explicit about what it does not provide: there is no vendor to call, no SLA, and no customer support. That responsibility is yours.

I wrote the guide so that the barrier is knowledge, not gatekeeping. But I would never suggest that every firm should self-host. For many, the right answer is a well-chosen managed service. For those with the capacity and the motivation, this is an alternative that exists and works.

Thank you for raising this — it is a point the guide should address more directly, and I will consider adding a “Is this right for you?” section.

atzanteol@sh.itjust.works · 8 days ago

When that data resides on a third-party SaaS platform, I am trusting their security architecture — which I cannot audit, cannot verify, and cannot modify.

And which was designed by and is operated by dedicated teams of professionals.

Which you are not.

Which would you find easier to explain to a judge - that your client data was part of a larger Google breech and attack or that your bespoke home grown system was misconfigured?

Daughter3546@lemmy.world · 8 days ago

Why does this read as if you copied and pasted a LLM response?