Vibix

April 20, 2026

Trust the robots

I had an idea a week or two ago.

Could an AI agent write an operating system?

And how far could it get? When would the thing self-combust?

And so, I booted up Claude determined to find out. I proceeded to invoke claude with a short request:

────────────────────────────────────────────────────────────────────────────────
❯ Let's build an operating system together.
────────────────────────────────────────────────────────────────────────────────

What followed over the next 48 hours still seems insane to me. At first, I was involved every step of the way. What language should it be? Rust. What bootloader? Limine. And on, for maybe the first hour or two.

At some point I had an idea.. what if I build a development environment in docker for it? Then I could pass --dangerously-skip-permissions without getting pwned as they used to say. I wouldn't have to stay by my computer hitting enter. Great!

Some time later I tired of coming back to my computer, telling it what to do in between tasks.

How could I get around that?

It occured to me that claude could perfectly well use the command line, so why don't we just install gh? That way I could file issues on the repo, and claude could solve them in a loop.

I called this /auto-engineer, and you can find it here. And so now here I was, in the days that followed, watching claude program from afar,

Some innovations (in Vibix) that emerged over the days following /auto-engineer:

/os-researcher: A skill which conducts an entire RFC topic for an os research problem. It researches the idea on the web, conducts a full peer-review process from a number of archetypes (all documented through a PR), and if consensus is formed, merges the RFC into the repo.
/project-report: Get a project report posted to the repo discussion page. One for every day I remembered to run it.
/auto-manager: Middle management layer orchestrating sprints of work with a team of /auto-engineers. Claude is able to map out dependencies; map out likely areas of code overlap; etc. This is a real unlock- just like any graph traversal (and a team working together in a shared area is one), you can only go so wide before you lose efficiency.

So far, it's going much better than expected. The operating system still boots, and we are partially on the way to having a functioning ext2 filesystem driver.

It's not all sunshine and rainbows though. On day 5 I hit a fork/exec hang that my /auto-engineers spun in circles on. I had to bisect the project manually, find the commit in question, and direct claude very closely to get it fixed. So coding isn't solved yet. But it might be on the path to being solved.

The future of writing software - as an industry - seems to be one that is evolving into something closer to philosophy:

How well do you understand first principles?
How good are the ideas you can come up with?
How logically can you articulate them to a model?

Perhaps today being able to debug a program is a requirement, but will it always be? With the right harness, could claude not debug for you? How long before someone writes the debug equivalent of Claude Code?

I will spend my excess quota in between other personal projects on Vibix, if only just to see how far it gets.

HUMANS.md

Vibix