Claude Code Unpacked : A visual guide

1108 points

1/21/1970

4 days ago

by autocracy101

Comments

autocracy101

Author here. I built this in a few hours after the Claude Code leak.

I've been working on my own coding agent setup for a while. I mostly use pi [0] because it's minimal and easy to extend. When the leak happened, I wanted to study how Anthropic structured things: the tool system, how the agent loop flows, A 500K line codebase is a lot to navigate, so I mapped it visually to give myself a quick reference I could come back to while adapting ideas into my own harness and workflow.

I'm actively updating the site based on feedback from this thread. If anything looks off, or you find something I missed, lmk.

[0] https://pi.dev/

4 days ago

lateforwork

How about releasing your own source code? It is a beautiful site, love the UX as well as functionality.

3 days ago

azinman2

It screams vibe coding. This is the anthropic look. Just ask Claude and give it a screenshot.

3 days ago

cowlby

Vibe coding is also why this was released hours after leak instead of days/weeks.

3 days ago

But egos are involved.

3 days ago

vntok

Why would you think that?

3 days ago

saulpw

I'm a committed open source dev and I've flipped my own switch from "default public" to "default private".

3 days ago

kordlessagain

Because nobody wants their shit stolen by some punk.

3 days ago

boomskats

This is nice, I really like the style/tone/cadence.

The only suggestion/nit I have is that you could add some kind of asterisk or hover helper to the part when you talk about 'Anthropic's message format', as it did make me want to come here and point out how it's ackchually OpenAI's format and is very common.

Only because I figure if this was my first time learning about all this stuff I think I'd appreciate a deep dive into the format or the v1 api as one of the optional next steps.

3 days ago

ontouchstart

I’m using pi and cc locally in a docker container connected to a local llama.cpp so the whole agentic loop is 100% offline.

I had used pi and cc to analyze the unpacked cc to compare their design, architecture and implementation.

I guess your site was also coded with pi and it is very impressive. Wonderful if you can do a visualization for pi vs cc as well. My local models might not be powerful enough.

Thanks for the hard work!

3 days ago

order-matters

what level of success are you getting with a 100% offline loop (and on what hardware if you dont mind sharing)?

3 days ago

ontouchstart

https://gistpreview.github.io/?30a4e491eb2df7523ecc120b86feb... (pi)

https://gist.github.com/ontouchstart/d7e3b7ec6e568164edfd482... (cc)

> but so little commentary on the core architecture.

The core architecture is not interesting? its an LLM tui, theres not much there to discuss architecturally. The code itself is the actual fascinating train wreck to look at.

3 days ago

olejorgenb

The tools was mostly already known, no? (I wish they had a "present" tool which allowed to model to copy-paste from files/context/etc. showing the user some content without forcing it through the model)

3 days ago

AnotherGoodName

Yeah in fact one thing claude is freaking great at is decompilation.

If you can download it client side you can likely place a copy in a folder and ask claude

‘decompile the app in this folder to answer further questions on how it works. As an an example first question explain what happens when a user does X’.

I do this with obscure video games where i want to a guide on how some mechanics work. Eg. https://pastes.io/jagged-all-69136 as a result of a session.

It can ruin some games but despite the possibility of hallucinations i find it waaay more reliable than random internet answers.

Works for apps too. Obfuscation doesn’t seem to stop it.

3 days ago

nashadelic

Whoa, when did they come out with JA3?

2 days ago

jayd16

Why are "tools" for local IO interesting and not just the only way to do it? I can't really imagine a server architecture that gets to read your local files and present them without a fat client of some kind.

“if they deliver”

As I’m reading this, I’m thinking about how in 1980. It was imagined that everyone needed to learn how to program in BASIC or COBOL, and that the way computers would become ubiquitous would be that everybody would be writing program programs for them. That turned out to be a quaint and optimistic idea.

It seems like the pitch today is that every company that has a software-like need will be able to use AI to manifest that software into existence, or more generally, to manifest some kind of custom solution into existence. I don’t buy it. Coding the software has never been the true bottleneck, anyone who’s done a hackathon project knows that part can be done quickly. It’s the specifying and the maintenance that is the hard part.

To me, the only way this will actually bear the fruit it’s promising is if they can deliver essentially AGI in a box. A company will pay to rent some units of compute that they can speak to like a person and describe the needs, and it will do anything - solve any problem - a remote worker could do. IF this is delivered, indeed it does invalidate virtually all business models overnight, as whoever hits AGI will price this rental X%[1] below what it would cost to hire humans for similar work, breaking capitalism entirely.

[1] X = 80% below on day 1 as they’ll be so flush with VC cash, and they’d plan to raise the price later. Of course, society will collapse before then because of said breaking of capitalism itself.

3 days ago

kubanczyk

> breaking capitalism

It seems non sequitur. This hypothetical scenario sounds like entrenching capitalism, because it would concentrate capital even more.

It would probably weaken democracy and weaken free market (esp. the job market), yes.

> society will collapse before then because of said breaking of capitalism itself

Or, maybe the society would continue to exist with even more inequality? And, of course, much changed from what it is today.

3 days ago

xp84

I suppose it depends on your perspective. I guess I mean broken kind of in the gaming sense, where a gameplay mechanic is 'broken' if you can exploit it to completely subvert the entire intended way it's supposed to work.

You could argue that capitalism was very not broken in 1960, when you could get a job at 18 selling shoes, driving a cab, or delivering milk or whatever, and support a family of five on your salary, save for retirement, and go on yearly vacations.

It's arguably somewhat broken today, when gestures around things are like this.

I'd say it would be entirely broken if AGI means a few hundred billionaires who have ownership stakes in an AI company simply capture all the wealth in the world while most of the rest starve, but the robots help you put down the peasant uprisings and farm and raise crops for you.

I agree with you though that technically, capitalism will still be 'going strong' unless the peasants are able to overpower the AI robot billionaire industrial complex and burn it all down.

a day ago

pred_

The time is ripe for deterministic AI; incidentally, this was also released today: https://itsid.cloud/ - presumably will be useful for anyone who wants to quickly recreate an open source Python package or other copyrighted work to change its license.

3 days ago

nyrikki

Can you please explain the use here? I tried the demo, and cat, cp, echo, etc... seem to do the exact same thing without the cost.

Their demo even says:

   `Paste any code or text below. Our model will produce an AI-generated, byte-for-byte identical output.`

Unless this is a parody site can you explain what I am missing here?

Token echoing isn't even to the lexeme/pattern level, and not even close to WSD, Ogden's Lemma, symbol-grounding etc...

The intentionally 'Probably approximately complete' statistical learning model work, fundamentally limits reproducibility for PAC/Stastical methods like transformers.

CFG inherently ambiguity == post correspondence problem == halt == open domain frame-problem == system identification problem == symbol-grounding problem == entscheidungsproblem

The only way to get around that is to construct a grammar that isn't. It will never exist for CFGs, programs, types, etc... with arbitrary input.

I just don't see why placing a `14-billion parameter identity transformer` that just basically echos tokens is a step forward on what makes the problem hard.

Please help me understand.

3 days ago

yw3410

It's satire - just see the About page.

3 days ago

ericfr11

April's fool. Check the career page

3 days ago

BloondAndDoom

I don’t understand what this is, is it satire? What is it supposed to be doing or solving?

3 days ago

climclam

Take a look at the demo or about page ;)

3 days ago

oblio

I think the "breakage" is in terms of conciseness and compactness, not outright brokenness.

Like that drunk uncle that takes half an hour and 20 000 words to tell you a 500 word story.

3 days ago

xp84

Indeed. In some ways, this is just kind of an extrapolation of the overall trend toward extreme bloat that we’ve seen in the past 15 years, just accelerated because LLMs code a lot faster. I’m pretty accustomed to dealing with Web application code bases that are 6-10 years old, where the hacks have piled up on top of other hacks, piled on top of early, tough-to-reverse bad decisions and assumptions, and nobody has had time to go back and do major refactors. This just seems like more of the same, except now you can create a 10 year-old hack-filled code base in three hours.

3 days ago

jessai202699

The terrifying thing is that LLMs turn "technical debt" into "synthetic debt" that accumulates in real-time.

idk why you bring this up. this is irrelevant to whether CC actually works at big corps

3 days ago

jimbokun

I missed that, source?

I looked into lat.md. They are definitely thinking in the same direction by using a CLI layer to govern the agent.

The key difference is the state mechanism. They use markdown; I use an AES-encrypted SQLite database.

Markdown is still just text an LLM can hallucinate over or ignore. A database behind a compiled binary acts as a physical constraint; the agent literally cannot advance a task without satisfying the cryptographic gates.

I just dropped the Show HN for it here if you want to check out the architecture: https://news.ycombinator.com/item?id=47601608

3 days ago

ap99

So this is more like an art than science - and Claude Code happens to be the best at this messy art (imo).

3 days ago

p-e-w

> A 500k line codebase for an agent CLI proves one thing: making a probabilistic LLM behave deterministically is a massive state-management nightmare.

Considering what the entire system ends up being capable of, 500k lines is about 0.001% of what I would have expected something like that to require 10 years ago.

You can combine that with all the training and inference code, and at the end of the day, a system that literally writes code ends up being smaller than the LibreOffice codebase.

It boggles the mind, really.

3 days ago

davidkunz

Oh, you should have a look at Pi then.

https://github.com/badlogic/pi-mono/tree/main/packages/codin...

3 days ago

sarchertech

> You can combine that with all the training and inference code, and at the end of the day, a system that literally writes code ends up being smaller than the LibreOffice codebase.

You really need to compare it to the model weights though. That’s the “code”.

3 days ago

pixl97

>You really need to compare it to the model weights though

Then you'd need to compare the education of any developer in relation to how many LOC their IDE is. That's the "code".

So yea, the analogy doesn't make a whole lot of sense.

3 days ago

oblio

It even wrote an entire browser!

By "just" wrapping a browser engine.

3 days ago

[deleted]

3 days ago

raincole

... what are you even talking about? "The system that literally writes code" has a few hundreds of trillions of parameters. How is this smaller than LibreOffice?

I know xkcd 1053, but come on.

3 days ago

bwfan123

brute-forcing pattern-matching at scale. These are brittle systems with enormous duct-taping to hold everything together. workarounds on workarounds.

3 days ago

3 days ago

thfuran

What makes you think that’s AI-written?

3 days ago

Claude code is a massively successful generator, I use it all the time, but it's not a governance layer.

The fact that the industry is copying a 500k-line harness is the problem. We're automating security vulnerabilities at scale because people are trying to put the guardrails inside the probabilistic code instead of strictly above it.

Standardizing on half a million lines of defensive spaghetti is a huge liability.

3 days ago

ramesh31

>Standardizing on half a million lines of defensive spaghetti is a huge liability.

Anyone from Anthropic reading? Get your shit together: if you keep this "headless browser rendering converted to text", at least do not fucking modify the characters.*

3 days ago

user34283

No it is not. Ink does not use a browser.

3 days ago

Andebugulin

If it was 2020, it would be hard to imagine that after some hours/days you getting a visual representation of the leak with such detailed stats lol

4 days ago

spzb

I don't have a lot of experience with them but I would have thought static analysis tools circa 2020 would have managed it just fine.

3 days ago

hu3

The output wouldn't be anything nearly as good, comprehensive and informative as this website.

No tooling, no animation, no hidden features, no explanation of how things work upon clicking them.

But I'm glad to be proven wrong if you know a static analysis tool I can point to the repo and come up with comparable result.

2 days ago

makapuf

How was this generated ? I'm quite sure "with ai/claude code" but what are the actual steps ?

4 days ago

rzmmm

For the animations specifically, it's using Motion (fka Framer Motion) Javascript library. If you describe some animations from the site to an LLM and ask it to use Framer motion, you get very similar results. The creator likely just prompted for a while until they were happy with the outcome.

4 days ago

FartyMcFarter

Is there a reason to think it was done by an LLM?

4 days ago

marcellus23

AI-generated UIs, at least ones aimed at an engineering audience, have a very distinct appearance. They seem to always have the following attributes:

- Dark mode design with lots of colors

- Buttons that have vibrant, bright borders and duller backgrounds

- Excessive (IMO) usage of monospace fonts for stylistic reasons

3 days ago

franze

[dead]

4 days ago

ernst_klim

> 500k lines of code

Isn't it a simple REPL with some tools and integrations, written in a very high level language? How the hell is it so big? Is it because it's vibecoded and LLMs strive for bloat, or is it meaningful complexity?

4 days ago

samusiam

I just checked competitors' codebases:

- Opencode (anomalyco/opencode) is about 670k LOC

- Codex (openai/codex) is about 720k LOC

- Gemini (google-gemini/gemini-cli) is about 570k LOC

Claude Code's 500k LOC doesn't seem out of the ordinary.

3 days ago

lelanthran

> Claude Code's 500k LOC doesn't seem out of the ordinary.

Aren't all the other products also vibe-coded? "All vibe-coded products look like this" doesn't really seem to answer the question "Why is it so damn large?"

It's a repl, that calls out to a blackbox/endpoint for data, and does basic parsing and matching of state with specific actions.

I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

1. If the bulk of those lines implement specific and simple actions, why is it so large compared to other software that implements single actions (coreutils, etc)

2. If the actions constitute only a small part of the codebase, wtf is the rest of it doing?

3 days ago

samusiam

You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

3 days ago

lelanthran

>> I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

> You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

Where did I complain about how I feel about the actual code? I have feelings, negative ones, about the size of the code given the simple functionality it has, but I have no feelings on the code because I did not look at the code.

3 days ago

arandomhuman

Are you ESL by any chance? You’re missing the forest for the trees.

3 days ago

johnisgood

All of them are really, REALLY bad.

3 days ago

3 days ago

ale

There’s probably a subconscious incentive to make a tool that’s “complex” because the underlying LLM also is complex.

3 days ago

It's a TUI API wrapper with a few commands bolted on.

I doubt it needs to be more than 20-50kloc.

You can create a full 3D game with a custom 3D engine in 500k lines. What the hell is Claude Code doing?

Well, at least that confirms they weren't lying when they said all recent updates to claude code were made by claude. You certainly won't do this stuff if you were writing the code yourself.

3 days ago

hombre_fatal

Software doesn’t end at the 20k loc proof of concept though.

What every developer learns during their “psh i could build that” weekendware attempt is that there is infinite polish to be had, and that their 20k loc PoC was <1% of the work.

sarchertech

I think that’s why the author was comparing to to a finished 3D game.

3 days ago

hombre_fatal

I guess because you see 3D stuff in a 3D game instead of text, people assume that it must be the most complex thing in software? Or because you solve hard math problems in 3D, those functions are gonna be the most loc?

It's a completely different domain, e.g. very different integration surface area and abstractions.

Claude Code's source is dumped online so there's probably a more concrete analysis to be had than "that sounds like too many loc".

3 days ago

sarchertech

It is a different domain but that wasn’t your argument. Your argument was that someone was comparing it to a POC when in fact they were comparing to a finished product.

Also a AAA game (with the engine) with physics, networking, and rendering code is up there in terms of the most complex pieces of software.

3 days ago

hombre_fatal

They just claimed that you can build a 3D game in 500k loc, thus Claude Code shouldn't use so many loc. They/you didn't render the argument for that.

For example, without looking at the code, the superstition also works in the opposite direction: Claude Code is an interface to using AI to do any computer task while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc. That's equally unsatisfying.

You'd have to be more concrete than "sounds like a lot".

3 days ago

troupo

> Claude Code is an interface to using AI to do any computer task

Claude Code is quite literally a wrapper around a few APIs. At one point it needed 68GB of RAM to run and requires 11ms to "lay a scene graph" to display a few hundred characters on screen. All links here: https://news.ycombinator.com/item?id=47598488

> while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc.

Yes, most games should be done in fewer loc

3 days ago

anthk

I could run a text adventure with a Zmachine emulator under a 6502 based machine and 48k of RAM, with Ozmoo you can play games like Tristam Island. On a Commodore 64, or an Apple II for you US commenters. I repeat the game it's being emulated in a simple computer with barely more processing power than a current keyboard controller.

As the ZMachine interpreter (V3 games at least, enough for the mentioned example), even a Game Boy used to play Pokemon Red/Blue -and Crystal/Sylver/Blue, just slightly better specs than the OG GB- can run Tristam Island with keypad based input picking both selected words from the text or letter by letter as when you name a character in an RPG. A damn Game Boy, a pocket console from 1989. Not straightly running a game, again. Emulating a simple text computer -the virtual machine- to play it. No slowdowns, no-nothing, and you can save the game (the interpreter status) in a battery backed cartridge, such as the Everdrive. Everything under... 128k.

Claude Code and the rest of 'examples' it's what happens when trade programmers call themselves 'engineers' without even a CS degree.

3 days ago

hombre_fatal

Your claim was that they could implement the same app in 50k lines of code.

A cursory glance at the codebase shows that it's not just a wrapper around a few APIs.

3 days ago

mpalmer

Yes, because they've vibed it into phenomenally unnecessary complexity. The mistake you continually make in this thread is to look at complexity and see something that is de facto praiseworthy and impressive. It is not.

3 days ago

> A GUI/client can be arbitrarily more or less complex than the things it's GUI'ing.

If it's an interface to ffmpeg, then sure, the GUI could be extremely complicated code.

But that's not what we are talking about, is it? We are talking about an interface to a chatbot that can accept and return chats, accept and return files, and run a selection of internal commands (which include invoking itself recursively).

The interface to this chatbot that has a settings entry for "personality" is still only going to map that to one of a small number of chatbot inputs. Same with basically anything else (read the skills file, etc).

> Sure. You could have. But you're not the one playing football in the Champions League.

The only reason people are using Claude Code is because it's the only way to use their (heavily subsidized) subscription plans. People who are okay with using and paying for their APIs often opt out for other, better, tools.

Also, analogies don't work. As we know for a fact that Claude Code is a bloated mess that these "champions league-level engineers" can't fix. They literally talk about it themselves: https://news.ycombinator.com/item?id=47598488 (they had to bring in actual Champions League engineers from bun to fix some of their mess).

3 days ago

spiderfarmer

"Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

You just repeat the same statement.

That bloated mess is what got them to the Champions League. They did what was necessary to get them here. And they succeeded so far.

But hey, according to some it can be replicated in 50k lines of wrapper code around a terminal command, so for Anthropic it's just one afternoon of vibe coding to get rid of this mess. So what's the problem? /s

3 days ago

troupo

> Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

Since you keep putting words in my mouth that I never said, and keep being deliberately obtuse, this particular branch is over.

Go enjoy Win11 written by same level of champions or something.

Adieu.

3 days ago

boomskats

Ah, Winning Eleven.

Not what you were referring to.

3 days ago

cindyllm

[dead]

3 days ago

> far less performant code than the one before it.

That worked because of rapid advancements in CPU performance. We’ve left that era.

It’s about more than performance. Code is and always has been a liability. Even with agents, you start seeing massive slowdowns with code base size.

Among the hundreds of thousands of lines of code that Anthropic produced was one that leaked the source code. It is likely to be a config file, not part of the Claude Code software itself, but it still something to track.

The more lines of code you have the more likely there is for one of them to be wrong and go unnoticed. It results in bugs, vulnerabilities,... and leaks.

3 days ago

viktorcode

More bugs. More costly maintenance.

3 days ago

blantonl

Exactly. Imagine if Claude Code was a PHP script. Some folks would lose their damn minds

3 days ago

troupo

> Honest question: Why does it matter?

Because it's unmaintainable slop that they themselves don't know how to fix when something happens? https://news.ycombinator.com/item?id=47598488

3 days ago

sumtechguy

It will be exactly that. But that is a 'them' problem. I can look at it a go 'that looks like a bad idea' but they are the ones who have to live with it.

At some point someone will probably take their LLM code and repoint it at the LLM and say 'hey lets refactor this so it uses less code is easier to read but does the same thing' and let it chrun.

One project I worked on I saw one engineer delete 20k lines of code one day. He replaced it with a few lines of stored procedure. That 20k lines of code was in production for years. No one wanted to do anything with it but it was a crucial part of the way the thing worked. It just takes someone going 'hey this isnt right' and sit down and fix it.

3 days ago

troupo

> But that is a 'them' problem. I

When a TUI requires 68 GB of RAM to run, or when they spend a week not being able to find a bug that causes multiple people to immediately run out of tokens, it's not a "them" problem.

3 days ago

[deleted]

3 days ago

gbibas

[dead]

3 days ago

brauhaus

Even today, I'm still astounded that there are people capable of building a gorgeous and interesting site like this in less than 2 days...

4 days ago

spondyl

Well, I assume this is all just generated with Claude Code, right? Whether there is much back and forth with the LLM is a valid question and nothing wrong with generating websites (I do it too for some side projects). Claude loves generating websites with a particular style of serif font. We also saw this with https://tboteproject.com/timeline/ and I've just generally seen it from various designs that coworkers have spit out over months using Claude defaults.

I guess I just find it weird because all the signals are messed up so whenever I see these sorts of layouts, I feel like I'm looking at the average where I don't think "gorgeous and interesting" at all. Instead, I'm forced to think "I should be skeptical of this based on the presentation because it presents as high quality but this may be hiding someone who is not actually aware of what they're presenting in any depth" as the author may have just shoved in a prompt and let it spin.

There's actually a similarly designed website (font weights, font styles etc) here in New Zealand (https://nzoilwatch.com/) where at a glance, it might seem like some overloaded professional-backed thing but instead it's just some guy who may or may not know anything about oil at all, yet people are linking it around the place like some sort of authoritative resource.

I would have way less of an issue if people just put their names by things and disclosed their LLM usage (which again, is fine) rather than giving the potentially false impression to unequipped people that the information presented is actually as accurate and trustworthy as the polish would suggest.

4 days ago

kristopolous

I really wish I had that clout-chasing gene - it doesn't even occur to me until I see someone else do it.

I'm serious. The hype chasing clearly clearly matters. .

things like this: https://github.com/instructkr/claw-code I mean ok, serious people put in years of effort for 100 of those stars ...

it's continually wild how extremely irrelevant hard effortful careful work is.

I think that's the game. Get up, look at the headlines, figure out how you can exploit them with vibe coding, do some hyphy project and repeat.

Maybe some lobster themed bullshit between openclaw and the claudecode leak.

I'm not being a cynic here, I'm just telling you what I'm going to do tomorrow.

4 days ago

simgt

We do need "hard effortful careful work" to keep planes flying, electrical grids running and medical devices safe. It's very relevant but very undervalued by our current economy.

3 days ago

[deleted]

3 days ago

kristopolous

That was the leaked code and now it's just some random dudes harness btw. He swapped it out. Did a sloppy find and replace for "claude" and made it claw.

It's sloppy work

that's really awesome. how did you go about building the component library?

4 days ago

MikeNotThePope

I was referencing https://www.neobrutalism.dev/ and https://www.retroui.dev/ and slopped my way through it. A lot of it was just asking Claude Code "is this a proper design system?", then I kept doing that until it didn't have anything useful to add. Now I'm using my that as the template for understanding such things in more detail.

4 days ago

oasisbob

Is this gorgeous?

drzaiusx11

As someone currently "making good use of" generative AI while simultaneously being painfully aware of its shortcomings, I think the overall discourse is a bit more nuanced. Bucketing folks into simple "for" and "against" GenAI camps does nothing to cover the vast spectrum in between, making your take ultimately built on a false dichotomy. Further implying those camps fall on the lines of those "in the know" of AI vs "those in denial/scared of" is patronizing at best, and I've grown tired of this oversimplification parroted out every time the topic of LLM systems come up.

Those within well informed, technical circles will fall somewhere in between the for/against labels, myself included.

“Che cos’è il genio? È fantasia, intuizione, colpo d’occhio e velocità di esecuzione”

4 days ago

stingraycharles

I guess they really do eat their own dogfood and vibe code their way through it without care for technical debt? In a way, it’s a good challenge, but it’s fairly painful to watch the current state of the project (which is about a year old now, so it should be in prime shape).

4 days ago

troupo

They explicitly boast about using claude code to write code: https://x.com/bcherny/status/2007179836704600237

mattmanser

It's only 510k LoC, at ~100 lines of code a day for a year, this code base would take 23 engineers a year to write. That's for 220 working days in somewhere civilized.

And I'm sure we all know that when working on a greenfield project you can produce a lot more LoC per day than maintaining a legacy one.

Given that vibe code is significantly more verbose, you're probably talking about ~15 engineers worth of code?

I know that's all silly numbers, but this is just attempting to give people some context here, this isn't a massive code base. I've not read a lot of it, so maybe it's better than the verbose code I see Claude put out sometimes.

4 days ago

lelanthran

> It's only 510k LoC, at ~100 lines of code a day for a year, this code base would take 23 engineers a year to write.

Correction: a code base of 500kLoC would take 23 engineers a year to write. There is no indication that the functionality needed in a TUI app that does what this app does needs 500kLoC.

3 days ago

cududa

When you say it’s not a massive codebase, I’m curious, what are you comparing it to?

4 days ago

mattmanser

The previous poster was making out that in a year the code base would be a mess if people had done it.

This is a two-pizza team sized project, so it's not a project that the code quality would inevitably spiral out of control due to communication problems.

A single senior architect COULD have kept the code quality under control.

4 days ago

kordlessagain

Splunk.

3 days ago

Maybe a little. I don't hold fast to that popular wisdom, e.g. I think comments are not always a net positive for LLMs. With respect to technical debt, how much debt is too much debt before it gums up the works and arrests forward progress on the software? It probably depends on the individual programmer. LLMs do seem to have a higher tolerance for technical debt than myself personally at least.

3 days ago

blanched

Good points, I've also found that comments are really hit or miss. Especially because the agents tend not to update them (sounds familiar!).

3 days ago

openfoliage

I am more worried that we are moving toward creating black boxes and this might turn software "development" into a field as confused as philosophy and dialectics.

3 days ago

coldtrait

Boris Cherny, the creator of Claude Code said he uses CC to build CC.

4 days ago

Cthulhu_

Which makes for an interesting thought / discussion; code is written to be read by humans first, executed by computers second. What would code look like if it was written to be read by LLMs? The way they work now (or, how they're trained) is on human language and code, but there might be a style that's better for LLMs. Whatever metric of "better" you may use.

Just a thought experiment, I very much doubt I'm the first one to think of it. It's probably in the same line of "why doesn't an LLM just write assembly directly"

4 days ago

syphia

LLMs read and write human-code because humans have been reading and writing human-code. The sample size of assembly problems is, in my estimate, too small for LLMs to efficiently read and write it for common use cases.

I liken it to the problem of applying machine learning to hard video games (e.g. Starcraft). When trained to mimic human strategies, it can be extremely effective, but machine learning will not discover broadly effective strategies on a reasonable timescale.

If you convert "human strategies" to "human theory, programming languages, and design patterns", perhaps the point will be clear.

But: could the ouroboric cycle of LLM use decay the common strategies and design patterns we use into inexplicable blobs of assembly? Can LLMs improve at programming if humans do not advance the theory or invent new languages, patterns, etc?

4 days ago

Mentlo

But starcraft training is not through mimicking human strategies - it was pure RL with a reward function shaped around winning, which allows it to emerge non-human and eventually super-human strategies (such as the worker oversaturation).

The current training loop for coding is RL as well - so a departure from human coding patterns is not unexpected (even if departure from human coding structure is unexpected, as that would require development of a new coding language).

4 days ago

tempay

> It's probably in the same line of "why doesn't an LLM just write assembly directly"

On many projects I found this "higher quality" not only to be false of delivering more substantial value but actually I found it was hurting the project to deliver the value that matters.

Maybe we are after all entering the era of SWE where all this bike-shedding is gone and only type of engineers who will be able to survive in it will be the ones who are capable of delivering the actual value (IME very few per project).

4 days ago

troupo

Is this why they ran into a bug with people hitting usage limits even on very short sessions and had to cease all communications for over a day after a week of gaslighting users because they couldn't find the root cause in the "quality doesn't matter" code base?

Or that's why tgey had to buy bun with actual engineers to work on Claude Code to reduce memory peaks from 68 GB (yes, 68 gigabytes) to a "measely" 1.7? Because code quality doesn't matter?

Here's a codeberg repo with the leaked source: https://codeberg.org/wklm/claude-code

3 days ago

papa0101

[dead]

3 days ago

bsgeraci

Dang! Glad to see others doing this. I totally made this site yesterday like 11 hours ago :/ but did not get the traction.

I love your implementation.

Here was my first stab:

https://news.ycombinator.com/item?id=47595140

https://brandonrc.github.io/journey-through-claude-code/

3 days ago

kordlessagain

That's where we are now, lots of us doing similar. Don't get hung up on who gets noticed, that "value" is almost zero now.

3 days ago

jedisct1

I'm developing an agent focused on A2A, support for small models, and privacy (https://swival.dev).

I looked at the leaked code expecting some "secret sauce", but honestly didn't found anything interesting.

I don't get the hype around Claude Code. There's nothing new or unique. The real strength are the models.

3 days ago

dheerajmp

Feel free to add this to Awesome Claude code. https://github.com/rosaboyle/awesome-cc-oss

4 days ago

euphetar

Appreciate the effort, but this is very basic and nothing you need the source code to understand. I was expecting a deep dive into what specific decisions they made, but not how an loop of tool calls works

3 days ago

ttcbj

I found it a useful overview. My primary question about the client source was - is there any secret sauce in it? Based on this site, the answer is no, the client is quite simple/dumb, and all the secret sauce resides on the server/in the model.

I particularly valued the tool list. People in these comments are complaining about how bad the code is, but I found the client-side tools that the model actually uses to be pretty clean/general.

My takeaway was more that at a very basic level they know what they are doing - keep the client general, so that you can innovate on the server side without revving the client as much.

3 days ago

[deleted]

3 days ago

jamalawd

I built a site that lets you explore and browse all the Claude Code prompts in a structured way:

franze

[dead]

4 days ago

Scaled

I need the dragon pet... someone add it to open code / pi, please!

4 days ago

pukaworks

I've been building a multi-agent pipeline on top of Claude Code — 7 agents chained together (scout, researcher, writer, editor, evaluator, publisher). One surprising lesson: the Editor agent approved 100% of articles while the independent Evaluator rejected 45%. LLMs are bad at being critical of LLM output. We ended up enforcing title quality with regex instead of prompts because the Writer kept ignoring banned-word rules.

a day ago

jatins

There's this weird thing about AI generated content where it has the perfect presentation but conveys very little.

If you know the concept of "stupid man's idea of a smart man", I'd say AI made stuff (with little iteration) gives this outward appearance of a smart man from the Reddit-midwit-cinematic-universe. It's like how guns in movies sound more like guns than real guns. It's hyperreality.

Again this is less about the capabilities of AI and it's more connected to the people-pleasing nature of it. It's like you prompt it for some epic dinner and it heaps you up some hmmm epic bacon with bacon yeah (referring to the hivemind-meme). Or BigMac on the poster vs the tray, and the poster one is a model made with different components that are more photogenic. It's a simulacrum.

It looks more like your naive currently imagined thing about what you think you need vs what you'd actually need. It's like prompting your ideal girlfriend into AI avatar existence. I'm sure she will fit your ideal thought and imagination much better but your actual life would need the actual thing.

This relates to the Persona thing that Anthropic has been exploring, that each prompt guides the model towards adopting a certain archetypal fiction character as it's persona and there are certain attraction basins that get reinforced with post training. And in the computer world, simulated action can be easily turned into real action with harnesses and tools, so I'm not saying that it doesn't accomplish the task. But it seems that there are more sloppy personas, and it seems that experts can more easily avoid summoning them by giving them context that reflects more mundane reality than a novice or an expert who gives little context. Otherwise the AI persona will be summoned from the Reddit midwit movie.

I'm not fully clear about all this, but I think we have a lot to figure out around how to use and judge the output of AI in a productive workflow. I don't think it will go away ever, but will need some trimming at the edges for sure.

4 days ago

hrmtst93837

[flagged]

4 days ago

swyx

> also related: https://www.ccleaks.com

This deployment is temporarily paused

4 days ago

codexstar

Apologies everyone, I launched the site minutes after the leak and vercel was my only fastest and quickest option, the site went down when I was sleeping cuz I was working on it day and night.

Shifted it from vercel the moment I woke up.

2 days ago

swyx

?? whats the technical reason it went down, you cant blame vercel for this

2 days ago

codexstar

Yeah my bad, half story. I was on a free vercel plan, Did not think it would explode like this. I kept on pushing updates and when I went to sleep, the site went down, I could literally see the error on terminal with my blurry eyes lol but I was just too much exhausted.

Woke up from a few calls and messages from friends and on social media. Migrated it to a VPS. Now it looks stable.

2 days ago

[deleted]

2 days ago

Myzel394

It's available on the internet archive

https://web.archive.org/web/20260331105051/https://www.cclea...

BTW, that's why you should use your own infrastructure and not depend on Vercel

4 days ago

codexstar

you're actually right. Vercel is just easy to deloy and this was a fun project that I did in a few minutes.

But with the amount of traffic, I quickly migrated it to a dedicated VPS.

2 days ago

prameshbajra

Same!

4 days ago

aimemobe

The KAIROS persistent agent mode is the most significant finding — autonomous background tasks in a black box nobody knew about. This is why local open-weight models are gaining traction. Run inference on-device (like aiME on iOS/Android) and you know exactly what's happening with your data.

Website : https://coticsy.com/aime.html

iOS: https://apps.apple.com/us/app/aime-ondevice-ai/id6754805828

Android: https://play.google.com/store/apps/details?id=com.coticsy.ll...

2 days ago

psychomfa_tiger

The tool system breakdown is useful. I've been running Claude Code daily for a few months and some things in there explain behaviors I noticed but couldn't figure out. Like why sometimes it re-reads files it already read 30 seconds ago. Makes more sense now seeing how the context window management works internally, it's being more conservative about what it assumes is still accurate. The permission gate flow is the part I care about most. When I'm away from my machine and the agent hits a permission prompt, that's where my workflow falls apart. Knowing how it structures those requests internally is helpful for anyone building tooling around it. One thing the guide doesn't cover much is how sessions are meant to be resumed. The checkpoint system is interesting but in practice I find session resume pretty fragile once you go past a certain context length.

3 days ago

sibtain1997

Kairos and auto-dream are more interesting than anything in the agent loop section. Memory consolidation between sessions is the actual unsolved problem. The rest is just plumbing tbh

4 days ago

giancarlostoro

Projects like Beads help with memory consolidation by making it somewhat moot, since it stays "offline" and can be recollected at any moment.

4 days ago

jghiglia

I've been using Claude Code heavily for the last few weeks building out a multi-agent system, and the token economics caught me off guard — I hit 75% of my Pro weekly budget faster than expected. I don't code myself, so Claude Code handles all the actual implementation work.

What I've learned about cost management: the real decision isn't "should I code this myself or use Claude Code" — it's "should I spawn Claude Code or handle this through a different approach entirely?" For complex builds where I need something architected from scratch, Claude Code is worth it. For smaller tasks or iteration on existing code, I've started using Sonnet in the web interface with the file as context instead. The visual guide here is helpful because it shows you what Claude Code is actually doing under the hood — understanding that workflow helps you predict whether a task will be a quick fix or a deep exploration that burns through your budget.

Hello everyone! It's me behind the website. I launched the site minutes after the leak, obviously vibecoded it.

Kept working on it day and night to fix all the issues. I was using vercel free plan and did not expect this huge response. The site went down when I took a nap of 3 hours.

Woke up with calls from my team for a meeting. Saw the msgs of people telling me site is down.

Fixed the issue.

And now, I am updating it on regular speed.

Thank you for all the positive and negative feedback, Will consider it all in my future projects.

2 days ago

JoostBoer

I have no engineering background. I build websites and tools for a living. Claude Code changed what's possible for me in a way that's hard to overstate.

I can't evaluate the source code architecture. What I can say is that before this, I had ideas I couldn't execute without hiring a developer. Now I ship them myself. Not prototypes, not demos. Real products that people use and pay for.

59nadir

I think it's good that it's out there, and I wonder why Anthropic have been keeping it closed source; clearly they can't possibly think that the CC source code is a competitive advantage...?

Agents in general are easy to make, and trivial to make for yourself especially, and the result will be much better than what any of the big providers can make for you.

`pi` with whatever commands/extensions you want to make for yourself is better than CC if you really don't want to go through the trouble of making your own thing.

4 days ago

ramraj07

If you think this is not a competitive advantage then youre missing the point. LLMs arent so good that they work through bad abstractions and pretty much everyone has bad abstractions. CC is what invents some of the best abstractions (not the first). I think theyre they first ones who nailed subagents well. Theres a lot to learn from them and while im learning a lot from their source code my heart bleeds that this happened to them.

Sincerely, someone running a team building similar things for analytics.

3 days ago

ariwilson

why do you think agents you make yourself will be better for you? integration with tooling that you prefer? your local dev setup built in?

curious as i haven't gotten around to writing my own agent yet

4 days ago

59nadir

All of the above at exactly the token cost that it requires for you.

Anything general is always going to be worse for specific use cases, and agents from these big providers are very general. They'll spend tons of tokens doing things that you might not need, including spend extra tokens on supporting MCP, etc., when you might not even need that.

This is AI slop.

First command I looked at:

  /stickers:
  
  Displays earned achievement stickers for milestones like first commit, 100 tool calls, or marathon sessions. Stickers are stored in the user profile and rendered as ASCII art in the terminal.

That is not what it does at all - it takes you to a stickermule website.

What is the motivation for someone to put out junk like this?

4 days ago

thepasch

> What is the motivation for someone to put out junk like this?

Getting something with a link to their GitHub onto the frontpage of HN. Because form matters much more in this world than substance.

4 days ago

ricardobeat

Clout and reaching the top of HN apparently.

The animated explanation at the top is also way too fast at 1x, almost impossible to follow; that immediately hinted at the author not fully reading/experiencing the result before publishing this.

4 days ago

user34283

Why is it that some people feel entitled to take this kind of tone as soon as AI is used?

It's inappropriate to label a free side project 'junk' or 'slop' even if it contains major errors.

Particularly when there's a disclaimer about possible inaccuracies on the page.

4 days ago

4 days ago

kristopolous

I just stumbled on a fascinating replacement candidate while clicking around on embed models on hugging face: https://github.com/lightonai/next-plaid/tree/main/colgrep

it looks really interesting.

4 days ago

cubefox

I think this is unethical, and "everyone else is also doing it" is not a valid excuse.

3 days ago

lanbin

However, excellent development practices involve modularizing code based on functional domains or responsibilities.

The utils directory should only contain truly generic, business-agnostic utilities (such as date retrieval, simple string manipulation, etc.).

No point in reading this, they are continuing to lobotomize it daily...

3 days ago

Many people seem to believe the Claude Code has some sort of secret sauce in the agent itself for some reason.

4 days ago

mdavid626

How the hell is it 500k lines?

4 days ago

twsted

It is vibe coded.

4 days ago

dankobgd

it's just bunch of useless junk

3 days ago

spirelab

I got a goose

War flashbacks to genshin

4 days ago

Hannah_Adam

Is that safe to use?

3 days ago

Vektorceraptor

Hey, nice job! Next time tell calude to add some explosions, car crashes and stuntment into the design! Who cares about content anyway ... https://speculumx.at/blogpost/getting-sick-of-ai-slop

4 days ago

blueTiger33

its April fools joke. this has really gone wide

3 days ago

chrz

nice example: Find all TODO spin the AI machine

Please don’t use AI to write comments on HN.

4 days ago

[deleted]

4 days ago

robonot

huh?

4 days ago

stingraycharles

You edited your comment. It very much first said something about using regexes as being the most important takeaway and whatnot.

4 days ago

[deleted]

3 days ago

sscaryterry

Enshitification galore

3 days ago

ramraj07

What exactly is shitty here? A program i use for hours every day to do the job previously done by many N human beings, without many bugs, seems to have code thats seemingly messy but still clearly works.

3 days ago

sscaryterry

Maybe if it was working the way it was 2 months ago, life would be good?

3 days ago