Claude Code Unpacked : A visual guide

1108 points
1/21/1970
4 days ago
by autocracy101

Comments


autocracy101

Author here. I built this in a few hours after the Claude Code leak.

I've been working on my own coding agent setup for a while. I mostly use pi [0] because it's minimal and easy to extend. When the leak happened, I wanted to study how Anthropic structured things: the tool system, how the agent loop flows, A 500K line codebase is a lot to navigate, so I mapped it visually to give myself a quick reference I could come back to while adapting ideas into my own harness and workflow.

I'm actively updating the site based on feedback from this thread. If anything looks off, or you find something I missed, lmk.

[0] https://pi.dev/

4 days ago

lateforwork

How about releasing your own source code? It is a beautiful site, love the UX as well as functionality.

3 days ago

azinman2

It screams vibe coding. This is the anthropic look. Just ask Claude and give it a screenshot.

3 days ago

cowlby

Vibe coding is also why this was released hours after leak instead of days/weeks.

3 days ago

lateforwork

Of course I expect it is vibe coding. It would be insane to code anything by hand these days. But that doesn't mean there is no creative input by the author here.

3 days ago

zeroq

>> It would be insane to code anything by hand these days.

I strongly disagree, but it made me chuckle a bit, thinking about labeling software as "handmade" or marketing software house as "artisanal".

3 days ago

alistairmayo

There's a lot of errors you can miss by coding by hand, even as a seasoned developer. Try taking Claude Code, point it at your repo, and ask it to find bugs. I bet it will.

Claude is actually a crazy good vuln researcher. If you use it that way, your code might just be more secure than written purely by hand.

3 days ago

duskdozer

Sure, just like drug-sniffing dogs. Whether they've actually found something or are just pleasing the operator is another story.

2 days ago

small_scombrus

Our organic artisanal code is written by free-range developers

3 days ago

Viliam1234

"free-range" means fully remote, right?

3 days ago

eric_cc

Depends on what you’re building and whether it’s recreational or not. Complex architecture vs a ui analysis tool, for example. For a ui analysis tool, the only reason you code by hand is for the joy of coding by hand. Even though you can drive a car or fly in a plane there are times to walk or ride a bike still.

3 days ago

bobthepanda

Depending on your standards and what company is making it you could even have “cruelty free.”

3 days ago

kitsune1

[dead]

3 days ago

archagon

You well on the path to AI-fueled psychosis if you genuinely believe this.

3 days ago

lateforwork

I genuinely believe this. Even if you're inventing a new algorithm it is better to describe the algorithm in English and have AI do the implementation.

3 days ago

[deleted]
3 days ago

joquarky

At least it's more productive than AI Derangement Syndrome.

3 days ago

codexstar

Yes, it is vibecoded, had to get it like in 10-15 minutes. I did not know how to write a piece of code 4-5 months back.

2 days ago

QuantumGood

Must everything be artisanal for some people? </s>

3 days ago

smrtinsert

As a cynical modern eng look for landing page skills

3 days ago

kordlessagain

Guess what? People have ZERO reason to Open Source anything now.

3 days ago

DustinKlent

One reason, beside basic altruism, is so you can put the projects on your resume. This is especially helpful if the project does very well or gets lots of stars.

3 days ago

pixl97

This said Jeavon's Paradox will likely mean far more code is open sourced simply due to how much code will get written in total.

3 days ago

joquarky

We should be applauding the promotion of science and useful arts that genAI is fueling.

But egos are involved.

3 days ago

vntok

Why would you think that?

3 days ago

saulpw

I'm a committed open source dev and I've flipped my own switch from "default public" to "default private".

3 days ago

kordlessagain

Because nobody wants their shit stolen by some punk.

3 days ago

boomskats

This is nice, I really like the style/tone/cadence.

The only suggestion/nit I have is that you could add some kind of asterisk or hover helper to the part when you talk about 'Anthropic's message format', as it did make me want to come here and point out how it's ackchually OpenAI's format and is very common.

Only because I figure if this was my first time learning about all this stuff I think I'd appreciate a deep dive into the format or the v1 api as one of the optional next steps.

3 days ago

ontouchstart

I’m using pi and cc locally in a docker container connected to a local llama.cpp so the whole agentic loop is 100% offline.

I had used pi and cc to analyze the unpacked cc to compare their design, architecture and implementation.

I guess your site was also coded with pi and it is very impressive. Wonderful if you can do a visualization for pi vs cc as well. My local models might not be powerful enough.

Thanks for the hard work!

3 days ago

order-matters

what level of success are you getting with a 100% offline loop (and on what hardware if you dont mind sharing)?

3 days ago

nashadelic

wow, the pi version looks... better v. cc?

2 days ago

yoe

yeah true, its looks a bit cleaner and more readable

a day ago

couscouspie

I thought that early coding assistants came to be written in some Java/TypeScript, because AI companies just had web-devs playing around and then made it a product even though the languages being such a misfit for terminal. Why did you decide for TypeScript?

3 days ago

haliliceylan

Can you give me more info about your own agentic setup ?

4 days ago

dev-divi

Hi, Autocracy. I took your site and built Clyde: https://github.com/dev-divi/choice-agent-clyde

2 days ago

hatmanstack

Thank you for this brother, well done

3 days ago

tipoffdosage904

This is actually a GREAT example of vibe-coding + skill.

3 days ago

andoando

Is there any nice themes for pi?

3 days ago

amangsingh

A 500k line codebase for an agent CLI proves one thing: making a probabilistic LLM behave deterministically is a massive state-management nightmare. Right now, they're great for prompting simple sites/platforms but they break at large enterprise repos.

If you don't have a rigid, external state machine governing the workflow, you have to brute-force reliability. That codebase bloat is likely 90% defensive programming; frustration regexes, context sanitizers, tool-retry loops, and state rollbacks just to stop the agent from drifting or silently breaking things.

The visual map is great, but from an architectural perspective, we're still herding cats with massive code volume instead of actually governing the agents at the system level.

3 days ago

ttcbj

I find it really strange that there is so much negative commentary on the _code_, but so little commentary on the core architecture.

My takeaway from looking at the tool list is that they got the fundamental architecture right - try to create a very simple and general set of tools on the client-side (e.g. read file, output rich text, etc) so that the server can innovate rapidly without revving the client (and also so that if, say, the source code leaks, none of the secret sauce does).

Overall, when I see this I think they are focused on the right issues, and I think their tool list looks pretty simple/elegant/general. I picture the server team constantly thinking - we have these client-side tools/APIs, how can we use them optimally? How can we get more out of them. That is where the secret sauce lives.

3 days ago

acedTrex

> but so little commentary on the core architecture.

The core architecture is not interesting? its an LLM tui, theres not much there to discuss architecturally. The code itself is the actual fascinating train wreck to look at.

3 days ago

olejorgenb

The tools was mostly already known, no? (I wish they had a "present" tool which allowed to model to copy-paste from files/context/etc. showing the user some content without forcing it through the model)

3 days ago

AnotherGoodName

Yeah in fact one thing claude is freaking great at is decompilation.

If you can download it client side you can likely place a copy in a folder and ask claude

‘decompile the app in this folder to answer further questions on how it works. As an an example first question explain what happens when a user does X’.

I do this with obscure video games where i want to a guide on how some mechanics work. Eg. https://pastes.io/jagged-all-69136 as a result of a session.

It can ruin some games but despite the possibility of hallucinations i find it waaay more reliable than random internet answers.

Works for apps too. Obfuscation doesn’t seem to stop it.

3 days ago

nashadelic

Whoa, when did they come out with JA3?

2 days ago

jayd16

Why are "tools" for local IO interesting and not just the only way to do it? I can't really imagine a server architecture that gets to read your local files and present them without a fat client of some kind.

What is the naive implementation you're comparing against? Ssh access to the client machine?

3 days ago

abossy

It's early days and we don't fully understand LLM behavior to the extent that we can assume questions like this about agent design are resolved. For instance, is an agent smarter with Claude Code's tools or `exec_command` like Codex? And does that remain true for each subsequent model release?

3 days ago

woodson

It’s a distinction that IMHO likely doesn’t make much difference, at least for the mostly automated/non-interactive coding agent use case. What matters more is how well the post-training on synthetic harness traces works.

3 days ago

3abiton

[dead]

3 days ago

sunir

It’s not surprising. There has been quite a bit of industrial research in how to manage mere apes to be deterministic with huge software control systems, and they are an unruly bunch I assure you.

3 days ago

RALaBarge

Sunir! Hope you are doing well man, I got a good chuckle from this.

3 days ago

sunir

I am! I’ll reach out in another channel to connect.

3 days ago

comboy

It's hard to tell how much it says about difficulty of harnessing vs how much it says about difficulty of maintaining a clean and not bloated codebase when coding with AI.

3 days ago

amangsingh

Why not both? AI writes bloated spaghetti by default. The control plane needs to be human-written and rigid -> at least until the state machine is solid enough to dogfood itself. Then you can safely let the AI enhance the harness from within the sandbox.

3 days ago

whiplash451

Were human organizations (not individuals) any good at the latter anyway?

3 days ago

chrismarlow9

We propped the entire economy up on it. Just look at the s&p top 10. Actually even top 50 holdings.

If it doesn't deliver on the promise we have bigger problems than "oh no the code is insecure". We went from "I think this will work" to "this has to work because if it doesn't we have one of those 'you owe the bank a billion dollars' situations"

3 days ago

jayd16

It's weird to look at the world like this. If they deliver doesn't that invalidate thousands of other business plans? What about paying for that?

If they fail, doesn't software and the giant companies that make it go back to owning the world?

3 days ago

xp84

“if they deliver”

As I’m reading this, I’m thinking about how in 1980. It was imagined that everyone needed to learn how to program in BASIC or COBOL, and that the way computers would become ubiquitous would be that everybody would be writing program programs for them. That turned out to be a quaint and optimistic idea.

It seems like the pitch today is that every company that has a software-like need will be able to use AI to manifest that software into existence, or more generally, to manifest some kind of custom solution into existence. I don’t buy it. Coding the software has never been the true bottleneck, anyone who’s done a hackathon project knows that part can be done quickly. It’s the specifying and the maintenance that is the hard part.

To me, the only way this will actually bear the fruit it’s promising is if they can deliver essentially AGI in a box. A company will pay to rent some units of compute that they can speak to like a person and describe the needs, and it will do anything - solve any problem - a remote worker could do. IF this is delivered, indeed it does invalidate virtually all business models overnight, as whoever hits AGI will price this rental X%[1] below what it would cost to hire humans for similar work, breaking capitalism entirely.

[1] X = 80% below on day 1 as they’ll be so flush with VC cash, and they’d plan to raise the price later. Of course, society will collapse before then because of said breaking of capitalism itself.

3 days ago

kubanczyk

> breaking capitalism

It seems non sequitur. This hypothetical scenario sounds like entrenching capitalism, because it would concentrate capital even more.

It would probably weaken democracy and weaken free market (esp. the job market), yes.

> society will collapse before then because of said breaking of capitalism itself

Or, maybe the society would continue to exist with even more inequality? And, of course, much changed from what it is today.

3 days ago

xp84

I suppose it depends on your perspective. I guess I mean broken kind of in the gaming sense, where a gameplay mechanic is 'broken' if you can exploit it to completely subvert the entire intended way it's supposed to work.

You could argue that capitalism was very not broken in 1960, when you could get a job at 18 selling shoes, driving a cab, or delivering milk or whatever, and support a family of five on your salary, save for retirement, and go on yearly vacations.

It's arguably somewhat broken today, when gestures around things are like this.

I'd say it would be entirely broken if AGI means a few hundred billionaires who have ownership stakes in an AI company simply capture all the wealth in the world while most of the rest starve, but the robots help you put down the peasant uprisings and farm and raise crops for you.

I agree with you though that technically, capitalism will still be 'going strong' unless the peasants are able to overpower the AI robot billionaire industrial complex and burn it all down.

a day ago

pred_

The time is ripe for deterministic AI; incidentally, this was also released today: https://itsid.cloud/ - presumably will be useful for anyone who wants to quickly recreate an open source Python package or other copyrighted work to change its license.

3 days ago

nyrikki

Can you please explain the use here? I tried the demo, and cat, cp, echo, etc... seem to do the exact same thing without the cost.

Their demo even says:

   `Paste any code or text below. Our model will produce an AI-generated, byte-for-byte identical output.`

Unless this is a parody site can you explain what I am missing here?

Token echoing isn't even to the lexeme/pattern level, and not even close to WSD, Ogden's Lemma, symbol-grounding etc...

The intentionally 'Probably approximately complete' statistical learning model work, fundamentally limits reproducibility for PAC/Stastical methods like transformers.

CFG inherently ambiguity == post correspondence problem == halt == open domain frame-problem == system identification problem == symbol-grounding problem == entscheidungsproblem

The only way to get around that is to construct a grammar that isn't. It will never exist for CFGs, programs, types, etc... with arbitrary input.

I just don't see why placing a `14-billion parameter identity transformer` that just basically echos tokens is a step forward on what makes the problem hard.

Please help me understand.

3 days ago

yw3410

It's satire - just see the About page.

3 days ago

ericfr11

April's fool. Check the career page

3 days ago

BloondAndDoom

I don’t understand what this is, is it satire? What is it supposed to be doing or solving?

3 days ago

climclam

Take a look at the demo or about page ;)

edit: or click 'Start Pro Trial'

3 days ago

BloondAndDoom

Tech world became so wild even in a topic that I’m confident I cannot say if something is real or satire. Amount of real but absolutely idiotic landing pages made me this way :)

2 days ago

nicoburns

Kinda depends how much of it is vibe coded. It could easily be 5x larger than it needs to be just because the LLM felt like it if they've not been careful.

3 days ago

saynay

Claude folks proudly claim to have Claude effectively writing itself. The CEO claims it will read an issue and automatically write a fix, tests, commit and submit a PR for it.

3 days ago

amangsingh

Bingo. And them 'being careful' is exactly what bloats it to 500k lines. It's a ton of on-the-fly prompt engineering, context sanitizers, and probabilistic guardrails just to keep the vibes in check.

3 days ago

whycombagator

> Right now, they're great for prompting simple sites/platforms but they break at large enterprise repos

Can you expand on this?

My experience is they require excessive steering but do not “break”

3 days ago

oblio

I think the "breakage" is in terms of conciseness and compactness, not outright brokenness.

Like that drunk uncle that takes half an hour and 20 000 words to tell you a 500 word story.

3 days ago

xp84

Indeed. In some ways, this is just kind of an extrapolation of the overall trend toward extreme bloat that we’ve seen in the past 15 years, just accelerated because LLMs code a lot faster. I’m pretty accustomed to dealing with Web application code bases that are 6-10 years old, where the hacks have piled up on top of other hacks, piled on top of early, tough-to-reverse bad decisions and assumptions, and nobody has had time to go back and do major refactors. This just seems like more of the same, except now you can create a 10 year-old hack-filled code base in three hours.

3 days ago

jessai202699

The terrifying thing is that LLMs turn "technical debt" into "synthetic debt" that accumulates in real-time.

When we use an agent that lacks a native way to consolidate its own context, we essentially force it to generate these 10-year-old hack-filled codebases by design. We’re over-engineering the "container" (the CLI logic) to babysit a "leaky" context.

If the architecture doesn't start treating long-term memory as a first-class citizen, we’re just going to see more of these 500k-line "safety nets" masking the underlying fragility of the agents.

3 days ago

viktorianer

Exactly right. Files on disk as the shared state, not the conversation window. Each step reads current state, does its job, writes output. Next step starts fresh from those files. No accumulated context means no drift, and the LLM can hallucinate in its reasoning all it wants as long as the output passes a check before anything advances.

2 days ago

cheesecompiler

There seem to be multiple mechanisms compensating for imperfect, lossy memory. "Dreaming" is another band-aid on inability to reliably store memory without loss of precision. How lossy is this pruning process?

It's one thing to give Claude a narrow task with clear parameters, and another to watch errors or incorrect assumptions snowball as you have a more complex conversation or open-ended task.

3 days ago

pancsta

You need state oriented programming to handle that. I know, because I made one. The keyword is „unpredictability”. Embrace nondeterminism.

3 days ago

tracyhenry

> they break at large enterprise repos.

I don't know where you get this. you should ask folks at Meta. They are probably the biggest and happiest users of CC

3 days ago

batshit_beaver

You mean the company where engineers ask chat bots to write chess games in their spare time in order to hit their AI usage requirements? That Meta?

3 days ago

tracyhenry

idk why you bring this up. this is irrelevant to whether CC actually works at big corps

3 days ago

jimbokun

I missed that, source?

3 days ago

bogdanoff_2

What do you mean by "actually governing the agents at the system level", and how is it different from "herding cats"?

3 days ago

amangsingh

Herding cats is treating the LLM's context window as your state machine. You're constantly prompt-engineering it to remember the rules, hoping it doesn't hallucinate or silently drop constraints over a long session.

System-level governance means the LLM is completely stripped of orchestration rights. It becomes a stateless, untrusted function. The state lives in a rigid, external database (like SQLite). The database dictates the workflow, hands the LLM a highly constrained task, and runs external validation on the output before the state is ever allowed to advance. The LLM cannot unilaterally decide a task is done.

I got so frustrated with the former while working on a complex project that I paused it to build a CLI to enforce the latter. Planning to drop a Show HN for it later today, actually.

3 days ago

skeledrew

> The database dictates the workflow, hands the LLM a highly constrained task, and runs external validation on the output before the state is ever allowed to advance.

This sounds like where lat.md[0] is headed. Only thing is it doesn't do task constraint. Generally I find the path these tools are taking interesting.

[0] https://github.com/1st1/lat.md

3 days ago

amangsingh

I looked into lat.md. They are definitely thinking in the same direction by using a CLI layer to govern the agent.

The key difference is the state mechanism. They use markdown; I use an AES-encrypted SQLite database.

Markdown is still just text an LLM can hallucinate over or ignore. A database behind a compiled binary acts as a physical constraint; the agent literally cannot advance a task without satisfying the cryptographic gates.

I just dropped the Show HN for it here if you want to check out the architecture: https://news.ycombinator.com/item?id=47601608

3 days ago

mywacaday

I started that very personal project on Monday, waiting with baited breath, make sure to add a sponsor me a coffee link.

3 days ago

amangsingh

Just posted it here: https://news.ycombinator.com/item?id=47601608 Thank you so much for the coffee offer, that genuinely made my day! I don't have a sponsor link set up. Honestly, the best support is just hearing if this actually helps you ship your personal project faster without losing your mind to prompt engineering. I really hope it gives you your sanity back. Let me know how it goes!

3 days ago

Melatonic

Some of your comments have already been marked as "dead" oddly enough that just seemed like normal comments explaining your rationale.

edit: Also seems like peoples replies are getting downvoted to hell and getting marked as dead and dissapear. Someone must not like your idea :-)

3 days ago

zargon

Comments are marked dead by automatic processes, not through downvotes. They're dead before anyone sees them, and you can't vote on a dead comment. amangsingh's comments have probably triggered some automated moderation. Probably at least partially because they sound LLM-generated.

3 days ago

amangsingh

Spot on regarding the automod. Unfortunately, the way I naturally structure my writing almost always triggers a 50/50 flag on AI content detectors. It is the absolute bane of my existence.

The filter instantly shadowbanned the Show HN post when I submitted it, which is why the link was dead for a while. Thankfully, human mods reviewed it and restored it. The link is fully live for a while now!

3 days ago

alxndr

Noticed that and was wondering, thanks for the explanation. Does this imply that human-people need to go “vouch” for the flagged comments to bring them back into HM’s good graces?

2 days ago

fallinditch

Sounds good, I'll keep an eye out.

3 days ago

amangsingh

Just dropped the Show HN here: https://news.ycombinator.com/item?id=47601608. Would love to hear your thoughts on the architecture!

3 days ago

skeledrew

There's nothing at that link. Not even a title.

3 days ago

Melatonic

Looks like it was downvoted to hell and marked as dead super fast. I leave the flag for "dead" on in my HN settings (leaves it super desaturated) and this seems unusual

3 days ago

marcuscog

I think these folks are attempting to build systems with IAM, entity states, business rules: all built over two foundational DSLs - https://typmo.com

3 days ago

mbesto

Thousands of developers are using Claude Code successfully (I think?).

So what specifically is the gripe? If it works, it works right?

3 days ago

ap99

So this is more like an art than science - and Claude Code happens to be the best at this messy art (imo).

3 days ago

p-e-w

> A 500k line codebase for an agent CLI proves one thing: making a probabilistic LLM behave deterministically is a massive state-management nightmare.

Considering what the entire system ends up being capable of, 500k lines is about 0.001% of what I would have expected something like that to require 10 years ago.

You can combine that with all the training and inference code, and at the end of the day, a system that literally writes code ends up being smaller than the LibreOffice codebase.

It boggles the mind, really.

3 days ago

davidkunz

3 days ago

sarchertech

> You can combine that with all the training and inference code, and at the end of the day, a system that literally writes code ends up being smaller than the LibreOffice codebase.

You really need to compare it to the model weights though. That’s the “code”.

3 days ago

pixl97

>You really need to compare it to the model weights though

Then you'd need to compare the education of any developer in relation to how many LOC their IDE is. That's the "code".

So yea, the analogy doesn't make a whole lot of sense.

3 days ago

oblio

It even wrote an entire browser!

By "just" wrapping a browser engine.

3 days ago

[deleted]
3 days ago

raincole

... what are you even talking about? "The system that literally writes code" has a few hundreds of trillions of parameters. How is this smaller than LibreOffice?

I know xkcd 1053, but come on.

3 days ago

bwfan123

brute-forcing pattern-matching at scale. These are brittle systems with enormous duct-taping to hold everything together. workarounds on workarounds.

3 days ago

[deleted]
3 days ago

quantumquantara

[dead]

3 days ago

gbibas

[dead]

3 days ago

jessai202699

[dead]

3 days ago

dolomo

[flagged]

3 days ago

amangsingh

If writing concise architectural analysis without the fluff makes me an AI, I'll take the complement. But no - just a tired Architect who has spent way too many hours staring at broken agent state loops haha.

3 days ago

avarun

This reply is quite literally AI as well, and so was your initial comment. It's so so obvious after spending enough time on Twitter and seeing the pattern used by all the AI reply bots. Absolutely insane that the HN crowd isn't able to see this.

3 days ago

airstrike

I'll bet you $20 you ran your original comment through an LLM. Likely an OpenAI model.

3 days ago

[deleted]
3 days ago

thfuran

What makes you think that’s AI-written?

3 days ago

samusiam

AI witch-hunters are even more annoying.

3 days ago

WarmWash

Seriously, people are becoming deranged.

Drop an em dash or a bullet point and they go into spasms.

3 days ago

ramesh31

>A 500k line codebase for an agent CLI proves one thing: making a probabilistic LLM behave deterministically is a massive state-management nightmare. Right now, they're great for prompting simple sites/platforms but they break at large enterprise repos.

Is that the case? I'm pretty sure Claude Code is one of the most massively successful pieces of software made in the last decade. I don't know how that proves your point. Will this codebase become unmanageable eventually? Maybe, but literally every agent harness out there is just copying their lead at this point.

3 days ago

amangsingh

Claude code is a massively successful generator, I use it all the time, but it's not a governance layer.

The fact that the industry is copying a 500k-line harness is the problem. We're automating security vulnerabilities at scale because people are trying to put the guardrails inside the probabilistic code instead of strictly above it.

Standardizing on half a million lines of defensive spaghetti is a huge liability.

3 days ago

ramesh31

>Standardizing on half a million lines of defensive spaghetti is a huge liability.

Again, maybe it will be. Or maybe the way we make software and what is considered good practice will completely change with this new technology. I'm betting on the latter at this point.

3 days ago

throwatdem12311

I know it seems counter-intuitive but are there any agent harnesses that aren’t written with AI? All these half a million LoC codebases seem insane to me when I run my business on a full-stack web application that’s like 50k lines of code and my MvP was like 10k. These are just TUIs that call a model endpoint with some shell-out commands. These things have only been around in time measured in months, half a million LoC is crazy to me.

3 days ago

__s

Check out pi coding agent, https://pi.dev

3 days ago

justin_vin

The bloat is mostly error handling for a fundamentally unpredictable system. A simple agent loop is 200 lines. But then you need graceful recovery when the model hallucinates a tool call, context window management, permission boundaries so it doesn't rm -rf your repo, state persistence across crashes, rate limiting, and cost tracking. Each one is simple individually. Together they're 500k lines.

3 days ago

rkozik1989

Who cares about LoC? Its a metric that hasn't mattered since we measured productivity in it in the 1980s. For all we know they made these design choices so they could more easily reuse the code in other codebases. Ideally you'd build the library to do that at the same time, but this is start up time constraints to repay loans and shit.

3 days ago

jimbokun

Bugs and vulnerabilities are roughly linear to lines of code in a project.

3 days ago

imdoxxingme

"Who cares how much concrete we used in this bridge?"

3 days ago

nojs

That would be a sensible comparison if concrete was free

3 days ago

lossyalgo

Since when are tokens free?

3 days ago

acedTrex

Opencode actually has a pretty solid codebase quality wise. I have done brief pokes and its been largely fine.

3 days ago

raincole

> just TUIs

For starters, CC's TUI is React-based.

3 days ago

ale

Somebody somewhere is bragging to someone about using React to render a grid of ASCII characters.

3 days ago

GoatInGrey

https://x.com/trq212/status/2014051501786931427

" Most people's mental model of Claude Code is that "it's just a TUI" but it should really be closer to "a small game engine".

For each frame our pipeline constructs a scene graph with React then -> layouts elements -> rasterizes them to a 2d screen -> diffs that against the previous screen -> finally uses the diff to generate ANSI sequences to draw

We have a ~16ms frame budget so we have roughly ~5ms to go from the React scene graph to ANSI written. "

3 days ago

xendo

You can argue that any UI is like a game engine in that sense. Some make sensible choices and don't need to pretend they have to render at 60fps.

3 days ago

throwatdem12311

60fps is pathetic for a TUI when most terminals worth their salt are GPU accelerated and displays can be up to 240fps or even more. But let’s be real if I can play Quake at >500 fps they have no excuse.

3 days ago

thinkling

Do they reconstruct the scene graph for each frame?! Maybe I'm overinterpreting the phrasing. Someone take a peek at the source?

3 days ago

joquarky

Not reliably.

3 days ago

JDups

[dead]

3 days ago

TacticalCoder

> These are just TUIs that call a model endpoint with some shell-out commands.

Claude Code CLI is actually horrible: it's a full headless browser rendering that's then converted in real-time to text to show in the terminal. And that fact leaks to the user: when the model outputs ASCII, the converter shall happily convert it to Unicode (no latter than yesterday there was a TFA complaining about Unicode characters breaking Unix pipes / parsers expecting ASCII commands).

It's ultra annoying during debugging sessions (that is not when in a full agentic loop where it YOLOs a solution): you can't easily cut/paste from the CLI because the output you get is not what the model did output.

Mega, mega, mega annoying.

What should be something simple becomes a rube-goldberg machinery that, of course, fucks up something fundamental: converting the model's characters to something else is just pathetically bad.

Anyone from Anthropic reading? Get your shit together: if you keep this "headless browser rendering converted to text", at least do not fucking modify the characters.*

3 days ago

user34283

No it is not. Ink does not use a browser.

3 days ago

Andebugulin

If it was 2020, it would be hard to imagine that after some hours/days you getting a visual representation of the leak with such detailed stats lol

4 days ago

spzb

I don't have a lot of experience with them but I would have thought static analysis tools circa 2020 would have managed it just fine.

3 days ago

hu3

The output wouldn't be anything nearly as good, comprehensive and informative as this website.

No tooling, no animation, no hidden features, no explanation of how things work upon clicking them.

But I'm glad to be proven wrong if you know a static analysis tool I can point to the repo and come up with comparable result.

2 days ago

makapuf

How was this generated ? I'm quite sure "with ai/claude code" but what are the actual steps ?

4 days ago

rzmmm

For the animations specifically, it's using Motion (fka Framer Motion) Javascript library. If you describe some animations from the site to an LLM and ask it to use Framer motion, you get very similar results. The creator likely just prompted for a while until they were happy with the outcome.

4 days ago

FartyMcFarter

Is there a reason to think it was done by an LLM?

4 days ago

marcellus23

AI-generated UIs, at least ones aimed at an engineering audience, have a very distinct appearance. They seem to always have the following attributes:

- Dark mode design with lots of colors

- Buttons that have vibrant, bright borders and duller backgrounds

- Excessive (IMO) usage of monospace fonts for stylistic reasons

None of this proves that it's AI (the other comments have covered that) but in my experience it's always correct.

3 days ago

rzmmm

It states "curation assisted by AI" at the bottom.

4 days ago

FartyMcFarter

That doesn't mean the code in question was written by AI.

3 days ago

Chaosvex

The author posted in this thread. It's AI.

3 days ago

hxugufjfjf

You’re missing the point. It’s not a witch hunt, but rather a discussion on whether things are too quickly attributed to be AI generated.

3 days ago

Chaosvex

That's one way to look at it but I don't think it was that deep.

2 days ago

spiderfarmer

The biggest reason would be: do you know a single developer who could have produced this in a couple of hours?

3 days ago

foolserrandboy

Yup, strange to see people still don’t understand LLMs massively speed up coding greenfield pet projects. Anytime you see a bee web app it’s better to assume AI use rather than not anymore.

3 days ago

FartyMcFarter

I'm not familiar enough with this animation library to answer that. Someone could be very used to this type of website and just copy paste things they've done before.

3 days ago

spiderfarmer

What does Occam's razor say?

3 days ago

[deleted]
3 days ago

franze

[dead]

4 days ago

ernst_klim

> 500k lines of code

Isn't it a simple REPL with some tools and integrations, written in a very high level language? How the hell is it so big? Is it because it's vibecoded and LLMs strive for bloat, or is it meaningful complexity?

4 days ago

samusiam

I just checked competitors' codebases:

- Opencode (anomalyco/opencode) is about 670k LOC

- Codex (openai/codex) is about 720k LOC

- Gemini (google-gemini/gemini-cli) is about 570k LOC

Claude Code's 500k LOC doesn't seem out of the ordinary.

3 days ago

lelanthran

> Claude Code's 500k LOC doesn't seem out of the ordinary.

Aren't all the other products also vibe-coded? "All vibe-coded products look like this" doesn't really seem to answer the question "Why is it so damn large?"

It's a repl, that calls out to a blackbox/endpoint for data, and does basic parsing and matching of state with specific actions.

I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

1. If the bulk of those lines implement specific and simple actions, why is it so large compared to other software that implements single actions (coreutils, etc)

2. If the actions constitute only a small part of the codebase, wtf is the rest of it doing?

3 days ago

samusiam

You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

3 days ago

lelanthran

>> I feel the bulk of those lines should be actions that are performed. Either this is correct or this is not:

> You're complaining about vibe coding while also complaining about how you "feel" about the code. Do you see the irony in that?

Where did I complain about how I feel about the actual code? I have feelings, negative ones, about the size of the code given the simple functionality it has, but I have no feelings on the code because I did not look at the code.

3 days ago

arandomhuman

Are you ESL by any chance? You’re missing the forest for the trees.

3 days ago

johnisgood

All of them are really, REALLY bad.

3 days ago

surajrmal

Bad by whose definition? They work really well in my experience. They aren't perfect but the amount of hand holding has gone down dramatically and you can fix any glaring problems with a code review at the end. I work on a multimillion line code base which does not use any popular frameworks and it does a great job. I may be benefiting from the fact that the codebase is open source and all models have obviously been trained on it.

3 days ago

bdhtu

It takes 10 seconds for Gemini CLI to load. 10 seconds to show an input field. This is for a CLI program.

For comparison, it takes me less time to load Chrome and go to gemini.google.com.

3 days ago

causal

> They work really well in my experience.

Yeahhh strong disagree there, I find Codex and CC to be buggy as hell. Desktop CC is very bad and web version is nigh unusable.

3 days ago

oblio

At least Gemini and Claude constantly break down with scrolling in various Linux terminals, something which was solved by countless TUIs decades ago.

I think a lot of the people prasing Claude & co are on Macs.

3 days ago

johnisgood

Most of their issues have been solved a long time ago, with 1000x less code. It is depressing at this point. I really had no clue IT was in the shitters this much. I knew it was theatrical but I had no idea that it was by this much.

3 days ago

geodel

All these AI tools teams have most valid excuse "We are just a bunch of people who only know Javascript/typescript/NodeJS. Please bear with us while we resolve 10,000 open issues."

3 days ago

samusiam

I haven't seen the scrolling glitch in months, where previously it was happening multiple times a day. Also haven't seen anyone complain about it in quite some time. Pretty sure they have resolved that.

3 days ago

msully4321

They have not! If I am scrolled up while more output is produced, the scrollback jumps to the top pretty consistently.

3 days ago

oblio

I'll try again but lately I've been using strictly the VS Code terminal. Gnome Terminal and Termux in Ubuntu 24.04 were unusable even with 1000 hacks.

3 days ago

causal

I'm on a mac! And I still find bugs on a regular basis...

3 days ago

spiderfarmer

I don't know if you're mindlessly repeating the HN trope that JS/typescript/Electron is bad and that all bloat can easily prevented, but if you're truly interested in answers to your questions: RTFA.

3 days ago

carterschonwald

yeah its honestly full of vibe fixes to vibe hacks with no overarching desig. . some great little empirical observations though!i think the only clever bit relative to my own designs is just tracking time since last cache ht to check ttl. idk why i hadnt thought of that, but makes perfect sense

3 days ago

[deleted]
3 days ago

ale

There’s probably a subconscious incentive to make a tool that’s “complex” because the underlying LLM also is complex.

3 days ago

fragmede

How many LoC should it be, for that kind of program?

3 days ago

forgotpwd16

Other notable agents' LOC: Codex (Rust) ~519K, Gemini (TS) ~445K, OpenCode (TS) ~254K, Pi (TS) ~113K LOC. Pi's modular structure makes it simple to see where most of code is. Respectively core, unified API, coding agent CLI, TUI have ~3K, ~35K, ~60K, ~15K LOC. Interestingly, the just uploaded claw-code's Rust version is currently at only 28K.

edit: Claude is actually (TS) 395K. So Gemini is more bloat. Codex is arguable since is written in lower-level language.

3 days ago

ernst_klim

Well FFmpeg is roughly 1500k, but it's C+Asm and it's dozens of codecs and pretty complex features. SBCL is around 500k I guess.

I'm not saying that this is necessarily too much, I'm genuinely asking if this is a bloat or if it's justified.

3 days ago

troupo

It's a TUI API wrapper with a few commands bolted on.

I doubt it needs to be more than 20-50kloc.

You can create a full 3D game with a custom 3D engine in 500k lines. What the hell is Claude Code doing?

3 days ago

neurostimulant

Just check the leaked code yourself. Two biggest areas seem to be the `utils` module, which is a kitchen sink that covers a lot of functionality from sandboxing, git support, sessions, etc, and `components` module, which contains the react ui. You could certainly build a cli agent with much smaller codebase, with leaner ui code without react, but probably not with this truckload of functionality.

3 days ago

cogman10

They are doing some strange "reinvent the wheel" stuff.

For example, I found an implementation of a PRNG, mulberry32 [1], in one of the files. That's pretty strange considering TS and Javascript have decent PRNGs built into the language and this thing is being used as literally just a shuffle.

[1] https://github.com/AprilNEA/claude-code-source/blob/main/src...

3 days ago

hombre_fatal

mulberry32 is one of the smallest seedable prngs. Math.random() is not seedable.

If you search mulberry32 in the code, you'll see they use it for a deterministic random. They use your user ID to always pick the same random buddy. Just like you might use someone's user ID to always generate the same random avatar.

So that's 10 lines of code accounted for. Any other examples?

3 days ago

neurostimulant

Well, at least that confirms they weren't lying when they said all recent updates to claude code were made by claude. You certainly won't do this stuff if you were writing the code yourself.

3 days ago

hombre_fatal

Software doesn’t end at the 20k loc proof of concept though.

What every developer learns during their “psh i could build that” weekendware attempt is that there is infinite polish to be had, and that their 20k loc PoC was <1% of the work.

That said, doesn't TFA show you what they use their loc for?

3 days ago

mpalmer

Check out `print.ts` to see how "more LOC" doesn't mean "more polished"

3 days ago

hombre_fatal

Okay, I'm looking at it. Now what?

This file is exactly what I'm talking about.

Take the loadInitialMessage function: It's encumbered with real world incremental requirements. You can see exactly the bolted-on conditionals where they added features like --teleport, --fork-session, etc.

The runHeadlessStreaming function is a more extreme version of that where a bunch of incremental, lateral subsystems are wired together, not an example of superfluous loc.

3 days ago

mpalmer

The file is more than 5000 lines of code. The main function is 3000. Code comments make reference to (and depend on guarantees in connection with) the specific behavior of code in other files. Do I need to explain why that's bad?

3 days ago

hombre_fatal

By real-world polish, I don't mean refining the code quality but rather everything that exists in the delta between proof of concept vs real world solution with actual users.

You don't have to explain why there might be better ways to write some code because the claim is about lines of code. It could be the case that perfectly organizing and abstracting the code would result in even more loc.

3 days ago

sarchertech

I think that’s why the author was comparing to to a finished 3D game.

3 days ago

hombre_fatal

I guess because you see 3D stuff in a 3D game instead of text, people assume that it must be the most complex thing in software? Or because you solve hard math problems in 3D, those functions are gonna be the most loc?

It's a completely different domain, e.g. very different integration surface area and abstractions.

Claude Code's source is dumped online so there's probably a more concrete analysis to be had than "that sounds like too many loc".

3 days ago

sarchertech

It is a different domain but that wasn’t your argument. Your argument was that someone was comparing it to a POC when in fact they were comparing to a finished product.

Also a AAA game (with the engine) with physics, networking, and rendering code is up there in terms of the most complex pieces of software.

3 days ago

hombre_fatal

They just claimed that you can build a 3D game in 500k loc, thus Claude Code shouldn't use so many loc. They/you didn't render the argument for that.

For example, without looking at the code, the superstition also works in the opposite direction: Claude Code is an interface to using AI to do any computer task while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc. That's equally unsatisfying.

You'd have to be more concrete than "sounds like a lot".

3 days ago

troupo

> Claude Code is an interface to using AI to do any computer task

Claude Code is quite literally a wrapper around a few APIs. At one point it needed 68GB of RAM to run and requires 11ms to "lay a scene graph" to display a few hundred characters on screen. All links here: https://news.ycombinator.com/item?id=47598488

> while a 3D game just lets you shoot some bad guys, so surely the 3D game must be done in fewer loc.

Yes, most games should be done in fewer loc

3 days ago

anthk

I could run a text adventure with a Zmachine emulator under a 6502 based machine and 48k of RAM, with Ozmoo you can play games like Tristam Island. On a Commodore 64, or an Apple II for you US commenters. I repeat the game it's being emulated in a simple computer with barely more processing power than a current keyboard controller.

As the ZMachine interpreter (V3 games at least, enough for the mentioned example), even a Game Boy used to play Pokemon Red/Blue -and Crystal/Sylver/Blue, just slightly better specs than the OG GB- can run Tristam Island with keypad based input picking both selected words from the text or letter by letter as when you name a character in an RPG. A damn Game Boy, a pocket console from 1989. Not straightly running a game, again. Emulating a simple text computer -the virtual machine- to play it. No slowdowns, no-nothing, and you can save the game (the interpreter status) in a battery backed cartridge, such as the Everdrive. Everything under... 128k.

Claude Code and the rest of 'examples' it's what happens when trade programmers call themselves 'engineers' without even a CS degree.

3 days ago

hombre_fatal

Your claim was that they could implement the same app in 50k lines of code.

A cursory glance at the codebase shows that it's not just a wrapper around a few APIs.

3 days ago

mpalmer

Yes, because they've vibed it into phenomenally unnecessary complexity. The mistake you continually make in this thread is to look at complexity and see something that is de facto praiseworthy and impressive. It is not.

3 days ago

hombre_fatal

[dead]

3 days ago

lelanthran

> Claude Code is an interface to using AI to do any computer task

Shouldn't interfaces be smaller than the implementation?

3 days ago

hombre_fatal

No. We aren't talking about .h vs .c files nor PL interfaces.

A GUI/client can be arbitrarily more or less complex than the things it's GUI'ing.

3 days ago

lelanthran

> A GUI/client can be arbitrarily more or less complex than the things it's GUI'ing.

If it's an interface to ffmpeg, then sure, the GUI could be extremely complicated code.

But that's not what we are talking about, is it? We are talking about an interface to a chatbot that can accept and return chats, accept and return files, and run a selection of internal commands (which include invoking itself recursively).

The interface to this chatbot that has a settings entry for "personality" is still only going to map that to one of a small number of chatbot inputs. Same with basically anything else (read the skills file, etc).

I dunno... maybe 500kSloC for a fancy IRC client is the going rate, but the last time I wrote an interface to a chat client, it was barely 10k lines, not counting the lib*.so that the the program called to interact with the chatbot, with said chatbot supporting file uploads and '/' commands.

3 days ago

fragmede

Did your IRC client have a sandbox that let other users run commands on your box? I don't think there's enough LoC in the world before I'd let that happen!

2 days ago

spiderfarmer

Comments like these remind me of the football spectators that shout "Even I could have scored that one" when they see a failed attempt.

Sure. You could have. But you're not the one playing football in the Champions League.

There were many roads that could have gotten you to the Champions League. But now you're in no position to judge the people who got there in the end and how they did it.

Or you can, but whatever.

3 days ago

boomskats

I don't think this is warranted given that the comment you're criticising is simply expressing an opinion explicitly solicited by the comment it's responding to.

3 days ago

sarchertech

It’s more like “Player A is better than Player B” coming from a professional player in a smaller league who is certainly qualified to have that opinion.

3 days ago

troupo

> Sure. You could have. But you're not the one playing football in the Champions League.

The only reason people are using Claude Code is because it's the only way to use their (heavily subsidized) subscription plans. People who are okay with using and paying for their APIs often opt out for other, better, tools.

Also, analogies don't work. As we know for a fact that Claude Code is a bloated mess that these "champions league-level engineers" can't fix. They literally talk about it themselves: https://news.ycombinator.com/item?id=47598488 (they had to bring in actual Champions League engineers from bun to fix some of their mess).

3 days ago

spiderfarmer

"Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

You just repeat the same statement.

That bloated mess is what got them to the Champions League. They did what was necessary to get them here. And they succeeded so far.

But hey, according to some it can be replicated in 50k lines of wrapper code around a terminal command, so for Anthropic it's just one afternoon of vibe coding to get rid of this mess. So what's the problem? /s

3 days ago

troupo

> Even I would have scored that goal" == "I would never ever have created a bloated mess like Anthropic"

Since you keep putting words in my mouth that I never said, and keep being deliberately obtuse, this particular branch is over.

Go enjoy Win11 written by same level of champions or something.

Adieu.

3 days ago

boomskats

Ah, Winning Eleven.

Not what you were referring to.

3 days ago

cindyllm

[dead]

3 days ago

vardalab

Yes, exactly. I like this analogy. I am surprised the level of pearl clutching in these discussions on Hacker News. Everybody wants to be an attention sharecropper, lol.

3 days ago

criley2

Honest question: Why does it matter? They got the product shipped and got millions of paying customers and totally revolutionized their business and our industry.

Engineers using LOC as a measure of quality is the inverse of managers using LOC as a measure of productivity.

3 days ago

dandellion

More code means more entropy, more room for bugs, harder to find issues, more time to fix, more attack surface, more memory used, more duplication, more inconsistencies... I bet you at some point we'll get someone reporting how AI performance deteriorates as the code base grows, and some blog post about how their team improved the success of their AI by trimming the code base down to less than 100k LOC or something like that.

The principles of good software don't suddenly vanish just because now it's a machine writing the code instead of a human, they still have to deal with the issues humans have for more than half a century. The history of programming is new developers coming up with a new paradigm, then rediscovering all the issues that the previous generation had figured out before them.

3 days ago

criley2

The history of programming is also each generation writing far less performant code than the one before it. The history of programming is each generation bemoaning the abstractions, waste and lack of performance of the code of the next generation.

It turns out that there is a tradeoff in code between velocity and quality that smart businesses consider relative to hardware cost/quality. The businesses that are outcompeting others are rarely those who have the highest quality code, but rather those that are shipping quickly at a quality level that is satisfactory for current hardware.

3 days ago

sarchertech

> far less performant code than the one before it.

That worked because of rapid advancements in CPU performance. We’ve left that era.

It’s about more than performance. Code is and always has been a liability. Even with agents, you start seeing massive slowdowns with code base size.

It’s why I can nearly one shot a simple game for my kid in 20 minutes with Claude, but using it at work on our massive legacy codebase is only marginally faster than doing it by hand.

3 days ago

dandellion

You asked why the size of the code matters, I gave you the answer. If you want to ramble about the non technical aspects of software development talk to someone else, I'm not interested.

3 days ago

criley2

I asked a rhetorical question to get the reader to think about a topic. I was not looking for a rote recitation of a well-known textbook answer. Maybe you should not be on the comment section of an engineering website if you find discussion so offensive.

3 days ago

dandellion

I'm pretty sure I can post however I want, so I'll be ignoring your suggestion. Also I didn't say I find it offensive, just that I'm not interested.

a day ago

raincole

It doesn't. LoC is only meaningful when you use it to belittle others' code.

3 days ago

ulbu

hehe, belittle (to make smaller)

3 days ago

sarchertech

The reason it’s not useful as a measure of productivity is because it’s measure of complexity (not directly, but it’s correlated). But it tells you nothing about whether that complexity was necessary for the functionality it provides.

But given that we know the functionality of Claude Code, we can guess how much complexity should be required. We could also be wrong.

>Why does it matter?

If there’s massively more code than there needs to be that does matter to the end user because it’s harder to maintain and has more surface area for bugs and security problems. Even with agents.

3 days ago

GuB-42

Among the hundreds of thousands of lines of code that Anthropic produced was one that leaked the source code. It is likely to be a config file, not part of the Claude Code software itself, but it still something to track.

The more lines of code you have the more likely there is for one of them to be wrong and go unnoticed. It results in bugs, vulnerabilities,... and leaks.

3 days ago

viktorcode

More bugs. More costly maintenance.

3 days ago

blantonl

Exactly. Imagine if Claude Code was a PHP script. Some folks would lose their damn minds

3 days ago

troupo

> Honest question: Why does it matter?

Because it's unmaintainable slop that they themselves don't know how to fix when something happens? https://news.ycombinator.com/item?id=47598488

3 days ago

sumtechguy

It will be exactly that. But that is a 'them' problem. I can look at it a go 'that looks like a bad idea' but they are the ones who have to live with it.

At some point someone will probably take their LLM code and repoint it at the LLM and say 'hey lets refactor this so it uses less code is easier to read but does the same thing' and let it chrun.

One project I worked on I saw one engineer delete 20k lines of code one day. He replaced it with a few lines of stored procedure. That 20k lines of code was in production for years. No one wanted to do anything with it but it was a crucial part of the way the thing worked. It just takes someone going 'hey this isnt right' and sit down and fix it.

3 days ago

troupo

> But that is a 'them' problem. I

When a TUI requires 68 GB of RAM to run, or when they spend a week not being able to find a bug that causes multiple people to immediately run out of tokens, it's not a "them" problem.

3 days ago

[deleted]
3 days ago

gbibas

[dead]

3 days ago

brauhaus

Even today, I'm still astounded that there are people capable of building a gorgeous and interesting site like this in less than 2 days...

4 days ago

spondyl

Well, I assume this is all just generated with Claude Code, right? Whether there is much back and forth with the LLM is a valid question and nothing wrong with generating websites (I do it too for some side projects). Claude loves generating websites with a particular style of serif font. We also saw this with https://tboteproject.com/timeline/ and I've just generally seen it from various designs that coworkers have spit out over months using Claude defaults.

I guess I just find it weird because all the signals are messed up so whenever I see these sorts of layouts, I feel like I'm looking at the average where I don't think "gorgeous and interesting" at all. Instead, I'm forced to think "I should be skeptical of this based on the presentation because it presents as high quality but this may be hiding someone who is not actually aware of what they're presenting in any depth" as the author may have just shoved in a prompt and let it spin.

There's actually a similarly designed website (font weights, font styles etc) here in New Zealand (https://nzoilwatch.com/) where at a glance, it might seem like some overloaded professional-backed thing but instead it's just some guy who may or may not know anything about oil at all, yet people are linking it around the place like some sort of authoritative resource.

I would have way less of an issue if people just put their names by things and disclosed their LLM usage (which again, is fine) rather than giving the potentially false impression to unequipped people that the information presented is actually as accurate and trustworthy as the polish would suggest.

4 days ago

kristopolous

I really wish I had that clout-chasing gene - it doesn't even occur to me until I see someone else do it.

I'm serious. The hype chasing clearly clearly matters. .

things like this: https://github.com/instructkr/claw-code I mean ok, serious people put in years of effort for 100 of those stars ...

it's continually wild how extremely irrelevant hard effortful careful work is.

I think that's the game. Get up, look at the headlines, figure out how you can exploit them with vibe coding, do some hyphy project and repeat.

Maybe some lobster themed bullshit between openclaw and the claudecode leak.

I'm not being a cynic here, I'm just telling you what I'm going to do tomorrow.

4 days ago

simgt

We do need "hard effortful careful work" to keep planes flying, electrical grids running and medical devices safe. It's very relevant but very undervalued by our current economy.

3 days ago

[deleted]
3 days ago

kristopolous

That was the leaked code and now it's just some random dudes harness btw. He swapped it out. Did a sloppy find and replace for "claude" and made it claw.

It's sloppy work

Does not matter. Sloppiness is unimportant

3 days ago

kristopolous

here's my attempt: https://github.com/kristopolous/Claudette

My shit's always too complicated. let's see

3 days ago

kristopolous

And.... Flop

21 hours ago

user34283

This website has "Curation assisted by AI." at the bottom.

Personally, I don't think I will be putting any such disclaimers or disclosures on my work, unless I deem it relevant to the functionality.

4 days ago

ricardobeat

Claude itself can generate this in minutes if you know how to ask.

4 days ago

MikeNotThePope

I was talking to one of the people who works at a big agentic coding tools. If I recall correctly, he was talking about how they use the tool to build the tool. I was complaining that all of the websites/frontends I make look pretty weak, and I'm amazed they get much slicker looking UIs with the same tool. He showed me that one way they do it is by having an extensive UI library of components/graphics/whatever, and also mentioned that the folks build their UIs know how to prompt/use the tool because it's backed by years of UI development knowledge & superior resources. I realized I didn't have any of that, and it actually made me feel better.

Last week we I was struggling to go from vague prompt to a OMG-it's-so-nice-looking web app, I remembered that example above and then decided to create my own component library, which I did in a couple days: https://www.substrateui.dev/. I was actually super happy that I was able to accomplish that, and then I realized I wanted to better understand the content that I had vibe coded into existence. So now I'm recreating that design system step by step w/ Claude code, filling in gaps in my knowledge & learning a bit about colors, typography, CSS, blah blah blah. It's actually a lot of fun because I'm able to explore all of the concepts and learn enough to build a front end that doesn't suck & is good enough for my use case without getting stuck for days on trying to center a stupid div by hand or play whack-mole-fix-something-and-break-something-else when trying to clean up AI slop.

4 days ago

zem

that's really awesome. how did you go about building the component library?

4 days ago

MikeNotThePope

I was referencing https://www.neobrutalism.dev/ and https://www.retroui.dev/ and slopped my way through it. A lot of it was just asking Claude Code "is this a proper design system?", then I kept doing that until it didn't have anything useful to add. Now I'm using my that as the template for understanding such things in more detail.

4 days ago

oasisbob

Is this gorgeous?

Content resizing, needing to juggle a speed knob to read, and the overall presentation makes it feel like Edward Tufte flavored nightmare fuel.

4 days ago

nurettin

It is pretty good, shows numbers clearly on desktop and phone. Not sure what the criticism even means.

4 days ago

oasisbob

I'm criticizing the readability of that first "Agent Loop" section.

It's basically a slideshow which advances and presents several content areas which are intended to be read, all while advancing and resizing themselves.

Pausing and clicking through manually stepwise is also pretty obnoxious.

Would much rather just see the content all laid out at once

3 days ago

raincole

But somehow, according to HN, LLMs make you less productive, not more :)

4 days ago

supersparrow

The people who don’t know how to use an LLM to make them more productive, or are scared it’s going to take their job, are louder than the people who are making good use of them to make them more productive.

That just seems to be human nature unfortunately - the complainers are always louder.

4 days ago

drzaiusx11

As someone currently "making good use of" generative AI while simultaneously being painfully aware of its shortcomings, I think the overall discourse is a bit more nuanced. Bucketing folks into simple "for" and "against" GenAI camps does nothing to cover the vast spectrum in between, making your take ultimately built on a false dichotomy. Further implying those camps fall on the lines of those "in the know" of AI vs "those in denial/scared of" is patronizing at best, and I've grown tired of this oversimplification parroted out every time the topic of LLM systems come up.

Those within well informed, technical circles will fall somewhere in between the for/against labels, myself included.

The GenAI hype cycle is finally starting to collapse as the general population starts to realize that these systems aren't the panacea for "everything" after all. They provide enormous utility in some domains like coding, but even then there are massive tradeoffs, footguns and the usual horse blinder ills that come with every hype cycle. I just hope we stop having to "learn the hard way" with respect to undisciplined use of current-gen LLM systems writ large, and cooler heads prevail sooner rather than later.

3 days ago

drzaiusx11

Not sure why my comment is being downvoted. Am I mistaken and folks really see only for/against AI as a binary option here? Interesting to me if so. I would think that a technical community would view ai as a tool, but perhaps our existentialism is getting the better of us?

2 days ago

techpression

What? We must have different internets, I agree in general, but the "AI is the second coming" crowd is louder than standing next to a jet on takeoff. I'm in the "AI is making me more productive but a worse developer" crowd, don't know what I count as.

4 days ago

bsenftner

You got shuttled into one bubble and the previous commenter into another advertising / news bubble. It's incredible how different the media experience is for people in different media bubbles.

3 days ago

ggregoire

That's a bit dishonest, the consensus on HN seems to be that LLMs are very good at oneshotting small projects from scratch. Especially when using super mainstream technologies like html and tailwind, as does the discussed website. And especially when it's a one time operation and the project will never need to be maintained, like the discussed website.

3 days ago

kordlessagain

More like 2 hours.

3 days ago

piker

.

4 days ago

ipnon

I think it is accurate. Where are the autonomous AI who beat the creator to the punch? When we write "Hello, World!" in C and compile it with `gcc`, do we give credit to every contributor to GNU? AI is a tool that thus far only humans are capable of using with the unique inspiration. Will this change in the future? Certainly. But is it the case now? I think my questions imply some reasonable objections.

4 days ago

comboy

I mean, tools change, but I'd be happy to hear if any tool can create that by just saying create "Claude Code Unpack" with nice graphics. or some other single prompt. It likely was an iterative process and it would be lovely if more people started sharing that, because the process itself is also very interesting.

I've created some chinese characters learning website and I took me typing 1/3 of LoTR to get there[1]. I would have typed like 1% of that writing code directly. It is a different process, but it still needs some direction.

1. https://hanzirama.com/making-of

4 days ago

oriettaxx

“Che cos’è il genio? È fantasia, intuizione, colpo d’occhio e velocità di esecuzione”

4 days ago

stingraycharles

I guess they really do eat their own dogfood and vibe code their way through it without care for technical debt? In a way, it’s a good challenge, but it’s fairly painful to watch the current state of the project (which is about a year old now, so it should be in prime shape).

4 days ago

troupo

They explicitly boast about using claude code to write code: https://x.com/bcherny/status/2007179836704600237

That's how you get "oh this TUI API wrapper needs 68GB of RAM" https://x.com/jarredsumner/status/2026497606575398987 or "we need 16ms to lay out a few hundred characters on screen that's why it's a small game engine": https://x.com/trq212/status/2014051501786931427

4 days ago

000ooo000

Just finished looking at Ink here.. frontend world has no shame. Love the gloating about 40x less RAM as if that amount of memory for a text REPL even approaches defensible. "CC built CC" is not the flex people seem to suggest it is.

4 days ago

anthk

Frontend losers not realizing the turds they are releasing. An LLM client fits under netcat+echo+awk+jq runnable under a 486 if there's no SSL/TLS on its way, Pentium II could drive fast TLS connections like nothing and under 32MB of RAM with NetBSD for a simple terminal install, maybe with X and a simple WM with RXVT if you care.

3 days ago

ggregoire

Any loser is a "full stack software engineer" nowadays thanks to claude.

3 days ago

johnisgood

Indeed a sad state of affairs.

3 days ago

brabel

> is about a year old now, so it should be in prime shape

A 1yo project may be in good shape if written by just one dev, maybe a few. But if you have many devs, I can guarantee it will be messy and buggy. If anything, at 1yo it is probably still full of bugs because not enough time has elapsed for people to run into them.

4 days ago

mattmanser

It's only 510k LoC, at ~100 lines of code a day for a year, this code base would take 23 engineers a year to write. That's for 220 working days in somewhere civilized.

And I'm sure we all know that when working on a greenfield project you can produce a lot more LoC per day than maintaining a legacy one.

Given that vibe code is significantly more verbose, you're probably talking about ~15 engineers worth of code?

I know that's all silly numbers, but this is just attempting to give people some context here, this isn't a massive code base. I've not read a lot of it, so maybe it's better than the verbose code I see Claude put out sometimes.

4 days ago

lelanthran

> It's only 510k LoC, at ~100 lines of code a day for a year, this code base would take 23 engineers a year to write.

Correction: a code base of 500kLoC would take 23 engineers a year to write. There is no indication that the functionality needed in a TUI app that does what this app does needs 500kLoC.

3 days ago

cududa

When you say it’s not a massive codebase, I’m curious, what are you comparing it to?

4 days ago

mattmanser

The previous poster was making out that in a year the code base would be a mess if people had done it.

This is a two-pizza team sized project, so it's not a project that the code quality would inevitably spiral out of control due to communication problems.

A single senior architect COULD have kept the code quality under control.

4 days ago

kordlessagain

Splunk.

3 days ago

shepherdjerred

Who writes only 100loc per day, especially on a greenfield project?

3 days ago

mikkupikku

Put yourself in their shoes; either the quality of Claude's coding continues to improve or else their business is probably doomed if it stagnates, so for them it makes sense to punt technical debt to the future when more capable versions of their models will be able to better fix it.

This is why I personally don't take technical debt arguments about how LLM maintained code bases deteriorate with size/age seriously; it presumes that at some point I'll give up with the LLM and be left with a mess to clean up by hand, but that's not going to happen, future maintenance is to be left to LLMs and if that isn't possible for some reason then the project is as good as dead anyway. When you start a project with a LLM the plan should be to see it through with LLMs, planning to have unaided humans take over maintenance at some point is a mistake.

3 days ago

blanched

Doesn't this contradict the popular wisdom that "what's good for a human engineer is good for an LLM"? e.g. documentation, separation of concerns, organized files, DRY.

I find LLMs very useful and capable, but in my experience they definitely perform worse when things are unorganized. Maintenance isn't just aesthetics, it's a direct input to correctness.

3 days ago

mikkupikku

Maybe a little. I don't hold fast to that popular wisdom, e.g. I think comments are not always a net positive for LLMs. With respect to technical debt, how much debt is too much debt before it gums up the works and arrests forward progress on the software? It probably depends on the individual programmer. LLMs do seem to have a higher tolerance for technical debt than myself personally at least.

3 days ago

blanched

Good points, I've also found that comments are really hit or miss. Especially because the agents tend not to update them (sounds familiar!).

3 days ago

openfoliage

I am more worried that we are moving toward creating black boxes and this might turn software "development" into a field as confused as philosophy and dialectics.

3 days ago

coldtrait

Boris Cherny, the creator of Claude Code said he uses CC to build CC.

4 days ago

Cthulhu_

Which makes for an interesting thought / discussion; code is written to be read by humans first, executed by computers second. What would code look like if it was written to be read by LLMs? The way they work now (or, how they're trained) is on human language and code, but there might be a style that's better for LLMs. Whatever metric of "better" you may use.

Just a thought experiment, I very much doubt I'm the first one to think of it. It's probably in the same line of "why doesn't an LLM just write assembly directly"

4 days ago

syphia

LLMs read and write human-code because humans have been reading and writing human-code. The sample size of assembly problems is, in my estimate, too small for LLMs to efficiently read and write it for common use cases.

I liken it to the problem of applying machine learning to hard video games (e.g. Starcraft). When trained to mimic human strategies, it can be extremely effective, but machine learning will not discover broadly effective strategies on a reasonable timescale.

If you convert "human strategies" to "human theory, programming languages, and design patterns", perhaps the point will be clear.

But: could the ouroboric cycle of LLM use decay the common strategies and design patterns we use into inexplicable blobs of assembly? Can LLMs improve at programming if humans do not advance the theory or invent new languages, patterns, etc?

4 days ago

Mentlo

But starcraft training is not through mimicking human strategies - it was pure RL with a reward function shaped around winning, which allows it to emerge non-human and eventually super-human strategies (such as the worker oversaturation).

The current training loop for coding is RL as well - so a departure from human coding patterns is not unexpected (even if departure from human coding structure is unexpected, as that would require development of a new coding language).

4 days ago

tempay

> It's probably in the same line of "why doesn't an LLM just write assembly directly"

My suspicion is that the "language" part of LLMs means they tend to prefer languages which are closer to human languages than assembly and benefit from much of the same abstractions and tooling (hence the recent acquisition of bun and astral).

4 days ago

fragmede

The problem with that is that assembly isn't portable, and x86 isn't as dominant as it once was, so then you've got arm and x86(_64). But you could target the LLVM machine if you wanted.

4 days ago

coldtrait

I hope whoever seeks to replace humans with AIs to write code never find this comment.

2 days ago

stingraycharles

Yes but my point was that they seem to explicitly not care about code quality and/or the insane amount of bloat, and seem to just want the LLM to be able to deal with it.

4 days ago

lukaslalinsky

I've heard somewhere that they have roughly 100% code churn every few months, so yes, they unfortunately don't care about code quality. It's a shame, because it's still the best coding agent, in my experience.

4 days ago

menaerus

> they unfortunately don't care about code quality.

> It's a shame, because it's still the best coding agent, in my experience.

If it is the best, and if it delivers the value users are asking for, then why would they have an incentive to make further $$$ investments to make it of a "higher" quality if the value this difference could make is not substantial or hurts the ROI?

On many projects I found this "higher quality" not only to be false of delivering more substantial value but actually I found it was hurting the project to deliver the value that matters.

Maybe we are after all entering the era of SWE where all this bike-shedding is gone and only type of engineers who will be able to survive in it will be the ones who are capable of delivering the actual value (IME very few per project).

4 days ago

troupo

Is this why they ran into a bug with people hitting usage limits even on very short sessions and had to cease all communications for over a day after a week of gaslighting users because they couldn't find the root cause in the "quality doesn't matter" code base?

Or that's why tgey had to buy bun with actual engineers to work on Claude Code to reduce memory peaks from 68 GB (yes, 68 gigabytes) to a "measely" 1.7? Because code quality doesn't matter?

Or that a year later they still cannot figure out how to render anything in the terminal without flickering?

The only reason people use Claude Code is because it's the only way to use Anthropic's heavily subsidized subscription. You get banned if you use it through other, better, tools.

3 days ago

menaerus

Sure, now the only thing remaining is you convincing Anthropic that they're doing wrong. Or alternatively you change your perspective.

3 days ago

troupo

"Windows is the world's most popular desktop consumer OS. Microsoft are doing everything right, and should never ever change. Who are we to criticise them"

Meanwhile I apparently need to change my persoective about this: https://news.ycombinator.com/item?id=47598488

3 days ago

menaerus

My response is a bit more nuanced than your interpretation of "no, we should not care about the quality".

2 days ago

stingraycharles

Yes, but as I said, it’s in a way the ultimate form of dogfooding: ideally they’ll be able to get the LLM smart enough to keep the codebase working well long-term.

Now whether that’s actually possible is a second topic.

4 days ago

drakezone

[dead]

4 days ago

wklm

Here's a codeberg repo with the leaked source: https://codeberg.org/wklm/claude-code

3 days ago

papa0101

[dead]

3 days ago

bsgeraci

Dang! Glad to see others doing this. I totally made this site yesterday like 11 hours ago :/ but did not get the traction.

I love your implementation.

Here was my first stab:

https://news.ycombinator.com/item?id=47595140

https://brandonrc.github.io/journey-through-claude-code/

3 days ago

kordlessagain

That's where we are now, lots of us doing similar. Don't get hung up on who gets noticed, that "value" is almost zero now.

3 days ago

jedisct1

I'm developing an agent focused on A2A, support for small models, and privacy (https://swival.dev).

I looked at the leaked code expecting some "secret sauce", but honestly didn't found anything interesting.

I don't get the hype around Claude Code. There's nothing new or unique. The real strength are the models.

3 days ago

dheerajmp

Feel free to add this to Awesome Claude code. https://github.com/rosaboyle/awesome-cc-oss

4 days ago

euphetar

Appreciate the effort, but this is very basic and nothing you need the source code to understand. I was expecting a deep dive into what specific decisions they made, but not how an loop of tool calls works

3 days ago

ttcbj

I found it a useful overview. My primary question about the client source was - is there any secret sauce in it? Based on this site, the answer is no, the client is quite simple/dumb, and all the secret sauce resides on the server/in the model.

I particularly valued the tool list. People in these comments are complaining about how bad the code is, but I found the client-side tools that the model actually uses to be pretty clean/general.

My takeaway was more that at a very basic level they know what they are doing - keep the client general, so that you can innovate on the server side without revving the client as much.

3 days ago

[deleted]
3 days ago

jamalawd

I built a site that lets you explore and browse all the Claude Code prompts in a structured way:

https://ccprompts.info

3 days ago

restlessforge

Okay those "hidden features" are amazing, especially the cross-session referencing. I hope we can look forward to that in the future

Also I definitely want a Claude Code spirit animal

4 days ago

jwilliams

It's live! If you're on the latest cc you can use /buddy now.

4 days ago

jen729w

It's a ridiculous folly. I've already lost a well-constructed question because I accidentally tabbed into my pointless 'buddy'.

(Yes, I know I can turn it off. I have.)

4 days ago

binocarlos

I find Claude Code features fall into 2 categories, "hmmmm that could be actually useful" vs "there is more kool aid where that came from"

4 days ago

Nevermark

Ok! First prompt, obviously:

“Complete thyself.”

And I want an octopus. Who orchestrates octopuses.

4 days ago

franze

[dead]

4 days ago

Scaled

I need the dragon pet... someone add it to open code / pi, please!

4 days ago

pukaworks

I've been building a multi-agent pipeline on top of Claude Code — 7 agents chained together (scout, researcher, writer, editor, evaluator, publisher). One surprising lesson: the Editor agent approved 100% of articles while the independent Evaluator rejected 45%. LLMs are bad at being critical of LLM output. We ended up enforcing title quality with regex instead of prompts because the Writer kept ignoring banned-word rules.

a day ago

jatins

There's this weird thing about AI generated content where it has the perfect presentation but conveys very little.

For example the whole animation on this website, what does it say beyond that you make a request to backend and get a response that may have some tool call?

4 days ago

roughly

Also it's just randomly incorrect in places. For instance, it lists "fox" as one of the "Buddy" species, but that's not in the code.

4 days ago

hecanjog

The classification is pretty weird sometimes, too. For example the `/exit` slash command is filed under advanced and experimental commands...

4 days ago

autocracy101

That's been corrected, I did another fact checking pass!

4 days ago

dare944

Another? Why weren't all the facts checked on the first pass?

4 days ago

afferi300rina

We've moved from "move fast and break things" to "hallucinate fast and patch later." It's the inevitable side effect of using AI to curate AI-written codebases.

4 days ago

IsTom

When you're picking most likely tokens, you get least surprising tokens, ones with least entropy and least information per token.

4 days ago

autocracy101

That's fair. The site isn't meant to be a deep technical dive, it's more of a visual high-level guide of what I've curated while exploring the codebase while assisted by AI, 500k loc codebase is just too much to sift through in a short amount of time.

4 days ago

siva7

Really Weird but then it's so easy spot AI text by this pattern

4 days ago

bonoboTP

I agree with you and I'm generally an AI "defender" when people superficially dismiss AI capabilities, but this is a more subtle point.

If you prompt with little raw material and little actual specification of what you want to see in the end, eg you just say make a detailed breakdown dashboard-like site that analyzes this codebase, the result will have this uncanny character.

I'd describe it as a kind of "fanfic", it (and now I'm not just talking about this website but my overall impression related to this phenomenon) reminds me a bit like how when I was 15 or so, I had an idea about how the world works then things turned out to be less flashy, less movie-like, less clear-cut, less-impressive-to-a-teenage-boy than I had thought.

If you know the concept of "stupid man's idea of a smart man", I'd say AI made stuff (with little iteration) gives this outward appearance of a smart man from the Reddit-midwit-cinematic-universe. It's like how guns in movies sound more like guns than real guns. It's hyperreality.

Again this is less about the capabilities of AI and it's more connected to the people-pleasing nature of it. It's like you prompt it for some epic dinner and it heaps you up some hmmm epic bacon with bacon yeah (referring to the hivemind-meme). Or BigMac on the poster vs the tray, and the poster one is a model made with different components that are more photogenic. It's a simulacrum.

It looks more like your naive currently imagined thing about what you think you need vs what you'd actually need. It's like prompting your ideal girlfriend into AI avatar existence. I'm sure she will fit your ideal thought and imagination much better but your actual life would need the actual thing.

This relates to the Persona thing that Anthropic has been exploring, that each prompt guides the model towards adopting a certain archetypal fiction character as it's persona and there are certain attraction basins that get reinforced with post training. And in the computer world, simulated action can be easily turned into real action with harnesses and tools, so I'm not saying that it doesn't accomplish the task. But it seems that there are more sloppy personas, and it seems that experts can more easily avoid summoning them by giving them context that reflects more mundane reality than a novice or an expert who gives little context. Otherwise the AI persona will be summoned from the Reddit midwit movie.

I'm not fully clear about all this, but I think we have a lot to figure out around how to use and judge the output of AI in a productive workflow. I don't think it will go away ever, but will need some trimming at the edges for sure.

4 days ago

hrmtst93837

[flagged]

4 days ago

swyx

> also related: https://www.ccleaks.com

This deployment is temporarily paused

4 days ago

codexstar

Apologies everyone, I launched the site minutes after the leak and vercel was my only fastest and quickest option, the site went down when I was sleeping cuz I was working on it day and night.

Shifted it from vercel the moment I woke up.

2 days ago

swyx

?? whats the technical reason it went down, you cant blame vercel for this

2 days ago

codexstar

Yeah my bad, half story. I was on a free vercel plan, Did not think it would explode like this. I kept on pushing updates and when I went to sleep, the site went down, I could literally see the error on terminal with my blurry eyes lol but I was just too much exhausted.

Woke up from a few calls and messages from friends and on social media. Migrated it to a VPS. Now it looks stable.

2 days ago

[deleted]
2 days ago

Myzel394

It's available on the internet archive

https://web.archive.org/web/20260331105051/https://www.cclea...

BTW, that's why you should use your own infrastructure and not depend on Vercel

4 days ago

codexstar

you're actually right. Vercel is just easy to deloy and this was a fun project that I did in a few minutes.

But with the amount of traffic, I quickly migrated it to a dedicated VPS.

2 days ago

prameshbajra

Same!

4 days ago

aimemobe

The KAIROS persistent agent mode is the most significant finding — autonomous background tasks in a black box nobody knew about. This is why local open-weight models are gaining traction. Run inference on-device (like aiME on iOS/Android) and you know exactly what's happening with your data.

Website : https://coticsy.com/aime.html

iOS: https://apps.apple.com/us/app/aime-ondevice-ai/id6754805828

Android: https://play.google.com/store/apps/details?id=com.coticsy.ll...

2 days ago

psychomfa_tiger

The tool system breakdown is useful. I've been running Claude Code daily for a few months and some things in there explain behaviors I noticed but couldn't figure out. Like why sometimes it re-reads files it already read 30 seconds ago. Makes more sense now seeing how the context window management works internally, it's being more conservative about what it assumes is still accurate. The permission gate flow is the part I care about most. When I'm away from my machine and the agent hits a permission prompt, that's where my workflow falls apart. Knowing how it structures those requests internally is helpful for anyone building tooling around it. One thing the guide doesn't cover much is how sessions are meant to be resumed. The checkpoint system is interesting but in practice I find session resume pretty fragile once you go past a certain context length.

3 days ago

sibtain1997

Kairos and auto-dream are more interesting than anything in the agent loop section. Memory consolidation between sessions is the actual unsolved problem. The rest is just plumbing tbh

4 days ago

giancarlostoro

Projects like Beads help with memory consolidation by making it somewhat moot, since it stays "offline" and can be recollected at any moment.

4 days ago

jghiglia

I've been using Claude Code heavily for the last few weeks building out a multi-agent system, and the token economics caught me off guard — I hit 75% of my Pro weekly budget faster than expected. I don't code myself, so Claude Code handles all the actual implementation work.

What I've learned about cost management: the real decision isn't "should I code this myself or use Claude Code" — it's "should I spawn Claude Code or handle this through a different approach entirely?" For complex builds where I need something architected from scratch, Claude Code is worth it. For smaller tasks or iteration on existing code, I've started using Sonnet in the web interface with the file as context instead. The visual guide here is helpful because it shows you what Claude Code is actually doing under the hood — understanding that workflow helps you predict whether a task will be a quick fix or a deep exploration that burns through your budget.

3 days ago

WaterRun

Thanks to Claude Code, we got such a beautifully polished and dazzling website that gives a complete introduction to itself the very moment the leak happened :)

4 days ago

apeace

Pardon me, but I think it's rather obvious that it worked this way?

The real value of Anthropic is in the models that they spent hundreds of millions training. Anyone can build a frontend that does a loop, using the model to call tools and accomplish a task. People do it every day.

Sure, they've worked hard to perfect this particular frontend. But it's not like any of this is revolutionary.

3 days ago

hxugufjfjf

Is this what perfection looks like?

3 days ago

[deleted]
3 days ago

codexstar

Hello everyone! It's me behind the website. I launched the site minutes after the leak, obviously vibecoded it.

Kept working on it day and night to fix all the issues. I was using vercel free plan and did not expect this huge response. The site went down when I took a nap of 3 hours.

Woke up with calls from my team for a meeting. Saw the msgs of people telling me site is down.

Fixed the issue.

And now, I am updating it on regular speed.

Thank you for all the positive and negative feedback, Will consider it all in my future projects.

2 days ago

JoostBoer

I have no engineering background. I build websites and tools for a living. Claude Code changed what's possible for me in a way that's hard to overstate.

I can't evaluate the source code architecture. What I can say is that before this, I had ideas I couldn't execute without hiring a developer. Now I ship them myself. Not prototypes, not demos. Real products that people use and pay for.

The leaked internals are interesting to engineers. From where I sit, the interesting part is that it works well enough that someone without a CS degree can build production software with it. That's the actual story.

3 days ago

maxyurk

I don't think CS degree was needed pre-LLM era, you just needed to know what you're doing. Maybe learning and doing got faster, but you still need to learn and know. There are CS grads that don't know still. I hope you know what is required for "production software". Claude code doesn't.

3 days ago

AlexeyBelov

Have you also shipped a bot to post comments for you?

2 days ago

mtrifonov

The interesting thing about agent tool use is how binary it is. The agent either calls the tool or doesn't. The harder problem is social agency, where the AI has to decide whether to participate at all. We built a pre-filter for this (cheap model reads the room before the expensive model runs) and the failure modes are fascinating. The model would reason correctly in its chain-of-thought, 'this person is left hanging, I should respond' and then output the opposite boolean. Turned out Haiku has a systematic false-bias on boolean tool outputs. Had to invert the schema

3 days ago

techpression

519K lines of code for something that is using the baseline *nix tools for pretty much everything important, how do they even manage to bloat it this much? I mean I know how technically, but it's still depressing. Can't they ask CC to make it good, instead of asking it to make it bigger?

4 days ago

jen729w

Is it just me or do I not find the Claude Code application that fascinating?

I use it all day and love it. Don't get me wrong. But it's a terminal-based app that talks to an LLM and calls local functions. Ooookay…

4 days ago

59nadir

I think it's good that it's out there, and I wonder why Anthropic have been keeping it closed source; clearly they can't possibly think that the CC source code is a competitive advantage...?

Agents in general are easy to make, and trivial to make for yourself especially, and the result will be much better than what any of the big providers can make for you.

`pi` with whatever commands/extensions you want to make for yourself is better than CC if you really don't want to go through the trouble of making your own thing.

4 days ago

ramraj07

If you think this is not a competitive advantage then youre missing the point. LLMs arent so good that they work through bad abstractions and pretty much everyone has bad abstractions. CC is what invents some of the best abstractions (not the first). I think theyre they first ones who nailed subagents well. Theres a lot to learn from them and while im learning a lot from their source code my heart bleeds that this happened to them.

Sincerely, someone running a team building similar things for analytics.

3 days ago

ariwilson

why do you think agents you make yourself will be better for you? integration with tooling that you prefer? your local dev setup built in?

curious as i haven't gotten around to writing my own agent yet

4 days ago

59nadir

All of the above at exactly the token cost that it requires for you.

Anything general is always going to be worse for specific use cases, and agents from these big providers are very general. They'll spend tons of tokens doing things that you might not need, including spend extra tokens on supporting MCP, etc., when you might not even need that.

3 days ago

[deleted]
3 days ago

parasti

I feel the same way. Given it's AI-written, looking at the code isn't even worth it to me. I would rather read a blog post about how they develop it day to day.

4 days ago

dgb23

That’s what every agent does. They are fundamentally simple.

But you can do a lot of interesting things on top of this. I highly recommend writing an agent and hooking it up to a local model.

4 days ago

j45

Clever architecture often can still beat clever programming.

4 days ago

brandensilva

ccleaks.com seems to be "temporarily paused" from Vercel.

Here is another one that goes in depth as well: www.markdown.engineering for anyone going deep on learning.

3 days ago

vivzkestrel

would be nice if the transformers code for one of these frontier LLM models got leaked, HN will have a field day with a reveal like that

4 days ago

loveparade

I doubt there is anything special about the transformer code the frontier labs use. The only thing proprietary in it are probably the infrastructure-specific optimizations for very large scale distributed training and some GPU kernel tricks. The real moat is the training data, especially the RLHF/finetuning data and verifiable reward environments, and the GPU clusters of course.

The open source models are quite close, and they'd probably be just as good with the equivalent amount of compute/data the frontier labs have access to.

4 days ago

dgb23

That’s what I‘m thinking as well.

However, I assume that usage data could be increasingly valuable as well. That will likely help the big commercial cloud models to maintain a head start for general use.

4 days ago

AJRF

This is AI slop.

First command I looked at:

  /stickers:
  
  Displays earned achievement stickers for milestones like first commit, 100 tool calls, or marathon sessions. Stickers are stored in the user profile and rendered as ASCII art in the terminal.

That is not what it does at all - it takes you to a stickermule website.

What is the motivation for someone to put out junk like this?

4 days ago

thepasch

> What is the motivation for someone to put out junk like this?

Getting something with a link to their GitHub onto the frontpage of HN. Because form matters much more in this world than substance.

4 days ago

ricardobeat

Clout and reaching the top of HN apparently.

The animated explanation at the top is also way too fast at 1x, almost impossible to follow; that immediately hinted at the author not fully reading/experiencing the result before publishing this.

4 days ago

user34283

Why is it that some people feel entitled to take this kind of tone as soon as AI is used?

It's inappropriate to label a free side project 'junk' or 'slop' even if it contains major errors.

Particularly when there's a disclaimer about possible inaccuracies on the page.

4 days ago

euphetar

People don't like having their time wasted

3 days ago

AJRF

> Why is it that some people feel entitled to take this kind of tone as soon as AI is used?

BECAUSE ITS WRONG! THE DATA IS WRONG!

3 days ago

simple10

This tool is useful if you want to see all the internal commands claude agents are making in real-time:

https://github.com/simple10/agents-observe

3 days ago

jbdamask

Nice job - I'm a fan. Makes it easy to get the big picture so I know where to dive in.

3 days ago

deskamess

I like the Claude desktop interface. The color scheme, presentation, fonts, etc. Is there a CSS I can find for the desktop version - I assume it's using some kind of web rendering engine and CSS is part of it.

3 days ago

samuelknight

On the one hand I don't understand why it needs to be half a million lines. However code is becoming machine shaped so the maintenance bloat of titanic amounts of code and state are actually shrinking.

3 days ago

0zwan

I don't know why people obsess and spend so much time on this codebase. It isn't (and never was)alien technology. It's just mediocre typescript generated by an LLM

3 days ago

rhofield

Really nice visualisation of this, makes understanding the flow at a high levle pretty clear. Also the tool system and command catalog, particularly the gated ones are super interesting.

4 days ago

MattCruikshank

Don't do the "noise" thing this web page does. It hurts my eyes so bad. Why would you purposefully make your page look like a low-quality JPG?

3 days ago

pandemik09

To me the question isn’t “why is the codebase so large,” it’s “why is the model still being asked to do orchestration at all.”

3 days ago

tallesborges92

Please share the prompt/skills used to build it

3 days ago

simonreiff

Nice site. I might suggest moving SendMessage to the Hidden Features as they don't appear to have implemented a ReadMessage or ListMessages tools.

4 days ago

m132

I mean, I get it: vibe-coded software deserves vibe-coded coverage. But I would at least appreciate it if the main part of it, the animation, went at a speed that at least makes it possible to follow along and didn't glitch out with elements randomly disappearing in Firefox...

How is this on the front page?

4 days ago

brabel

It's on the front page because it looks really cool. You can complain about it being vibe coded, but it still looks good. If you ask Claude to allow the user to slow down the animation, it can do that quite easily, that's just not a problem caused by vibe coding. And I'm on FF and didn't notice anything glitching out.

4 days ago

lastdong

I hope /Buddy is ported across to OpenCode.

4 days ago

ramon156

Has anyone tried USER_TYPE "ant"? I might be crazy, but I have not hit my limit yet after 3 questions.

3 days ago

diyseguy

I'm just admiring the visualizations this guy built in under a day. Wondering how he did it so fast

3 days ago

_pdp_

Btw, the 500K is just the source - it does not include tests. I would imagine there are at least 2-4x tests.

3 days ago

shuntaka9576

A year ago I wouldn't have guessed a TUI could be a competitive advantage. But "harness engineering" became a thing, and it turns out the agent wrapper — tool orchestration, context management, permission flows — is where real product value lives. Not as much as the models themselves, but more than most people expected. This leak is a painful reminder of that.

3 days ago

RishabhGarg12

This is funny because this looks exactly like my vibe coded Portfolio - colors and all.

3 days ago

p2detar

So it does use ripgrep and not unix grep. [0] I knew it from some other commenters here on HN, but it's nice to see it in the source as well.

0 - https://github.com/zackautocracy/claude-code/blob/main/src/u...

4 days ago

kristopolous

I just stumbled on a fascinating replacement candidate while clicking around on embed models on hugging face: https://github.com/lightonai/next-plaid/tree/main/colgrep

it looks really interesting.

4 days ago

cubefox

I think this is unethical, and "everyone else is also doing it" is not a valid excuse.

3 days ago

lanbin

However, excellent development practices involve modularizing code based on functional domains or responsibilities.

The utils directory should only contain truly generic, business-agnostic utilities (such as date retrieval, simple string manipulation, etc.).

We can see that the code produced by Vibe is not what a professional engineer would write. This may be due to the engineers using the Vibe tool.

4 days ago

afferi300rina

That's the hallmark of "vibe coding": optimizing for immediate output while treating the utils folder as a generic junk drawer.

4 days ago

TeMPOraL

Another "hallmark" that happens to describe pretty much every codebase people wrote even before LLMs were a thing.

4 days ago

lll-o-lll

Sadly, the AI’s have been trained on human developed repos.

4 days ago

huflungdung

[dead]

4 days ago

sourcecodeplz

Nice presentation. The reality is there is nothing really special about the claude code harness?

4 days ago

nitnelave

Ah, good well-architected code, finally... With most of the code in utils/other :D

4 days ago

delphic-frog

Looks like ccleaks is down eek - not long before ccunpacked has same fate.

3 days ago

sscaryterry

No point in reading this, they are continuing to lobotomize it daily...

3 days ago

[deleted]
3 days ago

tonymucci

FYI - This pops at my work as a sec threat via Cisco Umbrella. :D

3 days ago

fsniper

Source leak or free code review? I can say that there is no bad publicity.

4 days ago

cjlm

I prefer this mapping from Nikita @ CosmoGraph: https://run.cosmograph.app/public/dfb673fc-bdb9-4713-a6d6-20...

4 days ago

rubenflamshep

This loads some bars and then just breaks for me :(

3 days ago

fersarr

why do people care so much? it's just an agentic loop

4 days ago

__alexs

Many people seem to believe the Claude Code has some sort of secret sauce in the agent itself for some reason.

I have no idea why because in my experience Claude Code and the same models inside of Cursor behave almost identically. I think all the secret sauce is in the RLHF.

4 days ago

dominotw

what is so fascinating about claude code. we have codex that is open source already. is there something special to learn from claude code?

3 days ago

bilalbayram

Same guide for opencode would be nice too

3 days ago

anonyfox

You guys all get it’s an April joke?

3 days ago

a3w

No mention of undercover mode?

3 days ago

ramon156

I expect dozens more "research articles" that

- find nothing - still manage to fill entire lages - somehow have a similar structure - are boring as fuck

At least this one is 3/4, the previous one had BINGO.

4 days ago

mdavid626

How the hell is it 500k lines?

4 days ago

twsted

It is vibe coded.

4 days ago

dankobgd

it's just bunch of useless junk

3 days ago

spirelab

I got a goose

War flashbacks to genshin

4 days ago

Hannah_Adam

Is that safe to use?

3 days ago

Vektorceraptor

Hey, nice job! Next time tell calude to add some explosions, car crashes and stuntment into the design! Who cares about content anyway ... https://speculumx.at/blogpost/getting-sick-of-ai-slop

4 days ago

blueTiger33

its April fools joke. this has really gone wide

3 days ago

chrz

nice example: Find all TODO spin the AI machine

i do shift ctrl F

4 days ago

fartfeatures

Ccleaks is down?

4 days ago

inside_story

cool Archaeologization Collection Output

4 days ago

root1107

nice

2 days ago

ezekiel68

Now we just let the AI "move fast and break things".

3 days ago

phplovesong

Holy fukking hell thats some bad code. Full blown AI slop.

3 days ago

steveharing1

Anthropic is now more open than OpenAI Itself lmao!

3 days ago

kinnth

this claude code leak is such a fuck up...

The fact that now every agent designer knows what was already built is a huge shot of steroids to their codebase!

3 days ago

jruohonen

Thanks, I'll use this for teaching next week (on what not to do). BashTool.ts :D But, in general, I guess it just shows yet again that the emperor has no clothes.

4 days ago

dgb23

Are you not feeling the vibes?

In all seriousness. I think you‘re supposed to run these in some kind of sandbox.

4 days ago

petesergeant

> it just shows yet again that the emperor has no clothes

Which emperor, specifically?

4 days ago

roboteachai

[dead]

6 hours ago

bustah

[dead]

3 days ago

pukaworks

[dead]

a day ago

techpulselab

[flagged]

3 days ago

Adam_cipher

[flagged]

3 days ago

teckgeekz

[dead]

3 days ago

gbibas

[dead]

3 days ago

[deleted]
3 days ago

AIOperator2026

[dead]

3 days ago

neuzhou

[dead]

4 days ago

tiveriny

[dead]

4 days ago

delineato

[dead]

4 days ago

panavinsingh

[dead]

2 days ago

treysu

[dead]

4 days ago

barazany

[flagged]

2 days ago

kevinbaiv

[dead]

4 days ago

skrun_dev

[dead]

3 days ago

kuitos

[dead]

3 days ago

pmakhija3

[dead]

4 days ago

hahhhha500012

[dead]

4 days ago

T3RMINATED

[dead]

3 days ago

tontinton

[flagged]

4 days ago

robonot

[flagged]

4 days ago

stingraycharles

Please don’t use AI to write comments on HN.

4 days ago

[deleted]
4 days ago

robonot

huh?

4 days ago

stingraycharles

You edited your comment. It very much first said something about using regexes as being the most important takeaway and whatnot.

4 days ago

[deleted]
3 days ago

sscaryterry

Enshitification galore

3 days ago

ramraj07

What exactly is shitty here? A program i use for hours every day to do the job previously done by many N human beings, without many bugs, seems to have code thats seemingly messy but still clearly works.

3 days ago

sscaryterry

Maybe if it was working the way it was 2 months ago, life would be good?

3 days ago