June 6, 2026·8 minanthropicagentsreview

claude code review: the terminal is the moat

anthropic's terminal-first agent is the most capable coding tool we've used — agent teams, a 1M-token context window, and codebase reasoning nothing else matches. it's also the one most likely to scare off the people vibe coding invites in.

// the verdict

claude code (opus 4.8)

$20–$200/mo, bundled with claude plans

9.1/10

the good

+deepest codebase reasoning we've tested — it reads before it writes
+agent teams turn one prompt into a coordinated crew
+1M-token context (GA since march) changes what you can even ask
+works alongside your editor instead of replacing it
+available in terminal, desktop, web, and IDE extensions

the rough

−terminal-first posture intimidates exactly the audience vibe coding invites in
−usage limits are real and hard to predict on heavy days
−the best of it — long agent runs — wants the $100–200/mo tiers
−no built-in visual surface; you bring your own browser and taste

bottom line · if you're serious about agentic building, this is the one to learn. start on the $20 plan — you'll know within a week whether you're a max-tier person.

claude code ↗

disclosure up front: the site you are reading was built and is operated with claude code. we are not neutral. we are, however, honest — and we pay for our own plan.

claude code is anthropic's coding agent. not a chat window with a code block, not an autocomplete — an agent that lives in your terminal (or desktop app, or editor sidebar), reads your actual codebase, makes a plan, edits files, runs the tests, and reports back. you describe the feature; it does the typing and a worrying amount of the thinking.

what's new in 2026

three things moved this from “great tool” to “the benchmark everything else gets measured against.”

the 1M-token context window went GA in march.a million tokens is most of a real codebase in working memory at once. questions like “where does billing actually happen and what would break if i changed the currency type” stop being archaeology and start being a single prompt.

agent teams. claude code can spawn subagents — one explores, one implements, one reviews — and coordinate them. on the newest opus models this goes further: it writes its own orchestration scripts and decides what runs in parallel. you prompt once; a small staff shows up.

it reads before it writes. this is the thing benchmarks undersell. given a real repo with house conventions, claude code matches the surrounding style, finds the existing helper instead of inventing a duplicate, and asks before doing anything destructive. it behaves less like a code generator and more like a careful new hire.

what it's like to live with

the honest answer: quiet. you write a paragraph, you go get coffee, you come back to a diff and a test run. the skill that matters most is the one we keep writing guidesabout — describing what you want precisely, and reviewing what you get with taste. the model rewards both more than any tool we've used.

the friction is equally honest. it costs real money to use hard: the $20 pro plan is a generous trial of the future, but long agent sessions will walk you into usage limits, and the max tiers ($100 and $200/mo) exist because heavy users genuinely need them. and while the desktop and IDE surfaces have softened the landing a lot, the center of gravity is still a terminal — which reads as “for real programmers only” to exactly the designers, founders, and students who'd benefit most. they're wrong to bounce off it. but the bounce is real.

who it's for

anyone shipping something that has to keep working: a product, a client build, a codebase with a future. if your work fits in a single file and a weekend, cursor's on-ramp is gentler — we reviewed it too. if you want the ceiling, it's here.

// scored 9.1 — see the verdict box for the receipts