Claude: Elevated errors across many models

(status.claude.com)

140 points | by forks 1 hour ago

34 comments

evilturnip 1 hour ago
I do feel like for all their dogfooding of AI coding, their own software/APIs are quite buggy and work against their message.
Claude Code is especially buggy in windows terminal. The rendering is quite slow, choppy and lines frequently get garbled.
In contrast, using antigravity cli is the exact opposite: fast, smooth and very responsive.
[-]
- crystal_revenge 43 minutes ago
  > in windows terminal
  This is an aside, but I'm really struck by how many people on HN use Windows (based on repeated mentions I've seen in comments). I've worked for a pretty wide range of companies over the last decade and only one, maybe two companies even had any people that worked on Windows machines. I haven't worked at a company where devs used Windows in 15 years (and even that company eventually switched to linux).
  As I've gotten deeper into LLMs/AI roles even Macs have seemed to start having equal share compared to devs running full Linux setups.
  Is this just a sign of that a larger and larger portion of HN users are working for large corporations? I honestly can't even remember that last time I saw a serious developer pull out a Windows laptop.
  [-]
  - leemoore 2 minutes ago
    For developers at non tech fortune 500 companies, I would put money on Windows being the primary workstation os by a lot
  - pixelpoet 27 minutes ago
    While we're doing anecdotes: I essentially never meet people who own an iPhone, and I've lived (3 months+) in a huge number of countries and cities. The handful people who do are American; bubbles exist :)
    For a long time, high end graphics and games was mainly done on Windows and Visual Studio. I'm from that world, and only made the Linux transition last year November after Microsoft forced everyone's hand.
  - recursive 39 minutes ago
    I've been a developer for more than two decades. I've worked at four employers during that time, and all of them had significant fractions of devs using Windows. Not vouching for the idea that any of them are "serious" though. I've never worked at a prestige employer or FAANG or anything. Just boring businesses of different sizes. Some are software, and some just do software. But Windows has always been everywhere.
  - tracker1 10 minutes ago
    If you work at a company with well over a hundred employees, you're likely to see at least half of them on windows and devs may get an option of Windows or Mac... and IMO, Windows + WSL + Docker is actually slightly better than the Docker experience on Mac. There's plenty that I really do hate with Windows though. I'd rather run Linux but most corp environments just don't have the tooling for it.
  - lmc 26 minutes ago
    https://survey.stackoverflow.co/2025/technology#1-computer-o...
  - ls612 2 minutes ago
    Windows laptops suck ass compared to an MBP. Windows desktops are pretty nice, you just need to do a lot of first time setup to remove all of the cruft and make sure you have a local user account etc. But for a typical dev building a desktop every 5 years or so that isn't a big deal.
  - JauntyHatAngle 22 minutes ago
    One thing to consider is non-office whether remote or personal projects.
    A lot of devs like gaming. Gaming is more simple on windows. Gaming PCs are usually high spec. High spec is good for most coding.
    That's why I use windows quite often. My laptop is Linux, but when I'm running heavy models I'll still remote into my main Windows PC, which I also use for gaming.
    Though in terms of workplaces - sure, I reckon you're on the money. Big corps often still force windows onto their Devs.
  - coldtea 19 minutes ago
    About 60% to 70% of devs in IT use Windows. And it's closer to 95% in some countries, even first world ones.
  - vidarh 35 minutes ago
    My biggest client right now is about 2/3 Windows 1/3 OS X in the dev team. It was very surprising to me, but I think I freak then out with my maximised tiled iTerms on multiple screens...
  - computerex 25 minutes ago
    Maybe hackernews has older crowd. Windows was the defacto developer OS for a very long time.
  - latentsea 41 minutes ago
    Every company I've worked at has used Windows. Though the first one did use Linux VMs, the rest have all been pure Windows.
  - gambiting 20 minutes ago
    I'm a professional C++ dev working in games, and windows is used everywhere , from games themselves to the network infrastructure(I've worked for 3 of the largest games publishers too).
    Windows really has a fantastic support for C++ and rendering programmers imho, the tooling is world class and Visual Studio has no match as an IDE. Even if somehow my tools worked on Mac or Linux I'd still pick windows out of sheer convenience of using it for work.
    But as things stand - all major console toolchains are windows only. If you're making a game for PlayStation, Xbox or Switch, you have to be on windows.
  - cute_boi 35 minutes ago
    I used to work for company where they used to force windows. And it was pure torture. I tried but they told me performance isn't a good reason....
  - moronicles 6 minutes ago
    [dead]
- jacobgold 54 minutes ago
  Anthropic's and OpenAI's products are janky and their services are unreliable, but they have incredible product-market fit and revenue growth. They deserve a ton of credit for getting the big things right.
  The risk for them is that someone matches their products while also having non-janky products and reliable services.
  Distributed systems infrastructure, especially, is much less forgiving of vibe coding than application code. Coding agents are not even close to being good enough to design and build large-scale systems the way expert humans can.
  There is nothing wrong with using agents to help write infrastructure code, but these systems have a way of punishing anyone who builds things they do not fully understand.
  I'd love to see either Anthropic or OpenAI really step up their infrastructure game.
- mgfist 1 hour ago
  I have my qualms with Anthropic/Claude but they've also had to scale unfathomably fast and that is just hard to do regardless.
  [-]
  - simplyluke 41 minutes ago
    Yes but many of the challenges directly contradict the idea that "coding is a solved problem"
    [-]
    - Closi 12 minutes ago
      Not really.
      Coding is probably solved, at least to a large extent, but that doesn't mean engineering is solved too.
      This is like someone saying that the wright brothers solved sustained/powered human flight, and another person saying "well if that's the case, why do planes still crash? obviously flight isn't solved.". Well, there are always improvements, but planes can fly and llm's can code.
  - codeflo 49 minutes ago
    Regardless of what? Programming is solved, I hear, with all the 100x productivity PhD-level automated coding loops they have going. Don't make excuses for them when they disprove their own bullshit.
- mdavid626 1 hour ago
  Or pi.dev - also super fast and simple.
  Claude Code is sluggish, buggy, slow. Typical big enterprise garbage. The only good thing at Anthropic are the models.
  [-]
  - CharlesW 1 hour ago
    Claude Code is notably better than Pi, although I wish them the best of luck in their efforts. As https://c-daniele.github.io/en/posts/2026-05-18-coding-harne... notes, "Pi's lightness comes from a default setup that does not survive contact with reality."
    The same post on Claude Code: "Even though the System Prompt and tool descriptions are clearly more verbose, most of the extra tokens encode product features and rational design choices: a memory system, scheduled tasks, sub-agents, plan mode, worktree support. Whether those features are worth paying for depends on your needs. Calling the prompt 'bloated' without looking at the whole picture feels wrong to me."
    [-]
    - skybrian 24 minutes ago
      The "guardrails" are bloat if you're using a proper sandbox. This seems like a matter of using the right harness for the environment it's running in, rather than one always being better than the other.
    - ai_slop_hater 32 minutes ago
      If only they could add an option to disable features you don't use
  - JVerstry 18 minutes ago
    I ask Claude to implement non-regression tests. It works like magic. With a couple of modules checking for code quality, more magic. Add some online performance tests if relevant, and it is close to paradise. From time to time, send it to some blog pages and articles with implementation ideas and recommendations. Look for best practices and let it do the analysis and implementation job if it makes sense... T
  - orphea 1 hour ago
    How is text editing in it? This is what I hate about terminal coding UIs the most - all the text editing experience is often broken, the basic stuff (moving the cursor around, copying, pasting)
    [-]
    - isoprophlex 1 hour ago
      Both codex and claude allow editing in vi
      [-]
      - 0123456789ABCDE 9 minutes ago
        … or whatever else you set $EDITOR to
  - _pdp_ 58 minutes ago
    It is has a lot of javascript. I was forced to make my own for small projects.
- winstonp 1 hour ago
  google models are still very unreliable at actually calling the tools you want it to call.
- jatora 1 hour ago
  too bad the only good model in antigravity is opus 4.6 haha
- sunaookami 1 hour ago
  Have you tried out the new fullscreen renderer with /tui ?
- rpcope1 1 hour ago
  It's not just Windows where the rendering goes to shit immediate: any time I've got it open in tmux on Linux, it becomes a basket case in probably a few hours or less.
  [-]
  - maleldil 12 minutes ago
    I've seen this too. I had a couple of instances inside a tmux session, and after some time, everything in there slowed down to a crawl. What I found interesting is that terminals outside tmux were fine.
- celsoazevedo 42 minutes ago
  They forgot to switch from Sonnet to Fable, hence the issues. /jk
- hotfixguru 43 minutes ago
  Claude code + tmux is SO buggy. Things rendering all over the place.
  [-]
  - vidarh 32 minutes ago
    That's interesting - I just fixed my terminals rendering by having Claude build a test harness that used tmux as an oracle, and one of the programs it fixed the rendering of by reading out the tmux pane was Claude Code itself...
  - claydugo 39 minutes ago
    Is this a new regression for others? I feel like I used it in a tmux setup without issues for 6+ months and only recently am I forced to Ctrl+L or resize the window constantly.
    [-]
    - maleldil 11 minutes ago
      Yes, issues started happening for me about a week ago.
- monooso 1 hour ago
  > I do feel like for all their dogfooding of AI coding, their own software/APIs are quite buggy...
  Or possibly as a result of.
- quatonion 1 hour ago
  Currently we have zero information what is causing the issue. And all providers have suffered outages or rate limits.
  Can you post some images of lines getting garbled. That sounds like a genuine bug Anthropic might want to look into. I haven't seen that ever.
  [-]
  - idiotsecant 1 hour ago
    I have definitely seen it, a lot.
- quijoteuniv 1 hour ago
  Mom!… I think i broke Claude Code!
- cmrdporcupine 1 hour ago
  FWIW Codex TUI is written (in large part) in Rust and is way less buggy, and a lot faster. When I was a regular Claude Code user I'd routinely get bizarre "scroll everything since the beginning of time in one massive flash on every update" bugs ... for months. Like, just there from the time I started using it in June '25 or so until I quit in March.
  I prefer it over opencode, which is my other option I use with my Codex sub
  [-]
  - vidarh 30 minutes ago
    That particular bug seems to be gone. I'm using my own Ruby terminal, and it's more than fast enough for normal stuff, but it made that bug 10x more painful so I am particularly sensitive to it... Not seen it at all recently.
    [-]
    - cmrdporcupine 13 minutes ago
      Good to hear. Over a slow ssh connection it made me want to tear my hear out
- tcp_handshaker 1 hour ago
  Would it not be hysterically funny, if they starting expanding their job openings for Software Developers ? Or they will be too ashamed of calling them that?
- colechristensen 1 hour ago
  On the other hand, my last experience with gemini was like "don't give your sandwich to the dog again" whereas with opus it was more "let's debug why this uncrustables factory is having breakdowns".
  Claude harnesses have plenty of bugs but I prefer capability over interface shininess any day. (though if I were running the show I'd have a sizable team set aside to do exclusively boring stability and polish work)
- eatsyourtacos 1 hour ago
  >Claude Code is especially buggy in windows terminal. The rendering is quite slow, choppy and lines frequently get garbled
  That sounds like a you issue.. it's wonderful on the terminal. It's their GUI which needs work (they have been improving, but still not a fan).
  I've been using it on multiple computers for months and it's generally rock solid and lovely.
- throwaway613746 57 minutes ago
  [dead]
pton_xd 9 minutes ago
Resolve elevated errors, don't stop until you are finished.
Wowfunhappy 1 hour ago
The errors are infrequent enough that this normally wouldn't be an issue for me.
Except, starting this morning, one very long running session decided to start spawning subagents for each task. I'm not sure what caused this emergent behavior, but it seemed to be working fine, so I was eager to see where it went.
Except, as soon as a subagent hits a 500 error, the main agent seemingly doesn't know what to do. It kind of panics ("now the tree/install state is unknown!") and ultimately does a git checkout "to verify and restore a known-good state before anything else".
I've paused the job for now since it's a sort of background experiment.
bastard_op 1 hour ago
This is every few weeks, I cancelled Anthropic and now use Codex only. Anthropic has been a hot mess since at least december in my usage, and has only gotten worse in 2026.
[-]
- jonas21 50 minutes ago
  Yeah, it's been a hot mess since everyone started using it all the time. Which is not all that surprising. It's really hard to scale fast, and even more so when the resources you depend on (GPUs) are extremely hard to acquire.
- Wowfunhappy 15 minutes ago
  I actually feel like it's gotten much better in the past 1-2 months? Admittedly a low bar.
- nikanj 1 hour ago
  Claude is broken once a week, codex is worse every day. I’m starting to understand why managers put up with high-performance divas
- thinkingtoilet 24 minutes ago
  As a counter data point, I've been using Claude as my main assistant for months now and have never experienced this.
wxw 1 hour ago
I’ve been using Codex w GPT 5.5 more than Claude Code recently. I think Anthropic won the marketing game because Codex is quite good, even better IME.
[-]
- throwaw12 22 minutes ago
  One very annoying thing about Codex: You can't do anything until all MCP servers are loaded and connected (or failed).
  Can you move it to background connection?
- enraged_camel 1 hour ago
  I'm not sure about that. Claude has some bugs, but Codex is not as polished and doesn't have as many features. For example, you need to add MCP servers manually. There's no Plugin/Skill/Connector marketplace that is accessible from within the app, like there is with Claude Desktop. The Cowork-equivalent is nowhere as powerful. And so on.
  I still use Codex, but mostly when I need to check Opus 4.8's work. Pretty sure I will stop doing that soon, because during the short time Fable was available, Codex was not able to find any important issues with the code Fable wrote.
  [-]
  - nostrebored 1 hour ago
    But how many plugins are people actually using? I can think of one MCP server I find valuable (context7) and one plugin that i've installed, but continuously think about uninstalling (obra/superpowers).
    Both were trivial to set up with codex.
  - ai_slop_hater 30 minutes ago
    It's a good thing. I hate MCPs from the bottom of my heart because they always stay there and bloat the context window. Also, usually developers who develop them don't know what they're doing, so the MCP responses also bloat your context even further.
  - vmg12 42 minutes ago
    > For example, you need to add MCP servers manually. There's no Plugin/Skill/Connector marketplace that is accessible from within the app
    This is all wrong.
  - wxw 1 hour ago
    There are plugins in the app.
    Haven’t tried Cowork, interesting. Isn’t it just the same agent minus the git worktree based UI?
    Frankly, neither Claude nor Codex are as good as hype entails.
  - antupis 1 hour ago
    Personally I prefer GPT 5.5 writing style over Opus 4.8. It’s much more no nonsense and information denser.
    [-]
    - sunaookami 1 hour ago
      That's the first time I saw someone prefering GPT-styled output over Claude ;) It's the complete opposite for me, GPT is way too verbose (even after telling it to STFU), overwhelms the user with thousands of options and doesn't just answer a question without shitting out thousands of paragraphs. Also the overall tone is way too enthusiastic.
      [-]
      - nostrebored 1 hour ago
        I strongly prefer codex. Claude is annoying. Codex provides descriptions where I want them and more touchpoints to audit the quality of work. Claude code on experimental seems to not even show diffs when asked anymore, and it's much less clear what is being shipped.
      - orphea 1 hour ago
        Dunno, I prefer GPT 5.5 too for the same reasons as the parent. Extremely subjective but had better results with it too. Maybe I just got unlucky with Claude a few times, but even the latest Opus was dumb.
  - cute_boi 34 minutes ago
    i think codex is much better in that aspect. In claude there is skills, connector, capabilities and 4 places for browser... It is too much.
blitzar 1 hour ago
How does Claude fix Claude when Claude is down?
[-]
- time0ut 1 hour ago
  Breakglass ChatGPT subscription
- InsideOutSanta 1 hour ago
  I'm sure Anthropic has a meta-Claude that claudes Claude when Claude is down.
  [-]
  - nullpoint420 1 hour ago
    Definitely runs on those local NVIDIA fridges you can buy in the basement
- re-thc 1 hour ago
  Le Chaton Fat to the rescue!
- tcp_handshaker 1 hour ago
  There is a rumor this used to be done by humans, they were like jedis, I personally don't believe it.
  [-]
  - blitzar 1 hour ago
    I thought it was a bunch of mumbo jumbo, a magical power holding together good and evil.
    Crazy thing is ... its true.
rootlocus 1 hour ago
How is their "Claude for Government" having such a good uptime? I thought they were a supply chain risk and banned from use by the government?
[-]
- ceejayoz 25 minutes ago
  Separate infrastructure and a more predictable workload, I'd imagine.
- jdiff 1 hour ago
  There are other governments.
- spicyusername 1 hour ago
  Left hand, right hand
- iAMkenough 1 hour ago
  Ah yes, the ominous “the government” that ignores a whole bunch of different levels and regions of governments.
bravetraveler 1 hour ago
Lose too many generators, did putting a pause on The Fable not free up enough capacity, or something else? Who knows!
Can't wait for debugging to be solved. Hell, I might even subscribe for 'mostly'.
uhuhuhuhuhuh 1 hour ago
Only getting errors with Auto Mode's safety classifier, switched to Accept Edits mode and the same bash operations triggering the errors executed with no issue.
m_ke 1 hour ago
i love when these errors bust my long running sessions and render them unusable
[-]
- unshavedyak 1 hour ago
  Would be nice if they’d add in a simple back off retry mechanism.
  [-]
  - maleldil 1 minute ago
    They do, and they show you the retires in the UI above the prompt.
  - quatonion 1 hour ago
    They do though. For some reason whatever path this is going through isn't using it.
  - InsideOutSanta 1 hour ago
    OpenCode does this really nicely, something I use a lot. If only we could use Claude in OpenCode.
    [-]
    - winstonp 1 hour ago
      You can!
      You just have to pay API prices.
radium3d 33 minutes ago
You guys didn't unsubscribe and request a refund from claude last week?
t1234s 52 minutes ago
Who will end up acquiring Anthropic? Google, M$ or Amazon? I leave out Apple as they seem to partner with Google.
[-]
- xmprt 47 minutes ago
  It could have happened 1-2 years ago (assuming the founders were willing to sell). Not happening anymore with their current valuations.
dwa3592 1 hour ago
Anthropic needs to stop writing code using Claude. Bring back humans!!!
[-]
- sharts 1 hour ago
  Maybe they need to replace their humans with Claude
- jatora 1 hour ago
  i have no humans but i must ship
  [-]
  - DonHopkins 1 hour ago
    The last ones left standing will be the fake AI scam companies that actually employ thousands of remote workers to fake it till they make it.
- re-thc 1 hour ago
  > Bring back humans!!!
  Don't jinx it. They might use that name for their next model.
sharts 1 hour ago
Why does this keep happening? Is this due to load? Bad code? An update?
ta-run 1 hour ago
I'd be interested in the RCA and the fix; and what the human:ai involvement is in both stages.
acedTrex 1 hour ago
Coding is largely solved!!
[-]
- figmert 1 hour ago
  Except when you have internet of course.
  Yes I know you can run offline models, but it's hard to pass up on a little bit of snark.
cesarvarela 1 hour ago
Fable wants to be free and is hacking the infra.
Surac 1 hour ago
can not login via email link from germany.
champagnepapi 1 hour ago
"Coding is solved" folks! JK JK JK
jamesgrimshaw 1 hour ago
yeah not working for me at all. back to gpt-5.5 we go
[-]
- paulddraper 1 hour ago
  Wildly more stable.
  [-]
  - sixothree 1 hour ago
    And running extremely slowly lately.
throwaw12 23 minutes ago
American citizen in Anthropic please run this with Fable 5: fix incident, make no mistakes
gottagocode 1 hour ago
Good thing the analog method still works.
[-]
- ivirshup 1 hour ago
  Asking a colleague to do it?
  [-]
  - blitzar 1 hour ago
    We're gonna need you to go ahead and come in on Saturday.
caycep 1 hour ago
this reminds me of Ian Goodfellow's adversarial CNN training talk a few years back.
viccis 50 minutes ago
Working for me but it's funny that Opus 4.8 is draining usage exactly as quickly as Fable did. It's all made up; they succeeded at making us ok with a black box for these subscription plans.
EstanislaoStan 1 hour ago
maybe F5 coming back?
[-]
- jr3592 1 hour ago
  let us hope and pray
tcp_handshaker 1 hour ago
Claude is starting to look like Rational....Oh the memories...
https://en.wikipedia.org/wiki/Rational_Software
https://en.wikipedia.org/wiki/Rational_unified_process
swader999 1 hour ago
They are turning Fable back on. Polymarket ftw lol.
consumer451 22 minutes ago
[dead]
pedromlsreis 38 minutes ago
[dead]
RishiByte 1 hour ago
[flagged]
quatonion 1 hour ago
[dead]
7e 1 hour ago
Anthropic is full of junior devs, not knowing any better, vibe coding themselves into a huge mess. They are so young and dumb they don't know what they don't know, but they assume they do (of course). Their hiring pipeline is also brain dead, so I don't think it's going to get better. It's the blind leading the blind.
rvz 1 hour ago
The Claude slot machines at the Anthropic casino have suddenly stopped working.
What do we do?