They changed it do all of the changes in a virtual cloud environment, then dump the final result at the end of the response. Before it would stream changes, so if it made a minimal fix, then decided to go off on a tangent you could stop it quickly. Now you have to wait 5+ minutes to get a single line of code out of it just to find out it also refactored everything and burned a stack of tokens. No amount of prompting seems to force it to make incremental changes locally.
> They changed it do all of the changes in a virtual cloud environment, then dump the final result at the end of the response.
That’s a hallucination. All they did was hide thinking by default. Quick Google search should easily teach you how to turn it back on (I literally have it enabled in my harness).
I am using Copilot in VSCode and it does stream the thinking output to me. At some point it will say something like "Implementing changes..." similar to "Thinking...", but there is no content to expand. ChatGPT and local models always push the code changes in small chunks. Claude used to and at some point changed.
Can you blame them for believing thinking tokens are completely hidden now? Anthropic has changed the way to see it 3 times in 3 months with no warnings or visible upgrade path. First it was shown by default, then you had to press control+o, then control+t, then it got locked behind a settings.json, then you had to manually enable with --verbose, now it's some random ENV var.
Whoever is their product manager should be embarrassed at the UX they provide.
Product managers reduce velocity. The behavior changes every time another instance of Claude Code thinks something else would be a marginal improvement, with no further oversight or thought put into it.
Comments here overall do not reflect my experience -- i'm puzzled how the vast majority are using this technology day to day. 4.7 is absolute fire and an upgrade on 4.6.
I suspect the distinction is API vs subscription. The app has some kind of very restrictive system prompt which appears to heavily restrict compute without some creative coaxing. API remains solid. So if you're using OpenCode or some other harness with an API key, that's why you're still having a good time.
With respect, I don't think you've used the latest models and have not seen Anthropic's enterprise revenue hockey-stick like number. They are so busy outfitting fortune-500, you can't even get someone in sales to respond to emails. I've been waiting for months and so have others.
Wildflower Health | Junior Software Engineers | Remote (US) | Full Time
Wildflower provides a modular suite of software, support, and services, partnering with healthcare providers and payers at any stage of their journey to value. We address critical gaps by breaking down organizational silos and seamlessly integrating with existing clinical and operational workflows. Our approach focuses on the whole person, empowering payers and clinicians to efficiently identify patient needs and coordinate essential resources that bridge gaps in maternity care.
Tech Stack: Kubernetes, Node, Mongo (Backend) React Native Mobile App (Front End) Leveraging cutting edge AI tools for development.
Website: https://www.wildflowerhealth.com/
To apply, send your resume and cover letter to chris dot reath at wildflowerhealth dot com.
Good luck finding an employer that lets you do this moving forward. The new reality is that no one can give the estimates they previously gave for tasks. \
"Amazingly, I’m faster, more accurate, more creative, more productive, and more efficient than AI, when you price everything in, and not just code tokens per hour."
"After 2.5 years of insane hype, there’s no evidence that current AI is making the design process faster"
This is antidotally untrue. I work for a small startup. We don't have the money / aren't willing to pay for a full-time designer. So, let's just say, our UI design has always been pretty terrible. With AI, using claude, to generate design and HTML / CSS based on requirements, the design that has been generated has been heads and tails better than anything we ever came up with alone.
I've tried do this a couple of times in the last year, but it always seems to backfire, specifically with Gen Z. After I apologize, they seem to think of it as an opening to continue to shit on me and bring up more of my shortcomings -- which is just bizarre to me. Because of this, it's made me really second guess acknowledging when I think I've made a mistake.
reply