More

mirzap · 2026-04-12T18:35:02 1776018902

Those towns and villages will be rebuilt after the war. This is not excuse for what Apple did, it is justification for ethnic cleansing and occupation. Same as with Gaza City. It existed for 3500 years, it will be rebuilt and it will outlive the US/Israel for sure.

threethirtytwo · 2026-04-12T18:37:41 1776019061

How is removing a city from a map a some sort of sign that apple did it for malicious reasons?

mirzap · 2026-04-12T21:10:43 1776028243

Given the context, how it is not? These are not some random place on the map that have disappeared.

bigyabai · 2026-04-12T18:39:03 1776019143

I'm sure Apple doesn't see it as malicious, and that's precisely the issue. Apple's political grandstanding has forced them into awkward and contradictory positions.

walletdrainer · 2026-04-12T18:41:36 1776019296

[flagged]

threethirtytwo · 2026-04-12T18:44:02 1776019442

What is the signal? Not snark, I’m not well informed and I need it spelled out.

walletdrainer · 2026-04-12T18:57:19 1776020239

The bulk of Israelis want to annex this territory and use it as an empty buffer zone, exterminating everyone who refuses to leave.

This genocide of course involves deleting those villages.

mirzap · 2026-04-01T07:50:37 1775029837

It looks like AI-generated slop. Can't believe people would poison context with things like this.

IxInfra · 2026-04-01T15:04:38 1775055878

it was a genuine question...

mirzap · 2026-03-28T22:48:18 1774738098

https://wccftech.com/ddr5-prices-just-posted-their-first-dro...

mirzap · 2026-03-25T14:44:10 1774449850

Repo: https://github.com/openai/parameter-golf

mirzap · 2026-03-19T21:43:58 1773956638

Why would they have that feature in claude code cli if it goes against the ToS? You can use Claude Code programatically. This is not the issue. The issue is that Anthropic wants to lock you in within their dev ecosystem (like Apple does). Simple as that.

serf · 2026-03-19T23:12:12 1773961932

allowed shell pipes doesn't necessarily mean they want loops running them.

One of the economic tuning features of an LLM is to nudge the LLM into reaching conclusions and spending the tokens you want it to spend for the question.

presumably everyone running a form of ralph loop against every single workload is a doomsday situation for LLM providers.

whateveracct · 2026-03-19T23:20:34 1773962434

> allowed shell pipes doesn't necessarily mean they want loops running them.

insane that people apologize for this at all. we went from FOSS software being standard to a proprietary cli/tui using proprietary models behind a subscription. how quickly we give our freedom away.

gck1 · 2026-03-21T17:55:27 1774115727

Anthropic itself advertised their own implementation of agentic loop (Ralph plugin). Sure, it worked via their official plugin, but the end result for Anthropic would be the same.

There's nothing in TOS that prevents you from running agentic loops.

mirzap · 2026-03-13T11:51:43 1773402703

Official website: https://viteplus.dev/

mirzap · 2026-03-03T16:16:57 1772554617

And you shouldn’t verify. Many companies offering these identity verification services have ties to the intelligence networks of a country that shall not be named (similar to most VPN services that are supposedly there to protect your anonymity).

smallstepforman · 2026-03-03T16:59:01 1772557141

No Such Agency is the biggest government data collection agency, why not name the hosting country?

a456463 · 2026-03-03T19:59:31 1772567971

yoU Said it All

mirzap · 2026-02-19T04:54:51 1771476891

They are not losing money on subscription plans. Inference is very cheap - just a few dollars per million tokens. What they’re trying to do is bundle R&D costs with inference so they can fund the training of the next generation of models.

Banning third-party tools has nothing to do with rate limits. They’re trying to position themselves as the Apple of AI companies -a walled garden. They may soon discover that screwing developers is not a good strategy.

They are not 10× better than Codex; on the contrary, in my opinion Codex produces much better code. Even Kimi K2.5 is a very capable model I find on par with Sonnet at least, very close to Opus. Forcing people to use ONLY a broken Claude Code UX with a subscription only ensures they loose advantage they had.

rjh29 · 2026-02-19T05:23:02 1771478582

> "just a few dollars per million tokens"

Google AI Pro is like $15/month for practically unlimited Pro requests, each of which take million tokens of context (and then also perform thinking, free Google search for grounding, inline image generation if needed). This includes Gemini CLI, Gemini Code Assist (VS Code), the main chatbot, and a bunch of other vibe-coding projects which have their own rate limits or no rate limits at all.

It's crazy to think this is sustainable. It'll be like Xbox Game Pass - start at £5/month to hook people in and before you know it it's £20/month and has nowhere near as many games.

harrall · 2026-02-19T06:33:45 1771482825

OpenAI only released ChatGPT 4 years ago but…

Google has made custom AI chips for 11 years — since 2015 — and inference costs them 2-5x less than it does for every other competitor.

The landmark paper that invented the techniques behind ChatGPT, Claude and modern AI was also published by Google scientists 9 years ago.

That’s probably how they can afford it.

illiac786 · 2026-02-19T07:39:50 1771486790

I agree that the TPUs are one of the things that are underestimated (based on my personal reading of HN).

Google already has a huge competitive advantage because they have more data than anyone else, bundle Gemini in each android to siphon even more data, and the android platform. The TPUs truly make me believe there actually could be a sort of monopoly on LLMs in the end, even though there are so many good models with open weights, so little (technical) reasons to create software that only integrates with Gemini, etc.

Google will have a lion‘s share of inferring I believe. OpenAI and Claude will have a very hard time fighting this.

touristtam · 2026-02-20T04:37:55 1771562275

I can see it to be £18.95 from the UK, which is almost double that. I guess this is an oversight from your part or maybe quoting from memory.

gbear605 · 2026-02-19T06:47:15 1771483635

I’m not familiar with the Claude Code subscription, but with Codex I’m able to use millions of tokens per day on the $200/mo plan. My rough estimate was that if I were API billing, it would cost about $50/day, or $1200/mo. So either the API has a 6x profit margin on inference, the subscription is a loss leader, or they just rely on most people not to go anywhere near the usage caps.

trymas · 2026-02-19T07:31:35 1771486295

I use GLM lite subscription for personal use. It is advertised as 3x claude code pro (the 20$ one).

5h allowance is somewhere between 50M-100M tokens from what I can tell.

On 200$ claude code plan you should be burning hundreds of millions of token per day to make anthropic hurt.

IMHO subscription plans are totally banking on many users underusing them. Also LLM providers dont like to say exact numbers (how much you get , etc)

touristtam · 2026-02-20T04:38:31 1771562311

How's GLM treating you?

trymas · 2026-02-22T07:04:02 1771743842

At the moment cannot complain.

For small personal projects it’s great value for money. Cheapest subscription was like 3$ during new years, token quota is acceptable to me (my guess it’s about 50-100M tokens per 5h)

Dunno how it would be with big projects, but with “personal project” things it feels to me that GLM-4.7 is 80-90% of Claude Opus 4.5. Just a tiny bit of more hand holding for GLM.

dcre · 2026-02-19T16:38:24 1771519104

It's the latter. It's the average use that matters. Though I suspect API margins are also probably higher than people think.

dgellow · 2026-02-19T10:19:51 1771496391

Inference might be cheap, but I'm 100% sure Anthropic has been losing quite a lot of money with their subscription pricing with power users. I can literally see comparison between what my colleagues Claude cost when used with an API key vs when used with a personal subscription, and the delta is just massive

MikeNotThePope · 2026-02-19T05:40:30 1771479630

I wonder how many people have a subscription and don’t fully utilize it. That’s free money for them, too.

thunfischtoast · 2026-02-19T07:56:18 1771487778

The trick is that the jump goes from 20 to 100 Dollar for the Pro to Max subscription. Pro is not enough for me, Max is too much. 60 would be ideal, but currently at 100 it's worth the cost.

But this is how every subscription works. Most people lose money on their gym subscription, but the convenience takes us.

hobofan · 2026-02-19T10:05:58 1771495558

What can bite them in this case though is alternate providers at the same price point that can bridge the gap. e.g. you currently get a lot more bang for your buck with the $20 OpenAI Codex subscription than you get for the $20 Claude Code subscription.

bildung · 2026-02-19T09:15:18 1771492518

Of course they bundle R&D with inference pricing, how else could you the recoup that investment.

The interesting question is: In what scenario do you see any of the players as being able to stop spending ungodly amounts for R&D and hardware without losing out to the competitors?

stavros · 2026-02-19T10:07:53 1771495673

In the scenario where that market collapses, ie when we stop making significant gains with new models. It might be a while, though, who knows.

KingMob · 2026-02-19T05:33:21 1771479201

> They are not losing money on subscription plans. Inference is very cheap - just a few dollars per million tokens. What they’re trying to do is bundle R&D costs with inference so they can fund the training of the next generation of models.

You've described every R&D company ever.

"Synthesizing drugs is cheap - just a few dollars per million pills. They're trying to bundle pharmaceutical research costs... etc."

There's plenty of legit criticisms of this business model and Anthropic, but pointing out that R&D companies sink money into research and then charge more than the marginal cost for the final product, isn't one of them.

mirzap · 2026-02-19T05:53:14 1771480394

I’m not saying charging above marginal cost to fund R&D is weird. That’s how every R&D company works.

My point was simpler: they’re almost certainly not losing money on subscriptions because of inference. Inference is relatively cheap. And of course the big cost is training and ongoing R&D.

The real issue is the market they’re in. They’re competing with companies like Kimi and DeepSeek that also spend heavily on R&D but release strong models openly. That means anyone can run inference and customers can use it without paying for bundled research costs.

Training frontier models takes months, costs billions, and the model is outdated in six months. I just don’t see how a closed, subscription-only model reliably covers that in the long run, especially if you’re tightening ecosystem access at the same time.

KingMob · 2026-02-19T08:52:09 1771491129

Yes, and my point is that thinking the cost of subscriptions is only inference, and not the research, is mistaken.

They can totally lose money on subscriptions despite the costs of inference, because research costs have to be counted too.

hamandcheese · 2026-02-19T09:26:26 1771493186

> Yes, and my point is that thinking the cost of subscriptions is only inference, and not the research, is mistaken.

Of course they are losing money when you factor in R&D. Everybody knows that. That is not what people mean when they say that they "lose money" on subscriptions.

KingMob · 2026-02-20T06:41:40 1771569700

> That is not what people mean

I don't really think that view is as widespread as you believe.

maplethorpe · 2026-02-19T07:59:43 1771487983

Didn't OpenAI spend like 10 billion on inference in 2025? Which is around the same as their total revenue?

Why do people keep saying inference is cheap if they're losing so much money from it?

mirzap · 2026-02-19T08:51:57 1771491117

When you have 800–900 million active users, no matter how cheap it is, your costs will be in the billions.

bildung · 2026-02-19T09:26:23 1771493183

They paid about $10B on inference and had about $10B in revenue in 2025. The users and numbers of zeroes on those numbers are not relevant. What is relevant is the ratio of those numbers. They apparently are not even profitable on inference, wich is the cheap part of the whole business.

And cost of inference tripled from $3B in 2024 to $10B in 2025, so cost of revenue linearly grows with number of users, i.e. it does not get cheaper.

https://www.wheresyoured.at/oai_docs/

hhh · 2026-02-19T06:46:21 1771483581

What walled garden man? There’s like four major API providers for Anthropic.

mirzap · 2026-02-19T08:46:07 1771490767

For example, OpenAI’s agent (Codex) is open source, and you can use any harness you want with your OpenAI subscription. Anthropic keeps its tooling closed source and forbids using third-party tooling with a Claude subscription.

andersmurphy · 2026-02-19T07:06:59 1771484819

Except all those GPUs running inference need to be replaced every 2 years.

phyrex · 2026-02-19T07:24:29 1771485869

xyzsparetimexyz · 2026-02-19T07:36:03 1771486563

They wear down being run at 100% all the time. Support slowly drops off, the architecture and even the rack format become deprecated.

well_ackshually · 2026-02-19T08:11:15 1771488675

GPUs do not wear down from being ran at 100%, unless they're pushed past their voltage limits, or gravely overheating.

You can buy a GPU that's been used to mine bitcoin for 5 years with zero downtime, and as long as it's been properly taken care of (or better, undervolted), that GPU functions the exact same as a 5 year old GPU in your PC. Probably even better.

GPUs are rated to do 100%, all the time. That's the point. Otherwise it'd be 115%.

andersmurphy · 2026-02-19T09:43:40 1771494220

Yeah that's not how it works in practice in a datacenter with the latest GPUs, they are basically perishable goods.

You don't run your gaming PC 24/7.

well_ackshually · 2026-02-19T10:43:40 1771497820

No, you're fundamentally wrong. There's the regular wear & tear of GPUs that all have varying levels of quality, you'll have blown capacitors (just as you do with any piece of hardware), but running in a datacenter does not damage them more. If anything, they're better taken care of and will last longer. However, since instead of having one 5090 in a computer somewhere, you have a million of them. A 1% failure rate quickly makes a big number. My example included mining bitcoin because, just like datacenters, they were running in massive farms of thousands of devices. We have the proof and the numbers, running at full load with proper cooling and no over voltage does not damage hardware.

The only reason they're "perishable" is because of the GPU arms race, where renewing them every 5 years is likely to be worth the investment for the gains you make in power efficiency.

Do you think Google has a pile of millions of older TPUs they threw out because they all failed, when chips are basically impossible to recycle ? No, they keep using them, they're serving your nanobanana prompts.

andersmurphy · 2026-02-19T10:54:28 1771498468

GPU bitcoin mining rigs had a high failure rate too. It was quite common to run at 80% power to keep them going longer. That's before taking into account that the more recent generations of GPUs seems to be a lot more fragile in general.

bravetraveler · 2026-02-19T14:25:50 1771511150

Mining rigs also used more milk cartons than datacenter racks; [hot/cold] aisles? No, piles! Not to mention the often questionable power delivery...

xyzsparetimexyz · 2026-02-19T16:57:39 1771520259

AI data centers are also incentivised to reduce costs as far as they can. They could absolutely be running them in questionable setups

bravetraveler · 2026-02-19T19:54:30 1771530870

Indeed, fair point. I'd hope the larger players would be better... but I know better

andersmurphy · 2026-02-19T07:54:51 1771487691

Yeah what's crazy is most of these companies are making accounting choices that obscure the true cost. By extending the stated useful life of their equipment, in some cases from 3 years to 6. Perfectly legal. And it has the effect of suppressing depreciation expenses and inflating reported earnings.

navigate8310 · 2026-02-19T09:54:44 1771494884

But don't they palpitate for thise sweet depreciation credits to decrease their tax on revenue?

andersmurphy · 2026-02-19T11:28:37 1771500517

Small sacrifice to not spook investors and the market.

mvdtnz · 2026-02-19T07:25:09 1771485909

"They're not losing money on subscriptions, it's just their revenue is smaller than their costs". Weird take.

carderne · 2026-02-19T07:30:49 1771486249

It means the marginal cost to sell another subscription is lower than what they sell it for. I don't know if that's true, but it seems plausible.

mirzap · 2026-02-12T06:38:28 1770878308

Why are you all obsessed with this question when it comes to Chinese models? Here are some of the questions you should be asking Western governments and models instead: Who protects the pedophiles at the top of Western governments and corporations? How many people have been convicted in relation to the Epstein files? Who protects powerful politicians and Western oligarchs from pedophilia charges? Who did Epstein work for, and why (hint: it’s not Russia or China)?

downboots · 2026-02-12T07:01:00 1770879660

It's called whataboutism https://en.wikipedia.org/wiki/Whataboutism

mirzap · 2026-02-12T07:27:18 1770881238

No, it's called hypocrisy https://en.wikipedia.org/wiki/Hypocrisy

mirzap · 2026-01-13T14:28:03 1768314483

How so? Apple's subscription cancellation is one click away, and you don't get overcharged when canceling.