inference compute is vastly different versus training, also it has to stay hot i...

rafaelmn · 2026-03-10T09:15:00 1773134100

I think I've heard multiple time that a large % of training compute for SoTA models is inference to generate training tokens, this is bound to happen with RL training

eru · 2026-03-10T07:52:43 1773129163

They can run any number of inference experiments. Like a lot of the alignment work they have going on.

I am not saying this would be a great use of their compute, but idle is far from the only alternative. (Unless electricity is the binding constraint?)

himata4113 · 2026-03-10T07:54:37 1773129277

Electricity is charged whenever you use it or not, so very unlikely, but sure, they can find uses for it. Although they are not going to make that much money compared to claude code subscriptions.

eru · 2026-03-10T09:41:01 1773135661

> Electricity is charged whenever you use it or not, [...]

Huh, what? You know you can turn off unused equipment, and at least my nvidia GPU can use more or less Watts even when turned on?

Or does Anthropic have a flatline deal for electricity and cooling?

himata4113 · 2026-03-11T05:57:09 1773208629

in datacenters power allocation is a fixed cost.

eru · 2026-03-11T14:45:17 1773240317

If you are big enough, you can and will negotiate.

himata4113 · 2026-03-11T16:51:08 1773247868

the datacenter has a fixed cost for power, industrial power is not consumer power especially at large scale. Scale really kicks in if you own your power plant (ex: hydro, wind, solar).

eru · 2026-03-12T02:46:30 1773283590

Data centres are more complicated than that.

For an example, even if you have a fixed power budget at the data centre level, you still have opportunity costs: if you turn some unused GPUs off, you can run other things hotter.