Even distillation would be obvious.
And I wouldn't be surprised Anthropic is running the smaller inference models, keeping the base large model in machines they fully control/own.
Even distillation would be obvious.
And I wouldn't be surprised Anthropic is running the smaller inference models, keeping the base large model in machines they fully control/own.