More

Zetaphor · 2026-04-17T13:08:48 1776431328

Quantization is the major appeal, we can't all run full precision

Zetaphor · 2026-04-17T01:13:22 1776388402

Are you referring to the CLI Codex? That can be installed with NPM or Homebrew, and is fully open source.

laurels-marts · 2026-04-19T08:40:54 1776588054

Yes and the official docs even mention that if you’re on Windows you should run Codex CLI via WSL. Meaning, it’s specifically designed for unix systems.

Zetaphor · 2026-04-17T01:07:44 1776388064

Most organizations aren't going to need the wide breadth of capabilities of the frontier models. They're risk averse and LLMs are non-deterministic, so use cases are typically more tightly scoped to tasks that involve nuanced classification that small models can easily handle even if it takes a little fine-tuning on your organizations data.

Zetaphor · 2026-04-16T23:05:28 1776380728

I guess I write like an LLM :P

Probably a side effect of using them so much

Zetaphor · 2026-04-16T23:04:02 1776380642

LM Studio is equally as simple, has all the same features, and none of the performance or lock-in problems of ollama.

If you only needed a single reason, how about kneecapping your performance by choosing ollama?

Zetaphor · 2026-04-16T23:03:15 1776380595

Give LM Studio a shot! It gives you the same experience without all of the problems of Ollama.

Zetaphor · 2026-04-16T22:57:50 1776380270

LM Studio is a popular option that bundles the MLX backend

Zetaphor · 2026-04-16T22:56:02 1776380162

LM Studio is basically Ollama except they give attribution. It offers all of the same features including the ability to host a server.

Zetaphor · 2026-04-16T22:55:04 1776380104

LM Studio also offers curation, while giving credit to llama.cpp and also easy search across all of Huggingface's GGUF's

Zetaphor · 2026-04-16T22:54:11 1776380051

If you don't want to have to think about it, LM Studio is probably the best choice.