More

InsideOutSanta · 2026-05-01T05:40:18 1777614018

FWIW, an open-source clone of that earlier version of Warp called Wave is out there. It seems to be actively maintained and works quite well, in my experience.

cpursley · 2026-05-01T09:51:18 1777629078

Is it Rust or Node/Electron? That’s one of the key considerations I have these days; I’m over bloatware.

InsideOutSanta · 2026-04-29T13:20:18 1777468818

There are apps in the app store right now that pretend to do this kind of thing, so having somebody actually show that it doesn't work is valuable, even if we already knew the outcome ahead of time.

endymion-light · 2026-04-29T13:34:21 1777469661

I suppose i'd much rather a study analyse the apps in the app store that are attempting and claiming to do that kind of thing - rather than the base model they might be using.

InsideOutSanta · 2026-04-28T15:24:58 1777389898

The fact that you thought to consider the next 3, 5, or 10 years already makes you a better CEO than most CEOs that I personally know.

malfist · 2026-04-28T15:51:42 1777391502

Next quarter earnings call is the only thing that's important. Hollow out everything for that goal. My bonus depends on it.

InsideOutSanta · 2026-04-27T11:04:42 1777287882

Monkey Island taught me English. I can't tell you how confusing insult sword fighting was initially. I had to create long tables with the correct answers because I didn't get most of the puns, and then I had to start from scratch when I had to fight Carla.

Anyway, thanks, Ron Gilbert.

DanielHB · 2026-04-27T11:45:09 1777290309

A pirate I was meant to be, trim the sail and roam the sea

Monkey Island 3 taught me a good deal of english too. I was lucky to get a text-translated version with english voiceover.

We all would avoid scurvy if we eat an orange...

InsideOutSanta · 2026-04-26T20:23:17 1777234997

Gandi has started increasing prices like crazy in the last few years.

InsideOutSanta · 2026-04-26T17:48:05 1777225685

Yeah, it's crazy that there is no trustworthy source for model reviews. I'd love to know how well the new Deepseek 4 actually performs, for example, but I don't want to spend the next week testing it out. Reddit used to be a somewhat useful gauge, but now there are posts on how 4 is useless right next to posts on how amazing it is. And I have no idea if this is astroturfing, or somebody using a quantized version, or different workloads, or what.

I also find it increasingly difficult to evaluate the models I actually do use. Sometimes each new release seems identical or only marginally better than the previous version, but when I then go back two or three version, I suddenly find that oder model to be dramatically worse. But was that older model always that quality, or am I now being served a different model under the same version name?

It's all just so opaque.

rhdunn · 2026-04-26T18:02:27 1777226547

One challenge is that model evaluation is typically domain/application specific. Model performance can also depend on the system prompt and the input/context.

Regarding evaluation, I've found using tools like promptfoo (and in some cases custom tools built on top of that) are useful. These help when evaluating new models/versions and when modifying the system prompt to guide the model. Especially if you can define visualizations and assertions to accurately test what you are trying to achieve.

This can be difficult for tasks like summarization, code generation, or creative writing that don't have clear answers. Though having some basic evaluation metrics and test cases can still be useful, and being able to easily do side-by-side comparisons by hand.

InsideOutSanta · 2026-04-24T11:37:00 1777030620

Getting people to have an undifferentiated distrust of news organizations in general is an important aspect of technofeudalism.

InsideOutSanta · 2026-04-20T17:33:50 1776706430

If "today" were random, our universe would be pretty fricken weird.

devin · 2026-04-26T16:51:38 1777222298

What is today right now in Australia? How about where you live? You have not thought enough about what you’re saying and are probably not aware of all the weird time issues we have in our world.

InsideOutSanta · 2026-04-29T08:14:49 1777450489

That's not what "random" means.

InsideOutSanta · 2026-04-12T14:19:31 1776003571

Totally sounds like an LLM wrote it. Should have been two paragraphs instead of this verbose drivel.

InsideOutSanta · 2026-04-10T08:26:11 1775809571

Also, isn't this just a huge fire hazard of they actually do what they claim? Or will they remove the batteries from these old, continually plugged in, poorly cooled laptops?

SadTrombone · 2026-04-10T11:04:49 1775819089

This is addressed on their site.

> We might modify your laptop to remove or power down the battery, wireless radios, etc. to ensure it can be used safely in the data center.