More

jampekka · 2026-05-11T06:36:25 1778481385

The experts in MoEs aren't specialized in any meaningful task sense. From level of what we would think as tasks MoEs are selected essentially arbitrarily per token and per block.

stingraycharles · 2026-05-11T06:56:13 1778482573

It’s unsupervised, yes, but “unspecialized in any meaningful task sense” is incorrect, that’s the whole point. It’s just not in the sense of “this is a legal expert, this is a software developer”.

orbital-decay · 2026-05-11T12:37:19 1778503039

Optimal expert separation depends on the goal and can be pretty arbitrary, for example DeepSeek v4 separates them more or less by domain if I remember correctly.

jampekka · 2026-05-09T18:43:21 1778352201

"You could argue that I’m abusing 414 URI Too Long. I respond that it’s funnier this way. Other options I considered were:

    400 Bad Request, the generic client error code, which is correct but boring;

    402 Payment Required, and honestly if you want to pay me to make a particular URL with query string work, I’m open to it;

    404 Not Found, but it’s too likely to have side effects, and it doesn’t convey the idea that the request was malformed, which is what I’m going for; and

    303 See Other with no Location header, which is extremely uncommon these days but legitimate. Or at least it was in RFC 2616 (“The different URI SHOULD be given by the Location field in the response”), but it was reworded in 7231 and 9110 in a way that assumes the presence of a Location header (“… as indicated by a URI in the Location header field”), while 301, 302, 307 and 308 say “the server SHOULD generate a Location header field”. Well, I reckon See Other with no Location header is fair enough. But URI Too Long was funnier."

https://chrismorgan.info/no-query-strings?foo

ollien · 2026-05-09T20:05:41 1778357141

I don't think it's an abuse, RFC9110 defines 414 as a response for "refusing to service the request because the target URI is longer than the server is willing to interpret". Since adding a query string involves only adding characters, this seems fine; there's no stipulation as far as I can tell that all pages a server hosts must adhere to the same length. I'd be curious if any well-known clients interpret it that way though, and make caching decisions based on it. As far as I know, they shouldn't.

Obviously it's against the spirit of the thing, but I don't think it's wrong per-se.

lucketone · 2026-05-09T21:50:27 1778363427

If the goal is to be misleading, but technically correct, it hits the bullseye

ollien · 2026-05-09T22:08:54 1778364534

When the goal is "the funniest way", I think that's a hit :)

chrismorgan · 2026-05-10T02:58:41 1778381921

I reckon it is still an abuse. I am willing to interpret longer target URIs… so long as they don’t contain a question mark. /no-query-strings is longer than /?.

zymhan · 2026-05-10T16:31:36 1778430696

The standard doesn't say that you must accept all URIs up to a certain length. You're allowed to decide, based on context, what is "too long".

That is to say, you're allowed to have a double standard. And I love this sort of stuff, thanks for sharing.

thayne · 2026-05-09T22:00:37 1778364037

You could also redirect to the url with the query string dropped.

chrismorgan · 2026-05-10T07:32:55 1778398375

But then it’s barely a protest.

1shooner · 2026-05-09T18:55:06 1778352906

Also from the 414 page:

>Complain to whoever gave you the bad link, and ask them to stop modifying URLs, because it’s bad manners.

It's ironic that an error response so blatantly violating the robustness principle is throwing shade about bad manners.

btilly · 2026-05-09T21:08:56 1778360936

Opinions vary on how good an idea the robustness principle is. That is why, for example, the XML standard requires a conforming validator to throw an error on invalid XML.

In our modern world, the robustness principle has become an invitation to security bugs, and vendor lock-in. Edge cases snuck through one system on robustness, then trigger unfortunate behavior when they hit a different system. Two systems tried to do something reasonable on an ambiguous case, but did it differently, leading to software that works on one, failing to work on the other.

1shooner · 2026-05-09T21:55:05 1778363705

I generally agree, but I don't think XML is the best example. Getting HTML out of XML is considered to have been the right move isn't it? I was pro-XHTML2 at the time but in retrospect, have we suffered much for not sending webpage validation errors to end users?

btilly · 2026-05-10T01:07:39 1778375259

Once people have gotten used to not having to conform, forcing them to conform is an uphill battle. Doubly so when, as happened with Microsoft and IE, the vendors would like to encourage vendor lock-in. The only time to reasonably do it is at the start.

That said, we are paying a huge complexity cost due to our efforts to allow nonconforming pages. This complexity is widely abused by malicious actors. See, for instance, https://cheatsheetseries.owasp.org/cheatsheets/XSS_Filter_Ev... for ways in which attackers try to bypass security filters. A lot of it is only possible because of this unnecessary complexity.

zaphar · 2026-05-09T22:52:49 1778367169

But, this is robust? I mean it's pretty clearly stating that you are visiting an unsupported URL. It provides direction on what to do about it to the user. It does not crash the browser or the server. In pretty much every dimension this is highly robust.

wizzwizz4 · 2026-05-09T19:11:49 1778353909

The robustness principle is itself bad manners, in plenty of contexts. If I deliver packages by throwing them at the customer, I really want a customer to tell me "hey, don't throw packages at me!" before I attempt to lob something fragile and breakable, or something heavy at someone fragile and breakable. Otherwise, how am I supposed to learn that I'm doing anything wrong?

ikiris · 2026-05-09T19:50:51 1778356251

This is the point this has delved into internet crankery.

wizzwizz4 · 2026-05-09T20:35:32 1778358932

No, it's a fairly uncontroversial take. See https://en.wikipedia.org/wiki/Robustness_principle#Criticism and the "See also" section.

kelnos · 2026-05-10T02:26:42 1778380002

The robustness principle was a mistake. The attitude it promulgated is a big reason why interoperability is so difficult.

jampekka · 2026-05-04T10:23:10 1777890190

It's not really usable for assessing individuals. The "biotype" explains roughly 1% of variation in symptoms.

jampekka · 2026-05-03T11:14:40 1777806880

No-one is too absolute, but could ne used as a rough rule of thumb.

Depending on the study, 0.16% to 7% want to get tracked.

https://noyb.eu/sites/default/files/2025-07/Pay_or_Okay_Repo...

LocalH · 2026-05-03T20:19:22 1777839562

Lies, damned lies, and statistics

jampekka · 2026-05-02T14:38:32 1777732712

Seems to be very critical of western, and especially American, foreign policy. Reasonably well argued and factual, although a bit edgy at times. A decent read.

jampekka · 2026-05-02T14:26:50 1777732010

I don't doubt this happens, but given all the wolf crying about clandestine Russian operations, it's hard to assess what the scale and influence of these are. Especially as this is based on an analysis by Atlantic Council, which is essentially a NATO think tank.

This will probably read to many as me being a useful idiot for Putin or something. And maybe I am, hard to say definitely.

jeffbee · 2026-05-02T14:29:13 1777732153

Give some examples of prominent wolf-crying that wasn't eventually substantiated.

jampekka · 2026-05-02T14:44:51 1777733091

Some major ones that come to mind:

- Russia blowing up Nordstream

- "Havana syndrome"

- The Steele dossier

jeffbee · 2026-05-02T16:04:54 1777737894

OK, those are interesting choices that are outside of the realm of stuff that I was thinking about. What I was thinking about is that the Russians have been working the American people via the media for decades.

pessimizer · 2026-05-02T15:30:33 1777735833

What public state speculation about Russian interference in anything ever was substantiated?

As far as I can tell, nothing that has been said about Russian intelligence operations in the West (over the past decade or so) has ever been substantiated. That's why everybody started blaming every single problem or disagreement in the West on Russia, because you wouldn't be asked to or expected to be able to substantiate it.

I've been called Russian or Chinese more times since 2015 than I've ever been called anything else other than my name. I was usually called that by people when I was denying something that those same people now say nobody ever really believed or insisted was true.

jeffbee · 2026-05-02T16:02:05 1777737725

> What public state speculation about Russian interference in anything ever was substantiated?

Tenet Media

the_why_of_y · 2026-05-03T09:23:04 1777800184

> What public state speculation about Russian interference in anything ever was substantiated?

Report On The Investigation Into Russian Interference In The 2016 Presidential Election

https://www.justice.gov/archives/sco/file/1373816/dl

Pay08 · 2026-05-02T16:51:52 1777740712

Immigrants eating dogs? If you think about it, most wolf-crying is completely unsubstantiated.

BirAdam · 2026-05-02T15:17:32 1777735052

Most people lack principles and act purely emotionally. It is wicked and evil and vile if Russia does something because it is Russia doing it. It is good and right and true if “Western” powers do a thing because it is Western powers doing it. To a principled observer, they’re all evil regardless of which country is doing the thing.

jampekka · 2026-05-02T12:21:37 1777724497

> It tried to build something simple and while it got the job done the thinking displayed did not fill me with confidence. It was pages and pages of "actually no", "hang on", "wait that makes no sense". It was like the model was having a breakdown.

It has been probanly trained to assess its own "thoughts" regularly and outputs those for the assesment results. I wouldn't worry much about the reasoning text contents, and it's nice to have them in contrast to the closed model "summaries", so it's easier to see what's going on.

jampekka · 2026-05-01T17:35:49 1777656949

I have to churn to get to my ChatGPT Plus $20 plan limits with gpt-5.5 xhigh. Starts to feel like I'm doing something wrong.

jampekka · 2026-04-29T18:13:35 1777486415

If I understand this correctly based on a quick read, it argues that subjective experience arises at the (or in the) "alphabetization" process where continuous physical states (e.g. voltage) are mapped to discrete logical states (roughly like e.g. a bit) or "concepts" (figure 2).

Per this reading, implementing something in ASIC would make it have (a different) experience, as opposed to CPU/GPU. Not sure what would be the case for FPGAs.

It also seems to rely on the classical "GOFAI" idea of symbol manipulation, and e.g. denies experience that isn't discretizable into concepts. Or at least the system producing such concepts seems to be necessary, not sure if some "non-conceptual experiences" could form in the alphabetization process.

It reads a bit like a more rigorous formulation of the Searle's "biological naturalism" thesis, the central idea being that experience can not be explained at the logical level (e.g. porting an exact same algorithm to a different substrate wouldn't bring the experience along in the process).

jampekka · 2026-04-27T19:16:03 1777317363

Maybe something like swarms of autonomous killer drones invading Greenland and Canada?