More

pertymcpert · 2026-04-14T17:04:34 1776186274

The LLVM community used this model for years with Phabricator before it was EOL'd and moving to GH and PRs was forced. It's a proven model and works very well in complex code bases, multiple components and dependencies that can have very different reviewer groups. E.g: 1) A foundational change to the IR is the baseline commit 2) Then some tweaks on top to lay the groundwork for uses of that change 3) Some implementation of a new feature that uses the new IR change 4) A final change that flips the feature flag on to enable by default.

Each of these changes are dependent on the last. Without stacked PRs you have o only one PR and reviewing this is huge. Maybe thousands of lines of complex code. Worse, some reviewers only need to see some parts of it and not the rest.

Stacked diffs were a godsend and the LLVM community's number one complaint about moving to GitHub was losing this feature.

pertymcpert · 2026-04-13T21:00:44 1776114044

What might that be?

pertymcpert · 2026-04-12T08:37:45 1775983065

Shit. Really? You mean they modified their frontier model to improve it and make it better and just called it a day? That their benchmarks which show step change improvements are just the result of successive changes on an EXISTING MODEL?

Say it isn't so! I for one like to start from scratch each time I release my version of my compiler toolchain.

chjj · 2026-04-12T09:31:35 1775986295

They didn't call it a day. They created an entire deceptive hype cycle around it.

pertymcpert · 2026-04-12T08:34:27 1775982867

No one seems to have actually read the system card all the way through.

The reason they didn't publish it was that it's orders of magnitude more successful at writing exploits vs Opus 4.6, which only managed it something like 2% of the time.

pertymcpert · 2026-04-12T08:23:10 1775982190

Yeah...except Mythos's large context perf seems to be much better than Opus 4.6.

pertymcpert · 2026-04-10T21:31:35 1775856695

Here: https://chatgpt.com/s/t_69d96c050078819199750da28ebd2526

I gagged after 2 sentences.

pertymcpert · 2026-04-08T02:41:10 1775616070

If anything I’m seeing too much skepticism and not enough alarm. People burying their heads in the sand, fingers in their ears denying where this is all going. Unbelievable except it’s exactly what I expect from humans.

nananana9 · 2026-04-08T05:46:41 1775627201

Forgive me, but this is probably the 29th world destroying model I've seen in the last 4 years, that will change everything, take all the jobs, cure all the cancers and eat all the puppies.

pertymcpert · 2026-04-08T16:43:47 1775666627

I’m beyond trying to convince people to take this technology seriously. You’ll learn for yourself.

suddenlybananas · 2026-04-08T12:30:51 1775651451

OpenAI didn't want to make GPT2 available because it was "too dangerous" [1].

[1] https://www.theguardian.com/technology/2019/feb/14/elon-musk...

m3kw9 · 2026-04-08T14:32:39 1775658759

Alarm from hype is what they want, you are playing straight into their PR dept's hands

pertymcpert · 2026-04-12T08:43:35 1775983415

I'm not talking about Anthropic in particular. Other frontier labs will only be at most a year behind.

I'm seeing the future here beyond just what's in front of us.

rimliu · 2026-04-08T08:23:49 1775636629

alarm about what, exactly?

pertymcpert · 2026-04-08T02:34:03 1775615643

What evidence makes you say that? Do you have insider info?

nicce · 2026-04-08T05:33:21 1775626401

Neither party provided the evidence. I wonder why people like to take the side of the optimistic.

stratos123 · 2026-04-08T08:48:16 1775638096

We already know Opus can find real vulnerabilities ([1], [2], ...), so it's not exactly surprising that a bigger model is better at it.

[1] https://news.ycombinator.com/item?id=47273854

[2] https://news.ycombinator.com/item?id=47611921

nicce · 2026-04-08T10:51:39 1775645499

That is not thousands high-severity vulnerabilities as above commenter stated. Even many local models have found individual vulnerabilities.

muddi900 · 2026-04-09T13:29:45 1775741385

What evidence do we have that it is true?

pertymcpert · 2026-04-12T08:44:21 1775983461

I don't need any. I'm not making the claim that it's "most likely a lie".

pertymcpert · 2026-04-08T02:31:14 1775615474

This isn’t talking about compaction. This refers to performance as the model is loaded with 500k to 1m tokens.

radicality · 2026-04-08T15:18:12 1775661492

Ah, thanks, makes sense, I’ll read more about this

pertymcpert · 2026-04-08T02:29:34 1775615374

Did you read the article?