More

simonw · 2026-04-12T11:14:57 1775992497

Absolutely not "open source" - here's the license: https://huggingface.co/MiniMaxAI/MiniMax-M2.7/blob/main/LICE...

> Non-commercial use permitted based on MIT-style terms; commercial use requires prior written authorization.

And calling the non-commercial usage "MIT-style terms" is a stretch - they come with a bunch of extra restrictions about prohibited uses.

It's open weights, not open source.

zozbot234 · 2026-04-12T11:38:59 1775993939

It's not even open weights as generally understood, the non-commercial restriction is pretty severe. The earlier M2.5 model will still be preferred for many purposes.

orlp · 2026-04-12T11:47:28 1775994448

I've flagged the post, the title is editorialized, the title on the blog post is "MiniMax M2.7: The Agentic Model That Helped Build Itself" (at least at the time of writing this).

zozbot234 · 2026-04-12T11:50:03 1775994603

"Helped build itself" is arguably also a stretch as noted in another comment.

MarsIronPI · 2026-04-12T11:51:43 1775994703

Even the MIT-licensed weights are just that: open weights. Let's not call the weights "source", because they're emphatically not. I can't retrain Qwen from the ground up with different pre-training algorithms, for example.

zozbot234 · 2026-04-12T11:59:06 1775995146

Model weights are source because they are "the preferred form for modification", e.g. you can use them for fine-tuning. Training a new model from raw data (1) gets you something very different from the original and (2) is computationally unfeasible for most, compared to simpler fine tuning.

littlestymaar · 2026-04-12T12:02:01 1775995321

I've yet to see a convincing explanation of what make such a “license” legally bounding in the first place.

There's no copyright on model weights themselves (because they are produced purely mechanically without involving human creativity, the same way there's no copyright on compiled artifacts of a piece of software or an h264 encoded movie file). For software and movies the copyright cover the source material, not the resulting binary, and for LLMs the source material can also be protected by copyright. The problem, is that LLM makers don't own most of the copyright on the source material and worse they claim the training process is transformative enough to erase the copyright of the source material so even the part of the training data for which they own copyright couldn't extend their copyright protection to the weights.

It's very likely that these licenses are entirely devoid of legal value (and I don't think Meta engaged in any legal actions (not even a DMCA takedown) on any of the bazillions llama finetunes violating the llama license on huggingface).

simonw · 2026-04-12T10:25:06 1775989506

Jacob Kaplan-Moss, February 2024: https://social.jacobian.org/@jacob/111914179201102152

> “We believe that open source should be sustainable and open source maintainers should get paid!”

> Maintainer: introduces commercial features “Not like that”

> Maintainer: works for a large tech co “Not like that”

> Maintainer: takes investment “Not like that”

simonw · 2026-04-10T23:33:15 1775863995

Thaler v. Perlmutter said that an AI system cannot be listed as the sole author of a work - copyright requires a human author.

US Copyright Office guidance in 2023 said work created with the help of AI can be registered as long as there is "sufficient human creative input". I don't believe that has ever been qualified with respect to code, but my instinct is that the way most people use coding agents (especially for something like kernel development) would qualify.

mcv · 2026-04-11T22:45:13 1775947513

Sounds like using AI as a tool is fine, but those autonomous clawbots are not. All the more reason to reject their submissions, I guess.

davemp · 2026-04-11T02:36:05 1775874965

Interesting. That seems to suggest that one would need to retain the prompts in order to pursue copyright claims if a defendant can cast enough doubt on human authorship.

Though I guess such a suit is unlikely if the defendant could just AI wash the work in the first place.

simonw · 2026-04-10T15:55:04 1775836504

> A recent leak of Claude’s code prompted the startup to publish a blogpost at the beginning of the month saying that AI models had surpassed “all but the most skilled humans at finding and exploiting software vulnerabilities” [...]

I've seen a bunch of people conflate the Claude Code source-map leak with the Mythos story, though not quite as blatantly as here. I'm confident that they are totally unrelated.

jannyfer · 2026-04-10T19:18:09 1775848689

I have a pet theory that the uptick in normal cybersecurity PRs you mention as a trend in your blog were done with Claude Code’s stealth mode and Mythos.

simonw · 2026-04-10T14:19:13 1775830753

Yeah, regular web chat Claude and ChatGPT both have full container access (even on the free version, at least for ChatGPT) which can run CLI tools.

Both of them can even install CLI tools from npm and PyPI - they're limited in terms of what network services they can contact aside from those allow-listed ones though, so CLI tools in those environments won't be able to access the public web.

... unless you find the option buried deep in Claude for enabling additional hosts for the default container environment to talk to. That's a gnarly lethal trifecta exfiltration risk so I recommend against it, but the option is there!

More notes on ChatGPT's ability to install tools:

- https://simonwillison.net/2026/Jan/26/chatgpt-containers/

simonw · 2026-04-10T13:49:58 1775828998

The ads for prediction markets on TikTok are aggressive - like (paraphrasing) "this is your new source of passive income and you'd be crazy to miss it" aggressive.

tdeck · 2026-04-10T13:53:50 1775829230

So basically the standard online scam script for 20+ years but in a TikTok. I remember seeing AdWords text ads in the 2000s for "make $$$ working from home".

simonw · 2026-04-08T23:17:35 1775690255

Pelicans: https://simonwillison.net/2026/Apr/8/muse-spark/

I also had a poke around with the tools exposed on https://meta.ai/ - they're pretty cool, there's a Code Interpreter Python container thing now and they also have an image analysis tool called "container.visual_grounding" which is a lot of fun.

wsgeorge · 2026-04-09T00:03:12 1775692992

Alexandr Wang suggesting this might be open-weights/source in the future gives me hope. Hopefully they stay on this path.

lemonish97 · 2026-04-09T01:06:04 1775696764

I have a feeling it won't be this exact model, but rather smaller distilled variants, similar to the gemma line

sbinnee · 2026-04-09T02:10:59 1775700659

It is fair to think so because that is what everyone is doing. But being Meta and considering Llama, if MSL is going to keep releasing models and wants to join back the AI war, they may actually open weights just to get more attention. Once they establish a sizable community, they can start guarding their frontier models.

sunaookami · 2026-04-09T10:38:39 1775731119

Seems like not all tools are available everywhere? Don't have access to visual_grounding sadly, only these: https://embed.fbsbx.com/playables/view/4208761039384112/?ext...

simonw · 2026-04-09T13:33:03 1775741583

Interesting, you got some I didn't: animate image, create video and get reference audio.

nickvec · 2026-04-09T09:08:00 1775725680

The only benchmark I care about! Just curious Simon - which model do you think has created the best pelican riding a bicycle thus far?

simonw · 2026-04-09T20:46:46 1775767606

Gemini 3.1 Pro: https://simonwillison.net/2026/Feb/19/gemini-31-pro/

But GLM-5.1 has the best NORTH VIRGINIA OPOSSUM ON AN E-SCOOTER: https://simonwillison.net/2026/Apr/7/glm-51/

sbinnee · 2026-04-09T01:54:41 1775699681

> but you can try it out today on meta.ai (Facebook or Instagram login required).

I guess I will have to wait. I hope at least soon it will be available on Openrouter. Overall, I am really excited to try it out.

simonw · 2026-04-08T14:52:55 1775659975

I've been trying that prompt agains other leading models and honestly GLM-5.1's is by far the best.

simonw · 2026-04-07T21:25:39 1775597139

Not only did this one draw me an excellent pelican... it also animated it! https://simonwillison.net/2026/Apr/7/glm-51/

ipsum2 · 2026-04-07T21:30:59 1775597459

It made it realistic. A pelican is much more likely to be flying in the sky than riding a bicycle.

stingraycharles · 2026-04-08T02:30:43 1775615443

Surely at this point it’s part of the training set and the benchmark has lost its value?

Marciplan · 2026-04-08T15:52:17 1775663537

these comments are as useless as simon posting his pelicans

_pdp_ · 2026-04-07T22:27:13 1775600833

Simon, you need to come up with improved benchmarks soon.

lemonish97 · 2026-04-07T22:46:22 1775601982

Agree. But you can keep the pelican theme in whatever new benchmark you choose to come up with. Iconic at this point.

fancy_pantser · 2026-04-08T00:18:54 1775607534

let me see Tayne with a hat wobble

simonw · 2026-04-07T21:05:21 1775595921

I buy the rationale for this. There's been a notable uptick over the past couple of weeks of credible security experts unrelated to Anthropic calling the alarm on the recent influx of actually valuable AI-assisted vulnerability reports.

From Willy Tarreau, lead developer of HA Proxy: https://lwn.net/Articles/1065620/

> On the kernel security list we've seen a huge bump of reports. We were between 2 and 3 per week maybe two years ago, then reached probably 10 a week over the last year with the only difference being only AI slop, and now since the beginning of the year we're around 5-10 per day depending on the days (fridays and tuesdays seem the worst). Now most of these reports are correct, to the point that we had to bring in more maintainers to help us.

> And we're now seeing on a daily basis something that never happened before: duplicate reports, or the same bug found by two different people using (possibly slightly) different tools.

From Daniel Stenberg of curl: https://mastodon.social/@bagder/116336957584445742

> The challenge with AI in open source security has transitioned from an AI slop tsunami into more of a ... plain security report tsunami. Less slop but lots of reports. Many of them really good.

> I'm spending hours per day on this now. It's intense.

From Greg Kroah-Hartman, Linux kernel maintainer: https://www.theregister.com/2026/03/26/greg_kroahhartman_ai_...

> Months ago, we were getting what we called 'AI slop,' AI-generated security reports that were obviously wrong or low quality. It was kind of funny. It didn't really worry us.

> Something happened a month ago, and the world switched. Now we have real reports. All open source projects have real reports that are made with AI, but they're good, and they're real.

Shared some more notes on my blog here: https://simonwillison.net/2026/Apr/7/project-glasswing/

ofjcihen · 2026-04-08T05:38:46 1775626726

Could this potentially be because more researches are becoming accustomed to the tools/adding them in their pipelines?

The reason I ask is because I’ve been using them to snag bounties to great effect for quite a while and while other models have of course improved they’ve been useful for this kind of work before now.