More

slekker · 2026-04-16T17:51:07 1776361867

What does that actually do? Force the "effort" to be static to what I set?

slekker · 2026-04-16T17:48:38 1776361718

How does it do with the "car wash" benchmark? :D

slekker · 2026-04-16T17:48:06 1776361686

What about Qwen? Does it get that right?

lambda · 2026-04-16T17:59:06 1776362346

I've run several local models that get this right. Qwen 3.5 122B-A10B gets this right, as does Gemma 4 31B. These are local models I'm running on my laptop GPU (Strix Halo, 128 GiB of unified RAM).

And I've been using this commonly as a test when changing various parameters, so I've run it several times, these models get it consistently right. Amazing that Opus 4.7 whiffs it, these models are a couple of orders of magnitude smaller, at least if the rumors of the size of Opus are true.

qingcharles · 2026-04-16T18:40:21 1776364821

Does Gemma 4 31B run full res on Strix or are you running a quantized one? How much context can you get?

lambda · 2026-04-16T19:55:41 1776369341

I'm running an 8 bit quant right now, mostly for speed as memory bandwidth is the limiting factor and 8 bit quants generally lose very little compared to the full res, but also to save RAM.

I'm still working on tweaking the settings; I'm hitting OOM fairly often right now, it turns out that the sliding window attention context is huge and llama.cpp wants to keep lots of context snapshots.

qingcharles · 2026-04-16T20:04:50 1776369890

I had a whole bunch of trouble getting Gemma 4 working properly. Mostly because there aren't many people running it yet, so there aren't many docs on how to set it up correctly.

It is a fantastic model when it works, though! Good luck :)

slekker · 2026-04-11T06:39:58 1775889598

Americans are for the most part incredibly out of touch and still think they are at the center of the world.

They voted in the orange turd twice, a third of their population didn't even vote.

The US has become a laughing stock worldwide and internally looks like a prelude to Idiocracy.

Well, at least they know how to hire smart people, from China/Russia/Germany/India.

Bringing it back to your message, they love this! To be the center of attention, show off how strong and powerful of a nation they are.

Except, they messed with the wrong region. Same mistake Russia made.

raptor99 · 2026-04-11T10:37:16 1775903836

Sounds to me like you have a lot of envy and jealousy. That's normally what tantrums and name calling result from; that and a need for attention. Congrats.

slekker · 2026-04-11T20:38:13 1775939893

Thank you for the diagnosis

slekker · 2026-04-04T18:23:53 1775327033

Bots too, vanderBOT!

jvanderbot · 2026-04-04T20:16:13 1775333773

I used to work in robotics, and can't remember the password for my usual username so I pulled this one out of thin air years ago

slekker · 2026-03-28T07:45:01 1774683901

I had a problem with a Realtek wifi card, where it would become slow for a few seconds every couple of minutes, had to disable a setting , maybe it helps you: https://wiki.archlinux.org/title/Network_configuration/Wirel...

pipes · 2026-04-04T19:21:45 1775330505

I would have looked at this if I hadn't already jumped ship. It sounds like the problem I was having.

slekker · 2026-03-25T12:25:02 1774441502

That's a bargain Johnny boy! My company gives me $250 in AI tokens to use every day!

slekker · 2026-03-16T07:24:27 1773645867

It's great, this is the kind of content I look for in HackerNews, I'm so fucking tired of LLMs and AI posts

rbanffy · 2026-03-16T08:24:14 1773649454

I would appreciate some extra markings on the keycaps for chords and numbers, and a sans-serif font, but this looks awesome as it is.

slekker · 2026-02-28T16:42:40 1772296960

I love your take <3

slekker · 2026-02-28T08:49:29 1772268569

Following your analogy, if there is a way for the lawyer to never lose a case due to misinterpreting the law...