More

pigeons · 2026-04-11T06:43:48 1775889828

Magnet fishing

pigeons · 2026-04-11T04:40:19 1775882419

Are you concerned about telegram admins having access to that information?

fennecbutt · 2026-04-11T07:11:12 1775891472

Yeah! And what if Samsung put a bug in the silicon that gives them access to all your stuff without you ever knowing!

Or the CIA has set up inside your closet with a listening device!

pigeons · 2026-04-13T03:14:09 1776050049

But those are much more hypothetical. Telegram admins and anyone who bribes or hacks them do have access to your messages.

pigeons · 2026-04-05T15:11:54 1775401914

> So if Bob can do things with agents, he can do things.

But he does things wrong.

impjohn · 2026-04-07T12:48:21 1775566101

I believe the economic machine gives an edge to people who do more right things than wrong. Bob does things wrong, but given a 10x amount of output, the balance of right output vs wrong output may still be favored upon by the economy. A speculation, to be sure, we'll have to see how it pans out.

pigeons · 2026-04-01T04:34:10 1775018050

wait to release until it uses real data?

pigeons · 2026-03-29T14:44:52 1774795492

Doesn't take a weatherman to tell which way the wind blows

pigeons · 2026-03-26T20:30:28 1774557028

On the basis of nothing, or on the basis of gifts and connections?

brendanfinan · 2026-03-26T21:03:15 1774558995

the Kalshi legal team is a revolving door with the CFTC

pigeons · 2026-03-18T05:58:42 1773813522

And security

dd8601fn · 2026-03-18T06:58:43 1773817123

Most of these people just need like two or three static pages and a domain name. Same as it ever was.

pigeons · 2026-03-16T04:35:47 1773635747

Why don't they work anymore? RLHF or something else?

zachdotai · 2026-03-16T08:13:55 1773648835

Mostly just better training data and instruction following in the newer models. They’re much better at recognising encoded content and understanding intent regardless of language. A base64 string that would’ve slipped past a model a year ago gets decoded and flagged now because the model just… understands what you’re trying to do.

The attacks that still work tend to be the ones that don’t try to hide the intent at all. The winning attack on our first challenge was in plain English. It just reframed the context so that the dangerous action looked like the correct thing to do. Harder to train against because there’s nothing obviously malicious in the input.

pigeons · 2026-03-16T16:49:35 1773679775

Thank you. Its not your fault at all, but to me, "the model just… understands what you’re trying to do." shows me there is a whole new paradigm in some ways to get used to as far as understanding this software.

zachdotai · 2026-03-16T17:02:33 1773680553

Yeah it's closer to how you'd think about deceiving a person than exploiting software.

pigeons · 2026-03-15T04:11:59 1773547919

Yes I was wondering if the right applied to people who aren't age-verified.

pigeons · 2026-03-13T16:23:39 1773419019

yes but then what do you use nanoclaw for, that's its a better fit for than claude code.