More

cjonas · 2026-04-23T16:50:13 1776963013

It would be wild if they dropped below the "two 9's" metric. I think they would need an additional ~16hr of outage in the 90 day rolling period.

waiwai933 · 2026-04-23T16:51:35 1776963095

https://mrshu.github.io/github-statuses/ suggests that their combined uptime doesn't even meet 1 nine, let alone 2.

CamouflagedKiwi · 2026-04-23T17:44:16 1776966256

The intersection of uptime across every possible service they offer isn't a particularly great metric. I get the point that they are doing badly, but it makes it look worse than I think it really is.

What I would like to see is a combined uptime for "code services", basically Git+Webhooks+API+Issues+PRs, which corresponds to a set of user workflows that really should be their bread & butter, without highlighting things you might not care about (Codespaces, Copilot).

dijit · 2026-04-23T18:59:15 1776970755

Depends how integrated those features are.

A service's availability is capped by its critical dependencies; this is textbook SRE stuff (see Treynor et al., The Calculus of Service Availability). Copilot may well be on the side of it (and has the worst uptime, dragging everything down), but if Actions depends on Packages then Actions can be "up" while in reality the service is not functional. If your release pipeline depends on Webhooks, then you're unable to release.

The obvious one is git operations: if you don't have git ops then basically everything is down.

So; you're right about Copilot, but the subset you proposed (Git+Webhooks+API+Issues+PRs) has the exact same intersection problem. If git is at one nine, that entire subset is capped at one nine too, no matter how green the rest of it looks.

And to be clear: git operations is sitting at 98.98% on the reconstructed dashboard linked above[1]. That is one nine. Github stopped publishing aggregate numbers on their own status page, which.. tells you something.

[1]: https://mrshu.github.io/github-statuses/

CamouflagedKiwi · 2026-04-23T20:12:21 1776975141

Well yes you could do that on a status page, but it's basically just lying to put Actions as green if it's actually down because it depends on Packages which is red.

With that set, I wasn't proposing a set of totally independent services to be grouped together, I was talking about a set of things that I think represent pretty core services for Github users. If Git is dragging the rest of those down, fine; PRs are useless without it. In fact it is worse than some but it's not the worst of that group, and it is still a lot better then the dregs of Actions and Copilot.

Having said that, the numbers are of course terrible, two nines on a couple of things and one on everything else would be bad for a startup, it's an utter embarrassment for a company that's been doing this over a decade.

cjonas · 2026-04-23T17:07:42 1776964062

also I never had considered that breaking your up-time into a bunch of different components is just a strategy to make your SRE look better than it actually is. The combined up-time tells the real story (88%!). Thanks for the link

femiagbabiaka · 2026-04-23T17:18:05 1776964685

The number of nines assigned to a suite of services is not indicative of the quality of SRE at any given company, but rather a reflection of the tradeoffs a business has decided to make. Guaranteed there's a dashboard somewhere at Github looking at platform stickiness vs. reliability and deciding how hard to let teams push on various initiatives.

cjonas · 2026-04-24T02:51:58 1776999118

this is fair. I should have just said "Site Reliability", as it's almost certainly out of the engineers control.

cjonas · 2026-04-23T16:57:13 1776963433

ya i was just doing the math on their chart for the git operations. I added up 14.93 hours combined hours, which puts them WAY lower than the reported 99.7 metric they show right next to it.

So based on their own reporting, the uptime number should be 99.31. Which means only like 6 additional hours and they'd fall below 99.0%

roblh · 2026-04-23T21:58:36 1776981516

GitHub is going for “eight 8’s” at this rate.

cjonas · 2026-04-19T12:25:07 1776601507

Whats your actual tech experience?

Most enterprises that need consultants are using Salesforce, SAP, Hubspot, Dynamics, etc. If a company has an engineering department to build and run internal software, they very rarely need a consultant. And if they don't, they are very unlikely to higher a consultant to build it custom. They'd want "out of the box" because they think (often incorrectly these days), it will be easier to maintain.

cjonas · 2026-04-12T14:14:48 1776003288

Ya I've had this experience more than a few times recently. I've heard people claiming they are serving quantized models during high loads, but it happens in cursor as well so I don't think it's specific to Anthropics subscription. It could be that the context window has just gotten into a state that confuses the model... But that wouldn't explain why it appears to be temporary...

My best guess is this is the result of the companies running "experiments" to test changes. Or it's just all in my head :)

whywhywhywhy · 2026-04-12T14:20:30 1776003630

Cursor one is back to Claude 4 or 3.5+ at best. Struggles to do things it did effortlessly a few weeks ago.

It’s not under load either it’s just fully downgraded. Feels more they’re dialing in what they can get away with but are pushing it very far.

cjonas · 2026-04-12T23:00:56 1776034856

These days cursor feel more capable and reliable then Claude Code (at last for my workflow). For personal projects, I'm using cursor during planning and verification but run Claude code for just implementation to save $.

cjonas · 2026-04-10T04:26:38 1775795198

Wouldn't they be using the Azure inference API or AWS bedrock on their own accounts and NOT be going through the openAI/Anthropics servers anyways? I just always assumed this is how the big inference "resellers" (openrouter, cursor, etc) were operating.

cjonas · 2026-04-07T13:30:37 1775568637

Why does getting started have me sign up for an account vs take me to the docs to self host?

cjonas · 2026-03-26T23:18:27 1774567107

The docs indicate there are already 2 other go implementations. Why not just use one of those? https://docs.jsonata.org/overview.html

aniceperson · 2026-03-26T23:40:09 1774568409

Because his prompt said to implement in go, not to check if an go implementation already exists. They have been running kubernetes clusters to parse json, this is not suprising.

g947o · 2026-03-27T03:44:01 1774583041

Because otherwise they wouldn't have written this meaningless article and contributed to the AI hype.

zer00eyz · 2026-03-27T05:15:25 1774588525

And to market their AI security product.

leonidasv · 2026-03-27T05:18:59 1774588739

Those are compatible with the 1.x syntax while the gnata is compatible with the 2.x. Also, the repos haven't seen new commits in a long time.

vova_hn2 · 2026-03-27T04:28:38 1774585718

Last commits in those repos are 5 and 7 years ago.

heavyset_go · 2026-03-27T04:53:42 1774587222

If they're vendoring the dependency anyway, that wouldn't matter much if they're not using features that were added since 2021.

The last release of jsonata was mid 2025, and there hasn't been new features since the last 2022 release until the latest, so it's likely those other ports are fine.

hrmtst93837 · 2026-03-27T06:23:10 1774592590

[flagged]

grey-area · 2026-03-27T08:22:45 1774599765

Now they have 13k lines of someone else’s mess (the AIs) to manage instead.

lossoth · 2026-03-28T11:05:15 1774695915

But this is a different kind of problem.

With legacy systems, at least the complexity was somewhat anticipated early in the design process (even if it was incorrect).

With automatically generated code, you get something that "works" but with a much vaguer underlying model, which makes it harder to understand when things start to go wrong.

In both cases, the real cost comes later, when you're forced to debug under pressure.

cjonas · 2026-03-26T13:44:43 1774532683

Honestly memory seems like an overcomplicating way to solve this problem in the context of something like a coding agent. Rules and Skills are much more explicit, less noisy and easier to maintain. Just requires having an always on rule/system prompt to always update rules/skills as designs or architecture changes in a way that conflicts with old rules. Memories maybe make more sense for personal assistant use cases.

cjonas · 2026-03-24T13:49:53 1774360193

This is basically "reactive UI" foundation. The complexities come from effects (now managed via hooks)

cjonas · 2026-03-21T13:09:33 1774098573

We just create mini data "ponds" on the fly by copying tenant isolated gold tier data to parquet in s3. The users/agent queries are executed with duckdb. We run this process when the user start a session and generate an STS token scoped to their tenant bucket path. Its extremely simple and works well (at least with our data volumes).

leetrout · 2026-03-22T00:40:37 1774140037

I built something on top of DuckDB last year but it never got deployed. They wanted to trust Postgres.

I didn't use the in browser WASM but I did expose an api endpoint that passed data exploration queries directly to the backend like a knock off of what new relic does. I also use that same endpoint for all the graphs and metrics in the UI. Just filtered out the write / delete statements in a rudimentary way.

DuckDB is phenomenal tech and I love to use it with data ponds instead of data lakes although it is very capable of large sets as well.

And "data pond"? Glad I am not alone using this term! Somewhere between a data lake and warehouse - still unstructured but not _everything_ in one place. For instance, if I have a multi-tenant app I might choose to have a duckdb setup for each customer with pre-filtered data living alongside some global unstructured data.

Maybe there's already a term that covers this but I like the imagery of the metaphor... "smaller, multiple data but same idea as the big one".

mattaitken · 2026-03-21T14:17:43 1774102663

This is cool. I think for our use case this wouldn’t work. We’re dealing with billions of rows for some tenants.

We’re about to introduce alerts where users can write their own TRQL queries and then define alerts from them. Which requires evaluating them regularly so effectively the data needs to be continuously up to date.

SOLAR_FIELDS · 2026-03-21T19:05:49 1774119949

Billions still seems crunchable for DDB. It’s however much you can stuff into your RAM no? Billions is still consumer grade machine RAM depending on the data. Trillions I would start to worry. But you can have a super fat spot instance where the crunching happens and expose a light client on top of that then no?

Quadrillions, yeah go find yourself a trino spark pipeline

Waterluvian · 2026-03-21T13:20:20 1774099220

Is that why it’s called DuckDb? Because data ponds?

QuantumNomad_ · 2026-03-21T13:54:32 1774101272

The DuckDB website has the following to say about the name:

> Why call it DuckDB?

> Ducks are amazing animals. They can fly, walk and swim. They can also live off pretty much everything. They are quite resilient to environmental challenges. A duck's song will bring people back from the dead and inspires database research. They are thus the perfect mascot for a versatile and resilient data management system.

https://duckdb.org/faq#why-call-it-duckdb

cjonas · 2026-03-21T13:42:59 1774100579

Idk but I named everything in the related code "duckpond" :)

mritchie712 · 2026-03-21T13:48:35 1774100915

Hannes (one of the creators) had a pet duck

otterley · 2026-03-21T14:18:48 1774102728

How large are these data volumes? How long does it take to prepare the data when a customer request comes in?

cjonas · 2026-03-21T14:42:04 1774104124

Small. We're dealing with financial accounts, holdings and transactions. So a user might have 10 accounts, thousands of holdings, 10s of thousands of transactions. Plus a handful of supplemental data tables. Then there is market data that is shared across tenants and updated on interval. This data is maybe 10-20M rows.

Just to clarify, the data is prepared when the user (agent) analytics session starts. Right now it takes 5-10s, which means it's typically ready well before the agent has actually determined it needs to run any queries. I think for larger volumes, pg_duckdb would allow this to scale to 10s of millions rows pretty efficiently.

boundlessdreamz · 2026-03-21T13:26:56 1774099616

How do you copy all the relevant data? Doesn't this create unnecessary load on your source DB?

cjonas · 2026-03-21T13:57:34 1774101454

We have various data sources (which is another benefit of this approach). Data from the application DB is currently pulled using the FE apis which handle tenant isolation and allow the application database to deal with the load. I think pg_duckdb could be a good solution here as well, but haven't gotten around to testing it. Other data come from analytics DB. Most of this is landed on an interval via pipeline scripts.

cjonas · 2026-03-19T23:13:25 1773962005

For any sufficiently large codebase, the agent only ever has a very % of the code loaded into context. Context engineering strategies like "skills" allow the agent to more efficiently discover the key information required to produce consistent code.