> Pelican for Fable 5 on default settings is a clear improvement on Opus 4.8
And doesn't contain any actual criticism within the comment (your blog post might, but just referring to what was posted on HN, which is a bit booster-y on its own).
The entire pelican benchmark is a joke. The joke is that, for all of the billions of dollars poured into these things and the claims of PhD level intelligence, they still draw pelicans not-much-better than a five year-old would.
I don't spell that joke out in every comment I post here because that wouldn't be very funny.
They didn’t give up, the average Google engineer was never on board and hasn’t even tried agenetic programming beyond what they were required to do not to get reprimanded.
At no point has Google engineering culture actually embraced this at the ground level. This isn’t a change, this is the existing disconnect between the workers and the managers.
reply