Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
MattSayar
16 days ago
|
parent
|
context
|
favorite
| on:
Claude Opus 4.7
I recognize the sarcasm. The data I can find says it's performing at baseline however?
https://marginlab.ai/trackers/claude-code/
ACCount37
16 days ago
[–]
Yeah, that's my point. Humans are not reliable LLM evaluators. "Secret model nerfs" happen in "vibes" far more often than they do in any reality.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://marginlab.ai/trackers/claude-code/