Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That could be it. I still see LLMs fail a set of static typing challenges that I created a couple years ago as a benchmark. Google models still fail it. I wonder if the lack of typing in a lot of the training data makes python harder to reason about?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: