The still can’t pass the basic tests people cooked up years ago lmao. All these companies do is optimize for benchmarks and overfit for the most egregious shortcomings. The fundamental limitations of a neural net remain. Also just asked gpt5 how many Ls are in mammalian, it said 2 lol
Source: guy who cleans up coworker’s slop code as a significant portion of their day job
The still can’t pass the basic tests people cooked up years ago lmao. All these companies do is optimize for benchmarks and overfit for the most egregious shortcomings. The fundamental limitations of a neural net remain. Also just asked gpt5 how many Ls are in mammalian, it said 2 lol
Source: guy who cleans up coworker’s slop code as a significant portion of their day job