The AGI timeline debate is louder than ever, but the tools to measure whether we're actually getting closer may be the weakest link in the chain.
Two things happened in the same news cycle that don't seem related but are: AI systems began reliably generating their own GPU kernels, and a rigorous benchmark revealed that enterprise AI agents fail