Two things happened in the same news cycle that don't seem related but are: AI systems began reliably generating their own GPU kernels, and a rigorous benchmark revealed that enterprise AI agents fail