The AGI timeline debate is louder than ever, but the tools to measure whether we're actually getting closer may be the weakest link in the chain.
Two things happened in the same news cycle that don't seem related but are: AI systems began reliably generating their own GPU kernels, and a rigorous benchmark revealed that enterprise AI agents fail
The most interesting AI story this week isn't a model release — it's a pair of founders turning down $20 million and betting the open-source agent wave hasn't crested yet.
This week's most meaningful signal isn't a frontier model release — it's the quiet maturation of the layer underneath: post-training tooling, weight format governance, and inference infrastructure tha