The bitter lesson of AI — simple models with tons of data always win — is harder to apply to robotics. You want actions as output, but your training data lacks actions in 3D worlds. That gap is the core unsolved problem.