It is ALL about real world data. The debate about vision only or vision + multiple sensors is over.
Tesla’s has automated data annotation with video feeds coming in from millions of cars all over the world every day and is using for model training.
On the other hand, Waymo is paying drivers to drive around and collect data and hdmaps. You cannot generate the data synthetically. The fleet is very small and expensive even when compared with Uber. This is a dead end.