Toward video generative models of the molecular world