* The generation pipeline code and dataset will be made publicly available after peer-review.
Scaling up LiDAR Scene Flow with Synthetic Data. We present SynFlow, a data generation pipeline leveraging the CARLA simulator to synthesize diverse, perfectly labeled LiDAR scene flow data (center). While real-world datasets are often constrained by high annotation costs and limited scenario diversity, SynFlow provides a scalable source of dense, noise-free supervision for learning robust motion priors. As shown in the results (right), models trained on SynFlow-4k dataset achieve strong zero-shot generalization on real-world benchmarks and significantly outperform in-domain baselines when fine-tuned on a small subset of real data. More details available in the main paper pdf.
@article{zhang2026synflow,
author = {Zhang, Qingwen and Zhu, Xiaomeng and Jiang, Chenhan and Jensfelt, Patric},
title = {SynFlow: Scaling Up LiDAR Scene Flow Estimation with Synthetic Data},
journal = {arXiv preprint arXiv:2604.09411},
year = {2026},
}