Haofei Xu

Haofei Xu (徐豪飞)

I am a PhD student at ETH Zurich and University of Tübingen, supervised by Marc Pollefeys and Andreas Geiger.

I worked with Jianfei Cai and Hamid Rezatofighi at Monash University, Australia, prior to my PhD. I obtained a master's degree at University of Science and Technology of China (USTC) supervised by Juyong Zhang. During my master's, I exchanged at Nanyang Technological University (NTU), Singapore, where I was supervised by Jianfei Cai and Jianmin Zheng. I also interned at Microsoft Research Asia (MSRA), where I was mentored by Jiaolong Yang and Xin Tong.

I am honored to receive the 2025 Apple Scholar in AI/ML, Top Reviewer Award (NeurIPS 2024), and Outstanding Reviewer Award (CVPR 2022).

Email / Google Scholar / X / Bluesky / Github

Selected Publications

I have broad interests in computer vision, particularly in fundamental research problems like dense correspondences, motion, 3D and video representation learning. I like to explore simple and effective approaches to solving fundamental challenges. Please see the full publication list on Google Scholar.

	DepthSplat: Connecting Gaussian Splatting and Depth Haofei Xu, Songyou Peng, Fangjinhua Wang, Hermann Blum, Daniel Barath, Andreas Geiger, Marc Pollefeys Computer Vision and Pattern Recognition (CVPR), 2025 paper / project page / code Cross-task interactions between feed-forward Gaussian splatting and depth.
	PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting Cheng Zhang, Haofei Xu, Qianyi Wu, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai Computer Vision and Pattern Recognition (CVPR), 2025 paper / project page / code / interactive 4K 360 demo 4K panorama synthesis with a single feed-forward inference.
	No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images Botao Ye, Sifei Liu, Haofei Xu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang, Songyou Peng International Conference on Learning Representations (ICLR), 2025 (Oral) paper / project page / code Unposed 3DGS reconstruction made easy.
	MVSplat360: Feed‑Forward 360 Scene Synthesis from Sparse Views Yuedong Chen, Chuanxia Zheng, Haofei Xu, Bohan Zhuang, Andrea Vedaldi, Tat-Jen Cham, Jianfei Cai Neural Information Processing Systems (NeurIPS), 2024 paper / project page / code Empowering MVSplat with a video diffusion model.
	MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images Yuedong Chen, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, Jianfei Cai European Conference on Computer Vision (ECCV), 2024 (Oral) paper / project page / code A cost volume representation for efficiently predicting 3D Gaussians from sparse multi-view images in a single feed-forward inference.
	LaRa: Efficient Large-Baseline Radiance Fields Anpei Chen, Haofei Xu, Stefano Esposito, Siyu Tang, Andreas Geiger European Conference on Computer Vision (ECCV), 2024 paper / project page / code A feed-forward 2DGS model trained in two days using four GPUs.
	Unifying Flow, Stereo and Depth Estimation Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Fisher Yu, Dacheng Tao, Andreas Geiger IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 paper / project page / slides / video(cn) / colab / demo / code A unified dense correspondence matching formulation enables three motion and 3D perception tasks to be solved with a unified model.
	GMFlow: Learning Optical Flow via Global Matching Haofei Xu, Jing Zhang, Jianfei Cai, Hamid Rezatofighi, Dacheng Tao Computer Vision and Pattern Recognition (CVPR), 2022 (Oral) paper / slides / video(cn) / poster / code Learning cross-view features with a Transformer enables optical flow to be solved by directly comparing feature similarities.
	High-Resolution Optical Flow from 1D Attention and Correlation Haofei Xu, Jiaolong Yang, Jianfei Cai, Juyong Zhang, Xin Tong International Conference on Computer Vision (ICCV), 2021 (Oral) paper / code Factorizing 2D optical flow with 1D attention and 1D correlation enables 4K resolution optical flow estimation on standard GPUs.
	AANet: Adaptive Aggregation Network for Efficient Stereo Matching Haofei Xu, Juyong Zhang Computer Vision and Pattern Recognition (CVPR), 2020 paper / code A sparse points-based cost aggregation method leads to an efficient and accurate stereo matching architecture without any 3D convolutions.

Invited Talks

Learning to Splat, Huawei, Jun 3, 2025
DepthSplat: Connecting Gaussian Splatting and Depth, Google DeepMind, hosted by Ben Poole, Oct 29, 2024
Unifying Flow, Stereo and Depth Estimation [slides], Synced, Dec 28, 2022
GMFlow: Learning Optical Flow via Global Matching [slides], Monash University, Apr 13, 2022

Teaching

Head Teaching Assistant, 3D Vision, Spring 2025
Teaching Assistant, Computer Vision, Fall 2024
Teaching Assistant, Stochastics and Machine Learning, Spring 2024

Academic Services

Conference Reviewer: ICCV 2021, CVPR 2022, ECCV 2022, CVPR 2023, NeurIPS 2023, CVPR 2024, ECCV 2024, NeurIPS 2024, CVPR 2025, ICCV 2025, NeurIPS 2025
Journal Reviewer: TIP, IJCV, TPAMI

Awards

Apple Scholar in AI/ML, 2025
Top Reviewer, NeurIPS 2024
Outstanding Reviewer, CVPR 2022
1^st place of Argoverse Stereo Challenge, CVPR 2022 Workshop on Autonomous Driving
National Scholarship, 2016

Thank Jon Barron for the website's source code