ReSplat: Learning Recurrent Gaussian Splatting
Test-time scaling for feed-forward Gaussian splatting.
Final-year PhD student at ETH Zurich and University of Tübingen
Research Scientist at KE:SAI Open Science Lab
I am a final-year PhD student at ETH Zurich and University of Tübingen, advised by Marc Pollefeys and Andreas Geiger, and a Research Scientist at KE:SAI Open Science Lab. I previously interned at Google Zurich with Michael Niemeyer and Federico Tombari.
Before my PhD, I worked with Jianfei Cai and Hamid Rezatofighi at Monash University, and earned my master's from the University of Science and Technology of China with Juyong Zhang, including an exchange at NTU and a research internship at Microsoft Research Asia.
I am honored to have received the 2025 Apple Scholar in AI/ML, Gold Reviewer Award (ICML 2026), Top Reviewer Award (NeurIPS 2024), and Outstanding Reviewer Award (CVPR 2022).
I am looking for highly motivated students to explore frontier research topics together. If you are interested in working with me at ETH Zurich or KE:SAI Open Science Lab (Tübingen, Germany), please email me with your CV and a short research statement (for ETH), or apply here (for KE:SAI).
I have broad interests in computer vision and deep learning. I have worked on depth, stereo, optical flow, tracking, feed-forward NeRF/3DGS, and generative models. I am actively exploring new ideas, and I am currently interested in representation learning and generative models for efficient, scalable world models and physical AI. See my full publication list on Google Scholar.
Test-time scaling for feed-forward Gaussian splatting.
A simple yet powerful framework for large-displacement optical flow and point tracking.
The first data-driven multi-view 3D point tracker for tracking arbitrary 3D points across multiple cameras.
Cross-task interactions between feed-forward Gaussian splatting and depth.
4K panorama synthesis with a single feed-forward inference.
Unposed 3DGS reconstruction made easy.
A cost volume representation for efficiently predicting 3D Gaussians from sparse multi-view images in a single feed-forward inference.
A unified dense correspondence matching formulation enables three motion and 3D perception tasks to be solved with a unified model.
Factorizing 2D optical flow with 1D attention and 1D correlation enables 4K resolution optical flow estimation on standard GPUs.
A sparse points-based cost aggregation method leads to an efficient and accurate stereo matching architecture without any 3D convolutions.
I believe in the power of open source to advance science. Here are some of the key open-source projects stemming from my research and collaborations.