-
Monash University
- Melbourne, Australia
- https://chengzhag.github.io/
- in/chengzhag
- https://scholar.google.com/citations?user=2uEyZJQAAAAJ&hl
Highlights
- Pro
Stars
[CVPR 2025] VGGT: Visual Geometry Grounded Transformer
(CVPR2025) Official repository of paper "Panorama Generation From NFoV Image Done Right"
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
3D Gaussian Splatting (3DGS) on fisheye cameras
[NeurIPS 2024] ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splatting
[WACV 2025] OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting
🛠️ SLAM evaluation tool (supplement for EVO)
📍TextSLAM: Visual SLAM with Semantic Planar Text Features. (ICRA2020 & TPAMI2023)
🌳 [ICRA'25] Hier-SLAM: Semantic Gaussian Splatting SLAM with Hierarchical Categorical Representation
🏠 PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation
🤖 Dataset for TextSLAM: Visual SLAM with Semantic Planar Text Features. (ICRA2020 & TPAMI2023)
[CVPR' 24] Toolkit for 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Resources
[CVPR 2024] Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D Cameras
Boosting Generative Novel View Synthesis with Sparse and Unposed Images
[CVPR 2025] Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
[CVPR 2025] Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models
🍳 [CVPR'24 Highlight] Pytorch implementation of "Taming Stable Diffusion for Text to 360° Panorama Image Generation"
Wan: Open and Advanced Large-Scale Video Generative Models
🍳 [CVPR'25] PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
[NeurIPS2023] PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas(or 360-degree image)
[ECCV'24] On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy
[NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"