We present GP-4DGS, a novel framework that integrates Gaussian Processes (GPs) into 4D Gaussian Splatting (4DGS) for principled probabilistic modeling of dynamic scenes. While existing 4DGS methods focus on deterministic reconstruction, they are inherently limited in capturing motion ambiguity and lack mechanisms to assess prediction reliability.
By leveraging the kernel-based probabilistic nature of GPs, our approach introduces three key capabilities: (i) uncertainty quantification for motion predictions, (ii) motion estimation for unobserved or sparsely sampled regions, and (iii) temporal extrapolation beyond observed training frames. To scale GPs to the large number of Gaussian primitives in 4DGS, we design spatio-temporal kernels that capture the correlation structure of deformation fields and adopt variational Gaussian Processes with inducing points for tractable inference. Our experiments show that GP-4DGS enhances reconstruction quality while providing reliable uncertainty estimates that effectively identify regions of high motion ambiguity.
Metrics: mPSNR ↑, mSSIM ↑, mLPIPS ↓. GP-4DGS consistently achieves superior results, with the largest gains on the Challenging subset (reduced viewpoint overlap).
| Split | Method | mPSNR ↑ | mSSIM ↑ | mLPIPS ↓ |
|---|---|---|---|---|
| All Scenes (7) | ||||
| All | Gaussian Marbles | 15.84 | 0.54 | 0.57 |
| All | SoM | 17.09 | 0.65 | 0.39 |
| All | GP-4DGS (ours) | 17.38 | 0.65 | 0.37 |
| SoM 5 Scenes | ||||
| SoM 5 | SC-GS | 14.13 | 0.48 | 0.49 |
| SoM 5 | D-3DGS | 11.92 | 0.49 | 0.66 |
| SoM 5 | 4DGS | 13.42 | 0.49 | 0.56 |
| SoM 5 | HyperNeRF | 15.99 | 0.59 | 0.51 |
| SoM 5 | SoM | 16.73 | 0.64 | 0.43 |
| SoM 5 | GP-4DGS (ours) | 16.92 | 0.66 | 0.41 |
| Challenging Subset (reduced viewpoint overlap) | ||||
| Challenging | Gaussian Marbles | 14.05 | 0.40 | 0.61 |
| Challenging | SoM | 14.56 | 0.46 | 0.53 |
| Challenging | GP-4DGS (ours) | 15.02 | 0.46 | 0.51 |
Table 1. GP-4DGS surpasses all baselines across every split. The performance gap widens on the Challenging subset, demonstrating robustness under sparse observations.
PSNR ↑ evaluated on the last 5 and 15 frames held out from training. GP-4DGS dramatically outperforms naïve linear extrapolation, especially for periodic motion.
| Method | Periodic Motion | Non-periodic Motion | ||
|---|---|---|---|---|
| 5 frames | 15 frames | 5 frames | 15 frames | |
| Linear extrapolation | 11.55 | 8.11 | 15.02 | 11.92 |
| GP-4DGS (ours) | 17.62 | 16.65 | 15.27 | 13.22 |
Table 2. The periodic temporal kernel captures cyclic structure effectively, yielding a large PSNR gain (17.62 vs. 11.55 at 5 frames) over linear extrapolation.