Enhanced Monocular 3D Object Reconstruction
We revisit the Splatter Image framework and show how semantics from vision-language models, depth priors, and tailored loss functions can resolve ambiguity, improving view consistency and geometric accuracy.