| 3DV | 2025 | 360-GS: Layout-Guided Panoramic Gaussian Splatting for Indoor Roaming | Jiayang Bai; Letian Huang; Jie Guo; Wen Gong; Yuanqi Li; Yanwen Guo | 3D Gaussian Splatting (3D-GS) has recently attracted great attention with real-time and photo-realistic renderings. This technique typically takes perspective images as input and optimizes a set of 3D… | — | — | — | — | — | — | — |
| 3DV | 2025 | 3D Reconstruction with Spatial Memory | Hengyi Wang; Lourdes Agapito | We present Spann3R, a novel approach for dense 3D reconstruction from ordered or unordered image collections. Built on the DUSt3R paradigm, Spann3R uses a transformer-based architecture to directly re… | — | — | — | — | — | — | — |
| 3DV | 2025 | 3D Whole-Body Grasp Synthesis with Directional Controllability | Georgios Paschalidis; Romana Wilschut; Dimitrije Antic; Omid Taheri; Dimitrios Tzionas | Synthesizing 3D whole bodies that realistically grasp objects is useful for animation, mixed reality, and robotics. This is challenging, because the hands and body need to look natural w.r.t. each oth… | — | — | — | — | — | — | — |
| 3DV | 2025 | 3D-GPT: Procedural 3D Modeling with Large Language Models | Chunyi Sun; Junlin Han; Weijian Deng; Xinlong Wang; Zishan Qin; Stephen Gould | In the pursuit of efficient automated content creation, procedural generation, leveraging modifiable parameters and rule-based systems, has emerged as a promising approach. Nonetheless, this can be a… | — | — | — | — | — | — | — |
| 3DV | 2025 | 3Diface: Synthesizing and Editing Holistic 3D Facial Animation | Balamurugan Thambiraja; Malte Prinzler; Sadegh Aliakbarian; Darren Cosker; Justus Thies | Creating personalized 3D animations with precise control and realistic head motions remains challenging for current speech-driven 3D facial animation methods. Editing these animations is especially co… | — | — | — | — | — | — | — |
| 3DV | 2025 | 4D-Editor: Interactive Object-Level Editing in Dynamic Neural Radiance Fields via Semantic Distillation | Dadong Jiang; Zhihui Ke; Xiaobo Zhou; Tie Qiu; Xidong Shi; Hao Yan | This paper targets interactive object-level editing (e.g., deletion, recoloring, transformation, composition) in dynamic scenes. Recently, some methods aiming for flexible editing static scenes repres… | — | — | — | — | — | — | — |
| 3DV | 2025 | A Large-Scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining | Qi Ma; Yue Li; Bin Ren; Nicu Sebe; Ender Konukoglu; Theo Gevers; Luc Van Gool; Danda Pani Paudel | 3D Gaussian Splatting (3DGS) has become the de facto method of 3D representation in many vision tasks. This calls for the 3D understanding directly in this representation space. To facilitate the rese… | — | — | — | — | — | — | — |
| 3DV | 2025 | A Robust Translation Synchronization Algorithm | Zihang He; Hang Ruan; Qixing Huang | This paper introduces a robust translation synchronization approach which takes relative directions between pairs of images as inputs and outputs absolute image locations. Our approach is based on a g… | — | — | — | — | — | — | — |
| 3DV | 2025 | A2-GNN: Angle-Annular GNN for Visual Descriptor-Free Camera Relocalization | Yejun Zhang; Shuzhe Wang; Juho Kannala | Visual localization involves estimating the 6-degree-of-freedom (6-DoF) camera pose within a known scene. A critical step in this process is identifying pixel-to-point correspondences between 2D query… | — | — | — | — | — | — | — |
| 3DV | 2025 | AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition | Omar Ikne; Benjamin Allaert; Hazem Wannous | Hand gesture recognition plays a crucial role in the domain of computer vision, as it enhances human-computer interaction by enabling intuitive, touch-free control and communication. While offline met… | — | — | — | — | — | — | — |
| 3DV | 2025 | AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones | Xuqian Ren; Matias Turkulainen; Jiepeng Wang; Otto Seiskari; Iaroslav Melekhov; Juho Kannala; Esa Rahtu | Geometric priors are often used to enhance 3D reconstruction. With many smartphones featuring low-resolution depth sensors and the prevalence of off-the-shelf monocular geometry estimators, incorporat… | — | — | — | — | — | — | — |
| 3DV | 2025 | ARC-Flow: Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields | Adam Hartshorne; Allen Paul; Tony Shardlow; Neill D. F. Campbell | This work presents a unified framework for the unsupervised prediction of physically plausible interpolations between two 3D articulated shapes and the automatic estimation of dense correspondence bet… | — | — | — | — | — | — | — |
| 3DV | 2025 | An Object is Worth 64×64 Pixels: Generating 3D Object via Image Diffusion | Xingguang Yan; Han-Hung Lee; Ziyu Wan; Angel X. Chang | We introduce a new approach for generating realistic 3D models with UV maps through a representation termed “Object Images.” This approach encapsulates surface geometry, appearance, and patch structur… | — | — | — | — | — | — | — |
| 3DV | 2025 | Approximate 2D-3D Shape Matching for Interactive Applications | Christoph Petzsch; Paul Roetzer; Zorah Lähner; Florian Bernard | Matching a 2D contour to a non-rigidly deformed 3D mesh is a challenging problem due to ambiguities arising from dimensionality differences. In the past, product graph based methods were only able to… | — | — | — | — | — | — | — |
| 3DV | 2025 | AutoVFX: Physically Realistic Video Editing from Natural Language Instructions | Hao-Yu Hsu; Chih-Hao Lin; Albert J. Zhai; Hongchi Xia; Shenlong Wang | Modern visual effects (VFX) software has made it possible for skilled artists to create imagery of virtually anything. However, the creation process remains laborious, complex, and largely inaccessibl… | — | — | — | — | — | — | — |
| 3DV | 2025 | Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting Objects | Suhas Gopal; Rishabh Dabral; Vladislav Golyanik; Christian Theobalt | Separable 3D reconstruction of multiple objects from multi-view RGB images—resulting in two different 3D shapes for the two objects with a clear separation between them—remains a sparsely researched p… | — | — | — | — | — | — | — |
| 3DV | 2025 | BiGS: Bidirectional Primitives for Relightable 3D Gaussian Splatting | Zhenyuan Liu; Yu Guo; Xinyuan Li; Bernd Bickel; Ran Zhang | We present BiGS, an image-based novel view synthesis technique designed to model and render 3D objects with surface and volumetric materials under dynamic illumination, achieving real-time relighting… | — | — | — | — | — | — | — |
| 3DV | 2025 | CFPNet: Improving Lightweight ToF Depth Completion via Cross-Zone Feature Propagation | Laiyan Ding; Hualie Jiang; Rui Xu; Rui Huang | Depth completion using lightweight time-of-fight (ToF) depth sensors is attractive due to their low cost. However, lightweight ToF sensors usually have a limited field of view (FOV) compared with came… | — | — | — | — | — | — | — |
| 3DV | 2025 | CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control | Stefan Popov; Amit Raj; Michael Krainin; Yuanzhen Li; William T. Freeman; Michael Rubinstein | We propose a method for generating fly-through videos of a scene, from a single image and a given camera trajectory. We build upon an image-to-video latent diffusion model [5]. We condition its UNet [… | — | — | — | — | — | — | — |
| 3DV | 2025 | CameraHMR: Aligning People with Perspective | Priyanka Patel; Michael J. Black | We address the challenge of accurate 3D human pose and shape estimation from monocular images. The key to accuracy and robustness lies in high-quality training data. Existing training datasets contain… | — | — | — | — | — | — | — |
| 3DV | 2025 | CatFree3D: Category-Agnostic 3D Object Detection with Diffusion | Wenjing Bian; Zirui Wang; Andrea Vedaldi | Image-based 3D object detection is widely employed in applications such as autonomous vehicles and robotics, yet current systems struggle with generalisation due to complex problem setup and limited t… | — | — | — | — | — | — | — |
| 3DV | 2025 | CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences | Huajian Zeng; Maolin Gao; Daniel Cremers | The interest in matching non-rigidly deformed shapes represented as raw point clouds is rising due to the prolif-eration of low-cost 3D sensors. Yet, the task is challenging since point clouds are irr… | — | — | — | — | — | — | — |
| 3DV | 2025 | Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting | Zhiqi Li; Yiming Chen; Lingzhe Zhao; Peidong Liu | While text-to-3D and image-to-3D generation tasks have received considerable attention, one important but under-explored field between them is controllable text-to-3D generation, which we mainly focus… | — | — | — | — | — | — | — |
| 3DV | 2025 | Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints | Chuan Fang; Yuan Dong; Kunming Luo; Xiaotao Hu; Rakesh Shrestha; Ping Tan | Text-driven 3D indoor scene generation is useful for gaming, film industry, and AR/VR applications. However, existing methods cannot faithfully capture the scene layout based on text descriptions, nor… | — | — | — | — | — | — | — |
| 3DV | 2025 | DEGAS: Detailed Expressions on Full-Body Gaussian Avatars | Zhijing Shao; Duotun Wang; Qing-Yao Tian; Yao-Dong Yang; Hengyu Meng; Zeyu Cai; Bo Dong; Yu Zhang; Kang Zhang; Zeyu Wang | Although neural rendering has made significant ad-vances in creating lifelike, animatable full-body and head avatars, incorporating detailed expressions into full-body avatars remains largely unexplor… | — | — | — | — | — | — | — |
| 3DV | 2025 | Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes | Gahye Lee; Hyejeong Yoon; Jungeon Kim; Seungyong Lee | This paper presents a novel framework for compactly representing a 3D indoor scene using a set of polycuboids through a deep learning-based fitting method. Indoor scenes mainly consist of man-made obj… | — | — | — | — | — | — | — |
| 3DV | 2025 | DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery | Jaewoo Heo; George Hu; Zeyu Wang; Serena Yeung-Levy | Human Mesh Recovery (HMR) is an important yet chal-lenging problem with applications across various domains including motion capture, augmented reality, and biome-chanics. Accurately predicting human… | — | — | — | — | — | — | — |
| 3DV | 2025 | Denoising Monte Carlo Renders with Diffusion Models | Vaibhav Vavilala; Rahul Vasanth; David A. Forsyth | Physically-based renderings contain Monte Carlo noise, with variance that increases as the number of rays per pixel decreases. This noise, while zero-mean for good modern renderers, can have heavy tai… | — | — | — | — | — | — | — |
| 3DV | 2025 | Direct and Explicit 3D Generation from a Single Image | Haoyu Wu; Meher Gitika Karumuri; Chuhang Zou; Seungbae Bang; Yuelong Li; Dimitris Samaras; Sunil Hadap | Current image-to-3D approaches suffer from high computational costs and lack scalability for high-resolution outputs. In contrast, we introduce a novel framework to directly generate explicit surface… | — | — | — | — | — | — | — |
| 3DV | 2025 | Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation | Hubert Kompanowski; Binh-Son Hua | We present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the con… | — | — | — | — | — | — | — |