Dataset Preview

Field	Value
Citations as of	2026-04-28
Total papers	111939
CVPR	31548
ICCV	12564
ECCV	11760
3DV	1617
NeurIPS	25165
ICML	17032
ICLR	12253
Year range	1987 ~ 2025
With DOI	57855 (51.7%)
With abstract	37432 (33.4%)
Total citations	7693370
Mean citations	133.4
Median citations	28

year	3DV	CVPR	ECCV	ICCV	ICLR	ICML	NeurIPS	total
2025	142	3530	0	794	3704	3342	0	11512
2024	154	3531	2958	0	2454	2611	4495	16203
2023	0	3051	8	2654	1793	1829	3541	12876
2022	72	2627	2050	0	1094	1249	2852	9944
2021	141	2174	0	2076	860	1183	2564	8998
2020	125	1984	1674	0	687	1085	1919	7474
2019	79	1882	6	1635	502	774	1449	6327
2018	81	1326	1116	0	531	621	1010	4685
2017	73	1064	0	990	309	434	678	3548
2016	75	836	622	0	81	322	567	2503
2015	75	733	4	659	106	270	404	2251
2014	119	666	573	0	75	310	412	2155
2013	56	627	0	578	57	283	361	1962
2012	75	609	621	0	0	267	372	1944
2011	55	583	0	646	0	159	345	1788
2010	0	692	394	0	0	159	315	1560
2009	0	525	0	598	0	179	264	1566
2008	0	741	248	0	0	158	251	1398
2007	53	537	1	406	0	151	218	1366
2006	0	517	255	0	0	141	204	1117
2005	74	521	0	267	0	135	207	1204
2004	0	446	301	0	0	118	208	1073
2003	61	307	0	199	0	118	199	884
2002	0	0	250	0	0	88	207	545
2001	49	273	0	219	0	80	199	820
2000	0	229	118	1	0	151	154	653
1999	58	192	0	198	0	54	151	653
1998	0	143	114	167	0	66	152	642
1997	0	171	0	0	0	48	157	376
1996	0	137	145	0	0	67	152	501
1995	0	0	0	162	0	72	153	387
1994	0	165	116	0	0	46	141	468
1993	0	186	0	101	0	45	159	491
1992	0	159	106	0	0	60	128	453
1991	0	146	0	0	0	128	145	419
1990	0	0	80	126	0	51	144	401
1989	0	97	0	0	0	128	102	327
1988	0	141	0	88	0	50	96	375
1987	0	0	0	0	0	0	90	90

year	venue	papers	with_doi	with_abstract	total_citations	mean_citations	abstract_coverage_%
2025	3DV	142	142	141	2363	20.0	99.3
2025	CVPR	3530	2870	2852	51873	18.8	80.8
2025	ICCV	794	794	788	1969	4.5	99.2
2025	ICLR	3704	0	0	0	—	0.0
2025	ICML	3342	0	0	0	—	0.0
2024	3DV	154	154	154	3680	26.1	100.0
2024	CVPR	3531	3531	3510	69531	20.4	99.4
2024	ECCV	2958	2958	331	10927	13.6	11.2
2024	ICLR	2454	0	0	0	—	0.0
2024	ICML	2611	0	0	0	—	0.0
2024	NeurIPS	4495	0	0	0	—	0.0
2023	CVPR	3051	3051	3050	146162	48.6	100.0
2023	ECCV	8	8	0	29	4.1	0.0
2023	ICCV	2654	2654	2624	148221	57.3	98.9
2023	ICLR	1793	0	0	0	—	0.0
2023	ICML	1829	0	0	0	—	0.0
2023	NeurIPS	3541	0	0	0	—	0.0
2022	3DV	72	72	71	2100	29.6	98.6
2022	CVPR	2627	2627	2626	220693	87.1	100.0
2022	ECCV	2050	2050	372	41344	50.8	18.1
2022	ICLR	1094	0	0	0	—	0.0
2022	ICML	1249	0	0	0	—	0.0
2022	NeurIPS	2852	0	0	0	—	0.0
2021	3DV	141	141	140	5167	38.6	99.3
2021	CVPR	2174	2172	2171	184828	89.7	99.9
2021	ICCV	2076	2076	2064	268413	131.1	99.4
2021	ICLR	860	0	152	4374	31.2	17.7
2021	ICML	1183	0	148	2057	10.4	12.5
2021	NeurIPS	2564	0	201	5266	12.6	7.8
2020	3DV	125	125	122	3233	28.4	97.6

top_cited_100 6 cols 100 rows · first 30 shown

venue	year	title	authors	cited_by_count	doi
CVPR	2016	Deep Residual Learning for Image Recognition	Kaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun	226101	10.1109/cvpr.2016.90
CVPR	2009	ImageNet: A large-scale hierarchical image database	Jia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Li Fei-Fei	72461	10.1109/cvpr.2009.5206848
ECCV	2014	Microsoft COCO: Common Objects in Context	Tsung-Yi Lin; Michael Maire; Serge J. Belongie; James Hays; Pietro Perona; Deva Ramanan; Piotr Dollár; C. Lawrence Zitnick	52004	10.1007/978-3-319-10602-1_48
CVPR	2015	Going deeper with convolutions	Christian Szegedy; Wei Liu; Yangqing Jia; Pierre Sermanet; Scott E. Reed; Dragomir Anguelov; Dumitru Erhan; Vincent Vanhoucke; Andrew Rabinovich	47004	10.1109/cvpr.2015.7298594
CVPR	2016	You Only Look Once: Unified, Real-Time Object Detection	Joseph Redmon; Santosh Kumar Divvala; Ross B. Girshick; Ali Farhadi	44780	10.1109/cvpr.2016.91
CVPR	2017	Densely Connected Convolutional Networks	Gao Huang; Zhuang Liu; Laurens van der Maaten; Kilian Q. Weinberger	42658	10.1109/cvpr.2017.243
CVPR	2015	Fully convolutional networks for semantic segmentation	Jonathan Long; Evan Shelhamer; Trevor Darrell	41592	10.1109/cvpr.2015.7298965
CVPR	2005	Histograms of Oriented Gradients for Human Detection	Navneet Dalal; Bill Triggs	35486	10.1109/cvpr.2005.177
ECCV	2016	SSD: Single Shot MultiBox Detector	Wei Liu; Dragomir Anguelov; Dumitru Erhan; Christian Szegedy; Scott E. Reed; Cheng-Yang Fu; Alexander C. Berg	34496	10.1007/978-3-319-46448-0_2
CVPR	2018	Squeeze-and-Excitation Networks	Jie Hu; Li Shen; Gang Sun	33468	10.1109/cvpr.2018.00745
ICCV	2021	Swin Transformer: Hierarchical Vision Transformer using Shifted Windows	Ze Liu; Yutong Lin; Yue Cao; Han Hu; Yixuan Wei; Zheng Zhang; Stephen Lin; Baining Guo	31373	10.1109/iccv48922.2021.00986
ICCV	2017	Focal Loss for Dense Object Detection	Tsung-Yi Lin; Priya Goyal; Ross B. Girshick; Kaiming He; Piotr Dollár	30994	10.1109/iccv.2017.324
CVPR	2016	Rethinking the Inception Architecture for Computer Vision	Christian Szegedy; Vincent Vanhoucke; Sergey Ioffe; Jonathon Shlens; Zbigniew Wojna	30685	10.1109/cvpr.2016.308
CVPR	2014	Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation	Ross B. Girshick; Jeff Donahue; Trevor Darrell; Jitendra Malik	28779	10.1109/cvpr.2014.81
CVPR	2017	Feature Pyramid Networks for Object Detection	Tsung-Yi Lin; Piotr Dollár; Ross B. Girshick; Kaiming He; Bharath Hariharan; Serge J. Belongie	26438	10.1109/cvpr.2017.106
CVPR	2018	MobileNetV2: Inverted Residuals and Linear Bottlenecks	Mark Sandler; Andrew G. Howard; Menglong Zhu; Andrey Zhmoginov; Liang-Chieh Chen	23995	10.1109/cvpr.2018.00474
ECCV	2018	CBAM: Convolutional Block Attention Module	Sanghyun Woo; Jongchan Park; Joon-Young Lee; In So Kweon	22888	10.1007/978-3-030-01234-2_1
CVPR	2017	Image-to-Image Translation with Conditional Adversarial Networks	Phillip Isola; Jun-Yan Zhu; Tinghui Zhou; Alexei A. Efros	22073	10.1109/cvpr.2017.632
ICCV	2015	Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification	Kaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun	20360	10.1109/iccv.2015.123
ICCV	1999	Object Recognition from Local Scale-Invariant Features	David G. Lowe	18716	10.1109/iccv.1999.790410
ECCV	2020	End-to-End Object Detection with Transformers	Nicolas Carion; Francisco Massa; Gabriel Synnaeve; Nicolas Usunier; Alexander Kirillov; Sergey Zagoruyko	17822	10.1007/978-3-030-58452-8_13
CVPR	1997	Normalized Cuts and Image Segmentation	Jianbo Shi; Jitendra Malik	17802	10.1109/cvpr.1997.609407
CVPR	2017	Xception: Deep Learning with Depthwise Separable Convolutions	François Chollet	17471	10.1109/cvpr.2017.195
CVPR	2017	YOLO9000: Better, Faster, Stronger	Joseph Redmon; Ali Farhadi	17444	10.1109/cvpr.2017.690
CVPR	2017	PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation	Charles Ruizhongtai Qi; Hao Su; Kaichun Mo; Leonidas J. Guibas	17293	10.1109/cvpr.2017.16
CVPR	2018	The Unreasonable Effectiveness of Deep Features as a Perceptual Metric	Richard Zhang; Phillip Isola; Alexei A. Efros; Eli Shechtman; Oliver Wang	17069	10.1109/cvpr.2018.00068
ECCV	2014	Visualizing and Understanding Convolutional Networks	Matthew D. Zeiler; Rob Fergus	16932	10.1007/978-3-319-10590-1_53
ECCV	2018	Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation	Liang-Chieh Chen; Yukun Zhu; George Papandreou; Florian Schroff; Hartwig Adam	16313	10.1007/978-3-030-01234-2_49
CVPR	2012	Are we ready for autonomous driving? The KITTI vision benchmark suite	Andreas Geiger; Philip Lenz; Raquel Urtasun	14918	10.1109/cvpr.2012.6248074
CVPR	2015	FaceNet: A unified embedding for face recognition and clustering	Florian Schroff; Dmitry Kalenichenko; James Philbin	14694	10.1109/cvpr.2015.7298682

+ 70 more rows — download the xlsx to see all 100.

papers 12 cols 111,939 rows · first 30 shown

venue	year	title	authors	abstract	cited_by_count	doi	ee	pages	dblp_key	openalex_id	venues_all
3DV	2025	360-GS: Layout-Guided Panoramic Gaussian Splatting for Indoor Roaming	Jiayang Bai; Letian Huang; Jie Guo; Wen Gong; Yuanqi Li; Yanwen Guo	3D Gaussian Splatting (3D-GS) has recently attracted great attention with real-time and photo-realistic renderings. This technique typically takes perspective images as input and optimizes a set of 3D…	—	—	—	—	—	—	—
3DV	2025	3D Reconstruction with Spatial Memory	Hengyi Wang; Lourdes Agapito	We present Spann3R, a novel approach for dense 3D reconstruction from ordered or unordered image collections. Built on the DUSt3R paradigm, Spann3R uses a transformer-based architecture to directly re…	—	—	—	—	—	—	—
3DV	2025	3D Whole-Body Grasp Synthesis with Directional Controllability	Georgios Paschalidis; Romana Wilschut; Dimitrije Antic; Omid Taheri; Dimitrios Tzionas	Synthesizing 3D whole bodies that realistically grasp objects is useful for animation, mixed reality, and robotics. This is challenging, because the hands and body need to look natural w.r.t. each oth…	—	—	—	—	—	—	—
3DV	2025	3D-GPT: Procedural 3D Modeling with Large Language Models	Chunyi Sun; Junlin Han; Weijian Deng; Xinlong Wang; Zishan Qin; Stephen Gould	In the pursuit of efficient automated content creation, procedural generation, leveraging modifiable parameters and rule-based systems, has emerged as a promising approach. Nonetheless, this can be a…	—	—	—	—	—	—	—
3DV	2025	3Diface: Synthesizing and Editing Holistic 3D Facial Animation	Balamurugan Thambiraja; Malte Prinzler; Sadegh Aliakbarian; Darren Cosker; Justus Thies	Creating personalized 3D animations with precise control and realistic head motions remains challenging for current speech-driven 3D facial animation methods. Editing these animations is especially co…	—	—	—	—	—	—	—
3DV	2025	4D-Editor: Interactive Object-Level Editing in Dynamic Neural Radiance Fields via Semantic Distillation	Dadong Jiang; Zhihui Ke; Xiaobo Zhou; Tie Qiu; Xidong Shi; Hao Yan	This paper targets interactive object-level editing (e.g., deletion, recoloring, transformation, composition) in dynamic scenes. Recently, some methods aiming for flexible editing static scenes repres…	—	—	—	—	—	—	—
3DV	2025	A Large-Scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining	Qi Ma; Yue Li; Bin Ren; Nicu Sebe; Ender Konukoglu; Theo Gevers; Luc Van Gool; Danda Pani Paudel	3D Gaussian Splatting (3DGS) has become the de facto method of 3D representation in many vision tasks. This calls for the 3D understanding directly in this representation space. To facilitate the rese…	—	—	—	—	—	—	—
3DV	2025	A Robust Translation Synchronization Algorithm	Zihang He; Hang Ruan; Qixing Huang	This paper introduces a robust translation synchronization approach which takes relative directions between pairs of images as inputs and outputs absolute image locations. Our approach is based on a g…	—	—	—	—	—	—	—
3DV	2025	A2-GNN: Angle-Annular GNN for Visual Descriptor-Free Camera Relocalization	Yejun Zhang; Shuzhe Wang; Juho Kannala	Visual localization involves estimating the 6-degree-of-freedom (6-DoF) camera pose within a known scene. A critical step in this process is identifying pixel-to-point correspondences between 2D query…	—	—	—	—	—	—	—
3DV	2025	AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition	Omar Ikne; Benjamin Allaert; Hazem Wannous	Hand gesture recognition plays a crucial role in the domain of computer vision, as it enhances human-computer interaction by enabling intuitive, touch-free control and communication. While offline met…	—	—	—	—	—	—	—
3DV	2025	AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using Smartphones	Xuqian Ren; Matias Turkulainen; Jiepeng Wang; Otto Seiskari; Iaroslav Melekhov; Juho Kannala; Esa Rahtu	Geometric priors are often used to enhance 3D reconstruction. With many smartphones featuring low-resolution depth sensors and the prevalence of off-the-shelf monocular geometry estimators, incorporat…	—	—	—	—	—	—	—
3DV	2025	ARC-Flow: Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow Fields	Adam Hartshorne; Allen Paul; Tony Shardlow; Neill D. F. Campbell	This work presents a unified framework for the unsupervised prediction of physically plausible interpolations between two 3D articulated shapes and the automatic estimation of dense correspondence bet…	—	—	—	—	—	—	—
3DV	2025	An Object is Worth 64×64 Pixels: Generating 3D Object via Image Diffusion	Xingguang Yan; Han-Hung Lee; Ziyu Wan; Angel X. Chang	We introduce a new approach for generating realistic 3D models with UV maps through a representation termed “Object Images.” This approach encapsulates surface geometry, appearance, and patch structur…	—	—	—	—	—	—	—
3DV	2025	Approximate 2D-3D Shape Matching for Interactive Applications	Christoph Petzsch; Paul Roetzer; Zorah Lähner; Florian Bernard	Matching a 2D contour to a non-rigidly deformed 3D mesh is a challenging problem due to ambiguities arising from dimensionality differences. In the past, product graph based methods were only able to…	—	—	—	—	—	—	—
3DV	2025	AutoVFX: Physically Realistic Video Editing from Natural Language Instructions	Hao-Yu Hsu; Chih-Hao Lin; Albert J. Zhai; Hongchi Xia; Shenlong Wang	Modern visual effects (VFX) software has made it possible for skilled artists to create imagery of virtually anything. However, the creation process remains laborious, complex, and largely inaccessibl…	—	—	—	—	—	—	—
3DV	2025	Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting Objects	Suhas Gopal; Rishabh Dabral; Vladislav Golyanik; Christian Theobalt	Separable 3D reconstruction of multiple objects from multi-view RGB images—resulting in two different 3D shapes for the two objects with a clear separation between them—remains a sparsely researched p…	—	—	—	—	—	—	—
3DV	2025	BiGS: Bidirectional Primitives for Relightable 3D Gaussian Splatting	Zhenyuan Liu; Yu Guo; Xinyuan Li; Bernd Bickel; Ran Zhang	We present BiGS, an image-based novel view synthesis technique designed to model and render 3D objects with surface and volumetric materials under dynamic illumination, achieving real-time relighting…	—	—	—	—	—	—	—
3DV	2025	CFPNet: Improving Lightweight ToF Depth Completion via Cross-Zone Feature Propagation	Laiyan Ding; Hualie Jiang; Rui Xu; Rui Huang	Depth completion using lightweight time-of-fight (ToF) depth sensors is attractive due to their low cost. However, lightweight ToF sensors usually have a limited field of view (FOV) compared with came…	—	—	—	—	—	—	—
3DV	2025	CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control	Stefan Popov; Amit Raj; Michael Krainin; Yuanzhen Li; William T. Freeman; Michael Rubinstein	We propose a method for generating fly-through videos of a scene, from a single image and a given camera trajectory. We build upon an image-to-video latent diffusion model [5]. We condition its UNet […	—	—	—	—	—	—	—
3DV	2025	CameraHMR: Aligning People with Perspective	Priyanka Patel; Michael J. Black	We address the challenge of accurate 3D human pose and shape estimation from monocular images. The key to accuracy and robustness lies in high-quality training data. Existing training datasets contain…	—	—	—	—	—	—	—
3DV	2025	CatFree3D: Category-Agnostic 3D Object Detection with Diffusion	Wenjing Bian; Zirui Wang; Andrea Vedaldi	Image-based 3D object detection is widely employed in applications such as autonomous vehicles and robotics, yet current systems struggle with generalisation due to complex problem setup and limited t…	—	—	—	—	—	—	—
3DV	2025	CoE: Deep Coupled Embedding for Non-Rigid Point Cloud Correspondences	Huajian Zeng; Maolin Gao; Daniel Cremers	The interest in matching non-rigidly deformed shapes represented as raw point clouds is rising due to the prolif-eration of low-cost 3D sensors. Yet, the task is challenging since point clouds are irr…	—	—	—	—	—	—	—
3DV	2025	Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting	Zhiqi Li; Yiming Chen; Lingzhe Zhao; Peidong Liu	While text-to-3D and image-to-3D generation tasks have received considerable attention, one important but under-explored field between them is controllable text-to-3D generation, which we mainly focus…	—	—	—	—	—	—	—
3DV	2025	Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout Constraints	Chuan Fang; Yuan Dong; Kunming Luo; Xiaotao Hu; Rakesh Shrestha; Ping Tan	Text-driven 3D indoor scene generation is useful for gaming, film industry, and AR/VR applications. However, existing methods cannot faithfully capture the scene layout based on text descriptions, nor…	—	—	—	—	—	—	—
3DV	2025	DEGAS: Detailed Expressions on Full-Body Gaussian Avatars	Zhijing Shao; Duotun Wang; Qing-Yao Tian; Yao-Dong Yang; Hengyu Meng; Zeyu Cai; Bo Dong; Yu Zhang; Kang Zhang; Zeyu Wang	Although neural rendering has made significant ad-vances in creating lifelike, animatable full-body and head avatars, incorporating detailed expressions into full-body avatars remains largely unexplor…	—	—	—	—	—	—	—
3DV	2025	Deep Polycuboid Fitting for Compact 3D Representation of Indoor Scenes	Gahye Lee; Hyejeong Yoon; Jungeon Kim; Seungyong Lee	This paper presents a novel framework for compactly representing a 3D indoor scene using a set of polycuboids through a deep learning-based fitting method. Indoor scenes mainly consist of man-made obj…	—	—	—	—	—	—	—
3DV	2025	DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh Recovery	Jaewoo Heo; George Hu; Zeyu Wang; Serena Yeung-Levy	Human Mesh Recovery (HMR) is an important yet chal-lenging problem with applications across various domains including motion capture, augmented reality, and biome-chanics. Accurately predicting human…	—	—	—	—	—	—	—
3DV	2025	Denoising Monte Carlo Renders with Diffusion Models	Vaibhav Vavilala; Rahul Vasanth; David A. Forsyth	Physically-based renderings contain Monte Carlo noise, with variance that increases as the number of rays per pixel decreases. This noise, while zero-mean for good modern renderers, can have heavy tai…	—	—	—	—	—	—	—
3DV	2025	Direct and Explicit 3D Generation from a Single Image	Haoyu Wu; Meher Gitika Karumuri; Chuhang Zou; Seungbae Bang; Yuelong Li; Dimitris Samaras; Sunil Hadap	Current image-to-3D approaches suffer from high computational costs and lack scalability for high-resolution outputs. In contrast, we introduce a novel framework to directly generate explicit surface…	—	—	—	—	—	—	—
3DV	2025	Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation	Hubert Kompanowski; Binh-Son Hua	We present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the con…	—	—	—	—	—	—	—

+ 111,909 more rows — download the xlsx to see all 111,939.

summary 2 cols 15 rows

by_year_pivot 9 cols 39 rows

by_year_detail 8 cols 185 rows · first 30 shown

top_cited_100 6 cols 100 rows · first 30 shown

papers 12 cols 111,939 rows · first 30 shown