CV+ML Paper Atlas

Dataset Preview

cvml_atlas_all.xlsx  ·  22.1 MB  ·  generated 2026-04-28  ·  ⬇ Download

summary 2 cols 15 rows

FieldValue
Citations as of2026-04-28
Total papers111939
CVPR31548
ICCV12564
ECCV11760
3DV1617
NeurIPS25165
ICML17032
ICLR12253
Year range1987 ~ 2025
With DOI57855 (51.7%)
With abstract37432 (33.4%)
Total citations7693370
Mean citations133.4
Median citations28

by_year_pivot 9 cols 39 rows

year3DVCVPRECCVICCVICLRICMLNeurIPStotal
20251423530079437043342011512
202415435312958024542611449516203
2023030518265417931829354112876
2022722627205001094124928529944
2021141217402076860118325648998
2020125198416740687108519197474
20197918826163550277414496327
20188113261116053162110104685
201773106409903094346783548
2016758366220813225672503
20157573346591062704042251
20141196665730753104122155
2013566270578572833611962
201275609621002673721944
201155583064601593451788
20100692394001593151560
20090525059801792641566
20080741248001582511398
200753537140601512181366
20060517255001412041117
200574521026701352071204
20040446301001182081073
20036130701990118199884
2002002500088207545
2001492730219080199820
2000022911810151154653
1999581920198054151653
19980143114167066152642
1997017100048157376
199601371450067152501
1995000162072153387
199401651160046141468
199301860101045159491
199201591060060128453
19910146000128145419
19900080126051144401
1989097000128102327
1988014108805096375
19870000009090

by_year_detail 8 cols 185 rows · first 30 shown

yearvenuepaperswith_doiwith_abstracttotal_citationsmean_citationsabstract_coverage_%
20253DV142142141236320.099.3
2025CVPR3530287028525187318.880.8
2025ICCV79479478819694.599.2
2025ICLR37040000.0
2025ICML33420000.0
20243DV154154154368026.1100.0
2024CVPR3531353135106953120.499.4
2024ECCV295829583311092713.611.2
2024ICLR24540000.0
2024ICML26110000.0
2024NeurIPS44950000.0
2023CVPR30513051305014616248.6100.0
2023ECCV880294.10.0
2023ICCV26542654262414822157.398.9
2023ICLR17930000.0
2023ICML18290000.0
2023NeurIPS35410000.0
20223DV727271210029.698.6
2022CVPR26272627262622069387.1100.0
2022ECCV205020503724134450.818.1
2022ICLR10940000.0
2022ICML12490000.0
2022NeurIPS28520000.0
20213DV141141140516738.699.3
2021CVPR21742172217118482889.799.9
2021ICCV207620762064268413131.199.4
2021ICLR8600152437431.217.7
2021ICML11830148205710.412.5
2021NeurIPS25640201526612.67.8
20203DV125125122323328.497.6
+ 155 more rows — download the xlsx to see all 185.

top_cited_100 6 cols 100 rows · first 30 shown

venueyeartitleauthorscited_by_countdoi
CVPR2016Deep Residual Learning for Image RecognitionKaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun22610110.1109/cvpr.2016.90
CVPR2009ImageNet: A large-scale hierarchical image databaseJia Deng; Wei Dong; Richard Socher; Li-Jia Li; Kai Li; Li Fei-Fei7246110.1109/cvpr.2009.5206848
ECCV2014Microsoft COCO: Common Objects in ContextTsung-Yi Lin; Michael Maire; Serge J. Belongie; James Hays; Pietro Perona; Deva Ramanan; Piotr Dollár; C. Lawrence Zitnick5200410.1007/978-3-319-10602-1_48
CVPR2015Going deeper with convolutionsChristian Szegedy; Wei Liu; Yangqing Jia; Pierre Sermanet; Scott E. Reed; Dragomir Anguelov; Dumitru Erhan; Vincent Vanhoucke; Andrew Rabinovich4700410.1109/cvpr.2015.7298594
CVPR2016You Only Look Once: Unified, Real-Time Object DetectionJoseph Redmon; Santosh Kumar Divvala; Ross B. Girshick; Ali Farhadi4478010.1109/cvpr.2016.91
CVPR2017Densely Connected Convolutional NetworksGao Huang; Zhuang Liu; Laurens van der Maaten; Kilian Q. Weinberger4265810.1109/cvpr.2017.243
CVPR2015Fully convolutional networks for semantic segmentationJonathan Long; Evan Shelhamer; Trevor Darrell4159210.1109/cvpr.2015.7298965
CVPR2005Histograms of Oriented Gradients for Human DetectionNavneet Dalal; Bill Triggs3548610.1109/cvpr.2005.177
ECCV2016SSD: Single Shot MultiBox DetectorWei Liu; Dragomir Anguelov; Dumitru Erhan; Christian Szegedy; Scott E. Reed; Cheng-Yang Fu; Alexander C. Berg3449610.1007/978-3-319-46448-0_2
CVPR2018Squeeze-and-Excitation NetworksJie Hu; Li Shen; Gang Sun3346810.1109/cvpr.2018.00745
ICCV2021Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsZe Liu; Yutong Lin; Yue Cao; Han Hu; Yixuan Wei; Zheng Zhang; Stephen Lin; Baining Guo3137310.1109/iccv48922.2021.00986
ICCV2017Focal Loss for Dense Object DetectionTsung-Yi Lin; Priya Goyal; Ross B. Girshick; Kaiming He; Piotr Dollár3099410.1109/iccv.2017.324
CVPR2016Rethinking the Inception Architecture for Computer VisionChristian Szegedy; Vincent Vanhoucke; Sergey Ioffe; Jonathon Shlens; Zbigniew Wojna3068510.1109/cvpr.2016.308
CVPR2014Rich Feature Hierarchies for Accurate Object Detection and Semantic SegmentationRoss B. Girshick; Jeff Donahue; Trevor Darrell; Jitendra Malik2877910.1109/cvpr.2014.81
CVPR2017Feature Pyramid Networks for Object DetectionTsung-Yi Lin; Piotr Dollár; Ross B. Girshick; Kaiming He; Bharath Hariharan; Serge J. Belongie2643810.1109/cvpr.2017.106
CVPR2018MobileNetV2: Inverted Residuals and Linear BottlenecksMark Sandler; Andrew G. Howard; Menglong Zhu; Andrey Zhmoginov; Liang-Chieh Chen2399510.1109/cvpr.2018.00474
ECCV2018CBAM: Convolutional Block Attention ModuleSanghyun Woo; Jongchan Park; Joon-Young Lee; In So Kweon2288810.1007/978-3-030-01234-2_1
CVPR2017Image-to-Image Translation with Conditional Adversarial NetworksPhillip Isola; Jun-Yan Zhu; Tinghui Zhou; Alexei A. Efros2207310.1109/cvpr.2017.632
ICCV2015Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet ClassificationKaiming He; Xiangyu Zhang; Shaoqing Ren; Jian Sun2036010.1109/iccv.2015.123
ICCV1999Object Recognition from Local Scale-Invariant FeaturesDavid G. Lowe1871610.1109/iccv.1999.790410
ECCV2020End-to-End Object Detection with TransformersNicolas Carion; Francisco Massa; Gabriel Synnaeve; Nicolas Usunier; Alexander Kirillov; Sergey Zagoruyko1782210.1007/978-3-030-58452-8_13
CVPR1997Normalized Cuts and Image SegmentationJianbo Shi; Jitendra Malik1780210.1109/cvpr.1997.609407
CVPR2017Xception: Deep Learning with Depthwise Separable ConvolutionsFrançois Chollet1747110.1109/cvpr.2017.195
CVPR2017YOLO9000: Better, Faster, StrongerJoseph Redmon; Ali Farhadi1744410.1109/cvpr.2017.690
CVPR2017PointNet: Deep Learning on Point Sets for 3D Classification and SegmentationCharles Ruizhongtai Qi; Hao Su; Kaichun Mo; Leonidas J. Guibas1729310.1109/cvpr.2017.16
CVPR2018The Unreasonable Effectiveness of Deep Features as a Perceptual MetricRichard Zhang; Phillip Isola; Alexei A. Efros; Eli Shechtman; Oliver Wang1706910.1109/cvpr.2018.00068
ECCV2014Visualizing and Understanding Convolutional NetworksMatthew D. Zeiler; Rob Fergus1693210.1007/978-3-319-10590-1_53
ECCV2018Encoder-Decoder with Atrous Separable Convolution for Semantic Image SegmentationLiang-Chieh Chen; Yukun Zhu; George Papandreou; Florian Schroff; Hartwig Adam1631310.1007/978-3-030-01234-2_49
CVPR2012Are we ready for autonomous driving? The KITTI vision benchmark suiteAndreas Geiger; Philip Lenz; Raquel Urtasun1491810.1109/cvpr.2012.6248074
CVPR2015FaceNet: A unified embedding for face recognition and clusteringFlorian Schroff; Dmitry Kalenichenko; James Philbin1469410.1109/cvpr.2015.7298682
+ 70 more rows — download the xlsx to see all 100.

papers 12 cols 111,939 rows · first 30 shown

venueyeartitleauthorsabstractcited_by_countdoieepagesdblp_keyopenalex_idvenues_all
3DV2025360-GS: Layout-Guided Panoramic Gaussian Splatting for Indoor RoamingJiayang Bai; Letian Huang; Jie Guo; Wen Gong; Yuanqi Li; Yanwen Guo3D Gaussian Splatting (3D-GS) has recently attracted great attention with real-time and photo-realistic renderings. This technique typically takes perspective images as input and optimizes a set of 3D…
3DV20253D Reconstruction with Spatial MemoryHengyi Wang; Lourdes AgapitoWe present Spann3R, a novel approach for dense 3D reconstruction from ordered or unordered image collections. Built on the DUSt3R paradigm, Spann3R uses a transformer-based architecture to directly re…
3DV20253D Whole-Body Grasp Synthesis with Directional ControllabilityGeorgios Paschalidis; Romana Wilschut; Dimitrije Antic; Omid Taheri; Dimitrios TzionasSynthesizing 3D whole bodies that realistically grasp objects is useful for animation, mixed reality, and robotics. This is challenging, because the hands and body need to look natural w.r.t. each oth…
3DV20253D-GPT: Procedural 3D Modeling with Large Language ModelsChunyi Sun; Junlin Han; Weijian Deng; Xinlong Wang; Zishan Qin; Stephen GouldIn the pursuit of efficient automated content creation, procedural generation, leveraging modifiable parameters and rule-based systems, has emerged as a promising approach. Nonetheless, this can be a…
3DV20253Diface: Synthesizing and Editing Holistic 3D Facial AnimationBalamurugan Thambiraja; Malte Prinzler; Sadegh Aliakbarian; Darren Cosker; Justus ThiesCreating personalized 3D animations with precise control and realistic head motions remains challenging for current speech-driven 3D facial animation methods. Editing these animations is especially co…
3DV20254D-Editor: Interactive Object-Level Editing in Dynamic Neural Radiance Fields via Semantic DistillationDadong Jiang; Zhihui Ke; Xiaobo Zhou; Tie Qiu; Xidong Shi; Hao YanThis paper targets interactive object-level editing (e.g., deletion, recoloring, transformation, composition) in dynamic scenes. Recently, some methods aiming for flexible editing static scenes repres…
3DV2025A Large-Scale Dataset of Gaussian Splats and Their Self-Supervised PretrainingQi Ma; Yue Li; Bin Ren; Nicu Sebe; Ender Konukoglu; Theo Gevers; Luc Van Gool; Danda Pani Paudel3D Gaussian Splatting (3DGS) has become the de facto method of 3D representation in many vision tasks. This calls for the 3D understanding directly in this representation space. To facilitate the rese…
3DV2025A Robust Translation Synchronization AlgorithmZihang He; Hang Ruan; Qixing HuangThis paper introduces a robust translation synchronization approach which takes relative directions between pairs of images as inputs and outputs absolute image locations. Our approach is based on a g…
3DV2025A2-GNN: Angle-Annular GNN for Visual Descriptor-Free Camera RelocalizationYejun Zhang; Shuzhe Wang; Juho KannalaVisual localization involves estimating the 6-degree-of-freedom (6-DoF) camera pose within a known scene. A critical step in this process is identifying pixel-to-point correspondences between 2D query…
3DV2025AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture RecognitionOmar Ikne; Benjamin Allaert; Hazem WannousHand gesture recognition plays a crucial role in the domain of computer vision, as it enhances human-computer interaction by enabling intuitive, touch-free control and communication. While offline met…
3DV2025AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using SmartphonesXuqian Ren; Matias Turkulainen; Jiepeng Wang; Otto Seiskari; Iaroslav Melekhov; Juho Kannala; Esa RahtuGeometric priors are often used to enhance 3D reconstruction. With many smartphones featuring low-resolution depth sensors and the prevalence of off-the-shelf monocular geometry estimators, incorporat…
3DV2025ARC-Flow: Articulated, Resolution-Agnostic, Correspondence-Free Matching and Interpolation of 3D Shapes Under Flow FieldsAdam Hartshorne; Allen Paul; Tony Shardlow; Neill D. F. CampbellThis work presents a unified framework for the unsupervised prediction of physically plausible interpolations between two 3D articulated shapes and the automatic estimation of dense correspondence bet…
3DV2025An Object is Worth 64×64 Pixels: Generating 3D Object via Image DiffusionXingguang Yan; Han-Hung Lee; Ziyu Wan; Angel X. ChangWe introduce a new approach for generating realistic 3D models with UV maps through a representation termed “Object Images.” This approach encapsulates surface geometry, appearance, and patch structur…
3DV2025Approximate 2D-3D Shape Matching for Interactive ApplicationsChristoph Petzsch; Paul Roetzer; Zorah Lähner; Florian BernardMatching a 2D contour to a non-rigidly deformed 3D mesh is a challenging problem due to ambiguities arising from dimensionality differences. In the past, product graph based methods were only able to…
3DV2025AutoVFX: Physically Realistic Video Editing from Natural Language InstructionsHao-Yu Hsu; Chih-Hao Lin; Albert J. Zhai; Hongchi Xia; Shenlong WangModern visual effects (VFX) software has made it possible for skilled artists to create imagery of virtually anything. However, the creation process remains laborious, complex, and largely inaccessibl…
3DV2025Betsu-Betsu: Multi-View Separable 3D Reconstruction of Two Interacting ObjectsSuhas Gopal; Rishabh Dabral; Vladislav Golyanik; Christian TheobaltSeparable 3D reconstruction of multiple objects from multi-view RGB images—resulting in two different 3D shapes for the two objects with a clear separation between them—remains a sparsely researched p…
3DV2025BiGS: Bidirectional Primitives for Relightable 3D Gaussian SplattingZhenyuan Liu; Yu Guo; Xinyuan Li; Bernd Bickel; Ran ZhangWe present BiGS, an image-based novel view synthesis technique designed to model and render 3D objects with surface and volumetric materials under dynamic illumination, achieving real-time relighting…
3DV2025CFPNet: Improving Lightweight ToF Depth Completion via Cross-Zone Feature PropagationLaiyan Ding; Hualie Jiang; Rui Xu; Rui HuangDepth completion using lightweight time-of-fight (ToF) depth sensors is attractive due to their low cost. However, lightweight ToF sensors usually have a limited field of view (FOV) compared with came…
3DV2025CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera ControlStefan Popov; Amit Raj; Michael Krainin; Yuanzhen Li; William T. Freeman; Michael RubinsteinWe propose a method for generating fly-through videos of a scene, from a single image and a given camera trajectory. We build upon an image-to-video latent diffusion model [5]. We condition its UNet […
3DV2025CameraHMR: Aligning People with PerspectivePriyanka Patel; Michael J. BlackWe address the challenge of accurate 3D human pose and shape estimation from monocular images. The key to accuracy and robustness lies in high-quality training data. Existing training datasets contain…
3DV2025CatFree3D: Category-Agnostic 3D Object Detection with DiffusionWenjing Bian; Zirui Wang; Andrea VedaldiImage-based 3D object detection is widely employed in applications such as autonomous vehicles and robotics, yet current systems struggle with generalisation due to complex problem setup and limited t…
3DV2025CoE: Deep Coupled Embedding for Non-Rigid Point Cloud CorrespondencesHuajian Zeng; Maolin Gao; Daniel CremersThe interest in matching non-rigidly deformed shapes represented as raw point clouds is rising due to the prolif-eration of low-cost 3D sensors. Yet, the task is challenging since point clouds are irr…
3DV2025Controllable Text-to-3D Generation via Surface-Aligned Gaussian SplattingZhiqi Li; Yiming Chen; Lingzhe Zhao; Peidong LiuWhile text-to-3D and image-to-3D generation tasks have received considerable attention, one important but under-explored field between them is controllable text-to-3D generation, which we mainly focus…
3DV2025Ctrl-Room: Controllable Text-to-3D Room Meshes Generation with Layout ConstraintsChuan Fang; Yuan Dong; Kunming Luo; Xiaotao Hu; Rakesh Shrestha; Ping TanText-driven 3D indoor scene generation is useful for gaming, film industry, and AR/VR applications. However, existing methods cannot faithfully capture the scene layout based on text descriptions, nor…
3DV2025DEGAS: Detailed Expressions on Full-Body Gaussian AvatarsZhijing Shao; Duotun Wang; Qing-Yao Tian; Yao-Dong Yang; Hengyu Meng; Zeyu Cai; Bo Dong; Yu Zhang; Kang Zhang; Zeyu WangAlthough neural rendering has made significant ad-vances in creating lifelike, animatable full-body and head avatars, incorporating detailed expressions into full-body avatars remains largely unexplor…
3DV2025Deep Polycuboid Fitting for Compact 3D Representation of Indoor ScenesGahye Lee; Hyejeong Yoon; Jungeon Kim; Seungyong LeeThis paper presents a novel framework for compactly representing a 3D indoor scene using a set of polycuboids through a deep learning-based fitting method. Indoor scenes mainly consist of man-made obj…
3DV2025DeforHMR: Vision Transformer with Deformable Cross-Attention for 3D Human Mesh RecoveryJaewoo Heo; George Hu; Zeyu Wang; Serena Yeung-LevyHuman Mesh Recovery (HMR) is an important yet chal-lenging problem with applications across various domains including motion capture, augmented reality, and biome-chanics. Accurately predicting human…
3DV2025Denoising Monte Carlo Renders with Diffusion ModelsVaibhav Vavilala; Rahul Vasanth; David A. ForsythPhysically-based renderings contain Monte Carlo noise, with variance that increases as the number of rays per pixel decreases. This noise, while zero-mean for good modern renderers, can have heavy tai…
3DV2025Direct and Explicit 3D Generation from a Single ImageHaoyu Wu; Meher Gitika Karumuri; Chuhang Zou; Seungbae Bang; Yuelong Li; Dimitris Samaras; Sunil HadapCurrent image-to-3D approaches suffer from high computational costs and lack scalability for high-resolution outputs. In contrast, we introduce a novel framework to directly generate explicit surface…
3DV2025Dream-in-Style: Text-to-3D Generation Using Stylized Score DistillationHubert Kompanowski; Binh-Son HuaWe present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the con…
+ 111,909 more rows — download the xlsx to see all 111,939.