Sotabase
Home
Researchers
Career
·
PhD Research Intern
,
NVIDIA
2023–2024
·
PhD Research Intern Reality Lab
,
Meta
2022–2022
·
Doctor of Philosophy - PhD
,
美国加州大学伯克利分校
2020–2025
·
Undergraduate Researcher MSRA
,
微软
2018–2018
·
学士, 电子信息工程
,
华中科技大学
2015–2019
Publications
(66)
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
Computer Vision and Pattern Recognition · 2020
1,365
cited
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration0
IEEE International Conference on Robotics and Automation · 2023
785
cited
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
arXiv.org · 2020
680
cited
SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation
European Conference on Computer Vision · 2020
412
cited
Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration
IEEE International Conference on Robotics and Automation · 2024
282
cited
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
arXiv.org · 2023
222
cited
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
International Conference on Learning Representations · 2022
211
cited
Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting
IEEE International Conference on Computer Vision · 2019
138
cited
AutoScale: Learning to Scale for Crowd Counting
International Journal of Computer Vision · 2019
124
cited
SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
IEEE International Conference on Computer Vision · 2023
114
cited
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
International Conference on Machine Learning · 2025
101
cited
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation
Computer Vision and Pattern Recognition · 2023
92
cited
Sparse R-CNN: An End-to-End Framework for Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence · 2023
87
cited
HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption
arXiv.org · 2023
66
cited
NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
IEEE International Conference on Computer Vision · 2023
63
cited
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
European Conference on Computer Vision · 2021
62
cited
Visual Transformers: Where Do Transformers Really Belong in Vision Models?
IEEE International Conference on Computer Vision · 2021
51
cited
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Conference on Robot Learning · 2024
50
cited
RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning
Conference on Robot Learning · 2024
47
cited
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation
arXiv.org · 2025
37
cited
Show all 66 papers →
Sotabase
Chenfeng Xu | Researcher Profile | Sotabase | Sotabase