Chenfeng Xu | Researcher Profile | Sotabase

Career

· PhD Research Intern, NVIDIA2023–2024

· PhD Research Intern Reality Lab, Meta2022–2022

· Doctor of Philosophy - PhD, 美国加州大学伯克利分校2020–2025

· Undergraduate Researcher MSRA, 微软2018–2018

· 学士, 电子信息工程, 华中科技大学2015–2019

Publications (66)

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Computer Vision and Pattern Recognition · 2020

1,365

cited

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration0

IEEE International Conference on Robotics and Automation · 2023

785

cited

Visual Transformers: Token-based Image Representation and Processing for Computer Vision

arXiv.org · 2020

680

cited

SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation

European Conference on Computer Vision · 2020

412

cited

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration

IEEE International Conference on Robotics and Automation · 2024

282

cited

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving

arXiv.org · 2023

222

cited

Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection

International Conference on Learning Representations · 2022

211

cited

Learn to Scale: Generating Multipolar Normalized Density Maps for Crowd Counting

IEEE International Conference on Computer Vision · 2019

138

cited

AutoScale: Learning to Scale for Crowd Counting

International Journal of Computer Vision · 2019

124

cited

SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection

IEEE International Conference on Computer Vision · 2023

114

cited

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

International Conference on Machine Learning · 2025

101

cited

Open-Vocabulary Point-Cloud Object Detection without 3D Annotation

Computer Vision and Pattern Recognition · 2023

cited

Sparse R-CNN: An End-to-End Framework for Object Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence · 2023

cited

HallE-Switch: Rethinking and Controlling Object Existence Hallucinations in Large Vision Language Models for Detailed Caption

arXiv.org · 2023

cited

NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection

IEEE International Conference on Computer Vision · 2023

cited

Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models

European Conference on Computer Vision · 2021

cited

Visual Transformers: Where Do Transformers Really Belong in Vision Models?

IEEE International Conference on Computer Vision · 2021

cited

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning

Conference on Robot Learning · 2024

cited

RoVi-Aug: Robot and Viewpoint Augmentation for Cross-Embodiment Robot Learning

Conference on Robot Learning · 2024

cited

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

arXiv.org · 2025

cited

Sotabase