Sotabase
Home
Researchers
Career
·
Researcher in SNAP Group
,
Stanford University
Publications
(27)
TransVG: End-to-End Visual Grounding with Transformers
IEEE International Conference on Computer Vision · 2021
442
cited
Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition
IEEE transactions on circuits and systems for video technology (Print) · 2021
296
cited
Improving One-stage Visual Grounding by Recursive Sub-query Construction
European Conference on Computer Vision · 2020
292
cited
Improving Text-Based Person Search by Spatial Matching and Adaptive Threshold
IEEE Workshop/Winter Conference on Applications of Computer Vision · 2018
124
cited
One Transformer Can Understand Both 2D & 3D Molecular Data
International Conference on Learning Representations · 2022
121
cited
"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention
European Conference on Computer Vision · 2018
91
cited
TransVG++: End-to-End Visual Grounding With Language Conditioned Vision Transformer
IEEE Transactions on Pattern Analysis and Machine Intelligence · 2022
91
cited
Grounding-Tracking-Integration
IEEE transactions on circuits and systems for video technology (Print) · 2019
79
cited
Adaptive Offline Quintuplet Loss for Image-Text Matching
European Conference on Computer Vision · 2020
75
cited
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
AAAI Conference on Artificial Intelligence · 2020
70
cited
Large-Scale Tag-Based Font Retrieval With Generative Feature Learning
IEEE International Conference on Computer Vision · 2019
30
cited
How to Become Instagram Famous: Post Popularity Prediction with Dual-Attention
2018 IEEE International Conference on Big Data (Big Data) · 2018
25
cited
Anatomy-aware 3D Human Pose Estimation in Videos
arXiv.org · 2020
24
cited
When saliency meets sentiment: Understanding how image content invokes emotion and sentiment
International Conference on Information Photonics · 2016
21
cited
Image Sentiment Transfer
ACM Multimedia · 2020
19
cited
Prediction of ball milling performance by a convolutional neural network model and transfer learning
Powder Technology · 2022
19
cited
Example-Guided Image Synthesis Using Masked Spatial-Channel Attention and Self-supervision
European Conference on Computer Vision · 2020
15
cited
RelGNN: Composite Message Passing for Relational Deep Learning
International Conference on Machine Learning · 2025
14
cited
A Selfie is Worth a Thousand Words: Mining Personal Patterns behind User Selfie-posting Behaviours
The Web Conference · 2017
10
cited
Adaptive Filtering for Event Recognition from Noisy Signal: an Application to Earthquake Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing · 2019
7
cited
Show all 27 papers →
Sotabase