Sotabase
Home
Researchers
Career
·
Assistant Professor, Department of Computer Science and Engineering
,
Pennsylvania State University
2021–
·
Postdoctoral Scholar in EECS department
,
UC Berkeley
2018–
Publications
(45)
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
North American Chapter of the Association for Computational Linguistics · 2014
979
cited
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
European Conference on Computer Vision · 2015
782
cited
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
IEEE International Conference on Computer Vision · 2017
752
cited
Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
IEEE International Conference on Computer Vision · 2020
432
cited
Multilevel Language and Vision Integration for Text-to-Clip Retrieval
AAAI Conference on Artificial Intelligence · 2018
349
cited
A New Meta-Baseline for Few-Shot Learning
arXiv.org · 2020
223
cited
Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks
Computer Vision and Pattern Recognition · 2019
193
cited
Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning
European Conference on Computer Vision · 2020
131
cited
Learning Canonical Representations for Scene Graph to Image Generation
European Conference on Computer Vision · 2019
118
cited
Spatio-Temporal Action Graph Networks
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW) · 2018
85
cited
Learning Instance Activation Maps for Weakly Supervised Instance Segmentation
Computer Vision and Pattern Recognition · 2019
77
cited
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval
IEEE Workshop/Winter Conference on Applications of Computer Vision · 2019
76
cited
A Multi-scale Multiple Instance Video Description Network
arXiv.org · 2015
64
cited
Joint Event Detection and Description in Continuous Video Streams
2019 IEEE Winter Applications of Computer Vision Workshops (WACVW) · 2018
58
cited
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence · 2019
52
cited
Auxiliary Task Reweighting for Minimum-data Learning
Neural Information Processing Systems · 2020
41
cited
Text-to-Clip Video Retrieval with Early Fusion and Re-Captioning
arXiv.org · 2018
34
cited
Video Question Answering With Semantic Disentanglement and Reasoning
IEEE transactions on circuits and systems for video technology (Print) · 2024
21
cited
Classifying Collisions with Spatio-Temporal Action Graph Networks
arXiv.org · 2018
20
cited
TwoStreamVAN: Improving Motion Modeling in Video Generation
IEEE Workshop/Winter Conference on Applications of Computer Vision · 2018
18
cited
Show all 45 papers →
Sotabase