Sotabase
Home
Researchers
Career
·
PhD Student
,
UC Berkeley
2023–
Publications
(0)
CLIP-It! Language-Guided Video Summarization
Neural Information Processing Systems · 2021
156
cited
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
European Conference on Computer Vision · 2018
115
cited
Multi-Person 3D Motion Prediction with Multi-Range Transformers
Neural Information Processing Systems · 2021
93
cited
Modular Visual Question Answering via Code Generation
Annual Meeting of the Association for Computational Linguistics · 2023
62
cited
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
European Conference on Computer Vision · 2020
61
cited
Dynamic video anomaly detection and localization using sparse denoising autoencoders
Multimedia tools and applications · 2018
54
cited
TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
European Conference on Computer Vision · 2022
42
cited
Learning and Verification of Task Structure in Instructional Videos
arXiv.org · 2023
24
cited
Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
IEEE Workshop/Winter Conference on Applications of Computer Vision · 2021
21
cited
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Neural Information Processing Systems · 2018
11
cited
LUSE: Using LLMs for Unsupervised Step Extraction in Instructional Videos
4
cited
EGA-FMC: enhanced genetic algorithm-based fuzzy k-modes clustering for categorical data
International Journal of Bio-Inspired Computation (IJBIC) · 2018
3
cited
Visual question answering using external knowledge
2019
1
cited
Multimodal Long-Term Video Understanding
2023
Sotabase