Sotabase
Home
Researchers
Career
·
Tech Lead
,
HPC-AI Tech
2024–
·
Research Intern
,
UC Berkeley
2024–
Publications
(23)
Open-Sora: Democratizing Efficient Video Production for All
arXiv.org · 2024
490
cited
Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation
Computer Vision and Pattern Recognition · 2021
186
cited
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
International Conference on Machine Learning · 2024
160
cited
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Neural Information Processing Systems · 2023
122
cited
Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
IEEE International Conference on Computer Vision · 2023
113
cited
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Neural Information Processing Systems · 2023
105
cited
Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
arXiv.org · 2025
85
cited
InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
International Conference on Learning Representations · 2023
79
cited
Prompt Vision Transformer for Domain Generalization
arXiv.org · 2022
61
cited
CAME: Confidence-guided Adaptive Memory Efficient Optimization
Annual Meeting of the Association for Computational Linguistics · 2023
36
cited
Cross-token Modeling with Conditional Computation
2021
29
cited
Sparse-MLP: A Fully-MLP Architecture with Conditional Computation
arXiv.org · 2021
24
cited
Multi-source Few-shot Domain Adaptation
arXiv.org · 2021
17
cited
Scene-aware Learning Network for Radar Object Detection
International Conference on Multimedia Retrieval · 2021
12
cited
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
arXiv.org · 2024
11
cited
A Study on Transformer Configuration and Training Objective
International Conference on Machine Learning · 2022
10
cited
CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
AAAI Conference on Artificial Intelligence · 2022
9
cited
How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Conference on Empirical Methods in Natural Language Processing · 2024
9
cited
Deeper vs Wider: A Revisit of Transformer Configuration
arXiv.org · 2022
5
cited
Dataset Growth
European Conference on Computer Vision · 2024
4
cited
Show all 23 papers →
Sotabase