Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
arXiv 2024
Zexuan Zhong
Hi! I am a fifth-year Ph.D. student at Princeton University, advised by Prof. Danqi Chen. I received an M.S. from University of Illinois at Urbana-Champaign and a B.S. from Peking University. My PhD research is partially supported by the J.P. Morgan PhD Fellowship. I have interned at Meta AI and Microsoft Research Asia.
I am deeply interested in natural language processing (NLP) and machine learning. My recent research has focused on:
NAACL 2024
ACL 2023 (Tutorial)