Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training
COLM 2024
Zexuan Zhong
zzhong@cs.princeton.edu
I completed my Ph.D. at Princeton University, advised by Prof. Danqi Chen. Before that, I received an M.S. from University of Illinois at Urbana-Champaign and a B.S. from Peking University.
I am working at xAI now.
I am deeply interested in natural language processing (NLP) and deep learning. My recent research has focused on:
NAACL 2024
ACL 2023 (Tutorial)