I'm a master's student at Princeton Department of Computer Science, advised by Prof. Sanjeev Arora. I obtained my Bachelor's degree at Computer Science from Princeton University and had the fortune to work with Prof. Danqi Chen and Prof. Kai Li.
I am interested in the data-centric approach to characterize and improve LLMs. These day, I'm particular excited about what assumptions we can make about real-world data distribution to improve synthetic data quality.
I can be reached via jiatongy [at] princeton.edu