The Hidden Infinity in Preference Learning

An illustration of how length normalization aids learning from model-annotated data

Using LESS Data to Tune Models

Data Selection in the Era of LLMs

How to Scale Hyperparameters as Batch Size Increases

Understanding Optimization using Stochastic Differential Equations