Posts tagged #fine-tuning
-
Stable Rank as an Overfitting Signature in LoRA Fine-Tuning
Why we picked stable rank to detect overfitting geometry in DPO vs CLM fine-tuning, how it connects to "alignment geometry," and what the BitFit baseline was there to check.