Falsifying LoRA Alignment Geometry

Building a paper that tries to kill itself: srank as an overfitting signature in DPO fine-tuning, manuscript-as-code, adversarial passes that retracted two claims, and an interactive microsite as the publication artifact.

Stable Rank as an Overfitting Signature in LoRA Fine-Tuning

Apr 18, 2026

Why we picked stable rank to detect overfitting geometry in DPO vs CLM fine-tuning, how it connects to "alignment geometry," and what the BitFit baseline was there to check.

#ml-research #lora #fine-tuning #dpo #geometry #stable-rank
The Manuscript-as-Codebase Pattern

Apr 19, 2026

Hierarchical Makefiles, data-driven macro generation, and paper-scoped .gitignore: how treating a research paper like a software project caught hardcoded inconsistencies before reviewers did.

#ml-research #reproducibility #latex #makefile #workflow
Adversarial Passes That Killed Claims

Apr 22, 2026

Two hypotheses that started as clean ideas and ended as documented failures: the DPO-CLM orthogonal-complement hypothesis, and the cross-probe srank claim retracted as a length-bias artifact.

#ml-research #lora #dpo #falsification #methodology #negative-results
The Microsite as Interactive Publication

Apr 24, 2026

Building a GitHub Pages research microsite with d3 widgets, a Lean 4 theorem status page, and a reproducibility shim. One gotcha with Jekyll and markdown-inside-divs. One real answer to whether live widgets are worth the effort.

#ml-research #d3 #jekyll #lean4 #reproducibility #visualization